; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPIUnG00280 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPIUnG00280
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionTy3/gypsy retrotransposon protein
Genome locationScaffold000101:116943..118906
RNA-Seq ExpressionCSPIUnG00280
SyntenyCSPIUnG00280
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0098869 - cellular oxidant detoxification (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004601 - peroxidase activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYJ97017.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]1.5e-25569.58Show/hide
Query:  MLTKMYAKRQRQQEGSELTGVSIDKRKLRPNDIMEGDEEGETSLSLEIGAGQDRIKFKKLEMPVFNGEDPNGWIYRAEHYFQMHLLNEQEKLKIAI----
        MLT+MY  RQRQ   SELTGVS  KRK+R  D+++GDEEGETSLSLE GAGQDRIKFKKLEMPVFNGEDP+GWIYRAEHYFQMHLLNEQEKLKIAI    
Subjt:  MLTKMYAKRQRQQEGSELTGVSIDKRKLRPNDIMEGDEEGETSLSLEIGAGQDRIKFKKLEMPVFNGEDPNGWIYRAEHYFQMHLLNEQEKLKIAI----

Query:  --------------------ELKDRMYNRFQCREHGTSCARFLAIKQEQSVNEYLQRFEELSTPLPEMAEDVLVGTFTNGLDPIIRT-------------
                            ELK+R+Y RF+ RE+GT CARFLAIKQE SV EYLQRFEELS PLPEMAEDVLVG FTNGLDP+IRT             
Subjt:  --------------------ELKDRMYNRFQCREHGTSCARFLAIKQEQSVNEYLQRFEELSTPLPEMAEDVLVGTFTNGLDPIIRT-------------

Query:  -------EEKLEAARNPQGP-----------------------------VPTHSSTSISSQVTATGSGVRRENNFRRWTDSELQARREKRLCYQCDEPFS
               EEKLE AR   GP                             +P   + + +SQ  ATG G RR+  FRRWTDSELQARR+K LCY+C+EPFS
Subjt:  -------EEKLEAARNPQGP-----------------------------VPTHSSTSISSQVTATGSGVRRENNFRRWTDSELQARREKRLCYQCDEPFS

Query:  KGHRCKNKELRLYLVADDLEDTEMEDVENENGPIEVSPVVELSLNSVVGLTTLGTFKVKGTVEDREIIIMIDCGATHNFISLKLVDEQSLPTTETTSYNV
        KGHRCKN+ELRL +VADDLED EM D   E   +EVSPVVELSLNSVVGLT  GTFK+KGTVE++EI+IM+DCGATHNFISLKLV+   LP  ETT+Y V
Subjt:  KGHRCKNKELRLYLVADDLEDTEMEDVENENGPIEVSPVVELSLNSVVGLTTLGTFKVKGTVEDREIIIMIDCGATHNFISLKLVDEQSLPTTETTSYNV

Query:  IMGSGKAVQGQGMCKNITVGLPVLTIIEDFLPLELGNLDMVLGMQWLRKQGTMTVDWKALTMTFIVGDTNVILKGDPSLTRMEISLKMLVKTWQSDDQQF
        IMGSGKAVQG+G+CK ITVGLPV++I+EDFLPLELGN+DMVLGMQWL+KQG MTVDWKALTMTF+VGDT VILKGDPSLTRMEISLK+LVKTWQ DDQ F
Subjt:  IMGSGKAVQGQGMCKNITVGLPVLTIIEDFLPLELGNLDMVLGMQWLRKQGTMTVDWKALTMTFIVGDTNVILKGDPSLTRMEISLKMLVKTWQSDDQQF

Query:  LVDFRAMGIPKTDRMLMATKSIEEIQLEMEQLQKEFEDVF-----------IDHIIQLKEDTDPINLRPYRYLHAQKNEIKTLVNEMLTSGIIRPSINPF
        LV+FRAMGIPK DR L+ T ++EE Q E  QLQ+EF DVF           IDH IQLKE TDPIN+RPYRY HAQKNEI+ LVN+ML SGIIRPS +PF
Subjt:  LVDFRAMGIPKTDRMLMATKSIEEIQLEMEQLQKEFEDVF-----------IDHIIQLKEDTDPINLRPYRYLHAQKNEIKTLVNEMLTSGIIRPSINPF

Query:  SSPVILVKKKDGGWRFCVDYRALNRATVPDKFPIPMIDELLDELNGSSIFSKIDLKSGYHQIRVRDEDIRKTTF
        SSPVILVKKKDGGWRFCVDYRALNRATVPDKFPIPMIDELLDEL+G+SIFSKIDLKSGYHQIRVRDEDI KT F
Subjt:  SSPVILVKKKDGGWRFCVDYRALNRATVPDKFPIPMIDELLDELNGSSIFSKIDLKSGYHQIRVRDEDIRKTTF

TYK10423.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]5.2e-25669.73Show/hide
Query:  MLTKMYAKRQRQQEGSELTGVSIDKRKLRPNDIMEGDEEGETSLSLEIGAGQDRIKFKKLEMPVFNGEDPNGWIYRAEHYFQMHLLNEQEKLKIAI----
        MLT+MY  RQRQ   SELTGVS  KRK+R  D+++GDEEGETSLSLE GAGQDRIKFKKLEMPVFNGEDP+GWIYRAEHYFQMHLLNEQEKLKIAI    
Subjt:  MLTKMYAKRQRQQEGSELTGVSIDKRKLRPNDIMEGDEEGETSLSLEIGAGQDRIKFKKLEMPVFNGEDPNGWIYRAEHYFQMHLLNEQEKLKIAI----

Query:  --------------------ELKDRMYNRFQCREHGTSCARFLAIKQEQSVNEYLQRFEELSTPLPEMAEDVLVGTFTNGLDPIIRT-------------
                            ELK+R+Y RF+ RE+GT CARFLAIKQE SV EYLQRFEELS PLPEMAEDVLVG FTNGLDP+IRT             
Subjt:  --------------------ELKDRMYNRFQCREHGTSCARFLAIKQEQSVNEYLQRFEELSTPLPEMAEDVLVGTFTNGLDPIIRT-------------

Query:  -------EEKLEAARNPQGP-----------------------------VPTHSSTSISSQVTATGSGVRRENNFRRWTDSELQARREKRLCYQCDEPFS
               EEKLE AR   GP                             +P   + + +SQ  ATG G RR+  FRRWTDSELQARR+K LCY+C+EPFS
Subjt:  -------EEKLEAARNPQGP-----------------------------VPTHSSTSISSQVTATGSGVRRENNFRRWTDSELQARREKRLCYQCDEPFS

Query:  KGHRCKNKELRLYLVADDLEDTEMEDVENENGPIEVSPVVELSLNSVVGLTTLGTFKVKGTVEDREIIIMIDCGATHNFISLKLVDEQSLPTTETTSYNV
        KGHRCKNKELRL +VADDLED EM D   E   +EVSPVVELSLNSVVGLT  GTFK+KGTVE++EI+IM+DCGATHNFISLKLV+   LP  ETT+Y V
Subjt:  KGHRCKNKELRLYLVADDLEDTEMEDVENENGPIEVSPVVELSLNSVVGLTTLGTFKVKGTVEDREIIIMIDCGATHNFISLKLVDEQSLPTTETTSYNV

Query:  IMGSGKAVQGQGMCKNITVGLPVLTIIEDFLPLELGNLDMVLGMQWLRKQGTMTVDWKALTMTFIVGDTNVILKGDPSLTRMEISLKMLVKTWQSDDQQF
        IMGSGKAVQG+G+CK ITVGLPV++I+EDFLPLELGN+DMVLGMQWL+KQG MTVDWKALTMTF+VGDT VILKGDPSLTRMEISLK+LVKTWQ DDQ F
Subjt:  IMGSGKAVQGQGMCKNITVGLPVLTIIEDFLPLELGNLDMVLGMQWLRKQGTMTVDWKALTMTFIVGDTNVILKGDPSLTRMEISLKMLVKTWQSDDQQF

Query:  LVDFRAMGIPKTDRMLMATKSIEEIQLEMEQLQKEFEDVF-----------IDHIIQLKEDTDPINLRPYRYLHAQKNEIKTLVNEMLTSGIIRPSINPF
        LV+FRAMGIPK DR L+ T ++EE Q E  QLQ+EF DVF           IDH IQLKE TDPIN+RPYRY HAQKNEI+ LVN+ML SGIIRPS +PF
Subjt:  LVDFRAMGIPKTDRMLMATKSIEEIQLEMEQLQKEFEDVF-----------IDHIIQLKEDTDPINLRPYRYLHAQKNEIKTLVNEMLTSGIIRPSINPF

Query:  SSPVILVKKKDGGWRFCVDYRALNRATVPDKFPIPMIDELLDELNGSSIFSKIDLKSGYHQIRVRDEDIRKTTF
        SSPVILVKKKDGGWRFCVDYRALNRATVPDKFPIPMIDELLDEL+G+SIFSKIDLKSGYHQIRVRDEDI KT F
Subjt:  SSPVILVKKKDGGWRFCVDYRALNRATVPDKFPIPMIDELLDELNGSSIFSKIDLKSGYHQIRVRDEDIRKTTF

TYK21209.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]5.2e-25669.73Show/hide
Query:  MLTKMYAKRQRQQEGSELTGVSIDKRKLRPNDIMEGDEEGETSLSLEIGAGQDRIKFKKLEMPVFNGEDPNGWIYRAEHYFQMHLLNEQEKLKIAI----
        MLT+MY  RQRQ   SELTGVS  KRK+R  D+++GDEEGETSLSLE GAGQDRIKFKKLEMPVFNGEDP+GWIYRAEHYFQMHLLNEQEKLKIAI    
Subjt:  MLTKMYAKRQRQQEGSELTGVSIDKRKLRPNDIMEGDEEGETSLSLEIGAGQDRIKFKKLEMPVFNGEDPNGWIYRAEHYFQMHLLNEQEKLKIAI----

Query:  --------------------ELKDRMYNRFQCREHGTSCARFLAIKQEQSVNEYLQRFEELSTPLPEMAEDVLVGTFTNGLDPIIRT-------------
                            ELK+R+Y RF+ RE+GT CARFLAIKQE SV EYLQRFEELS PLPEMAEDVLVG FTNGLDP+IRT             
Subjt:  --------------------ELKDRMYNRFQCREHGTSCARFLAIKQEQSVNEYLQRFEELSTPLPEMAEDVLVGTFTNGLDPIIRT-------------

Query:  -------EEKLEAARNPQGP-----------------------------VPTHSSTSISSQVTATGSGVRRENNFRRWTDSELQARREKRLCYQCDEPFS
               EEKLE AR   GP                             +P   + + +SQ  ATG G RR+  FRRWTDSELQARR+K LCY+C+EPFS
Subjt:  -------EEKLEAARNPQGP-----------------------------VPTHSSTSISSQVTATGSGVRRENNFRRWTDSELQARREKRLCYQCDEPFS

Query:  KGHRCKNKELRLYLVADDLEDTEMEDVENENGPIEVSPVVELSLNSVVGLTTLGTFKVKGTVEDREIIIMIDCGATHNFISLKLVDEQSLPTTETTSYNV
        KGHRCKNKELRL +VADDLED EM D   E   +EVSPVVELSLNSVVGLT  GTFK+KGTVE++EI+IM+DCGATHNFISLKLV+   LP  ETT+Y V
Subjt:  KGHRCKNKELRLYLVADDLEDTEMEDVENENGPIEVSPVVELSLNSVVGLTTLGTFKVKGTVEDREIIIMIDCGATHNFISLKLVDEQSLPTTETTSYNV

Query:  IMGSGKAVQGQGMCKNITVGLPVLTIIEDFLPLELGNLDMVLGMQWLRKQGTMTVDWKALTMTFIVGDTNVILKGDPSLTRMEISLKMLVKTWQSDDQQF
        IMGSGKAVQG+G+CK ITVGLPV++I+EDFLPLELGN+DMVLGMQWL+KQG MTVDWKALTMTF+VGDT VILKGDPSLTRMEISLK+LVKTWQ DDQ F
Subjt:  IMGSGKAVQGQGMCKNITVGLPVLTIIEDFLPLELGNLDMVLGMQWLRKQGTMTVDWKALTMTFIVGDTNVILKGDPSLTRMEISLKMLVKTWQSDDQQF

Query:  LVDFRAMGIPKTDRMLMATKSIEEIQLEMEQLQKEFEDVF-----------IDHIIQLKEDTDPINLRPYRYLHAQKNEIKTLVNEMLTSGIIRPSINPF
        LV+FRAMGIPK DR L+ T ++EE Q E  QLQ+EF DVF           IDH IQLKE TDPIN+RPYRY HAQKNEI+ LVN+ML SGIIRPS +PF
Subjt:  LVDFRAMGIPKTDRMLMATKSIEEIQLEMEQLQKEFEDVF-----------IDHIIQLKEDTDPINLRPYRYLHAQKNEIKTLVNEMLTSGIIRPSINPF

Query:  SSPVILVKKKDGGWRFCVDYRALNRATVPDKFPIPMIDELLDELNGSSIFSKIDLKSGYHQIRVRDEDIRKTTF
        SSPVILVKKKDGGWRFCVDYRALNRATVPDKFPIPMIDELLDEL+G+SIFSKIDLKSGYHQIRVRDEDI KT F
Subjt:  SSPVILVKKKDGGWRFCVDYRALNRATVPDKFPIPMIDELLDELNGSSIFSKIDLKSGYHQIRVRDEDIRKTTF

TYK23724.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]1.0e-25669.88Show/hide
Query:  MLTKMYAKRQRQQEGSELTGVSIDKRKLRPNDIMEGDEEGETSLSLEIGAGQDRIKFKKLEMPVFNGEDPNGWIYRAEHYFQMHLLNEQEKLKIAI----
        MLT+MY  RQRQ   SELTGVS  KRK+R  D+++GDEEGETSLSLE GAGQDRIKFKKLEMPVFNGEDP+GWIYRAEHYFQMHLLNEQEKLKIAI    
Subjt:  MLTKMYAKRQRQQEGSELTGVSIDKRKLRPNDIMEGDEEGETSLSLEIGAGQDRIKFKKLEMPVFNGEDPNGWIYRAEHYFQMHLLNEQEKLKIAI----

Query:  --------------------ELKDRMYNRFQCREHGTSCARFLAIKQEQSVNEYLQRFEELSTPLPEMAEDVLVGTFTNGLDPIIRT-------------
                            ELK+R+YNRF+ RE+GT CARFLAIKQE SV EYLQRFEELS PLPEMAEDVLVG FTNGLDP+IRT             
Subjt:  --------------------ELKDRMYNRFQCREHGTSCARFLAIKQEQSVNEYLQRFEELSTPLPEMAEDVLVGTFTNGLDPIIRT-------------

Query:  -------EEKLEAARNPQGP-----------------------------VPTHSSTSISSQVTATGSGVRRENNFRRWTDSELQARREKRLCYQCDEPFS
               EEKLE AR   GP                             +P   + + +SQ  ATG G RR+  FRRWTDSELQARR+K LCY+C+EPFS
Subjt:  -------EEKLEAARNPQGP-----------------------------VPTHSSTSISSQVTATGSGVRRENNFRRWTDSELQARREKRLCYQCDEPFS

Query:  KGHRCKNKELRLYLVADDLEDTEMEDVENENGPIEVSPVVELSLNSVVGLTTLGTFKVKGTVEDREIIIMIDCGATHNFISLKLVDEQSLPTTETTSYNV
        KGHRCKNKELRL +VADDLED EM D   E   +EVSPVVELSLNSVVGLT  GTFK+KGTVE++EI+IM+DCGATHNFISLKLV+   LP  ETT+Y V
Subjt:  KGHRCKNKELRLYLVADDLEDTEMEDVENENGPIEVSPVVELSLNSVVGLTTLGTFKVKGTVEDREIIIMIDCGATHNFISLKLVDEQSLPTTETTSYNV

Query:  IMGSGKAVQGQGMCKNITVGLPVLTIIEDFLPLELGNLDMVLGMQWLRKQGTMTVDWKALTMTFIVGDTNVILKGDPSLTRMEISLKMLVKTWQSDDQQF
        IMGSGKAVQG+G+CK ITVGLPV++I+EDFLPLELGN+DMVLGMQWL+KQG MTVDWKALTMTF+VGDT VILKGDPSLTRMEISLK+LVKTWQ DDQ F
Subjt:  IMGSGKAVQGQGMCKNITVGLPVLTIIEDFLPLELGNLDMVLGMQWLRKQGTMTVDWKALTMTFIVGDTNVILKGDPSLTRMEISLKMLVKTWQSDDQQF

Query:  LVDFRAMGIPKTDRMLMATKSIEEIQLEMEQLQKEFEDVF-----------IDHIIQLKEDTDPINLRPYRYLHAQKNEIKTLVNEMLTSGIIRPSINPF
        LV+FRAMGIPK DR L+ T ++EE Q E  QLQ+EF DVF           IDH IQLKE TDPIN+RPYRY HAQKNEI+ LVN+ML SGIIRPS +PF
Subjt:  LVDFRAMGIPKTDRMLMATKSIEEIQLEMEQLQKEFEDVF-----------IDHIIQLKEDTDPINLRPYRYLHAQKNEIKTLVNEMLTSGIIRPSINPF

Query:  SSPVILVKKKDGGWRFCVDYRALNRATVPDKFPIPMIDELLDELNGSSIFSKIDLKSGYHQIRVRDEDIRKTTF
        SSPVILVKKKDGGWRFCVDYRALNRATVPDKFPIPMIDELLDEL+G+SIFSKIDLKSGYHQIRVRDEDI KT F
Subjt:  SSPVILVKKKDGGWRFCVDYRALNRATVPDKFPIPMIDELLDELNGSSIFSKIDLKSGYHQIRVRDEDIRKTTF

TYK26407.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]5.2e-25669.73Show/hide
Query:  MLTKMYAKRQRQQEGSELTGVSIDKRKLRPNDIMEGDEEGETSLSLEIGAGQDRIKFKKLEMPVFNGEDPNGWIYRAEHYFQMHLLNEQEKLKIAI----
        MLT+MY  RQRQ   SELTGVS  KRK+R  D+++GDEEGETSLSLE GAGQDRIKFKKLEMPVFNGEDP+GWIYRAEHYFQMHLLNEQEKLKIAI    
Subjt:  MLTKMYAKRQRQQEGSELTGVSIDKRKLRPNDIMEGDEEGETSLSLEIGAGQDRIKFKKLEMPVFNGEDPNGWIYRAEHYFQMHLLNEQEKLKIAI----

Query:  --------------------ELKDRMYNRFQCREHGTSCARFLAIKQEQSVNEYLQRFEELSTPLPEMAEDVLVGTFTNGLDPIIRT-------------
                            ELK+R+Y RF+ RE+GT CARFLAIKQE SV EYLQRFEELS PLPEMAEDVLVG FTNGLDP+IRT             
Subjt:  --------------------ELKDRMYNRFQCREHGTSCARFLAIKQEQSVNEYLQRFEELSTPLPEMAEDVLVGTFTNGLDPIIRT-------------

Query:  -------EEKLEAARNPQGP-----------------------------VPTHSSTSISSQVTATGSGVRRENNFRRWTDSELQARREKRLCYQCDEPFS
               EEKLE AR   GP                             +P   + + +SQ  ATG G RR+  FRRWTDSELQARR+K LCY+C+EPFS
Subjt:  -------EEKLEAARNPQGP-----------------------------VPTHSSTSISSQVTATGSGVRRENNFRRWTDSELQARREKRLCYQCDEPFS

Query:  KGHRCKNKELRLYLVADDLEDTEMEDVENENGPIEVSPVVELSLNSVVGLTTLGTFKVKGTVEDREIIIMIDCGATHNFISLKLVDEQSLPTTETTSYNV
        KGHRCKNKELRL +VADDLED EM D   E   +EVSPVVELSLNSVVGLT  GTFK+KGTVE++EI+IM+DCGATHNFISLKLV+   LP  ETT+Y V
Subjt:  KGHRCKNKELRLYLVADDLEDTEMEDVENENGPIEVSPVVELSLNSVVGLTTLGTFKVKGTVEDREIIIMIDCGATHNFISLKLVDEQSLPTTETTSYNV

Query:  IMGSGKAVQGQGMCKNITVGLPVLTIIEDFLPLELGNLDMVLGMQWLRKQGTMTVDWKALTMTFIVGDTNVILKGDPSLTRMEISLKMLVKTWQSDDQQF
        IMGSGKAVQG+G+CK ITVGLPV++I+EDFLPLELGN+DMVLGMQWL+KQG MTVDWKALTMTF+VGDT VILKGDPSLTRMEISLK+LVKTWQ DDQ F
Subjt:  IMGSGKAVQGQGMCKNITVGLPVLTIIEDFLPLELGNLDMVLGMQWLRKQGTMTVDWKALTMTFIVGDTNVILKGDPSLTRMEISLKMLVKTWQSDDQQF

Query:  LVDFRAMGIPKTDRMLMATKSIEEIQLEMEQLQKEFEDVF-----------IDHIIQLKEDTDPINLRPYRYLHAQKNEIKTLVNEMLTSGIIRPSINPF
        LV+FRAMGIPK DR L+ T ++EE Q E  QLQ+EF DVF           IDH IQLKE TDPIN+RPYRY HAQKNEI+ LVN+ML SGIIRPS +PF
Subjt:  LVDFRAMGIPKTDRMLMATKSIEEIQLEMEQLQKEFEDVF-----------IDHIIQLKEDTDPINLRPYRYLHAQKNEIKTLVNEMLTSGIIRPSINPF

Query:  SSPVILVKKKDGGWRFCVDYRALNRATVPDKFPIPMIDELLDELNGSSIFSKIDLKSGYHQIRVRDEDIRKTTF
        SSPVILVKKKDGGWRFCVDYRALNRATVPDKFPIPMIDELLDEL+G+SIFSKIDLKSGYHQIRVRDEDI KT F
Subjt:  SSPVILVKKKDGGWRFCVDYRALNRATVPDKFPIPMIDELLDELNGSSIFSKIDLKSGYHQIRVRDEDIRKTTF

TrEMBL top hitse value%identityAlignment
A0A5D3BD16 Ty3/gypsy retrotransposon protein7.3e-25669.58Show/hide
Query:  MLTKMYAKRQRQQEGSELTGVSIDKRKLRPNDIMEGDEEGETSLSLEIGAGQDRIKFKKLEMPVFNGEDPNGWIYRAEHYFQMHLLNEQEKLKIAI----
        MLT+MY  RQRQ   SELTGVS  KRK+R  D+++GDEEGETSLSLE GAGQDRIKFKKLEMPVFNGEDP+GWIYRAEHYFQMHLLNEQEKLKIAI    
Subjt:  MLTKMYAKRQRQQEGSELTGVSIDKRKLRPNDIMEGDEEGETSLSLEIGAGQDRIKFKKLEMPVFNGEDPNGWIYRAEHYFQMHLLNEQEKLKIAI----

Query:  --------------------ELKDRMYNRFQCREHGTSCARFLAIKQEQSVNEYLQRFEELSTPLPEMAEDVLVGTFTNGLDPIIRT-------------
                            ELK+R+Y RF+ RE+GT CARFLAIKQE SV EYLQRFEELS PLPEMAEDVLVG FTNGLDP+IRT             
Subjt:  --------------------ELKDRMYNRFQCREHGTSCARFLAIKQEQSVNEYLQRFEELSTPLPEMAEDVLVGTFTNGLDPIIRT-------------

Query:  -------EEKLEAARNPQGP-----------------------------VPTHSSTSISSQVTATGSGVRRENNFRRWTDSELQARREKRLCYQCDEPFS
               EEKLE AR   GP                             +P   + + +SQ  ATG G RR+  FRRWTDSELQARR+K LCY+C+EPFS
Subjt:  -------EEKLEAARNPQGP-----------------------------VPTHSSTSISSQVTATGSGVRRENNFRRWTDSELQARREKRLCYQCDEPFS

Query:  KGHRCKNKELRLYLVADDLEDTEMEDVENENGPIEVSPVVELSLNSVVGLTTLGTFKVKGTVEDREIIIMIDCGATHNFISLKLVDEQSLPTTETTSYNV
        KGHRCKN+ELRL +VADDLED EM D   E   +EVSPVVELSLNSVVGLT  GTFK+KGTVE++EI+IM+DCGATHNFISLKLV+   LP  ETT+Y V
Subjt:  KGHRCKNKELRLYLVADDLEDTEMEDVENENGPIEVSPVVELSLNSVVGLTTLGTFKVKGTVEDREIIIMIDCGATHNFISLKLVDEQSLPTTETTSYNV

Query:  IMGSGKAVQGQGMCKNITVGLPVLTIIEDFLPLELGNLDMVLGMQWLRKQGTMTVDWKALTMTFIVGDTNVILKGDPSLTRMEISLKMLVKTWQSDDQQF
        IMGSGKAVQG+G+CK ITVGLPV++I+EDFLPLELGN+DMVLGMQWL+KQG MTVDWKALTMTF+VGDT VILKGDPSLTRMEISLK+LVKTWQ DDQ F
Subjt:  IMGSGKAVQGQGMCKNITVGLPVLTIIEDFLPLELGNLDMVLGMQWLRKQGTMTVDWKALTMTFIVGDTNVILKGDPSLTRMEISLKMLVKTWQSDDQQF

Query:  LVDFRAMGIPKTDRMLMATKSIEEIQLEMEQLQKEFEDVF-----------IDHIIQLKEDTDPINLRPYRYLHAQKNEIKTLVNEMLTSGIIRPSINPF
        LV+FRAMGIPK DR L+ T ++EE Q E  QLQ+EF DVF           IDH IQLKE TDPIN+RPYRY HAQKNEI+ LVN+ML SGIIRPS +PF
Subjt:  LVDFRAMGIPKTDRMLMATKSIEEIQLEMEQLQKEFEDVF-----------IDHIIQLKEDTDPINLRPYRYLHAQKNEIKTLVNEMLTSGIIRPSINPF

Query:  SSPVILVKKKDGGWRFCVDYRALNRATVPDKFPIPMIDELLDELNGSSIFSKIDLKSGYHQIRVRDEDIRKTTF
        SSPVILVKKKDGGWRFCVDYRALNRATVPDKFPIPMIDELLDEL+G+SIFSKIDLKSGYHQIRVRDEDI KT F
Subjt:  SSPVILVKKKDGGWRFCVDYRALNRATVPDKFPIPMIDELLDELNGSSIFSKIDLKSGYHQIRVRDEDIRKTTF

A0A5D3CEX8 Ty3/gypsy retrotransposon protein2.5e-25669.73Show/hide
Query:  MLTKMYAKRQRQQEGSELTGVSIDKRKLRPNDIMEGDEEGETSLSLEIGAGQDRIKFKKLEMPVFNGEDPNGWIYRAEHYFQMHLLNEQEKLKIAI----
        MLT+MY  RQRQ   SELTGVS  KRK+R  D+++GDEEGETSLSLE GAGQDRIKFKKLEMPVFNGEDP+GWIYRAEHYFQMHLLNEQEKLKIAI    
Subjt:  MLTKMYAKRQRQQEGSELTGVSIDKRKLRPNDIMEGDEEGETSLSLEIGAGQDRIKFKKLEMPVFNGEDPNGWIYRAEHYFQMHLLNEQEKLKIAI----

Query:  --------------------ELKDRMYNRFQCREHGTSCARFLAIKQEQSVNEYLQRFEELSTPLPEMAEDVLVGTFTNGLDPIIRT-------------
                            ELK+R+Y RF+ RE+GT CARFLAIKQE SV EYLQRFEELS PLPEMAEDVLVG FTNGLDP+IRT             
Subjt:  --------------------ELKDRMYNRFQCREHGTSCARFLAIKQEQSVNEYLQRFEELSTPLPEMAEDVLVGTFTNGLDPIIRT-------------

Query:  -------EEKLEAARNPQGP-----------------------------VPTHSSTSISSQVTATGSGVRRENNFRRWTDSELQARREKRLCYQCDEPFS
               EEKLE AR   GP                             +P   + + +SQ  ATG G RR+  FRRWTDSELQARR+K LCY+C+EPFS
Subjt:  -------EEKLEAARNPQGP-----------------------------VPTHSSTSISSQVTATGSGVRRENNFRRWTDSELQARREKRLCYQCDEPFS

Query:  KGHRCKNKELRLYLVADDLEDTEMEDVENENGPIEVSPVVELSLNSVVGLTTLGTFKVKGTVEDREIIIMIDCGATHNFISLKLVDEQSLPTTETTSYNV
        KGHRCKNKELRL +VADDLED EM D   E   +EVSPVVELSLNSVVGLT  GTFK+KGTVE++EI+IM+DCGATHNFISLKLV+   LP  ETT+Y V
Subjt:  KGHRCKNKELRLYLVADDLEDTEMEDVENENGPIEVSPVVELSLNSVVGLTTLGTFKVKGTVEDREIIIMIDCGATHNFISLKLVDEQSLPTTETTSYNV

Query:  IMGSGKAVQGQGMCKNITVGLPVLTIIEDFLPLELGNLDMVLGMQWLRKQGTMTVDWKALTMTFIVGDTNVILKGDPSLTRMEISLKMLVKTWQSDDQQF
        IMGSGKAVQG+G+CK ITVGLPV++I+EDFLPLELGN+DMVLGMQWL+KQG MTVDWKALTMTF+VGDT VILKGDPSLTRMEISLK+LVKTWQ DDQ F
Subjt:  IMGSGKAVQGQGMCKNITVGLPVLTIIEDFLPLELGNLDMVLGMQWLRKQGTMTVDWKALTMTFIVGDTNVILKGDPSLTRMEISLKMLVKTWQSDDQQF

Query:  LVDFRAMGIPKTDRMLMATKSIEEIQLEMEQLQKEFEDVF-----------IDHIIQLKEDTDPINLRPYRYLHAQKNEIKTLVNEMLTSGIIRPSINPF
        LV+FRAMGIPK DR L+ T ++EE Q E  QLQ+EF DVF           IDH IQLKE TDPIN+RPYRY HAQKNEI+ LVN+ML SGIIRPS +PF
Subjt:  LVDFRAMGIPKTDRMLMATKSIEEIQLEMEQLQKEFEDVF-----------IDHIIQLKEDTDPINLRPYRYLHAQKNEIKTLVNEMLTSGIIRPSINPF

Query:  SSPVILVKKKDGGWRFCVDYRALNRATVPDKFPIPMIDELLDELNGSSIFSKIDLKSGYHQIRVRDEDIRKTTF
        SSPVILVKKKDGGWRFCVDYRALNRATVPDKFPIPMIDELLDEL+G+SIFSKIDLKSGYHQIRVRDEDI KT F
Subjt:  SSPVILVKKKDGGWRFCVDYRALNRATVPDKFPIPMIDELLDELNGSSIFSKIDLKSGYHQIRVRDEDIRKTTF

A0A5D3DD68 Ty3/gypsy retrotransposon protein2.5e-25669.73Show/hide
Query:  MLTKMYAKRQRQQEGSELTGVSIDKRKLRPNDIMEGDEEGETSLSLEIGAGQDRIKFKKLEMPVFNGEDPNGWIYRAEHYFQMHLLNEQEKLKIAI----
        MLT+MY  RQRQ   SELTGVS  KRK+R  D+++GDEEGETSLSLE GAGQDRIKFKKLEMPVFNGEDP+GWIYRAEHYFQMHLLNEQEKLKIAI    
Subjt:  MLTKMYAKRQRQQEGSELTGVSIDKRKLRPNDIMEGDEEGETSLSLEIGAGQDRIKFKKLEMPVFNGEDPNGWIYRAEHYFQMHLLNEQEKLKIAI----

Query:  --------------------ELKDRMYNRFQCREHGTSCARFLAIKQEQSVNEYLQRFEELSTPLPEMAEDVLVGTFTNGLDPIIRT-------------
                            ELK+R+Y RF+ RE+GT CARFLAIKQE SV EYLQRFEELS PLPEMAEDVLVG FTNGLDP+IRT             
Subjt:  --------------------ELKDRMYNRFQCREHGTSCARFLAIKQEQSVNEYLQRFEELSTPLPEMAEDVLVGTFTNGLDPIIRT-------------

Query:  -------EEKLEAARNPQGP-----------------------------VPTHSSTSISSQVTATGSGVRRENNFRRWTDSELQARREKRLCYQCDEPFS
               EEKLE AR   GP                             +P   + + +SQ  ATG G RR+  FRRWTDSELQARR+K LCY+C+EPFS
Subjt:  -------EEKLEAARNPQGP-----------------------------VPTHSSTSISSQVTATGSGVRRENNFRRWTDSELQARREKRLCYQCDEPFS

Query:  KGHRCKNKELRLYLVADDLEDTEMEDVENENGPIEVSPVVELSLNSVVGLTTLGTFKVKGTVEDREIIIMIDCGATHNFISLKLVDEQSLPTTETTSYNV
        KGHRCKNKELRL +VADDLED EM D   E   +EVSPVVELSLNSVVGLT  GTFK+KGTVE++EI+IM+DCGATHNFISLKLV+   LP  ETT+Y V
Subjt:  KGHRCKNKELRLYLVADDLEDTEMEDVENENGPIEVSPVVELSLNSVVGLTTLGTFKVKGTVEDREIIIMIDCGATHNFISLKLVDEQSLPTTETTSYNV

Query:  IMGSGKAVQGQGMCKNITVGLPVLTIIEDFLPLELGNLDMVLGMQWLRKQGTMTVDWKALTMTFIVGDTNVILKGDPSLTRMEISLKMLVKTWQSDDQQF
        IMGSGKAVQG+G+CK ITVGLPV++I+EDFLPLELGN+DMVLGMQWL+KQG MTVDWKALTMTF+VGDT VILKGDPSLTRMEISLK+LVKTWQ DDQ F
Subjt:  IMGSGKAVQGQGMCKNITVGLPVLTIIEDFLPLELGNLDMVLGMQWLRKQGTMTVDWKALTMTFIVGDTNVILKGDPSLTRMEISLKMLVKTWQSDDQQF

Query:  LVDFRAMGIPKTDRMLMATKSIEEIQLEMEQLQKEFEDVF-----------IDHIIQLKEDTDPINLRPYRYLHAQKNEIKTLVNEMLTSGIIRPSINPF
        LV+FRAMGIPK DR L+ T ++EE Q E  QLQ+EF DVF           IDH IQLKE TDPIN+RPYRY HAQKNEI+ LVN+ML SGIIRPS +PF
Subjt:  LVDFRAMGIPKTDRMLMATKSIEEIQLEMEQLQKEFEDVF-----------IDHIIQLKEDTDPINLRPYRYLHAQKNEIKTLVNEMLTSGIIRPSINPF

Query:  SSPVILVKKKDGGWRFCVDYRALNRATVPDKFPIPMIDELLDELNGSSIFSKIDLKSGYHQIRVRDEDIRKTTF
        SSPVILVKKKDGGWRFCVDYRALNRATVPDKFPIPMIDELLDEL+G+SIFSKIDLKSGYHQIRVRDEDI KT F
Subjt:  SSPVILVKKKDGGWRFCVDYRALNRATVPDKFPIPMIDELLDELNGSSIFSKIDLKSGYHQIRVRDEDIRKTTF

A0A5D3DJA9 Ty3/gypsy retrotransposon protein5.1e-25769.88Show/hide
Query:  MLTKMYAKRQRQQEGSELTGVSIDKRKLRPNDIMEGDEEGETSLSLEIGAGQDRIKFKKLEMPVFNGEDPNGWIYRAEHYFQMHLLNEQEKLKIAI----
        MLT+MY  RQRQ   SELTGVS  KRK+R  D+++GDEEGETSLSLE GAGQDRIKFKKLEMPVFNGEDP+GWIYRAEHYFQMHLLNEQEKLKIAI    
Subjt:  MLTKMYAKRQRQQEGSELTGVSIDKRKLRPNDIMEGDEEGETSLSLEIGAGQDRIKFKKLEMPVFNGEDPNGWIYRAEHYFQMHLLNEQEKLKIAI----

Query:  --------------------ELKDRMYNRFQCREHGTSCARFLAIKQEQSVNEYLQRFEELSTPLPEMAEDVLVGTFTNGLDPIIRT-------------
                            ELK+R+YNRF+ RE+GT CARFLAIKQE SV EYLQRFEELS PLPEMAEDVLVG FTNGLDP+IRT             
Subjt:  --------------------ELKDRMYNRFQCREHGTSCARFLAIKQEQSVNEYLQRFEELSTPLPEMAEDVLVGTFTNGLDPIIRT-------------

Query:  -------EEKLEAARNPQGP-----------------------------VPTHSSTSISSQVTATGSGVRRENNFRRWTDSELQARREKRLCYQCDEPFS
               EEKLE AR   GP                             +P   + + +SQ  ATG G RR+  FRRWTDSELQARR+K LCY+C+EPFS
Subjt:  -------EEKLEAARNPQGP-----------------------------VPTHSSTSISSQVTATGSGVRRENNFRRWTDSELQARREKRLCYQCDEPFS

Query:  KGHRCKNKELRLYLVADDLEDTEMEDVENENGPIEVSPVVELSLNSVVGLTTLGTFKVKGTVEDREIIIMIDCGATHNFISLKLVDEQSLPTTETTSYNV
        KGHRCKNKELRL +VADDLED EM D   E   +EVSPVVELSLNSVVGLT  GTFK+KGTVE++EI+IM+DCGATHNFISLKLV+   LP  ETT+Y V
Subjt:  KGHRCKNKELRLYLVADDLEDTEMEDVENENGPIEVSPVVELSLNSVVGLTTLGTFKVKGTVEDREIIIMIDCGATHNFISLKLVDEQSLPTTETTSYNV

Query:  IMGSGKAVQGQGMCKNITVGLPVLTIIEDFLPLELGNLDMVLGMQWLRKQGTMTVDWKALTMTFIVGDTNVILKGDPSLTRMEISLKMLVKTWQSDDQQF
        IMGSGKAVQG+G+CK ITVGLPV++I+EDFLPLELGN+DMVLGMQWL+KQG MTVDWKALTMTF+VGDT VILKGDPSLTRMEISLK+LVKTWQ DDQ F
Subjt:  IMGSGKAVQGQGMCKNITVGLPVLTIIEDFLPLELGNLDMVLGMQWLRKQGTMTVDWKALTMTFIVGDTNVILKGDPSLTRMEISLKMLVKTWQSDDQQF

Query:  LVDFRAMGIPKTDRMLMATKSIEEIQLEMEQLQKEFEDVF-----------IDHIIQLKEDTDPINLRPYRYLHAQKNEIKTLVNEMLTSGIIRPSINPF
        LV+FRAMGIPK DR L+ T ++EE Q E  QLQ+EF DVF           IDH IQLKE TDPIN+RPYRY HAQKNEI+ LVN+ML SGIIRPS +PF
Subjt:  LVDFRAMGIPKTDRMLMATKSIEEIQLEMEQLQKEFEDVF-----------IDHIIQLKEDTDPINLRPYRYLHAQKNEIKTLVNEMLTSGIIRPSINPF

Query:  SSPVILVKKKDGGWRFCVDYRALNRATVPDKFPIPMIDELLDELNGSSIFSKIDLKSGYHQIRVRDEDIRKTTF
        SSPVILVKKKDGGWRFCVDYRALNRATVPDKFPIPMIDELLDEL+G+SIFSKIDLKSGYHQIRVRDEDI KT F
Subjt:  SSPVILVKKKDGGWRFCVDYRALNRATVPDKFPIPMIDELLDELNGSSIFSKIDLKSGYHQIRVRDEDIRKTTF

A0A5D3DRT3 Ty3/gypsy retrotransposon protein2.5e-25669.73Show/hide
Query:  MLTKMYAKRQRQQEGSELTGVSIDKRKLRPNDIMEGDEEGETSLSLEIGAGQDRIKFKKLEMPVFNGEDPNGWIYRAEHYFQMHLLNEQEKLKIAI----
        MLT+MY  RQRQ   SELTGVS  KRK+R  D+++GDEEGETSLSLE GAGQDRIKFKKLEMPVFNGEDP+GWIYRAEHYFQMHLLNEQEKLKIAI    
Subjt:  MLTKMYAKRQRQQEGSELTGVSIDKRKLRPNDIMEGDEEGETSLSLEIGAGQDRIKFKKLEMPVFNGEDPNGWIYRAEHYFQMHLLNEQEKLKIAI----

Query:  --------------------ELKDRMYNRFQCREHGTSCARFLAIKQEQSVNEYLQRFEELSTPLPEMAEDVLVGTFTNGLDPIIRT-------------
                            ELK+R+Y RF+ RE+GT CARFLAIKQE SV EYLQRFEELS PLPEMAEDVLVG FTNGLDP+IRT             
Subjt:  --------------------ELKDRMYNRFQCREHGTSCARFLAIKQEQSVNEYLQRFEELSTPLPEMAEDVLVGTFTNGLDPIIRT-------------

Query:  -------EEKLEAARNPQGP-----------------------------VPTHSSTSISSQVTATGSGVRRENNFRRWTDSELQARREKRLCYQCDEPFS
               EEKLE AR   GP                             +P   + + +SQ  ATG G RR+  FRRWTDSELQARR+K LCY+C+EPFS
Subjt:  -------EEKLEAARNPQGP-----------------------------VPTHSSTSISSQVTATGSGVRRENNFRRWTDSELQARREKRLCYQCDEPFS

Query:  KGHRCKNKELRLYLVADDLEDTEMEDVENENGPIEVSPVVELSLNSVVGLTTLGTFKVKGTVEDREIIIMIDCGATHNFISLKLVDEQSLPTTETTSYNV
        KGHRCKNKELRL +VADDLED EM D   E   +EVSPVVELSLNSVVGLT  GTFK+KGTVE++EI+IM+DCGATHNFISLKLV+   LP  ETT+Y V
Subjt:  KGHRCKNKELRLYLVADDLEDTEMEDVENENGPIEVSPVVELSLNSVVGLTTLGTFKVKGTVEDREIIIMIDCGATHNFISLKLVDEQSLPTTETTSYNV

Query:  IMGSGKAVQGQGMCKNITVGLPVLTIIEDFLPLELGNLDMVLGMQWLRKQGTMTVDWKALTMTFIVGDTNVILKGDPSLTRMEISLKMLVKTWQSDDQQF
        IMGSGKAVQG+G+CK ITVGLPV++I+EDFLPLELGN+DMVLGMQWL+KQG MTVDWKALTMTF+VGDT VILKGDPSLTRMEISLK+LVKTWQ DDQ F
Subjt:  IMGSGKAVQGQGMCKNITVGLPVLTIIEDFLPLELGNLDMVLGMQWLRKQGTMTVDWKALTMTFIVGDTNVILKGDPSLTRMEISLKMLVKTWQSDDQQF

Query:  LVDFRAMGIPKTDRMLMATKSIEEIQLEMEQLQKEFEDVF-----------IDHIIQLKEDTDPINLRPYRYLHAQKNEIKTLVNEMLTSGIIRPSINPF
        LV+FRAMGIPK DR L+ T ++EE Q E  QLQ+EF DVF           IDH IQLKE TDPIN+RPYRY HAQKNEI+ LVN+ML SGIIRPS +PF
Subjt:  LVDFRAMGIPKTDRMLMATKSIEEIQLEMEQLQKEFEDVF-----------IDHIIQLKEDTDPINLRPYRYLHAQKNEIKTLVNEMLTSGIIRPSINPF

Query:  SSPVILVKKKDGGWRFCVDYRALNRATVPDKFPIPMIDELLDELNGSSIFSKIDLKSGYHQIRVRDEDIRKTTF
        SSPVILVKKKDGGWRFCVDYRALNRATVPDKFPIPMIDELLDEL+G+SIFSKIDLKSGYHQIRVRDEDI KT F
Subjt:  SSPVILVKKKDGGWRFCVDYRALNRATVPDKFPIPMIDELLDELNGSSIFSKIDLKSGYHQIRVRDEDIRKTTF

SwissProt top hitse value%identityAlignment
P10394 Retrovirus-related Pol polyprotein from transposon 4126.6e-2034.18Show/hide
Query:  EEIQLEMEQLQKEFEDVF--------IDHI----IQLKEDTDPINLRPYRYLHAQKNEIKTLVNEMLTSGIIRPSINPFSSPVILVKKKDG------GWR
        E  + ++E +  E+ D+F        ++++    ++LK+D +P+  + YR  H+Q  EI+  V +++   I+ PS++ ++SP++LV KK         WR
Subjt:  EEIQLEMEQLQKEFEDVF--------IDHI----IQLKEDTDPINLRPYRYLHAQKNEIKTLVNEMLTSGIIRPSINPFSSPVILVKKKDG------GWR

Query:  FCVDYRALNRATVPDKFPIPMIDELLDELNGSSIFSKIDLKSGYHQIRVRDEDIRKTT
          +DYR +N+  + DKFP+P ID++LD+L  +  FS +DL SG+HQI + DE  R  T
Subjt:  FCVDYRALNRATVPDKFPIPMIDELLDELNGSSIFSKIDLKSGYHQIRVRDEDIRKTT

P20825 Retrovirus-related Pol polyprotein from transposon 2976.6e-2026.56Show/hide
Query:  VKGTVEDREIIIMIDCGATHNFISLKLVDEQSLPTTETTSYNVIMGSGKAVQGQGMCKNITVGLPVLTIIEDFLPLEL----GNLDMVLGMQWLRKQGTM
        +K   + R    ++D G+T N I+  +     LP  + +   V+  +G       +  N  + LP  +I +   P  +     N DM++G + L K    
Subjt:  VKGTVEDREIIIMIDCGATHNFISLKLVDEQSLPTTETTSYNVIMGSGKAVQGQGMCKNITVGLPVLTIIEDFLPLEL----GNLDMVLGMQWLRKQGTM

Query:  TVDWKALTMTFIVGDTNVILKGDPSLTRMEISLKMLVKTWQSDDQQFL-----VDFRAMGIPKTDRMLMATKSIEEIQLEMEQLQKEFEDVFIDHIIQLK
         +++K  T+T    D    L    S     + ++   ++  S DQ+ +       FR   + + +   +     +   LE ++ +K      I H++   
Subjt:  TVDWKALTMTFIVGDTNVILKGDPSLTRMEISLKMLVKTWQSDDQQFL-----VDFRAMGIPKTDRMLMATKSIEEIQLEMEQLQKEFEDVFIDHIIQLK

Query:  EDTDPINLRPYRYLHAQKNEIKTLVNEMLTSGIIRPSINPFSSPVILVKKKDGG-----WRFCVDYRALNRATVPDKFPIPMIDELLDELNGSSIFSKID
         ++ PI  + Y      + E++  V EML  G+IR S +P++SP  +V KK        +R  +DYR LN  T+PD++PIP +DE+L +L     F+ ID
Subjt:  EDTDPINLRPYRYLHAQKNEIKTLVNEMLTSGIIRPSINPFSSPVILVKKKDGG-----WRFCVDYRALNRATVPDKFPIPMIDELLDELNGSSIFSKID

Query:  LKSGYHQIRVRDEDIRKTTF
        L  G+HQI + +E I KT F
Subjt:  LKSGYHQIRVRDEDIRKTTF

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein1.8e-2541.13Show/hide
Query:  EEIQLEMEQLQKEFEDVFIDHIIQLKEDTDPINLRPYRYLHAQKNEIKTLVNEMLTSGIIRPSINPFSSPVILVKKKDGGWRFCVDYRALNRATVPDKFP
        E I+ ++     +  ++ + H I++K       L+PY      + EI  +V ++L +  I PS +P SSPV+LV KKDG +R CVDYR LN+AT+ D FP
Subjt:  EEIQLEMEQLQKEFEDVFIDHIIQLKEDTDPINLRPYRYLHAQKNEIKTLVNEMLTSGIIRPSINPFSSPVILVKKKDGGWRFCVDYRALNRATVPDKFP

Query:  IPMIDELLDELNGSSIFSKIDLKSGYHQIRVRDEDIRKTTF
        +P ID LL  +  + IF+ +DL SGYHQI +  +D  KT F
Subjt:  IPMIDELLDELNGSSIFSKIDLKSGYHQIRVRDEDIRKTTF

Q8I7P9 Retrovirus-related Pol polyprotein from transposon opus1.5e-1933.54Show/hide
Query:  LMATKSIEEIQLEMEQLQKEFEDVFIDHIIQLKEDT-----------DPINLRPYRYLHAQKNEIKTLVNEMLTSGIIRPSINPFSSPVILVKKK-----
        L+A +  +  Q  +  L  EF  +F   +  +  +T           DPI  + Y Y    + E++  ++E+L  GIIRPS +P++SP+ +V KK     
Subjt:  LMATKSIEEIQLEMEQLQKEFEDVFIDHIIQLKEDT-----------DPINLRPYRYLHAQKNEIKTLVNEMLTSGIIRPSINPFSSPVILVKKK-----

Query:  DGGWRFCVDYRALNRATVPDKFPIPMIDELLDELNGSSIFSKIDLKSGYHQIRVRDEDIRKTTF
        +  +R  VD++ LN  T+PD +PIP I+  L  L  +  F+ +DL SG+HQI +++ DI KT F
Subjt:  DGGWRFCVDYRALNRATVPDKFPIPMIDELLDELNGSSIFSKIDLKSGYHQIRVRDEDIRKTTF

Q99315 Transposon Ty3-G Gag-Pol polyprotein1.8e-2541.13Show/hide
Query:  EEIQLEMEQLQKEFEDVFIDHIIQLKEDTDPINLRPYRYLHAQKNEIKTLVNEMLTSGIIRPSINPFSSPVILVKKKDGGWRFCVDYRALNRATVPDKFP
        E I+ ++     +  ++ + H I++K       L+PY      + EI  +V ++L +  I PS +P SSPV+LV KKDG +R CVDYR LN+AT+ D FP
Subjt:  EEIQLEMEQLQKEFEDVFIDHIIQLKEDTDPINLRPYRYLHAQKNEIKTLVNEMLTSGIIRPSINPFSSPVILVKKKDGGWRFCVDYRALNRATVPDKFP

Query:  IPMIDELLDELNGSSIFSKIDLKSGYHQIRVRDEDIRKTTF
        +P ID LL  +  + IF+ +DL SGYHQI +  +D  KT F
Subjt:  IPMIDELLDELNGSSIFSKIDLKSGYHQIRVRDEDIRKTTF

Arabidopsis top hitse value%identityAlignment
AT3G29750.1 Eukaryotic aspartyl protease family protein4.7e-1324.44Show/hide
Query:  FLAIKQEQSVNEYLQRFEEL---STPLPEMAEDVLVGTFTNGLDPIIRTEEKLEAARNPQGPVPTHSSTSISSQVTATGSGVRRENNFRRWTDSELQARR
        +  I+QE SV +Y +RFE L   S  LP    + +   F  GL P ++T  +      P G     S  +    +T                 ++L   +
Subjt:  FLAIKQEQSVNEYLQRFEEL---STPLPEMAEDVLVGTFTNGLDPIIRTEEKLEAARNPQGPVPTHSSTSISSQVTATGSGVRRENNFRRWTDSELQARR

Query:  EKRLCYQCDEPFSKGHRCKNKELRLYLVADDLEDTEMEDVENENGPIEVSPVVELSLNSVVGLTTLGTFKVKGTVEDREIIIMIDCGATHNFISLKLVDE
        +K           KG            V ++LE+ E +      G  ++          V+ LT     +  G + D ++++ ID GAT NFI ++L   
Subjt:  EKRLCYQCDEPFSKGHRCKNKELRLYLVADDLEDTEMEDVENENGPIEVSPVVELSLNSVVGLTTLGTFKVKGTVEDREIIIMIDCGATHNFISLKLVDE

Query:  QSLPTTETTSYNVIMGSGKAVQGQGMCKNITVGLPVLTIIEDFLPLELG--NLDMVLGMQWLRKQGTMTVDWKALTMTFIVGDTNVILKGD-PSLTRMEI
          LPT+ T   +V++G  + +Q  G C  I + +  + I E+FL L+L   ++D++LG +WL K G   V+W+    +F      + L  +   L ++  
Subjt:  QSLPTTETTSYNVIMGSGKAVQGQGMCKNITVGLPVLTIIEDFLPLELG--NLDMVLGMQWLRKQGTMTVDWKALTMTFIVGDTNVILKGD-PSLTRMEI

Query:  SLKMLVKTWQSDDQQ
         +KM  +  Q D ++
Subjt:  SLKMLVKTWQSDDQQ

AT3G30770.1 Eukaryotic aspartyl protease family protein1.3e-0727.89Show/hide
Query:  GTVEDREIIIMIDCGATHNFISLKLVDEQSLPTTETTSYNVIMGSGKAVQGQGMCKNITVGLPVLTIIEDFLPLEL--GNLDMVLGMQWLRKQGTMTVDW
        G +   +++++ID GAT+NFIS +L     LPT+ T   +V++G  + +Q  G C  I + +  + I E+FL L+L   ++D++LG    +      + W
Subjt:  GTVEDREIIIMIDCGATHNFISLKLVDEQSLPTTETTSYNVIMGSGKAVQGQGMCKNITVGLPVLTIIEDFLPLEL--GNLDMVLGMQWLRKQGTMTVDW

Query:  KALTMTFIVGDTNVIL-KGDPSLTRMEISLKMLVKTWQSDDQQFLVD
             +F      V L   D  L ++   +KM  +  Q     +L D
Subjt:  KALTMTFIVGDTNVIL-KGDPSLTRMEISLKMLVKTWQSDDQQFLVD

ATMG00850.1 DNA/RNA polymerases superfamily protein5.0e-0756.41Show/hide
Query:  QKNEIKTLVNEMLTSGIIRPSINPFSSPVILVKKKDGGW
        ++  +K  + EML + II+PSI+P+SSPV+LV+KKDGGW
Subjt:  QKNEIKTLVNEMLTSGIIRPSINPFSSPVILVKKKDGGW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTGACCAAGATGTATGCGAAACGTCAACGGCAACAAGAAGGGTCAGAACTGACCGGAGTAAGCATCGACAAGAGGAAACTTCGACCAAACGACATCATGGAAGGAGA
CGAAGAAGGAGAAACGTCTCTGAGCTTGGAAATAGGGGCTGGGCAGGATCGAATCAAATTTAAAAAACTGGAAATGCCGGTTTTCAACGGAGAAGATCCAAATGGCTGGA
TCTATAGGGCGGAGCATTATTTTCAGATGCATTTACTGAATGAACAAGAGAAACTGAAGATTGCGATAGAGTTGAAAGATCGGATGTATAATCGGTTTCAATGTCGAGAA
CATGGAACCAGCTGCGCACGATTTTTAGCCATCAAACAAGAACAGTCGGTGAACGAATACCTGCAACGTTTCGAGGAACTATCGACGCCGTTACCTGAGATGGCCGAGGA
CGTGTTAGTGGGGACATTTACAAATGGGCTGGACCCTATTATTAGGACAGAGGAAAAATTGGAAGCGGCTCGGAATCCTCAAGGCCCGGTACCAACGCACTCCTCTACCT
CGATCTCCTCCCAGGTTACCGCAACGGGTAGTGGGGTTCGGCGCGAAAACAACTTCCGCAGGTGGACCGATTCTGAGCTCCAAGCACGACGAGAAAAGAGACTTTGCTAT
CAGTGTGACGAACCGTTTAGCAAGGGACATCGCTGCAAAAACAAAGAGCTCCGCCTTTACCTTGTAGCCGATGACTTAGAGGATACAGAGATGGAAGACGTGGAAAACGA
GAATGGCCCCATTGAAGTCAGTCCGGTGGTGGAATTATCCTTGAATTCGGTGGTGGGTCTAACAACCCTGGGTACATTCAAGGTTAAAGGCACAGTGGAAGACAGAGAGA
TAATTATCATGATAGATTGTGGAGCAACCCACAATTTCATCTCTCTCAAGTTGGTAGACGAACAAAGTTTACCAACGACAGAAACAACCAGCTACAACGTCATTATGGGA
TCTGGGAAGGCAGTGCAAGGTCAAGGGATGTGCAAGAACATCACGGTAGGGCTGCCGGTACTGACAATCATTGAGGATTTCTTACCGCTTGAATTAGGCAACTTAGATAT
GGTGCTGGGAATGCAATGGCTCCGAAAACAAGGGACGATGACTGTCGATTGGAAGGCGTTAACCATGACATTCATTGTCGGGGACACTAATGTCATTTTGAAAGGGGACC
CCTCGCTGACCAGAATGGAAATATCGTTGAAGATGCTGGTGAAAACATGGCAATCCGACGATCAACAATTCCTAGTTGACTTCAGAGCTATGGGAATCCCCAAGACTGAC
CGAATGTTAATGGCCACAAAATCAATCGAGGAAATACAACTCGAAATGGAACAACTTCAAAAGGAATTCGAAGATGTTTTCATTGATCACATAATCCAACTGAAGGAAGA
CACCGACCCCATTAATTTAAGACCATACCGTTACCTGCACGCACAGAAAAATGAGATCAAAACGCTGGTGAATGAGATGTTGACGTCGGGTATCATAAGACCGAGCATCA
ACCCATTTTCCAGTCCGGTAATTCTAGTAAAAAAGAAGGATGGCGGGTGGAGATTTTGTGTCGATTATAGAGCTTTGAATCGCGCGACAGTACCAGACAAATTCCCAATT
CCGATGATTGATGAGCTGTTGGATGAGTTGAATGGCTCGAGTATCTTCTCCAAGATAGACTTGAAATCAGGTTACCATCAAATCCGAGTGCGAGATGAAGACATAAGGAA
GACAACCTTCTGA
mRNA sequenceShow/hide mRNA sequence
ATGTTGACCAAGATGTATGCGAAACGTCAACGGCAACAAGAAGGGTCAGAACTGACCGGAGTAAGCATCGACAAGAGGAAACTTCGACCAAACGACATCATGGAAGGAGA
CGAAGAAGGAGAAACGTCTCTGAGCTTGGAAATAGGGGCTGGGCAGGATCGAATCAAATTTAAAAAACTGGAAATGCCGGTTTTCAACGGAGAAGATCCAAATGGCTGGA
TCTATAGGGCGGAGCATTATTTTCAGATGCATTTACTGAATGAACAAGAGAAACTGAAGATTGCGATAGAGTTGAAAGATCGGATGTATAATCGGTTTCAATGTCGAGAA
CATGGAACCAGCTGCGCACGATTTTTAGCCATCAAACAAGAACAGTCGGTGAACGAATACCTGCAACGTTTCGAGGAACTATCGACGCCGTTACCTGAGATGGCCGAGGA
CGTGTTAGTGGGGACATTTACAAATGGGCTGGACCCTATTATTAGGACAGAGGAAAAATTGGAAGCGGCTCGGAATCCTCAAGGCCCGGTACCAACGCACTCCTCTACCT
CGATCTCCTCCCAGGTTACCGCAACGGGTAGTGGGGTTCGGCGCGAAAACAACTTCCGCAGGTGGACCGATTCTGAGCTCCAAGCACGACGAGAAAAGAGACTTTGCTAT
CAGTGTGACGAACCGTTTAGCAAGGGACATCGCTGCAAAAACAAAGAGCTCCGCCTTTACCTTGTAGCCGATGACTTAGAGGATACAGAGATGGAAGACGTGGAAAACGA
GAATGGCCCCATTGAAGTCAGTCCGGTGGTGGAATTATCCTTGAATTCGGTGGTGGGTCTAACAACCCTGGGTACATTCAAGGTTAAAGGCACAGTGGAAGACAGAGAGA
TAATTATCATGATAGATTGTGGAGCAACCCACAATTTCATCTCTCTCAAGTTGGTAGACGAACAAAGTTTACCAACGACAGAAACAACCAGCTACAACGTCATTATGGGA
TCTGGGAAGGCAGTGCAAGGTCAAGGGATGTGCAAGAACATCACGGTAGGGCTGCCGGTACTGACAATCATTGAGGATTTCTTACCGCTTGAATTAGGCAACTTAGATAT
GGTGCTGGGAATGCAATGGCTCCGAAAACAAGGGACGATGACTGTCGATTGGAAGGCGTTAACCATGACATTCATTGTCGGGGACACTAATGTCATTTTGAAAGGGGACC
CCTCGCTGACCAGAATGGAAATATCGTTGAAGATGCTGGTGAAAACATGGCAATCCGACGATCAACAATTCCTAGTTGACTTCAGAGCTATGGGAATCCCCAAGACTGAC
CGAATGTTAATGGCCACAAAATCAATCGAGGAAATACAACTCGAAATGGAACAACTTCAAAAGGAATTCGAAGATGTTTTCATTGATCACATAATCCAACTGAAGGAAGA
CACCGACCCCATTAATTTAAGACCATACCGTTACCTGCACGCACAGAAAAATGAGATCAAAACGCTGGTGAATGAGATGTTGACGTCGGGTATCATAAGACCGAGCATCA
ACCCATTTTCCAGTCCGGTAATTCTAGTAAAAAAGAAGGATGGCGGGTGGAGATTTTGTGTCGATTATAGAGCTTTGAATCGCGCGACAGTACCAGACAAATTCCCAATT
CCGATGATTGATGAGCTGTTGGATGAGTTGAATGGCTCGAGTATCTTCTCCAAGATAGACTTGAAATCAGGTTACCATCAAATCCGAGTGCGAGATGAAGACATAAGGAA
GACAACCTTCTGA
Protein sequenceShow/hide protein sequence
MLTKMYAKRQRQQEGSELTGVSIDKRKLRPNDIMEGDEEGETSLSLEIGAGQDRIKFKKLEMPVFNGEDPNGWIYRAEHYFQMHLLNEQEKLKIAIELKDRMYNRFQCRE
HGTSCARFLAIKQEQSVNEYLQRFEELSTPLPEMAEDVLVGTFTNGLDPIIRTEEKLEAARNPQGPVPTHSSTSISSQVTATGSGVRRENNFRRWTDSELQARREKRLCY
QCDEPFSKGHRCKNKELRLYLVADDLEDTEMEDVENENGPIEVSPVVELSLNSVVGLTTLGTFKVKGTVEDREIIIMIDCGATHNFISLKLVDEQSLPTTETTSYNVIMG
SGKAVQGQGMCKNITVGLPVLTIIEDFLPLELGNLDMVLGMQWLRKQGTMTVDWKALTMTFIVGDTNVILKGDPSLTRMEISLKMLVKTWQSDDQQFLVDFRAMGIPKTD
RMLMATKSIEEIQLEMEQLQKEFEDVFIDHIIQLKEDTDPINLRPYRYLHAQKNEIKTLVNEMLTSGIIRPSINPFSSPVILVKKKDGGWRFCVDYRALNRATVPDKFPI
PMIDELLDELNGSSIFSKIDLKSGYHQIRVRDEDIRKTTF