; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI04G09790 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI04G09790
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionTy3/gypsy retrotransposon protein
Genome locationChr4:7771377..7773806
RNA-Seq ExpressionCSPI04G09790
SyntenyCSPI04G09790
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0016020 - membrane (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR041588 - Integrase zinc-binding domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0035107.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]5.6e-30363.86Show/hide
Query:  MMKTWDNSDQGFLIECRTMEGRIITEERDDLQQTVVGEEAAVVVLRDYENVFMWPEKLPPRRDTEHHIHQKKDTKPVNVRPYRYAYQQKAEIEKLVDEML
        +MKTW   DQGFL+ECRT+E   + +E+D  ++ V  E  A  +L+ +  VF WP  LPP+R  EHHI+ K  T PVNVRPYRYA+ QK E+E+LVDEML
Subjt:  MMKTWDNSDQGFLIECRTMEGRIITEERDDLQQTVVGEEAAVVVLRDYENVFMWPEKLPPRRDTEHHIHQKKDTKPVNVRPYRYAYQQKAEIEKLVDEML

Query:  TSGVIRPN-SPYSSPVLLVRKKDGSWRICVDYKALNNFTIPDKLPIPIIEELFDELNEASYFSKIDLKSGYHQIRMCEEDIKKTVFRTHEGHYEFMVMPF
        +SG+IRP+ SPYSSPVLLVRKKDGSWR CVDY+ALNN TIPDK PIP+IEELFDEL  AS FSK+DLK+GYHQIRMC EDI+KT FRTHEGHYEF+VMPF
Subjt:  TSGVIRPN-SPYSSPVLLVRKKDGSWRICVDYKALNNFTIPDKLPIPIIEELFDELNEASYFSKIDLKSGYHQIRMCEEDIKKTVFRTHEGHYEFMVMPF

Query:  KLTNAP-TFQSLMNSIFKPYLRRFVLVFFDDTLIYSKDLKTHLQHLGLTLQVLRKNELYAN-RKCSFAQGKIDYLGHIISSQGVEVDPEKFRAIREWPIP
         LTNAP TFQ+LMN +FKPYLRRFVLVFFDD LIYS+ +  H+QHL + L +L+  EL+ N  KCSFA+ +I YLGH IS QG+E DPEK RA+ EWP P
Subjt:  KLTNAP-TFQSLMNSIFKPYLRRFVLVFFDDTLIYSKDLKTHLQHLGLTLQVLRKNELYAN-RKCSFAQGKIDYLGHIISSQGVEVDPEKFRAIREWPIP

Query:  INIRQVRGFLGLTGYYRKFVQNYGSIAAPLTQLLKKGGFKWGEEAQEAFLKLQQAMMTLLVLALPNFNAPFEIETNASGIGIGVVLTQSKRPIAYFSHTL
         N+R+VRGFLGLTGYYR+FV+NYG+IAAPLTQLLKKG +KW  EA+ AF KL++AMMTL VL +P+FN PFEIE++ASG+G+G VLTQ ++P+AYFS TL
Subjt:  INIRQVRGFLGLTGYYRKFVQNYGSIAAPLTQLLKKGGFKWGEEAQEAFLKLQQAMMTLLVLALPNFNAPFEIETNASGIGIGVVLTQSKRPIAYFSHTL

Query:  ATKDRAKPVYERELMAVVLAVQRWRPYLLGRRFLVKTDQQSLKFLLEQRMIQPEYQKWIAKLLGYSFEVVYKPGLENKAADALSRIPTSTHLNSLMVQTL
        + +DRA+PVYEREL+AVVLAVQRWRPYLLGR+F VKTDQ+SLKFLLEQR++QP+YQKW+AKLLGYSFEVVY+PGLENKAADALSR+P + HL+ +    +
Subjt:  ATKDRAKPVYERELMAVVLAVQRWRPYLLGRRFLVKTDQQSLKFLLEQRMIQPEYQKWIAKLLGYSFEVVYKPGLENKAADALSRIPTSTHLNSLMVQTL

Query:  IDLQVIKREVEEDDHLKKIITLIEKGEETEEQKYSIRQVVLRYEDRLVISKNSTIIPTILHTYHDSVFGGQSGFLRTYKRIAGELYWLGMKHMIKKYCDK
        ID+++IK E + D  L++I  ++E+G E     Y+++Q VL+++ RLVI   ST+IPTILHTYHDSVFGG SGFLRTYKR+ GE+YW GMK  I +YC++
Subjt:  IDLQVIKREVEEDDHLKKIITLIEKGEETEEQKYSIRQVVLRYEDRLVISKNSTIIPTILHTYHDSVFGGQSGFLRTYKRIAGELYWLGMKHMIKKYCDK

Query:  CSVCQRSKTLSLLP-GLPVPLEIPSKIWNDISMDFIEGLPKSKGCEVIFVVVDRLSKYGHFLPVKHPYTAKSIAELFIKEVVRLHGYPSSI---------
        C++CQR+K+ +L P GL +PLEIP  IW+DISMDFIEGLPKSKG +VIFVVVDRLSKYGHFL +KHP++AK +AE F+KEVVRLHGYP SI         
Subjt:  CSVCQRSKTLSLLP-GLPVPLEIPSKIWNDISMDFIEGLPKSKGCEVIFVVVDRLSKYGHFLPVKHPYTAKSIAELFIKEVVRLHGYPSSI---------

Query:  -----ELSRMVGTRLNRSTAYHPQSDGQTEVVNRGLETYLRCFCGEKPKEW--------------------------------------GDHSTSNSTLD
             EL R+ GT+LNRS++YHPQSDGQTEVVN+ +ETYLRCFCGEKP EW                                      GD  T NSTLD
Subjt:  -----ELSRMVGTRLNRSTAYHPQSDGQTEVVNRGLETYLRCFCGEKPKEW--------------------------------------GDHSTSNSTLD

Query:  ELLKERDI
        + LK+RDI
Subjt:  ELLKERDI

KAA0037196.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]5.6e-30363.49Show/hide
Query:  MMKTWDNSDQGFLIECRTMEGRIITEERDDLQQTVVGEEAAVVVLRDYENVFMWPEKLPPRRDTEHHIHQKKDTKPVNVRPYRYAYQQKAEIEKLVDEML
        +MK+W   DQGFL+ECRT+E   + E   D +Q  +  E    +L+ +  VF WP  LPP+R  +HHI+ K    PVNVRPYRYA+ QK E+E+LVDEML
Subjt:  MMKTWDNSDQGFLIECRTMEGRIITEERDDLQQTVVGEEAAVVVLRDYENVFMWPEKLPPRRDTEHHIHQKKDTKPVNVRPYRYAYQQKAEIEKLVDEML

Query:  TSGVIRPN-SPYSSPVLLVRKKDGSWRICVDYKALNNFTIPDKLPIPIIEELFDELNEASYFSKIDLKSGYHQIRMCEEDIKKTVFRTHEGHYEFMVMPF
        TSG+IRP+ SPYSSPVLLVRKKDGSWR CVDY+ALNN TIPDK PIP+IEELFDEL  AS FSKIDLK+GYHQIRMC EDI+KT FRTHEGHYEF+VMPF
Subjt:  TSGVIRPN-SPYSSPVLLVRKKDGSWRICVDYKALNNFTIPDKLPIPIIEELFDELNEASYFSKIDLKSGYHQIRMCEEDIKKTVFRTHEGHYEFMVMPF

Query:  KLTNAP-TFQSLMNSIFKPYLRRFVLVFFDDTLIYSKDLKTHLQHLGLTLQVLRKNELYAN-RKCSFAQGKIDYLGHIISSQGVEVDPEKFRAIREWPIP
         LTNAP TFQ+LMN +FKPYLRRFVLVFFDD L+YS+ ++ H+QHL + L +L++ ELY N  KCSFA+ +I YLGH IS QG+E DPEK RA+ EWP P
Subjt:  KLTNAP-TFQSLMNSIFKPYLRRFVLVFFDDTLIYSKDLKTHLQHLGLTLQVLRKNELYAN-RKCSFAQGKIDYLGHIISSQGVEVDPEKFRAIREWPIP

Query:  INIRQVRGFLGLTGYYRKFVQNYGSIAAPLTQLLKKGGFKWGEEAQEAFLKLQQAMMTLLVLALPNFNAPFEIETNASGIGIGVVLTQSKRPIAYFSHTL
         N+R+VRGFLGLTGYYR+FV+NYG+IAAPLTQLLKKG +KWGEE + AF KL++AMMTL VL +P+F+ PFEIE++ASG G+G VLTQ ++P+AYFS TL
Subjt:  INIRQVRGFLGLTGYYRKFVQNYGSIAAPLTQLLKKGGFKWGEEAQEAFLKLQQAMMTLLVLALPNFNAPFEIETNASGIGIGVVLTQSKRPIAYFSHTL

Query:  ATKDRAKPVYERELMAVVLAVQRWRPYLLGRRFLVKTDQQSLKFLLEQRMIQPEYQKWIAKLLGYSFEVVYKPGLENKAADALSRIPTSTHLNSLMVQTL
        + +DR++PVYEREL+AVVLAVQRWRPYLLGR+F VKTDQ+SLKFLLEQR++QP+YQKW+AKLLGYSFEVVY+PGLENKAADALSRI  +  LN +    +
Subjt:  ATKDRAKPVYERELMAVVLAVQRWRPYLLGRRFLVKTDQQSLKFLLEQRMIQPEYQKWIAKLLGYSFEVVYKPGLENKAADALSRIPTSTHLNSLMVQTL

Query:  IDLQVIKREVEEDDHLKKIITLIEKGEETEEQKYSIRQVVLRYEDRLVISKNSTIIPTILHTYHDSVFGGQSGFLRTYKRIAGELYWLGMKHMIKKYCDK
        ID+++IK E   D  L++II LIE+ +  E   Y+++Q VL+++ RLV+S  ST++PTILHTYHDSVFGG SGFLRTYKR+ GE+YW GMK  + +YC++
Subjt:  IDLQVIKREVEEDDHLKKIITLIEKGEETEEQKYSIRQVVLRYEDRLVISKNSTIIPTILHTYHDSVFGGQSGFLRTYKRIAGELYWLGMKHMIKKYCDK

Query:  CSVCQRSKTLSLLP-GLPVPLEIPSKIWNDISMDFIEGLPKSKGCEVIFVVVDRLSKYGHFLPVKHPYTAKSIAELFIKEVVRLHGYPSSI---------
        C++CQR+K+ +L P GL +PLEIP  IW+DISMDFIEGLPKSKG +VI VVVDRLSKYGHFL +KHP+TAK +AE F+KEVVRLHGYP SI         
Subjt:  CSVCQRSKTLSLLP-GLPVPLEIPSKIWNDISMDFIEGLPKSKGCEVIFVVVDRLSKYGHFLPVKHPYTAKSIAELFIKEVVRLHGYPSSI---------

Query:  -----ELSRMVGTRLNRSTAYHPQSDGQTEVVNRGLETYLRCFCGEKPKEW--------------------------------------GDHSTSNSTLD
             EL R+ GT+LNRS++YHPQSDGQTEVVN+ +ETYLRCFCGEKP+EW                                      GD  T NSTLD
Subjt:  -----ELSRMVGTRLNRSTAYHPQSDGQTEVVNRGLETYLRCFCGEKPKEW--------------------------------------GDHSTSNSTLD

Query:  ELLKERDI
        + LK+RDI
Subjt:  ELLKERDI

KAA0038753.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]1.2e-30263.86Show/hide
Query:  MMKTWDNSDQGFLIECRTMEGRIITEERDDLQQTVVGEEAAVVVLRDYENVFMWPEKLPPRRDTEHHIHQKKDTKPVNVRPYRYAYQQKAEIEKLVDEML
        +MKTW   DQGFL+ECRT+E   + +E+D  +  V  E  A  +L+ +  VF WP  LPP+R  EHHI+ K  T PVNVRPYRYA+ QK E+E+LVDEML
Subjt:  MMKTWDNSDQGFLIECRTMEGRIITEERDDLQQTVVGEEAAVVVLRDYENVFMWPEKLPPRRDTEHHIHQKKDTKPVNVRPYRYAYQQKAEIEKLVDEML

Query:  TSGVIRPN-SPYSSPVLLVRKKDGSWRICVDYKALNNFTIPDKLPIPIIEELFDELNEASYFSKIDLKSGYHQIRMCEEDIKKTVFRTHEGHYEFMVMPF
        +SG+IRP+ SPYSSPVLLVRKKDGSWR CVDY+ALNN TIPDK PIP+IEELFDEL  AS FSK+DLK+GYHQIRMC EDI+KT FRTHEGHYEF+VMPF
Subjt:  TSGVIRPN-SPYSSPVLLVRKKDGSWRICVDYKALNNFTIPDKLPIPIIEELFDELNEASYFSKIDLKSGYHQIRMCEEDIKKTVFRTHEGHYEFMVMPF

Query:  KLTNAP-TFQSLMNSIFKPYLRRFVLVFFDDTLIYSKDLKTHLQHLGLTLQVLRKNELYAN-RKCSFAQGKIDYLGHIISSQGVEVDPEKFRAIREWPIP
         LTNAP TFQ+LMN +FKPYLRRFVLVFFDD LIYS+ +  H+QHL + L +L+  EL+ N  KCSFA+ +I YLGH IS QG+E DPEK RA+ EWP P
Subjt:  KLTNAP-TFQSLMNSIFKPYLRRFVLVFFDDTLIYSKDLKTHLQHLGLTLQVLRKNELYAN-RKCSFAQGKIDYLGHIISSQGVEVDPEKFRAIREWPIP

Query:  INIRQVRGFLGLTGYYRKFVQNYGSIAAPLTQLLKKGGFKWGEEAQEAFLKLQQAMMTLLVLALPNFNAPFEIETNASGIGIGVVLTQSKRPIAYFSHTL
         N+R+VRGFLGLTGYYR+FV+NYG+IAAPLTQLLKKG +KW  EA+ AF KL++AMMTL VL +P+FN PFEIE++ASG+G+G VLTQ ++P+AYFS TL
Subjt:  INIRQVRGFLGLTGYYRKFVQNYGSIAAPLTQLLKKGGFKWGEEAQEAFLKLQQAMMTLLVLALPNFNAPFEIETNASGIGIGVVLTQSKRPIAYFSHTL

Query:  ATKDRAKPVYERELMAVVLAVQRWRPYLLGRRFLVKTDQQSLKFLLEQRMIQPEYQKWIAKLLGYSFEVVYKPGLENKAADALSRIPTSTHLNSLMVQTL
        + +DRA+PVYEREL+AVVLAVQRWRPYLLGR+F VKTDQ+SLKFLLEQR++QP+YQKW+AKLLGYSFEVVY+PGLENKAADALSR+P + HL+ +    +
Subjt:  ATKDRAKPVYERELMAVVLAVQRWRPYLLGRRFLVKTDQQSLKFLLEQRMIQPEYQKWIAKLLGYSFEVVYKPGLENKAADALSRIPTSTHLNSLMVQTL

Query:  IDLQVIKREVEEDDHLKKIITLIEKGEETEEQKYSIRQVVLRYEDRLVISKNSTIIPTILHTYHDSVFGGQSGFLRTYKRIAGELYWLGMKHMIKKYCDK
        ID+++IK E + D  L++I  ++E+G E     Y+++Q VL+++ RLVI   ST+IPTILHTYHDSVFGG SGFLRTYKR+ GE+YW GMK  I +YC++
Subjt:  IDLQVIKREVEEDDHLKKIITLIEKGEETEEQKYSIRQVVLRYEDRLVISKNSTIIPTILHTYHDSVFGGQSGFLRTYKRIAGELYWLGMKHMIKKYCDK

Query:  CSVCQRSKTLSLLP-GLPVPLEIPSKIWNDISMDFIEGLPKSKGCEVIFVVVDRLSKYGHFLPVKHPYTAKSIAELFIKEVVRLHGYPSSI---------
        C++CQR+K+ +L P GL +PLEIP  IW+DISMDFIEGLPKSKG +VIFVVVDRLSKYGHFL +KHP++AK +AE F+KEVVRLHGYP SI         
Subjt:  CSVCQRSKTLSLLP-GLPVPLEIPSKIWNDISMDFIEGLPKSKGCEVIFVVVDRLSKYGHFLPVKHPYTAKSIAELFIKEVVRLHGYPSSI---------

Query:  -----ELSRMVGTRLNRSTAYHPQSDGQTEVVNRGLETYLRCFCGEKPKEW--------------------------------------GDHSTSNSTLD
             EL R+ GT+LNRS++YHPQSDGQTEVVN+ +ETYLRCFCGEKP EW                                      GD  T NSTLD
Subjt:  -----ELSRMVGTRLNRSTAYHPQSDGQTEVVNRGLETYLRCFCGEKPKEW--------------------------------------GDHSTSNSTLD

Query:  ELLKERDI
        + LK+RDI
Subjt:  ELLKERDI

TYK02563.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]4.3e-30363.99Show/hide
Query:  MMKTWDNSDQGFLIECRTMEGRIITEERDDLQQTVVGEEAAVVVLRDYENVFMWPEKLPPRRDTEHHIHQKKDTKPVNVRPYRYAYQQKAEIEKLVDEML
        +MKTW   DQGFL+ECRT+E   + +E+D  +  V  E  A  +L+ +  VF WP  LPP+R  EHHI+ K  T PVNVRPYRYA+ QK E+E+LVDEML
Subjt:  MMKTWDNSDQGFLIECRTMEGRIITEERDDLQQTVVGEEAAVVVLRDYENVFMWPEKLPPRRDTEHHIHQKKDTKPVNVRPYRYAYQQKAEIEKLVDEML

Query:  TSGVIRPN-SPYSSPVLLVRKKDGSWRICVDYKALNNFTIPDKLPIPIIEELFDELNEASYFSKIDLKSGYHQIRMCEEDIKKTVFRTHEGHYEFMVMPF
        +SG+IRP+ SPYSSPVLLVRKKDGSWR CVDY+ALNN TIPDK PIP+IEELFDEL  AS FSK+DLK+GYHQIRMC EDI+KT FRTHEGHYEF+VMPF
Subjt:  TSGVIRPN-SPYSSPVLLVRKKDGSWRICVDYKALNNFTIPDKLPIPIIEELFDELNEASYFSKIDLKSGYHQIRMCEEDIKKTVFRTHEGHYEFMVMPF

Query:  KLTNAP-TFQSLMNSIFKPYLRRFVLVFFDDTLIYSKDLKTHLQHLGLTLQVLRKNELYAN-RKCSFAQGKIDYLGHIISSQGVEVDPEKFRAIREWPIP
         LTNAP TFQ+LMN +FKPYLRRFVLVFFDD LIYS+ +  H+QHL + L +L+  EL+ N  KCSFA+ +I YLGH IS QG+E DPEK RA+ EWP P
Subjt:  KLTNAP-TFQSLMNSIFKPYLRRFVLVFFDDTLIYSKDLKTHLQHLGLTLQVLRKNELYAN-RKCSFAQGKIDYLGHIISSQGVEVDPEKFRAIREWPIP

Query:  INIRQVRGFLGLTGYYRKFVQNYGSIAAPLTQLLKKGGFKWGEEAQEAFLKLQQAMMTLLVLALPNFNAPFEIETNASGIGIGVVLTQSKRPIAYFSHTL
         N+R+VRGFLGLTGYYR+FV+NYG+IAAPLTQLLKKG +KW  EA+ AF KL++AMMTL VL +P+FN PFEIE++ASG+G+G VLTQ ++P+AYFS TL
Subjt:  INIRQVRGFLGLTGYYRKFVQNYGSIAAPLTQLLKKGGFKWGEEAQEAFLKLQQAMMTLLVLALPNFNAPFEIETNASGIGIGVVLTQSKRPIAYFSHTL

Query:  ATKDRAKPVYERELMAVVLAVQRWRPYLLGRRFLVKTDQQSLKFLLEQRMIQPEYQKWIAKLLGYSFEVVYKPGLENKAADALSRIPTSTHLNSLMVQTL
        + +DRA+PVYEREL+AVVLAVQRWRPYLLGR+F VKTDQ+SLKFLLEQRM+QP+YQKW+AKLLGYSFEVVY+PGLENKAADALSR+P + HL+ +    +
Subjt:  ATKDRAKPVYERELMAVVLAVQRWRPYLLGRRFLVKTDQQSLKFLLEQRMIQPEYQKWIAKLLGYSFEVVYKPGLENKAADALSRIPTSTHLNSLMVQTL

Query:  IDLQVIKREVEEDDHLKKIITLIEKGEETEEQKYSIRQVVLRYEDRLVISKNSTIIPTILHTYHDSVFGGQSGFLRTYKRIAGELYWLGMKHMIKKYCDK
        ID+++IK E + D  L++I  ++E+G E     Y+++Q VL+++ RLVI   ST+IPTILHTYHDSVFGG SGFLRTYKR+ GE+YW GMK  I +YC++
Subjt:  IDLQVIKREVEEDDHLKKIITLIEKGEETEEQKYSIRQVVLRYEDRLVISKNSTIIPTILHTYHDSVFGGQSGFLRTYKRIAGELYWLGMKHMIKKYCDK

Query:  CSVCQRSKTLSLLP-GLPVPLEIPSKIWNDISMDFIEGLPKSKGCEVIFVVVDRLSKYGHFLPVKHPYTAKSIAELFIKEVVRLHGYPSSI---------
        C++CQR+K+ +L P GL +PLEIP  IW+DISMDFIEGLPKSKG +VIFVVVDRLSKYGHFL +KHP++AK +AE F+KEVVRLHGYP SI         
Subjt:  CSVCQRSKTLSLLP-GLPVPLEIPSKIWNDISMDFIEGLPKSKGCEVIFVVVDRLSKYGHFLPVKHPYTAKSIAELFIKEVVRLHGYPSSI---------

Query:  -----ELSRMVGTRLNRSTAYHPQSDGQTEVVNRGLETYLRCFCGEKPKEW--------------------------------------GDHSTSNSTLD
             EL R+ GT+LNRS++YHPQSDGQTEVVN+ +ETYLRCFCGEKP EW                                      GD  T NSTLD
Subjt:  -----ELSRMVGTRLNRSTAYHPQSDGQTEVVNRGLETYLRCFCGEKPKEW--------------------------------------GDHSTSNSTLD

Query:  ELLKERDI
        + LK+RDI
Subjt:  ELLKERDI

TYK13876.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]5.6e-30363.49Show/hide
Query:  MMKTWDNSDQGFLIECRTMEGRIITEERDDLQQTVVGEEAAVVVLRDYENVFMWPEKLPPRRDTEHHIHQKKDTKPVNVRPYRYAYQQKAEIEKLVDEML
        +MK+W   DQGFL+ECRT+E   + E   D +Q  +  E    +L+ +  VF WP  LPP+R  +HHI+ K    PVNVRPYRYA+ QK E+E+LVDEML
Subjt:  MMKTWDNSDQGFLIECRTMEGRIITEERDDLQQTVVGEEAAVVVLRDYENVFMWPEKLPPRRDTEHHIHQKKDTKPVNVRPYRYAYQQKAEIEKLVDEML

Query:  TSGVIRPN-SPYSSPVLLVRKKDGSWRICVDYKALNNFTIPDKLPIPIIEELFDELNEASYFSKIDLKSGYHQIRMCEEDIKKTVFRTHEGHYEFMVMPF
        TSG+IRP+ SPYSSPVLLVRKKDGSWR CVDY+ALNN TIPDK PIP+IEELFDEL  AS FSKIDLK+GYHQIRMC EDI+KT FRTHEGHYEF+VMPF
Subjt:  TSGVIRPN-SPYSSPVLLVRKKDGSWRICVDYKALNNFTIPDKLPIPIIEELFDELNEASYFSKIDLKSGYHQIRMCEEDIKKTVFRTHEGHYEFMVMPF

Query:  KLTNAP-TFQSLMNSIFKPYLRRFVLVFFDDTLIYSKDLKTHLQHLGLTLQVLRKNELYAN-RKCSFAQGKIDYLGHIISSQGVEVDPEKFRAIREWPIP
         LTNAP TFQ+LMN +FKPYLRRFVLVFFDD L+YS+ ++ H+QHL + L +L++ ELY N  KCSFA+ +I YLGH IS QG+E DPEK RA+ EWP P
Subjt:  KLTNAP-TFQSLMNSIFKPYLRRFVLVFFDDTLIYSKDLKTHLQHLGLTLQVLRKNELYAN-RKCSFAQGKIDYLGHIISSQGVEVDPEKFRAIREWPIP

Query:  INIRQVRGFLGLTGYYRKFVQNYGSIAAPLTQLLKKGGFKWGEEAQEAFLKLQQAMMTLLVLALPNFNAPFEIETNASGIGIGVVLTQSKRPIAYFSHTL
         N+R+VRGFLGLTGYYR+FV+NYG+IAAPLTQLLKKG +KWGEE + AF KL++AMMTL VL +P+F+ PFEIE++ASG G+G VLTQ ++P+AYFS TL
Subjt:  INIRQVRGFLGLTGYYRKFVQNYGSIAAPLTQLLKKGGFKWGEEAQEAFLKLQQAMMTLLVLALPNFNAPFEIETNASGIGIGVVLTQSKRPIAYFSHTL

Query:  ATKDRAKPVYERELMAVVLAVQRWRPYLLGRRFLVKTDQQSLKFLLEQRMIQPEYQKWIAKLLGYSFEVVYKPGLENKAADALSRIPTSTHLNSLMVQTL
        + +DR++PVYEREL+AVVLAVQRWRPYLLGR+F VKTDQ+SLKFLLEQR++QP+YQKW+AKLLGYSFEVVY+PGLENKAADALSRI  +  LN +    +
Subjt:  ATKDRAKPVYERELMAVVLAVQRWRPYLLGRRFLVKTDQQSLKFLLEQRMIQPEYQKWIAKLLGYSFEVVYKPGLENKAADALSRIPTSTHLNSLMVQTL

Query:  IDLQVIKREVEEDDHLKKIITLIEKGEETEEQKYSIRQVVLRYEDRLVISKNSTIIPTILHTYHDSVFGGQSGFLRTYKRIAGELYWLGMKHMIKKYCDK
        ID+++IK E   D  L++II LIE+ +  E   Y+++Q VL+++ RLV+S  ST++PTILHTYHDSVFGG SGFLRTYKR+ GE+YW GMK  + +YC++
Subjt:  IDLQVIKREVEEDDHLKKIITLIEKGEETEEQKYSIRQVVLRYEDRLVISKNSTIIPTILHTYHDSVFGGQSGFLRTYKRIAGELYWLGMKHMIKKYCDK

Query:  CSVCQRSKTLSLLP-GLPVPLEIPSKIWNDISMDFIEGLPKSKGCEVIFVVVDRLSKYGHFLPVKHPYTAKSIAELFIKEVVRLHGYPSSI---------
        C++CQR+K+ +L P GL +PLEIP  IW+DISMDFIEGLPKSKG +VI VVVDRLSKYGHFL +KHP+TAK +AE F+KEVVRLHGYP SI         
Subjt:  CSVCQRSKTLSLLP-GLPVPLEIPSKIWNDISMDFIEGLPKSKGCEVIFVVVDRLSKYGHFLPVKHPYTAKSIAELFIKEVVRLHGYPSSI---------

Query:  -----ELSRMVGTRLNRSTAYHPQSDGQTEVVNRGLETYLRCFCGEKPKEW--------------------------------------GDHSTSNSTLD
             EL R+ GT+LNRS++YHPQSDGQTEVVN+ +ETYLRCFCGEKP+EW                                      GD  T NSTLD
Subjt:  -----ELSRMVGTRLNRSTAYHPQSDGQTEVVNRGLETYLRCFCGEKPKEW--------------------------------------GDHSTSNSTLD

Query:  ELLKERDI
        + LK+RDI
Subjt:  ELLKERDI

TrEMBL top hitse value%identityAlignment
A0A5A7SS61 Ty3/gypsy retrotransposon protein6.0e-30363.86Show/hide
Query:  MMKTWDNSDQGFLIECRTMEGRIITEERDDLQQTVVGEEAAVVVLRDYENVFMWPEKLPPRRDTEHHIHQKKDTKPVNVRPYRYAYQQKAEIEKLVDEML
        +MKTW   DQGFL+ECRT+E   + +E+D  +  V  E  A  +L+ +  VF WP  LPP+R  EHHI+ K  T PVNVRPYRYA+ QK E+E+LVDEML
Subjt:  MMKTWDNSDQGFLIECRTMEGRIITEERDDLQQTVVGEEAAVVVLRDYENVFMWPEKLPPRRDTEHHIHQKKDTKPVNVRPYRYAYQQKAEIEKLVDEML

Query:  TSGVIRPN-SPYSSPVLLVRKKDGSWRICVDYKALNNFTIPDKLPIPIIEELFDELNEASYFSKIDLKSGYHQIRMCEEDIKKTVFRTHEGHYEFMVMPF
        +SG+IRP+ SPYSSPVLLVRKKDGSWR CVDY+ALNN TIPDK PIP+IEELFDEL  AS FSK+DLK+GYHQIRMC EDI+KT FRTHEGHYEF+VMPF
Subjt:  TSGVIRPN-SPYSSPVLLVRKKDGSWRICVDYKALNNFTIPDKLPIPIIEELFDELNEASYFSKIDLKSGYHQIRMCEEDIKKTVFRTHEGHYEFMVMPF

Query:  KLTNAP-TFQSLMNSIFKPYLRRFVLVFFDDTLIYSKDLKTHLQHLGLTLQVLRKNELYAN-RKCSFAQGKIDYLGHIISSQGVEVDPEKFRAIREWPIP
         LTNAP TFQ+LMN +FKPYLRRFVLVFFDD LIYS+ +  H+QHL + L +L+  EL+ N  KCSFA+ +I YLGH IS QG+E DPEK RA+ EWP P
Subjt:  KLTNAP-TFQSLMNSIFKPYLRRFVLVFFDDTLIYSKDLKTHLQHLGLTLQVLRKNELYAN-RKCSFAQGKIDYLGHIISSQGVEVDPEKFRAIREWPIP

Query:  INIRQVRGFLGLTGYYRKFVQNYGSIAAPLTQLLKKGGFKWGEEAQEAFLKLQQAMMTLLVLALPNFNAPFEIETNASGIGIGVVLTQSKRPIAYFSHTL
         N+R+VRGFLGLTGYYR+FV+NYG+IAAPLTQLLKKG +KW  EA+ AF KL++AMMTL VL +P+FN PFEIE++ASG+G+G VLTQ ++P+AYFS TL
Subjt:  INIRQVRGFLGLTGYYRKFVQNYGSIAAPLTQLLKKGGFKWGEEAQEAFLKLQQAMMTLLVLALPNFNAPFEIETNASGIGIGVVLTQSKRPIAYFSHTL

Query:  ATKDRAKPVYERELMAVVLAVQRWRPYLLGRRFLVKTDQQSLKFLLEQRMIQPEYQKWIAKLLGYSFEVVYKPGLENKAADALSRIPTSTHLNSLMVQTL
        + +DRA+PVYEREL+AVVLAVQRWRPYLLGR+F VKTDQ+SLKFLLEQR++QP+YQKW+AKLLGYSFEVVY+PGLENKAADALSR+P + HL+ +    +
Subjt:  ATKDRAKPVYERELMAVVLAVQRWRPYLLGRRFLVKTDQQSLKFLLEQRMIQPEYQKWIAKLLGYSFEVVYKPGLENKAADALSRIPTSTHLNSLMVQTL

Query:  IDLQVIKREVEEDDHLKKIITLIEKGEETEEQKYSIRQVVLRYEDRLVISKNSTIIPTILHTYHDSVFGGQSGFLRTYKRIAGELYWLGMKHMIKKYCDK
        ID+++IK E + D  L++I  ++E+G E     Y+++Q VL+++ RLVI   ST+IPTILHTYHDSVFGG SGFLRTYKR+ GE+YW GMK  I +YC++
Subjt:  IDLQVIKREVEEDDHLKKIITLIEKGEETEEQKYSIRQVVLRYEDRLVISKNSTIIPTILHTYHDSVFGGQSGFLRTYKRIAGELYWLGMKHMIKKYCDK

Query:  CSVCQRSKTLSLLP-GLPVPLEIPSKIWNDISMDFIEGLPKSKGCEVIFVVVDRLSKYGHFLPVKHPYTAKSIAELFIKEVVRLHGYPSSI---------
        C++CQR+K+ +L P GL +PLEIP  IW+DISMDFIEGLPKSKG +VIFVVVDRLSKYGHFL +KHP++AK +AE F+KEVVRLHGYP SI         
Subjt:  CSVCQRSKTLSLLP-GLPVPLEIPSKIWNDISMDFIEGLPKSKGCEVIFVVVDRLSKYGHFLPVKHPYTAKSIAELFIKEVVRLHGYPSSI---------

Query:  -----ELSRMVGTRLNRSTAYHPQSDGQTEVVNRGLETYLRCFCGEKPKEW--------------------------------------GDHSTSNSTLD
             EL R+ GT+LNRS++YHPQSDGQTEVVN+ +ETYLRCFCGEKP EW                                      GD  T NSTLD
Subjt:  -----ELSRMVGTRLNRSTAYHPQSDGQTEVVNRGLETYLRCFCGEKPKEW--------------------------------------GDHSTSNSTLD

Query:  ELLKERDI
        + LK+RDI
Subjt:  ELLKERDI

A0A5A7T0J9 Ty3/gypsy retrotransposon protein2.7e-30363.86Show/hide
Query:  MMKTWDNSDQGFLIECRTMEGRIITEERDDLQQTVVGEEAAVVVLRDYENVFMWPEKLPPRRDTEHHIHQKKDTKPVNVRPYRYAYQQKAEIEKLVDEML
        +MKTW   DQGFL+ECRT+E   + +E+D  ++ V  E  A  +L+ +  VF WP  LPP+R  EHHI+ K  T PVNVRPYRYA+ QK E+E+LVDEML
Subjt:  MMKTWDNSDQGFLIECRTMEGRIITEERDDLQQTVVGEEAAVVVLRDYENVFMWPEKLPPRRDTEHHIHQKKDTKPVNVRPYRYAYQQKAEIEKLVDEML

Query:  TSGVIRPN-SPYSSPVLLVRKKDGSWRICVDYKALNNFTIPDKLPIPIIEELFDELNEASYFSKIDLKSGYHQIRMCEEDIKKTVFRTHEGHYEFMVMPF
        +SG+IRP+ SPYSSPVLLVRKKDGSWR CVDY+ALNN TIPDK PIP+IEELFDEL  AS FSK+DLK+GYHQIRMC EDI+KT FRTHEGHYEF+VMPF
Subjt:  TSGVIRPN-SPYSSPVLLVRKKDGSWRICVDYKALNNFTIPDKLPIPIIEELFDELNEASYFSKIDLKSGYHQIRMCEEDIKKTVFRTHEGHYEFMVMPF

Query:  KLTNAP-TFQSLMNSIFKPYLRRFVLVFFDDTLIYSKDLKTHLQHLGLTLQVLRKNELYAN-RKCSFAQGKIDYLGHIISSQGVEVDPEKFRAIREWPIP
         LTNAP TFQ+LMN +FKPYLRRFVLVFFDD LIYS+ +  H+QHL + L +L+  EL+ N  KCSFA+ +I YLGH IS QG+E DPEK RA+ EWP P
Subjt:  KLTNAP-TFQSLMNSIFKPYLRRFVLVFFDDTLIYSKDLKTHLQHLGLTLQVLRKNELYAN-RKCSFAQGKIDYLGHIISSQGVEVDPEKFRAIREWPIP

Query:  INIRQVRGFLGLTGYYRKFVQNYGSIAAPLTQLLKKGGFKWGEEAQEAFLKLQQAMMTLLVLALPNFNAPFEIETNASGIGIGVVLTQSKRPIAYFSHTL
         N+R+VRGFLGLTGYYR+FV+NYG+IAAPLTQLLKKG +KW  EA+ AF KL++AMMTL VL +P+FN PFEIE++ASG+G+G VLTQ ++P+AYFS TL
Subjt:  INIRQVRGFLGLTGYYRKFVQNYGSIAAPLTQLLKKGGFKWGEEAQEAFLKLQQAMMTLLVLALPNFNAPFEIETNASGIGIGVVLTQSKRPIAYFSHTL

Query:  ATKDRAKPVYERELMAVVLAVQRWRPYLLGRRFLVKTDQQSLKFLLEQRMIQPEYQKWIAKLLGYSFEVVYKPGLENKAADALSRIPTSTHLNSLMVQTL
        + +DRA+PVYEREL+AVVLAVQRWRPYLLGR+F VKTDQ+SLKFLLEQR++QP+YQKW+AKLLGYSFEVVY+PGLENKAADALSR+P + HL+ +    +
Subjt:  ATKDRAKPVYERELMAVVLAVQRWRPYLLGRRFLVKTDQQSLKFLLEQRMIQPEYQKWIAKLLGYSFEVVYKPGLENKAADALSRIPTSTHLNSLMVQTL

Query:  IDLQVIKREVEEDDHLKKIITLIEKGEETEEQKYSIRQVVLRYEDRLVISKNSTIIPTILHTYHDSVFGGQSGFLRTYKRIAGELYWLGMKHMIKKYCDK
        ID+++IK E + D  L++I  ++E+G E     Y+++Q VL+++ RLVI   ST+IPTILHTYHDSVFGG SGFLRTYKR+ GE+YW GMK  I +YC++
Subjt:  IDLQVIKREVEEDDHLKKIITLIEKGEETEEQKYSIRQVVLRYEDRLVISKNSTIIPTILHTYHDSVFGGQSGFLRTYKRIAGELYWLGMKHMIKKYCDK

Query:  CSVCQRSKTLSLLP-GLPVPLEIPSKIWNDISMDFIEGLPKSKGCEVIFVVVDRLSKYGHFLPVKHPYTAKSIAELFIKEVVRLHGYPSSI---------
        C++CQR+K+ +L P GL +PLEIP  IW+DISMDFIEGLPKSKG +VIFVVVDRLSKYGHFL +KHP++AK +AE F+KEVVRLHGYP SI         
Subjt:  CSVCQRSKTLSLLP-GLPVPLEIPSKIWNDISMDFIEGLPKSKGCEVIFVVVDRLSKYGHFLPVKHPYTAKSIAELFIKEVVRLHGYPSSI---------

Query:  -----ELSRMVGTRLNRSTAYHPQSDGQTEVVNRGLETYLRCFCGEKPKEW--------------------------------------GDHSTSNSTLD
             EL R+ GT+LNRS++YHPQSDGQTEVVN+ +ETYLRCFCGEKP EW                                      GD  T NSTLD
Subjt:  -----ELSRMVGTRLNRSTAYHPQSDGQTEVVNRGLETYLRCFCGEKPKEW--------------------------------------GDHSTSNSTLD

Query:  ELLKERDI
        + LK+RDI
Subjt:  ELLKERDI

A0A5A7T4Y0 Ty3/gypsy retrotransposon protein2.7e-30363.49Show/hide
Query:  MMKTWDNSDQGFLIECRTMEGRIITEERDDLQQTVVGEEAAVVVLRDYENVFMWPEKLPPRRDTEHHIHQKKDTKPVNVRPYRYAYQQKAEIEKLVDEML
        +MK+W   DQGFL+ECRT+E   + E   D +Q  +  E    +L+ +  VF WP  LPP+R  +HHI+ K    PVNVRPYRYA+ QK E+E+LVDEML
Subjt:  MMKTWDNSDQGFLIECRTMEGRIITEERDDLQQTVVGEEAAVVVLRDYENVFMWPEKLPPRRDTEHHIHQKKDTKPVNVRPYRYAYQQKAEIEKLVDEML

Query:  TSGVIRPN-SPYSSPVLLVRKKDGSWRICVDYKALNNFTIPDKLPIPIIEELFDELNEASYFSKIDLKSGYHQIRMCEEDIKKTVFRTHEGHYEFMVMPF
        TSG+IRP+ SPYSSPVLLVRKKDGSWR CVDY+ALNN TIPDK PIP+IEELFDEL  AS FSKIDLK+GYHQIRMC EDI+KT FRTHEGHYEF+VMPF
Subjt:  TSGVIRPN-SPYSSPVLLVRKKDGSWRICVDYKALNNFTIPDKLPIPIIEELFDELNEASYFSKIDLKSGYHQIRMCEEDIKKTVFRTHEGHYEFMVMPF

Query:  KLTNAP-TFQSLMNSIFKPYLRRFVLVFFDDTLIYSKDLKTHLQHLGLTLQVLRKNELYAN-RKCSFAQGKIDYLGHIISSQGVEVDPEKFRAIREWPIP
         LTNAP TFQ+LMN +FKPYLRRFVLVFFDD L+YS+ ++ H+QHL + L +L++ ELY N  KCSFA+ +I YLGH IS QG+E DPEK RA+ EWP P
Subjt:  KLTNAP-TFQSLMNSIFKPYLRRFVLVFFDDTLIYSKDLKTHLQHLGLTLQVLRKNELYAN-RKCSFAQGKIDYLGHIISSQGVEVDPEKFRAIREWPIP

Query:  INIRQVRGFLGLTGYYRKFVQNYGSIAAPLTQLLKKGGFKWGEEAQEAFLKLQQAMMTLLVLALPNFNAPFEIETNASGIGIGVVLTQSKRPIAYFSHTL
         N+R+VRGFLGLTGYYR+FV+NYG+IAAPLTQLLKKG +KWGEE + AF KL++AMMTL VL +P+F+ PFEIE++ASG G+G VLTQ ++P+AYFS TL
Subjt:  INIRQVRGFLGLTGYYRKFVQNYGSIAAPLTQLLKKGGFKWGEEAQEAFLKLQQAMMTLLVLALPNFNAPFEIETNASGIGIGVVLTQSKRPIAYFSHTL

Query:  ATKDRAKPVYERELMAVVLAVQRWRPYLLGRRFLVKTDQQSLKFLLEQRMIQPEYQKWIAKLLGYSFEVVYKPGLENKAADALSRIPTSTHLNSLMVQTL
        + +DR++PVYEREL+AVVLAVQRWRPYLLGR+F VKTDQ+SLKFLLEQR++QP+YQKW+AKLLGYSFEVVY+PGLENKAADALSRI  +  LN +    +
Subjt:  ATKDRAKPVYERELMAVVLAVQRWRPYLLGRRFLVKTDQQSLKFLLEQRMIQPEYQKWIAKLLGYSFEVVYKPGLENKAADALSRIPTSTHLNSLMVQTL

Query:  IDLQVIKREVEEDDHLKKIITLIEKGEETEEQKYSIRQVVLRYEDRLVISKNSTIIPTILHTYHDSVFGGQSGFLRTYKRIAGELYWLGMKHMIKKYCDK
        ID+++IK E   D  L++II LIE+ +  E   Y+++Q VL+++ RLV+S  ST++PTILHTYHDSVFGG SGFLRTYKR+ GE+YW GMK  + +YC++
Subjt:  IDLQVIKREVEEDDHLKKIITLIEKGEETEEQKYSIRQVVLRYEDRLVISKNSTIIPTILHTYHDSVFGGQSGFLRTYKRIAGELYWLGMKHMIKKYCDK

Query:  CSVCQRSKTLSLLP-GLPVPLEIPSKIWNDISMDFIEGLPKSKGCEVIFVVVDRLSKYGHFLPVKHPYTAKSIAELFIKEVVRLHGYPSSI---------
        C++CQR+K+ +L P GL +PLEIP  IW+DISMDFIEGLPKSKG +VI VVVDRLSKYGHFL +KHP+TAK +AE F+KEVVRLHGYP SI         
Subjt:  CSVCQRSKTLSLLP-GLPVPLEIPSKIWNDISMDFIEGLPKSKGCEVIFVVVDRLSKYGHFLPVKHPYTAKSIAELFIKEVVRLHGYPSSI---------

Query:  -----ELSRMVGTRLNRSTAYHPQSDGQTEVVNRGLETYLRCFCGEKPKEW--------------------------------------GDHSTSNSTLD
             EL R+ GT+LNRS++YHPQSDGQTEVVN+ +ETYLRCFCGEKP+EW                                      GD  T NSTLD
Subjt:  -----ELSRMVGTRLNRSTAYHPQSDGQTEVVNRGLETYLRCFCGEKPKEW--------------------------------------GDHSTSNSTLD

Query:  ELLKERDI
        + LK+RDI
Subjt:  ELLKERDI

A0A5D3BSA8 Ty3/gypsy retrotransposon protein2.1e-30363.99Show/hide
Query:  MMKTWDNSDQGFLIECRTMEGRIITEERDDLQQTVVGEEAAVVVLRDYENVFMWPEKLPPRRDTEHHIHQKKDTKPVNVRPYRYAYQQKAEIEKLVDEML
        +MKTW   DQGFL+ECRT+E   + +E+D  +  V  E  A  +L+ +  VF WP  LPP+R  EHHI+ K  T PVNVRPYRYA+ QK E+E+LVDEML
Subjt:  MMKTWDNSDQGFLIECRTMEGRIITEERDDLQQTVVGEEAAVVVLRDYENVFMWPEKLPPRRDTEHHIHQKKDTKPVNVRPYRYAYQQKAEIEKLVDEML

Query:  TSGVIRPN-SPYSSPVLLVRKKDGSWRICVDYKALNNFTIPDKLPIPIIEELFDELNEASYFSKIDLKSGYHQIRMCEEDIKKTVFRTHEGHYEFMVMPF
        +SG+IRP+ SPYSSPVLLVRKKDGSWR CVDY+ALNN TIPDK PIP+IEELFDEL  AS FSK+DLK+GYHQIRMC EDI+KT FRTHEGHYEF+VMPF
Subjt:  TSGVIRPN-SPYSSPVLLVRKKDGSWRICVDYKALNNFTIPDKLPIPIIEELFDELNEASYFSKIDLKSGYHQIRMCEEDIKKTVFRTHEGHYEFMVMPF

Query:  KLTNAP-TFQSLMNSIFKPYLRRFVLVFFDDTLIYSKDLKTHLQHLGLTLQVLRKNELYAN-RKCSFAQGKIDYLGHIISSQGVEVDPEKFRAIREWPIP
         LTNAP TFQ+LMN +FKPYLRRFVLVFFDD LIYS+ +  H+QHL + L +L+  EL+ N  KCSFA+ +I YLGH IS QG+E DPEK RA+ EWP P
Subjt:  KLTNAP-TFQSLMNSIFKPYLRRFVLVFFDDTLIYSKDLKTHLQHLGLTLQVLRKNELYAN-RKCSFAQGKIDYLGHIISSQGVEVDPEKFRAIREWPIP

Query:  INIRQVRGFLGLTGYYRKFVQNYGSIAAPLTQLLKKGGFKWGEEAQEAFLKLQQAMMTLLVLALPNFNAPFEIETNASGIGIGVVLTQSKRPIAYFSHTL
         N+R+VRGFLGLTGYYR+FV+NYG+IAAPLTQLLKKG +KW  EA+ AF KL++AMMTL VL +P+FN PFEIE++ASG+G+G VLTQ ++P+AYFS TL
Subjt:  INIRQVRGFLGLTGYYRKFVQNYGSIAAPLTQLLKKGGFKWGEEAQEAFLKLQQAMMTLLVLALPNFNAPFEIETNASGIGIGVVLTQSKRPIAYFSHTL

Query:  ATKDRAKPVYERELMAVVLAVQRWRPYLLGRRFLVKTDQQSLKFLLEQRMIQPEYQKWIAKLLGYSFEVVYKPGLENKAADALSRIPTSTHLNSLMVQTL
        + +DRA+PVYEREL+AVVLAVQRWRPYLLGR+F VKTDQ+SLKFLLEQRM+QP+YQKW+AKLLGYSFEVVY+PGLENKAADALSR+P + HL+ +    +
Subjt:  ATKDRAKPVYERELMAVVLAVQRWRPYLLGRRFLVKTDQQSLKFLLEQRMIQPEYQKWIAKLLGYSFEVVYKPGLENKAADALSRIPTSTHLNSLMVQTL

Query:  IDLQVIKREVEEDDHLKKIITLIEKGEETEEQKYSIRQVVLRYEDRLVISKNSTIIPTILHTYHDSVFGGQSGFLRTYKRIAGELYWLGMKHMIKKYCDK
        ID+++IK E + D  L++I  ++E+G E     Y+++Q VL+++ RLVI   ST+IPTILHTYHDSVFGG SGFLRTYKR+ GE+YW GMK  I +YC++
Subjt:  IDLQVIKREVEEDDHLKKIITLIEKGEETEEQKYSIRQVVLRYEDRLVISKNSTIIPTILHTYHDSVFGGQSGFLRTYKRIAGELYWLGMKHMIKKYCDK

Query:  CSVCQRSKTLSLLP-GLPVPLEIPSKIWNDISMDFIEGLPKSKGCEVIFVVVDRLSKYGHFLPVKHPYTAKSIAELFIKEVVRLHGYPSSI---------
        C++CQR+K+ +L P GL +PLEIP  IW+DISMDFIEGLPKSKG +VIFVVVDRLSKYGHFL +KHP++AK +AE F+KEVVRLHGYP SI         
Subjt:  CSVCQRSKTLSLLP-GLPVPLEIPSKIWNDISMDFIEGLPKSKGCEVIFVVVDRLSKYGHFLPVKHPYTAKSIAELFIKEVVRLHGYPSSI---------

Query:  -----ELSRMVGTRLNRSTAYHPQSDGQTEVVNRGLETYLRCFCGEKPKEW--------------------------------------GDHSTSNSTLD
             EL R+ GT+LNRS++YHPQSDGQTEVVN+ +ETYLRCFCGEKP EW                                      GD  T NSTLD
Subjt:  -----ELSRMVGTRLNRSTAYHPQSDGQTEVVNRGLETYLRCFCGEKPKEW--------------------------------------GDHSTSNSTLD

Query:  ELLKERDI
        + LK+RDI
Subjt:  ELLKERDI

A0A5D3CU05 Ty3/gypsy retrotransposon protein2.7e-30363.49Show/hide
Query:  MMKTWDNSDQGFLIECRTMEGRIITEERDDLQQTVVGEEAAVVVLRDYENVFMWPEKLPPRRDTEHHIHQKKDTKPVNVRPYRYAYQQKAEIEKLVDEML
        +MK+W   DQGFL+ECRT+E   + E   D +Q  +  E    +L+ +  VF WP  LPP+R  +HHI+ K    PVNVRPYRYA+ QK E+E+LVDEML
Subjt:  MMKTWDNSDQGFLIECRTMEGRIITEERDDLQQTVVGEEAAVVVLRDYENVFMWPEKLPPRRDTEHHIHQKKDTKPVNVRPYRYAYQQKAEIEKLVDEML

Query:  TSGVIRPN-SPYSSPVLLVRKKDGSWRICVDYKALNNFTIPDKLPIPIIEELFDELNEASYFSKIDLKSGYHQIRMCEEDIKKTVFRTHEGHYEFMVMPF
        TSG+IRP+ SPYSSPVLLVRKKDGSWR CVDY+ALNN TIPDK PIP+IEELFDEL  AS FSKIDLK+GYHQIRMC EDI+KT FRTHEGHYEF+VMPF
Subjt:  TSGVIRPN-SPYSSPVLLVRKKDGSWRICVDYKALNNFTIPDKLPIPIIEELFDELNEASYFSKIDLKSGYHQIRMCEEDIKKTVFRTHEGHYEFMVMPF

Query:  KLTNAP-TFQSLMNSIFKPYLRRFVLVFFDDTLIYSKDLKTHLQHLGLTLQVLRKNELYAN-RKCSFAQGKIDYLGHIISSQGVEVDPEKFRAIREWPIP
         LTNAP TFQ+LMN +FKPYLRRFVLVFFDD L+YS+ ++ H+QHL + L +L++ ELY N  KCSFA+ +I YLGH IS QG+E DPEK RA+ EWP P
Subjt:  KLTNAP-TFQSLMNSIFKPYLRRFVLVFFDDTLIYSKDLKTHLQHLGLTLQVLRKNELYAN-RKCSFAQGKIDYLGHIISSQGVEVDPEKFRAIREWPIP

Query:  INIRQVRGFLGLTGYYRKFVQNYGSIAAPLTQLLKKGGFKWGEEAQEAFLKLQQAMMTLLVLALPNFNAPFEIETNASGIGIGVVLTQSKRPIAYFSHTL
         N+R+VRGFLGLTGYYR+FV+NYG+IAAPLTQLLKKG +KWGEE + AF KL++AMMTL VL +P+F+ PFEIE++ASG G+G VLTQ ++P+AYFS TL
Subjt:  INIRQVRGFLGLTGYYRKFVQNYGSIAAPLTQLLKKGGFKWGEEAQEAFLKLQQAMMTLLVLALPNFNAPFEIETNASGIGIGVVLTQSKRPIAYFSHTL

Query:  ATKDRAKPVYERELMAVVLAVQRWRPYLLGRRFLVKTDQQSLKFLLEQRMIQPEYQKWIAKLLGYSFEVVYKPGLENKAADALSRIPTSTHLNSLMVQTL
        + +DR++PVYEREL+AVVLAVQRWRPYLLGR+F VKTDQ+SLKFLLEQR++QP+YQKW+AKLLGYSFEVVY+PGLENKAADALSRI  +  LN +    +
Subjt:  ATKDRAKPVYERELMAVVLAVQRWRPYLLGRRFLVKTDQQSLKFLLEQRMIQPEYQKWIAKLLGYSFEVVYKPGLENKAADALSRIPTSTHLNSLMVQTL

Query:  IDLQVIKREVEEDDHLKKIITLIEKGEETEEQKYSIRQVVLRYEDRLVISKNSTIIPTILHTYHDSVFGGQSGFLRTYKRIAGELYWLGMKHMIKKYCDK
        ID+++IK E   D  L++II LIE+ +  E   Y+++Q VL+++ RLV+S  ST++PTILHTYHDSVFGG SGFLRTYKR+ GE+YW GMK  + +YC++
Subjt:  IDLQVIKREVEEDDHLKKIITLIEKGEETEEQKYSIRQVVLRYEDRLVISKNSTIIPTILHTYHDSVFGGQSGFLRTYKRIAGELYWLGMKHMIKKYCDK

Query:  CSVCQRSKTLSLLP-GLPVPLEIPSKIWNDISMDFIEGLPKSKGCEVIFVVVDRLSKYGHFLPVKHPYTAKSIAELFIKEVVRLHGYPSSI---------
        C++CQR+K+ +L P GL +PLEIP  IW+DISMDFIEGLPKSKG +VI VVVDRLSKYGHFL +KHP+TAK +AE F+KEVVRLHGYP SI         
Subjt:  CSVCQRSKTLSLLP-GLPVPLEIPSKIWNDISMDFIEGLPKSKGCEVIFVVVDRLSKYGHFLPVKHPYTAKSIAELFIKEVVRLHGYPSSI---------

Query:  -----ELSRMVGTRLNRSTAYHPQSDGQTEVVNRGLETYLRCFCGEKPKEW--------------------------------------GDHSTSNSTLD
             EL R+ GT+LNRS++YHPQSDGQTEVVN+ +ETYLRCFCGEKP+EW                                      GD  T NSTLD
Subjt:  -----ELSRMVGTRLNRSTAYHPQSDGQTEVVNRGLETYLRCFCGEKPKEW--------------------------------------GDHSTSNSTLD

Query:  ELLKERDI
        + LK+RDI
Subjt:  ELLKERDI

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein8.3e-9230Show/hide
Query:  EKLP-PRRDTEHHIHQKKDTKPVNVRPYRYAYQQKAEIEKLVDEMLTSGVIRPNSPYSS-PVLLVRKKDGSWRICVDYKALNNFTIPDKLPIPIIEELFD
        EKLP P +  E  +   ++   + +R Y     +   +   +++ L SG+IR +   ++ PV+ V KK+G+ R+ VDYK LN +  P+  P+P+IE+L  
Subjt:  EKLP-PRRDTEHHIHQKKDTKPVNVRPYRYAYQQKAEIEKLVDEMLTSGVIRPNSPYSS-PVLLVRKKDGSWRICVDYKALNNFTIPDKLPIPIIEELFD

Query:  ELNEASYFSKIDLKSGYHQIRMCEEDIKKTVFRTHEGHYEFMVMPFKLTNAPT-FQSLMNSIFKPYLRRFVLVFFDDTLIYSKDLKTHLQHLGLTLQVLR
        ++  ++ F+K+DLKS YH IR+ + D  K  FR   G +E++VMP+ ++ AP  FQ  +N+I        V+ + DD LI+SK    H++H+   LQ L+
Subjt:  ELNEASYFSKIDLKSGYHQIRMCEEDIKKTVFRTHEGHYEFMVMPFKLTNAPT-FQSLMNSIFKPYLRRFVLVFFDDTLIYSKDLKTHLQHLGLTLQVLR

Query:  KNELYANR-KCSFAQGKIDYLGHIISSQGVEVDPEKFRAIREWPIPINIRQVRGFLGLTGYYRKFVQNYGSIAAPLTQLLKKG-GFKWGEEAQEAFLKLQ
           L  N+ KC F Q ++ ++G+ IS +G     E    + +W  P N +++R FLG   Y RKF+     +  PL  LLKK   +KW     +A   ++
Subjt:  KNELYANR-KCSFAQGKIDYLGHIISSQGVEVDPEKFRAIREWPIPINIRQVRGFLGLTGYYRKFVQNYGSIAAPLTQLLKKG-GFKWGEEAQEAFLKLQ

Query:  QAMMTLLVLALPNFNAPFEIETNASGIGIGVVLTQSK-----RPIAYFSHTLATKDRAKPVYERELMAVVLAVQRWRPYLLG--RRFLVKTDQQSL--KF
        Q +++  VL   +F+    +ET+AS + +G VL+Q        P+ Y+S  ++       V ++E++A++ +++ WR YL      F + TD ++L  + 
Subjt:  QAMMTLLVLALPNFNAPFEIETNASGIGIGVVLTQSK-----RPIAYFSHTLATKDRAKPVYERELMAVVLAVQRWRPYLLG--RRFLVKTDQQSL--KF

Query:  LLEQRMIQPEYQKWIAKLLGYSFEVVYKPGLENKAADALSR-------IPTSTHLNSL--MVQTLIDLQVIKREVEEDDHLKKIITLIEKGEETEEQKYS
          E         +W   L  ++FE+ Y+PG  N  ADALSR       IP  +  NS+  + Q  I      + V E  +  K++ L+   ++  E+   
Subjt:  LLEQRMIQPEYQKWIAKLLGYSFEVVYKPGLENKAADALSR-------IPTSTHLNSL--MVQTLIDLQVIKREVEEDDHLKKIITLIEKGEETEEQKYS

Query:  IRQ-VVLRYEDRLVISKNSTIIPTILHTYHDSVFGGQSGFLRTYKRIAGELYWLGMKHMIKKYCDKCSVCQRSKTLSLLPGLPV-PLEIPSKIWNDISMD
        ++  +++  +D++++  ++ +  TI+  YH+       G       I     W G++  I++Y   C  CQ +K+ +  P  P+ P+    + W  +SMD
Subjt:  IRQ-VVLRYEDRLVISKNSTIIPTILHTYHDSVFGGQSGFLRTYKRIAGELYWLGMKHMIKKYCDKCSVCQRSKTLSLLPGLPV-PLEIPSKIWNDISMD

Query:  FIEGLPKSKGCEVIFVVVDRLSKYGHFLPVKHPYTAKSIAELFIKEVVRLHGYPSSI--------------ELSRMVGTRLNRSTAYHPQSDGQTEVVNR
        FI  LP+S G   +FVVVDR SK    +P     TA+  A +F + V+   G P  I              + +      +  S  Y PQ+DGQTE  N+
Subjt:  FIEGLPKSKGCEVIFVVVDRLSKYGHFLPVKHPYTAKSIAELFIKEVVRLHGYPSSI--------------ELSRMVGTRLNRSTAYHPQSDGQTEVVNR

Query:  GLETYLRCFCGEKPKEWGDH
         +E  LRC C   P  W DH
Subjt:  GLETYLRCFCGEKPKEWGDH

P0CT35 Transposon Tf2-2 polyprotein8.3e-9230Show/hide
Query:  EKLP-PRRDTEHHIHQKKDTKPVNVRPYRYAYQQKAEIEKLVDEMLTSGVIRPNSPYSS-PVLLVRKKDGSWRICVDYKALNNFTIPDKLPIPIIEELFD
        EKLP P +  E  +   ++   + +R Y     +   +   +++ L SG+IR +   ++ PV+ V KK+G+ R+ VDYK LN +  P+  P+P+IE+L  
Subjt:  EKLP-PRRDTEHHIHQKKDTKPVNVRPYRYAYQQKAEIEKLVDEMLTSGVIRPNSPYSS-PVLLVRKKDGSWRICVDYKALNNFTIPDKLPIPIIEELFD

Query:  ELNEASYFSKIDLKSGYHQIRMCEEDIKKTVFRTHEGHYEFMVMPFKLTNAPT-FQSLMNSIFKPYLRRFVLVFFDDTLIYSKDLKTHLQHLGLTLQVLR
        ++  ++ F+K+DLKS YH IR+ + D  K  FR   G +E++VMP+ ++ AP  FQ  +N+I        V+ + DD LI+SK    H++H+   LQ L+
Subjt:  ELNEASYFSKIDLKSGYHQIRMCEEDIKKTVFRTHEGHYEFMVMPFKLTNAPT-FQSLMNSIFKPYLRRFVLVFFDDTLIYSKDLKTHLQHLGLTLQVLR

Query:  KNELYANR-KCSFAQGKIDYLGHIISSQGVEVDPEKFRAIREWPIPINIRQVRGFLGLTGYYRKFVQNYGSIAAPLTQLLKKG-GFKWGEEAQEAFLKLQ
           L  N+ KC F Q ++ ++G+ IS +G     E    + +W  P N +++R FLG   Y RKF+     +  PL  LLKK   +KW     +A   ++
Subjt:  KNELYANR-KCSFAQGKIDYLGHIISSQGVEVDPEKFRAIREWPIPINIRQVRGFLGLTGYYRKFVQNYGSIAAPLTQLLKKG-GFKWGEEAQEAFLKLQ

Query:  QAMMTLLVLALPNFNAPFEIETNASGIGIGVVLTQSK-----RPIAYFSHTLATKDRAKPVYERELMAVVLAVQRWRPYLLG--RRFLVKTDQQSL--KF
        Q +++  VL   +F+    +ET+AS + +G VL+Q        P+ Y+S  ++       V ++E++A++ +++ WR YL      F + TD ++L  + 
Subjt:  QAMMTLLVLALPNFNAPFEIETNASGIGIGVVLTQSK-----RPIAYFSHTLATKDRAKPVYERELMAVVLAVQRWRPYLLG--RRFLVKTDQQSL--KF

Query:  LLEQRMIQPEYQKWIAKLLGYSFEVVYKPGLENKAADALSR-------IPTSTHLNSL--MVQTLIDLQVIKREVEEDDHLKKIITLIEKGEETEEQKYS
          E         +W   L  ++FE+ Y+PG  N  ADALSR       IP  +  NS+  + Q  I      + V E  +  K++ L+   ++  E+   
Subjt:  LLEQRMIQPEYQKWIAKLLGYSFEVVYKPGLENKAADALSR-------IPTSTHLNSL--MVQTLIDLQVIKREVEEDDHLKKIITLIEKGEETEEQKYS

Query:  IRQ-VVLRYEDRLVISKNSTIIPTILHTYHDSVFGGQSGFLRTYKRIAGELYWLGMKHMIKKYCDKCSVCQRSKTLSLLPGLPV-PLEIPSKIWNDISMD
        ++  +++  +D++++  ++ +  TI+  YH+       G       I     W G++  I++Y   C  CQ +K+ +  P  P+ P+    + W  +SMD
Subjt:  IRQ-VVLRYEDRLVISKNSTIIPTILHTYHDSVFGGQSGFLRTYKRIAGELYWLGMKHMIKKYCDKCSVCQRSKTLSLLPGLPV-PLEIPSKIWNDISMD

Query:  FIEGLPKSKGCEVIFVVVDRLSKYGHFLPVKHPYTAKSIAELFIKEVVRLHGYPSSI--------------ELSRMVGTRLNRSTAYHPQSDGQTEVVNR
        FI  LP+S G   +FVVVDR SK    +P     TA+  A +F + V+   G P  I              + +      +  S  Y PQ+DGQTE  N+
Subjt:  FIEGLPKSKGCEVIFVVVDRLSKYGHFLPVKHPYTAKSIAELFIKEVVRLHGYPSSI--------------ELSRMVGTRLNRSTAYHPQSDGQTEVVNR

Query:  GLETYLRCFCGEKPKEWGDH
         +E  LRC C   P  W DH
Subjt:  GLETYLRCFCGEKPKEWGDH

P0CT41 Transposon Tf2-12 polyprotein8.3e-9230Show/hide
Query:  EKLP-PRRDTEHHIHQKKDTKPVNVRPYRYAYQQKAEIEKLVDEMLTSGVIRPNSPYSS-PVLLVRKKDGSWRICVDYKALNNFTIPDKLPIPIIEELFD
        EKLP P +  E  +   ++   + +R Y     +   +   +++ L SG+IR +   ++ PV+ V KK+G+ R+ VDYK LN +  P+  P+P+IE+L  
Subjt:  EKLP-PRRDTEHHIHQKKDTKPVNVRPYRYAYQQKAEIEKLVDEMLTSGVIRPNSPYSS-PVLLVRKKDGSWRICVDYKALNNFTIPDKLPIPIIEELFD

Query:  ELNEASYFSKIDLKSGYHQIRMCEEDIKKTVFRTHEGHYEFMVMPFKLTNAPT-FQSLMNSIFKPYLRRFVLVFFDDTLIYSKDLKTHLQHLGLTLQVLR
        ++  ++ F+K+DLKS YH IR+ + D  K  FR   G +E++VMP+ ++ AP  FQ  +N+I        V+ + DD LI+SK    H++H+   LQ L+
Subjt:  ELNEASYFSKIDLKSGYHQIRMCEEDIKKTVFRTHEGHYEFMVMPFKLTNAPT-FQSLMNSIFKPYLRRFVLVFFDDTLIYSKDLKTHLQHLGLTLQVLR

Query:  KNELYANR-KCSFAQGKIDYLGHIISSQGVEVDPEKFRAIREWPIPINIRQVRGFLGLTGYYRKFVQNYGSIAAPLTQLLKKG-GFKWGEEAQEAFLKLQ
           L  N+ KC F Q ++ ++G+ IS +G     E    + +W  P N +++R FLG   Y RKF+     +  PL  LLKK   +KW     +A   ++
Subjt:  KNELYANR-KCSFAQGKIDYLGHIISSQGVEVDPEKFRAIREWPIPINIRQVRGFLGLTGYYRKFVQNYGSIAAPLTQLLKKG-GFKWGEEAQEAFLKLQ

Query:  QAMMTLLVLALPNFNAPFEIETNASGIGIGVVLTQSK-----RPIAYFSHTLATKDRAKPVYERELMAVVLAVQRWRPYLLG--RRFLVKTDQQSL--KF
        Q +++  VL   +F+    +ET+AS + +G VL+Q        P+ Y+S  ++       V ++E++A++ +++ WR YL      F + TD ++L  + 
Subjt:  QAMMTLLVLALPNFNAPFEIETNASGIGIGVVLTQSK-----RPIAYFSHTLATKDRAKPVYERELMAVVLAVQRWRPYLLG--RRFLVKTDQQSL--KF

Query:  LLEQRMIQPEYQKWIAKLLGYSFEVVYKPGLENKAADALSR-------IPTSTHLNSL--MVQTLIDLQVIKREVEEDDHLKKIITLIEKGEETEEQKYS
          E         +W   L  ++FE+ Y+PG  N  ADALSR       IP  +  NS+  + Q  I      + V E  +  K++ L+   ++  E+   
Subjt:  LLEQRMIQPEYQKWIAKLLGYSFEVVYKPGLENKAADALSR-------IPTSTHLNSL--MVQTLIDLQVIKREVEEDDHLKKIITLIEKGEETEEQKYS

Query:  IRQ-VVLRYEDRLVISKNSTIIPTILHTYHDSVFGGQSGFLRTYKRIAGELYWLGMKHMIKKYCDKCSVCQRSKTLSLLPGLPV-PLEIPSKIWNDISMD
        ++  +++  +D++++  ++ +  TI+  YH+       G       I     W G++  I++Y   C  CQ +K+ +  P  P+ P+    + W  +SMD
Subjt:  IRQ-VVLRYEDRLVISKNSTIIPTILHTYHDSVFGGQSGFLRTYKRIAGELYWLGMKHMIKKYCDKCSVCQRSKTLSLLPGLPV-PLEIPSKIWNDISMD

Query:  FIEGLPKSKGCEVIFVVVDRLSKYGHFLPVKHPYTAKSIAELFIKEVVRLHGYPSSI--------------ELSRMVGTRLNRSTAYHPQSDGQTEVVNR
        FI  LP+S G   +FVVVDR SK    +P     TA+  A +F + V+   G P  I              + +      +  S  Y PQ+DGQTE  N+
Subjt:  FIEGLPKSKGCEVIFVVVDRLSKYGHFLPVKHPYTAKSIAELFIKEVVRLHGYPSSI--------------ELSRMVGTRLNRSTAYHPQSDGQTEVVNR

Query:  GLETYLRCFCGEKPKEWGDH
         +E  LRC C   P  W DH
Subjt:  GLETYLRCFCGEKPKEWGDH

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein3.0e-10234.39Show/hide
Query:  LPPRR------DTEHHIHQKKDTKPVNVRPYRYAYQQKAEIEKLVDEMLTSGVIRPN-SPYSSPVLLVRKKDGSWRICVDYKALNNFTIPDKLPIPIIEE
        LPPR         +H I  K   +   ++PY    + + EI K+V ++L +  I P+ SP SSPV+LV KKDG++R+CVDY+ LN  TI D  P+P I+ 
Subjt:  LPPRR------DTEHHIHQKKDTKPVNVRPYRYAYQQKAEIEKLVDEMLTSGVIRPN-SPYSSPVLLVRKKDGSWRICVDYKALNNFTIPDKLPIPIIEE

Query:  LFDELNEASYFSKIDLKSGYHQIRMCEEDIKKTVFRTHEGHYEFMVMPFKLTNAP-TFQSLMNSIFKPYLRRFVLVFFDDTLIYSKDLKTHLQHLGLTLQ
        L   +  A  F+ +DL SGYHQI M  +D  KT F T  G YE+ VMPF L NAP TF   M   F+    RFV V+ DD LI+S+  + H +HL   L+
Subjt:  LFDELNEASYFSKIDLKSGYHQIRMCEEDIKKTVFRTHEGHYEFMVMPFKLTNAP-TFQSLMNSIFKPYLRRFVLVFFDDTLIYSKDLKTHLQHLGLTLQ

Query:  VLR-KNELYANRKCSFAQGKIDYLGHIISSQGVEVDPEKFRAIREWPIPINIRQVRGFLGLTGYYRKFVQNYGSIAAPLTQLLKKGGFKWGEEAQEAFLK
         L+ +N +   +KC FA  + ++LG+ I  Q +     K  AIR++P P  ++Q + FLG+  YYR+F+ N   IA P+ QL      +W E+  +A  K
Subjt:  VLR-KNELYANRKCSFAQGKIDYLGHIISSQGVEVDPEKFRAIREWPIPINIRQVRGFLGLTGYYRKFVQNYGSIAAPLTQLLKKGGFKWGEEAQEAFLK

Query:  LQQAMMTLLVLALPNFNAPFEIETNASGIGIGVVLTQSKRP------IAYFSHTLATKDRAKPVYERELMAVVLAVQRWRPYLLGRRFLVKTDQQSLKFL
        L+ A+    VL   N  A + + T+AS  GIG VL +          + YFS +L +  +  P  E EL+ ++ A+  +R  L G+ F ++TD  SL  L
Subjt:  LQQAMMTLLVLALPNFNAPFEIETNASGIGIGVVLTQSKRP------IAYFSHTLATKDRAKPVYERELMAVVLAVQRWRPYLLGRRFLVKTDQQSLKFL

Query:  LEQRMIQPEYQKWIAKLLGYSFEVVYKPGLENKAADALSR-IPTSTHLNSLMVQT----------------LIDLQVIKREVEEDDHLKKIITLIEKGE-
          +       Q+W+  L  Y F + Y  G +N  ADA+SR I T T   S  + T                LI ++ + +     + +    +  +K E 
Subjt:  LEQRMIQPEYQKWIAKLLGYSFEVVYKPGLENKAADALSR-IPTSTHLNSLMVQT----------------LIDLQVIKREVEEDDHLKKIITLIEKGE-

Query:  -ETEEQKYSIRQVVLRYEDRLVISKNSTIIPTILHTYHD-SVFGGQSGFLRTYKRIAGELYWLGMKHMIKKYCDKCSVCQRSKT-LSLLPGLPVPLEIPS
         ET  + YS+   ++ Y+DRLV+         ++  YHD ++FGG  G   T  +I+   YW  ++H I +Y   C  CQ  K+    L GL  PL I  
Subjt:  -ETEEQKYSIRQVVLRYEDRLVISKNSTIIPTILHTYHD-SVFGGQSGFLRTYKRIAGELYWLGMKHMIKKYCDKCSVCQRSKT-LSLLPGLPVPLEIPS

Query:  KIWNDISMDFIEGL-PKSKGCEVIFVVVDRLSKYGHFLPVKHPYTAKSIAELFIKEVVRLHGYPSSI--------------ELSRMVGTRLNRSTAYHPQ
          W DISMDF+ GL P S    +I VVVDR SK  HF+  +    A  + +L  + +   HG+P +I              EL++ +G +   S+A HPQ
Subjt:  KIWNDISMDFIEGL-PKSKGCEVIFVVVDRLSKYGHFLPVKHPYTAKSIAELFIKEVVRLHGYPSSI--------------ELSRMVGTRLNRSTAYHPQ

Query:  SDGQTEVVNRGLETYLRCFCGEKPKEW
        +DGQ+E   + L   LR +     + W
Subjt:  SDGQTEVVNRGLETYLRCFCGEKPKEW

Q99315 Transposon Ty3-G Gag-Pol polyprotein3.0e-10234.25Show/hide
Query:  LPPRR------DTEHHIHQKKDTKPVNVRPYRYAYQQKAEIEKLVDEMLTSGVIRPN-SPYSSPVLLVRKKDGSWRICVDYKALNNFTIPDKLPIPIIEE
        LPPR         +H I  K   +   ++PY    + + EI K+V ++L +  I P+ SP SSPV+LV KKDG++R+CVDY+ LN  TI D  P+P I+ 
Subjt:  LPPRR------DTEHHIHQKKDTKPVNVRPYRYAYQQKAEIEKLVDEMLTSGVIRPN-SPYSSPVLLVRKKDGSWRICVDYKALNNFTIPDKLPIPIIEE

Query:  LFDELNEASYFSKIDLKSGYHQIRMCEEDIKKTVFRTHEGHYEFMVMPFKLTNAP-TFQSLMNSIFKPYLRRFVLVFFDDTLIYSKDLKTHLQHLGLTLQ
        L   +  A  F+ +DL SGYHQI M  +D  KT F T  G YE+ VMPF L NAP TF   M   F+    RFV V+ DD LI+S+  + H +HL   L+
Subjt:  LFDELNEASYFSKIDLKSGYHQIRMCEEDIKKTVFRTHEGHYEFMVMPFKLTNAP-TFQSLMNSIFKPYLRRFVLVFFDDTLIYSKDLKTHLQHLGLTLQ

Query:  VLR-KNELYANRKCSFAQGKIDYLGHIISSQGVEVDPEKFRAIREWPIPINIRQVRGFLGLTGYYRKFVQNYGSIAAPLTQLLKKGGFKWGEEAQEAFLK
         L+ +N +   +KC FA  + ++LG+ I  Q +     K  AIR++P P  ++Q + FLG+  YYR+F+ N   IA P+ QL      +W E+  +A  K
Subjt:  VLR-KNELYANRKCSFAQGKIDYLGHIISSQGVEVDPEKFRAIREWPIPINIRQVRGFLGLTGYYRKFVQNYGSIAAPLTQLLKKGGFKWGEEAQEAFLK

Query:  LQQAMMTLLVLALPNFNAPFEIETNASGIGIGVVLTQSKRP------IAYFSHTLATKDRAKPVYERELMAVVLAVQRWRPYLLGRRFLVKTDQQSLKFL
        L+ A+    VL   N  A + + T+AS  GIG VL +          + YFS +L +  +  P  E EL+ ++ A+  +R  L G+ F ++TD  SL  L
Subjt:  LQQAMMTLLVLALPNFNAPFEIETNASGIGIGVVLTQSKRP------IAYFSHTLATKDRAKPVYERELMAVVLAVQRWRPYLLGRRFLVKTDQQSLKFL

Query:  LEQRMIQPEYQKWIAKLLGYSFEVVYKPGLENKAADALSR-IPTSTHLNSLMVQT----------------LIDLQVIKREVEEDDHLKKIITLIEKGE-
          +       Q+W+  L  Y F + Y  G +N  ADA+SR + T T   S  + T                LI ++ + +     + +    +  +K E 
Subjt:  LEQRMIQPEYQKWIAKLLGYSFEVVYKPGLENKAADALSR-IPTSTHLNSLMVQT----------------LIDLQVIKREVEEDDHLKKIITLIEKGE-

Query:  -ETEEQKYSIRQVVLRYEDRLVISKNSTIIPTILHTYHD-SVFGGQSGFLRTYKRIAGELYWLGMKHMIKKYCDKCSVCQRSKT-LSLLPGLPVPLEIPS
         ET  + YS+   ++ Y+DRLV+         ++  YHD ++FGG  G   T  +I+   YW  ++H I +Y   C  CQ  K+    L GL  PL I  
Subjt:  -ETEEQKYSIRQVVLRYEDRLVISKNSTIIPTILHTYHD-SVFGGQSGFLRTYKRIAGELYWLGMKHMIKKYCDKCSVCQRSKT-LSLLPGLPVPLEIPS

Query:  KIWNDISMDFIEGL-PKSKGCEVIFVVVDRLSKYGHFLPVKHPYTAKSIAELFIKEVVRLHGYPSSI--------------ELSRMVGTRLNRSTAYHPQ
          W DISMDF+ GL P S    +I VVVDR SK  HF+  +    A  + +L  + +   HG+P +I              EL++ +G +   S+A HPQ
Subjt:  KIWNDISMDFIEGL-PKSKGCEVIFVVVDRLSKYGHFLPVKHPYTAKSIAELFIKEVVRLHGYPSSI--------------ELSRMVGTRLNRSTAYHPQ

Query:  SDGQTEVVNRGLETYLRCFCGEKPKEW
        +DGQ+E   + L   LR +     + W
Subjt:  SDGQTEVVNRGLETYLRCFCGEKPKEW

Arabidopsis top hitse value%identityAlignment
ATMG00850.1 DNA/RNA polymerases superfamily protein2.3e-0451.28Show/hide
Query:  QKAEIEKLVDEMLTSGVIRPN-SPYSSPVLLVRKKDGSW
        ++  ++  + EML + +I+P+ SPYSSPVLLV+KKDG W
Subjt:  QKAEIEKLVDEMLTSGVIRPN-SPYSSPVLLVRKKDGSW

ATMG00860.1 DNA/RNA polymerases superfamily protein1.6e-3454.96Show/hide
Query:  LQHLGLTLQVLRKNELYANR-KCSFAQGKIDYLG--HIISSQGVEVDPEKFRAIREWPIPINIRQVRGFLGLTGYYRKFVQNYGSIAAPLTQLLKKGGFK
        + HLG+ LQ+  +++ YANR KC+F Q +I YLG  HIIS +GV  DP K  A+  WP P N  ++RGFLGLTGYYR+FV+NYG I  PLT+LLKK   K
Subjt:  LQHLGLTLQVLRKNELYANR-KCSFAQGKIDYLG--HIISSQGVEVDPEKFRAIREWPIPINIRQVRGFLGLTGYYRKFVQNYGSIAAPLTQLLKKGGFK

Query:  WGEEAQEAFLKLQQAMMTLLVLALPNFNAPF
        W E A  AF  L+ A+ TL VLALP+   PF
Subjt:  WGEEAQEAFLKLQQAMMTLLVLALPNFNAPF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGAAGACATGGGATAACTCCGATCAAGGATTCTTGATCGAATGCCGTACCATGGAAGGGAGAATAATAACAGAAGAAAGAGATGACCTTCAACAAACCGTGGTTGG
AGAAGAAGCAGCAGTCGTAGTCCTAAGGGATTACGAGAACGTGTTCATGTGGCCTGAGAAACTACCTCCAAGGAGAGATACTGAGCATCATATACATCAGAAGAAGGATA
CCAAACCAGTTAATGTCAGACCCTATCGCTATGCTTACCAACAAAAGGCCGAGATAGAAAAGCTAGTGGACGAAATGTTAACATCCGGAGTAATACGGCCCAACAGTCCA
TATTCCAGCCCAGTATTGTTAGTGAGGAAGAAGGATGGGAGCTGGCGTATTTGTGTAGACTACAAAGCACTGAATAATTTTACCATCCCCGACAAGCTCCCAATCCCCAT
CATTGAAGAACTGTTTGACGAGCTTAATGAAGCCTCATATTTCTCAAAGATTGATCTCAAGTCAGGGTATCATCAGATTCGTATGTGCGAAGAAGACATCAAGAAAACAG
TCTTCCGAACCCATGAGGGCCATTATGAATTTATGGTAATGCCCTTCAAGCTAACCAATGCACCAACCTTCCAATCGTTGATGAATTCCATATTCAAACCATACCTAAGG
AGATTTGTGTTAGTATTCTTTGACGACACCCTAATTTACAGCAAGGACCTCAAAACACACCTCCAACATCTCGGTTTAACTCTACAAGTACTTCGGAAGAATGAACTATA
CGCTAACCGGAAATGCAGTTTTGCCCAAGGAAAAATTGATTATTTGGGGCATATTATATCAAGTCAAGGAGTAGAAGTGGATCCCGAGAAGTTTCGAGCTATCAGGGAGT
GGCCAATCCCCATTAACATACGGCAAGTCAGAGGATTTTTGGGTTTGACCGGCTATTATCGAAAGTTTGTACAAAACTATGGTTCAATAGCTGCACCCTTGACTCAACTG
TTGAAGAAAGGTGGATTTAAATGGGGAGAAGAAGCGCAAGAGGCCTTTCTAAAGCTACAGCAAGCGATGATGACACTCCTTGTGCTAGCATTACCTAATTTTAATGCCCC
TTTTGAGATAGAAACGAACGCATCAGGAATTGGAATAGGTGTTGTACTCACGCAATCCAAACGACCCATCGCTTATTTCAGCCATACGTTAGCAACCAAAGACCGAGCTA
AACCTGTATACGAAAGAGAGTTAATGGCAGTAGTACTAGCAGTCCAAAGGTGGAGGCCATACCTGTTGGGTAGAAGGTTTCTGGTAAAAACAGACCAGCAATCCTTGAAG
TTTCTCTTGGAGCAGAGGATGATTCAACCAGAGTACCAAAAATGGATAGCTAAACTTTTAGGCTATTCATTCGAAGTTGTGTATAAGCCCGGTCTGGAAAACAAGGCAGC
AGATGCTCTGTCCCGAATACCAACTTCCACTCACTTGAACAGTCTAATGGTCCAAACCCTAATAGACTTACAAGTCATCAAAAGGGAAGTAGAGGAGGATGACCACCTGA
AGAAAATTATAACTCTAATAGAAAAAGGAGAGGAAACAGAGGAGCAAAAGTATTCCATCAGACAAGTAGTGCTTAGATATGAAGATCGACTCGTCATCTCAAAAAACTCC
ACAATAATCCCCACTATTCTCCACACCTATCATGACTCTGTGTTTGGAGGACAATCGGGATTCCTACGCACTTATAAAAGAATAGCAGGAGAATTGTATTGGTTAGGGAT
GAAACATATGATAAAGAAATACTGTGACAAATGTTCCGTATGCCAAAGGAGTAAGACGTTATCATTATTACCCGGATTGCCAGTACCATTGGAGATTCCCAGCAAGATAT
GGAATGATATTTCGATGGACTTCATCGAAGGTTTACCTAAATCAAAAGGATGTGAAGTAATTTTTGTGGTAGTGGACCGATTGAGCAAATACGGACACTTCTTACCAGTC
AAGCATCCTTACACAGCCAAGAGCATTGCAGAATTATTCATCAAAGAAGTAGTTCGGCTCCACGGATATCCAAGCTCTATAGAACTATCCAGAATGGTTGGTACGAGATT
AAACCGAAGCACAGCATATCACCCTCAATCTGATGGCCAAACCGAGGTGGTCAACAGAGGATTAGAAACGTATTTACGCTGTTTTTGCGGAGAGAAACCAAAGGAATGGG
GAGATCATAGCACTTCAAACTCCACCCTGGATGAGCTATTAAAAGAAAGAGATATTGCTCCCTTGGAGTACTAA
mRNA sequenceShow/hide mRNA sequence
ATGATGAAGACATGGGATAACTCCGATCAAGGATTCTTGATCGAATGCCGTACCATGGAAGGGAGAATAATAACAGAAGAAAGAGATGACCTTCAACAAACCGTGGTTGG
AGAAGAAGCAGCAGTCGTAGTCCTAAGGGATTACGAGAACGTGTTCATGTGGCCTGAGAAACTACCTCCAAGGAGAGATACTGAGCATCATATACATCAGAAGAAGGATA
CCAAACCAGTTAATGTCAGACCCTATCGCTATGCTTACCAACAAAAGGCCGAGATAGAAAAGCTAGTGGACGAAATGTTAACATCCGGAGTAATACGGCCCAACAGTCCA
TATTCCAGCCCAGTATTGTTAGTGAGGAAGAAGGATGGGAGCTGGCGTATTTGTGTAGACTACAAAGCACTGAATAATTTTACCATCCCCGACAAGCTCCCAATCCCCAT
CATTGAAGAACTGTTTGACGAGCTTAATGAAGCCTCATATTTCTCAAAGATTGATCTCAAGTCAGGGTATCATCAGATTCGTATGTGCGAAGAAGACATCAAGAAAACAG
TCTTCCGAACCCATGAGGGCCATTATGAATTTATGGTAATGCCCTTCAAGCTAACCAATGCACCAACCTTCCAATCGTTGATGAATTCCATATTCAAACCATACCTAAGG
AGATTTGTGTTAGTATTCTTTGACGACACCCTAATTTACAGCAAGGACCTCAAAACACACCTCCAACATCTCGGTTTAACTCTACAAGTACTTCGGAAGAATGAACTATA
CGCTAACCGGAAATGCAGTTTTGCCCAAGGAAAAATTGATTATTTGGGGCATATTATATCAAGTCAAGGAGTAGAAGTGGATCCCGAGAAGTTTCGAGCTATCAGGGAGT
GGCCAATCCCCATTAACATACGGCAAGTCAGAGGATTTTTGGGTTTGACCGGCTATTATCGAAAGTTTGTACAAAACTATGGTTCAATAGCTGCACCCTTGACTCAACTG
TTGAAGAAAGGTGGATTTAAATGGGGAGAAGAAGCGCAAGAGGCCTTTCTAAAGCTACAGCAAGCGATGATGACACTCCTTGTGCTAGCATTACCTAATTTTAATGCCCC
TTTTGAGATAGAAACGAACGCATCAGGAATTGGAATAGGTGTTGTACTCACGCAATCCAAACGACCCATCGCTTATTTCAGCCATACGTTAGCAACCAAAGACCGAGCTA
AACCTGTATACGAAAGAGAGTTAATGGCAGTAGTACTAGCAGTCCAAAGGTGGAGGCCATACCTGTTGGGTAGAAGGTTTCTGGTAAAAACAGACCAGCAATCCTTGAAG
TTTCTCTTGGAGCAGAGGATGATTCAACCAGAGTACCAAAAATGGATAGCTAAACTTTTAGGCTATTCATTCGAAGTTGTGTATAAGCCCGGTCTGGAAAACAAGGCAGC
AGATGCTCTGTCCCGAATACCAACTTCCACTCACTTGAACAGTCTAATGGTCCAAACCCTAATAGACTTACAAGTCATCAAAAGGGAAGTAGAGGAGGATGACCACCTGA
AGAAAATTATAACTCTAATAGAAAAAGGAGAGGAAACAGAGGAGCAAAAGTATTCCATCAGACAAGTAGTGCTTAGATATGAAGATCGACTCGTCATCTCAAAAAACTCC
ACAATAATCCCCACTATTCTCCACACCTATCATGACTCTGTGTTTGGAGGACAATCGGGATTCCTACGCACTTATAAAAGAATAGCAGGAGAATTGTATTGGTTAGGGAT
GAAACATATGATAAAGAAATACTGTGACAAATGTTCCGTATGCCAAAGGAGTAAGACGTTATCATTATTACCCGGATTGCCAGTACCATTGGAGATTCCCAGCAAGATAT
GGAATGATATTTCGATGGACTTCATCGAAGGTTTACCTAAATCAAAAGGATGTGAAGTAATTTTTGTGGTAGTGGACCGATTGAGCAAATACGGACACTTCTTACCAGTC
AAGCATCCTTACACAGCCAAGAGCATTGCAGAATTATTCATCAAAGAAGTAGTTCGGCTCCACGGATATCCAAGCTCTATAGAACTATCCAGAATGGTTGGTACGAGATT
AAACCGAAGCACAGCATATCACCCTCAATCTGATGGCCAAACCGAGGTGGTCAACAGAGGATTAGAAACGTATTTACGCTGTTTTTGCGGAGAGAAACCAAAGGAATGGG
GAGATCATAGCACTTCAAACTCCACCCTGGATGAGCTATTAAAAGAAAGAGATATTGCTCCCTTGGAGTACTAA
Protein sequenceShow/hide protein sequence
MMKTWDNSDQGFLIECRTMEGRIITEERDDLQQTVVGEEAAVVVLRDYENVFMWPEKLPPRRDTEHHIHQKKDTKPVNVRPYRYAYQQKAEIEKLVDEMLTSGVIRPNSP
YSSPVLLVRKKDGSWRICVDYKALNNFTIPDKLPIPIIEELFDELNEASYFSKIDLKSGYHQIRMCEEDIKKTVFRTHEGHYEFMVMPFKLTNAPTFQSLMNSIFKPYLR
RFVLVFFDDTLIYSKDLKTHLQHLGLTLQVLRKNELYANRKCSFAQGKIDYLGHIISSQGVEVDPEKFRAIREWPIPINIRQVRGFLGLTGYYRKFVQNYGSIAAPLTQL
LKKGGFKWGEEAQEAFLKLQQAMMTLLVLALPNFNAPFEIETNASGIGIGVVLTQSKRPIAYFSHTLATKDRAKPVYERELMAVVLAVQRWRPYLLGRRFLVKTDQQSLK
FLLEQRMIQPEYQKWIAKLLGYSFEVVYKPGLENKAADALSRIPTSTHLNSLMVQTLIDLQVIKREVEEDDHLKKIITLIEKGEETEEQKYSIRQVVLRYEDRLVISKNS
TIIPTILHTYHDSVFGGQSGFLRTYKRIAGELYWLGMKHMIKKYCDKCSVCQRSKTLSLLPGLPVPLEIPSKIWNDISMDFIEGLPKSKGCEVIFVVVDRLSKYGHFLPV
KHPYTAKSIAELFIKEVVRLHGYPSSIELSRMVGTRLNRSTAYHPQSDGQTEVVNRGLETYLRCFCGEKPKEWGDHSTSNSTLDELLKERDIAPLEY