; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cucsat.G7675 (gene) of Cucumber (B10) v3 genome

Gene IDCucsat.G7675
OrganismCucumis sativus L. var. sativus cv. B10 (Cucumber (B10) v3)
DescriptionTy3/gypsy retrotransposon protein
Genome locationctg1544:700635..705440
RNA-Seq ExpressionCucsat.G7675
SyntenyCucsat.G7675
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR021109 - Aspartic peptidase domain superfamily
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8637561.1 hypothetical protein CSA_017659 [Cucumis sativus]4.65e-30674.55Show/hide
Query:  MKLKGHVNGKEVVILIDSRATNNFISQVLVDELQLSIDPGTRFGVTIGNGNQCEGSGICKRVKVKLKELIIVADFLAVELGTVDLVLGMQWLDSTGTMKV
        MKLKG++ GK V++LIDS AT+NFI    V E  L ++ G  F VTIG+G +C G GICKRV+++L+ +++VADFLA+ELG VD++LGMQWLD+TGTMK+
Subjt:  MKLKGHVNGKEVVILIDSRATNNFISQVLVDELQLSIDPGTRFGVTIGNGNQCEGSGICKRVKVKLKELIIVADFLAVELGTVDLVLGMQWLDSTGTMKV

Query:  HWPSLTMTFWMKGRRIILKGDPSLTKSECSLRTLEKTWQSGDQGFLLEFQNYEVNYEGESETEAELKGKEEGLPMVQRFLEQYADVFRLPTGLPPRRAID
        HWPSLTM+FW   ++IILKGDPSL ++EC L+T+EKTW++ DQGFLLEFQ YE++ +   E +  +KG EE +PM++  L+QYA++F  P  LPP+R ID
Subjt:  HWPSLTMTFWMKGRRIILKGDPSLTKSECSLRTLEKTWQSGDQGFLLEFQNYEVNYEGESETEAELKGKEEGLPMVQRFLEQYADVFRLPTGLPPRRAID

Query:  HRILTVADQKPINVRPYKYGHVQKEEIEKLVLEMLQAGVIRPSRSPYSSPILLVKKKDGGWRFCVDYRKLNQVTVADKFPIPMIEELLDELHGATVFSKL
        HRIL + DQ+PINVRPYKYG+VQKEEIEKLV+EMLQAGVIRPS SPYSSP+LLVKKKDGGWRFCVDYRKLNQVT++DKFPIP+IEELLDELHGATVFSKL
Subjt:  HRILTVADQKPINVRPYKYGHVQKEEIEKLVLEMLQAGVIRPSRSPYSSPILLVKKKDGGWRFCVDYRKLNQVTVADKFPIPMIEELLDELHGATVFSKL

Query:  DLKSGYHQIRMREEDVEKTTFRTHEGHYEFLVMPFGLTNAPATFQSLMNEVFKPFLRRCVLVFFYDILVYSVDIDEHMKHLGMVFAILRDHELFANRSKC
        D+KS YHQIRM+EEDVEKT FRTHEGHYEFLVMPFGLTNAPATFQSLMN+VFKPFLRRCVLVFF DILVYS DI EH KHLGMVFA+LRD+ LFAN+ KC
Subjt:  DLKSGYHQIRMREEDVEKTTFRTHEGHYEFLVMPFGLTNAPATFQSLMNEVFKPFLRRCVLVFFYDILVYSVDIDEHMKHLGMVFAILRDHELFANRSKC

Query:  VIAHSQVQYLGHLISSRGVEADEAKIRSMVNWPRPKDITGLRGFLGLTGYYRRFVKSYGEIAAPLTKLLQKNAFHWNEEATIAFDQLKLAMTTLPVLALP
        VIAHS++QYLGH+ISS+GV+ADE KI+ MV WP+PKD+TGLRGFLGL+GYYRRFVK YGEIAAPLT+LLQKN+F W+E+AT+AF++LK AMTT+PVLALP
Subjt:  VIAHSQVQYLGHLISSRGVEADEAKIRSMVNWPRPKDITGLRGFLGLTGYYRRFVKSYGEIAAPLTKLLQKNAFHWNEEATIAFDQLKLAMTTLPVLALP

Query:  DWSKPFTIETDASGVGLGAVLSQDGHPIAFFSQKLSPRAQGKSIYERELMAVVLSVQK
        +W  PF IETDASG GLGAVLSQ+GHPIAFFSQKLS RAQ KSIYERELM VVLSVQK
Subjt:  DWSKPFTIETDASGVGLGAVLSQDGHPIAFFSQKLSPRAQGKSIYERELMAVVLSVQK

KAE8637598.1 hypothetical protein CSA_022681 [Cucumis sativus]0.098.3Show/hide
Query:  MQWLDSTGTMKVHWPSLTMTFWMKGRRIILKGDPSLTKSECSLRTLEKTWQSGDQGFLLEFQNYEVNYEGESETEAELKGKEEGLPMVQRFLEQYADVFR
        MQWLDSTGTMKVHWPSLTMTFWMKGRRIILKGDPSLTKSECSLRTLEKTWQSGDQGFLLEFQNYEVNYEGESETEAELKGKEEGLPMVQR LEQYADVFR
Subjt:  MQWLDSTGTMKVHWPSLTMTFWMKGRRIILKGDPSLTKSECSLRTLEKTWQSGDQGFLLEFQNYEVNYEGESETEAELKGKEEGLPMVQRFLEQYADVFR

Query:  LPTGLPPRRAIDHRILTVADQKPINVRPYKYGHVQKEEIEKLVLEMLQAGVIRPSRSPYSSPILLVKKKDGGWRFCVDYRKLNQVTVADKFPIPMIEELL
        LPTGLPPRRAIDHRILTVADQKPINVRPYKYGHVQKEEIEKLVLEMLQAGVIRPSRSPYSSP+LLVKKKDGGWRFCVDYRKLNQVTVADKFPIP+IEELL
Subjt:  LPTGLPPRRAIDHRILTVADQKPINVRPYKYGHVQKEEIEKLVLEMLQAGVIRPSRSPYSSPILLVKKKDGGWRFCVDYRKLNQVTVADKFPIPMIEELL

Query:  DELHGATVFSKLDLKSGYHQIRMREEDVEKTTFRTHEGHYEFLVMPFGLTNAPATFQSLMNEVFKPFLRRCVLVFFYDILVYSVDIDEHMKHLGMVFAIL
        DELHGAT FSKLDLKSGYHQIRMREEDVEKT F THEGHYEFLVMPFGLTNAPATFQSLMNEVFKPFLRRCVLVFFYDILVYSVDIDEHMKHLGMVFAIL
Subjt:  DELHGATVFSKLDLKSGYHQIRMREEDVEKTTFRTHEGHYEFLVMPFGLTNAPATFQSLMNEVFKPFLRRCVLVFFYDILVYSVDIDEHMKHLGMVFAIL

Query:  RDHELFANRSKCVIAHSQVQYLGHLISSRGVEADEAKIRSMVNWPRPKDITGLRGFLGLTGYYRRFVKSYGEIAAPLTKLLQKNAFHWNEEATIAFDQLK
        RDHELFANRSKCVIAHSQVQYLGHLISSRGVEADE KIRSMVNWPRPKDITGLRGFLGLTGYYRRFVKSYGEIAAPLTKLLQKNAFHWNEEATIAFDQLK
Subjt:  RDHELFANRSKCVIAHSQVQYLGHLISSRGVEADEAKIRSMVNWPRPKDITGLRGFLGLTGYYRRFVKSYGEIAAPLTKLLQKNAFHWNEEATIAFDQLK

Query:  LAMTTLPVLALPDWSKPFTIETDASGVGLGAVLSQDGHPIAFFSQKLSPRAQGKSIYERELMAVVLSVQK
        LAMTTLPVLALPDWS+PFTIETDASGVGLGAVLSQDGHPIAFFSQKLSPRAQGKSIYERELMAVVLSVQK
Subjt:  LAMTTLPVLALPDWSKPFTIETDASGVGLGAVLSQDGHPIAFFSQKLSPRAQGKSIYERELMAVVLSVQK

TYK14439.1 uncharacterized protein E5676_scaffold186G00980 [Cucumis melo var. makuwa]1.89e-31276.34Show/hide
Query:  MKLKGHVNGKEVVILIDSRATNNFISQVLVDELQLSIDPGTRFGVTIGNGNQCEGSGICKRVKVKLKELIIVADFLAVELGTVDLVLGMQWLDSTGTMKV
        MKLKG V  KE+V+LIDS AT+NFI Q L +ELQ+ ++  T+FG TIGNG +C+G G+C+RV++KLKE+ I+ADFLAVELGTVD VLGMQWLD+TGTM++
Subjt:  MKLKGHVNGKEVVILIDSRATNNFISQVLVDELQLSIDPGTRFGVTIGNGNQCEGSGICKRVKVKLKELIIVADFLAVELGTVDLVLGMQWLDSTGTMKV

Query:  HWPSLTMTFWMKGRRIILKGDPSLTKSECSLRTLEKTWQSGDQGFLLEFQNYEVNYEGESETEAELKGKEEGLPMVQRFLEQYADVFRLPTGLPPRRAID
        HWPSLTM FW +GR+I+LKGDPSL K+ECSL+TLEKTWQ  DQGFLLE+ N E++ E + ET+ E+KG E  +PM++  L+QYAD+F  P GLPP+R ID
Subjt:  HWPSLTMTFWMKGRRIILKGDPSLTKSECSLRTLEKTWQSGDQGFLLEFQNYEVNYEGESETEAELKGKEEGLPMVQRFLEQYADVFRLPTGLPPRRAID

Query:  HRILTVADQKPINVRPYKYGHVQKEEIEKLVLEMLQAGVIRPSRSPYSSPILLVKKKDGGWRFCVDYRKLNQVTVADKFPIPMIEELLDELHGATVFSKL
        HRILT+ DQ+PINVRPYKYGHVQKEEIE LV EMLQ G+IRPS SPYSSP+LLVK+KDGGWRFCVDYRKLNQ TV+DKFPIP+IEELLDELHGA VFSKL
Subjt:  HRILTVADQKPINVRPYKYGHVQKEEIEKLVLEMLQAGVIRPSRSPYSSPILLVKKKDGGWRFCVDYRKLNQVTVADKFPIPMIEELLDELHGATVFSKL

Query:  DLKSGYHQIRMREEDVEKTTFRTHEGHYEFLVMPFGLTNAPATFQSLMNEVFKPFLRRCVLVFFYDILVYSVDIDEHMKHLGMVFAILRDHELFANRSKC
        DLKSGYHQIRM+EED+EKT FRTHEGHYEFLVMPFGLTNA ATFQSLMN+VFKPFLRRCVLVF YDILVYS DI EH KHLGMVFA+LRD++L+AN+ KC
Subjt:  DLKSGYHQIRMREEDVEKTTFRTHEGHYEFLVMPFGLTNAPATFQSLMNEVFKPFLRRCVLVFFYDILVYSVDIDEHMKHLGMVFAILRDHELFANRSKC

Query:  VIAHSQVQYLGHLISSRGVEADEAKIRSMVNWPRPKDITGLRGFLGLTGYYRRFVKSYGEIAAPLTKLLQKNAFHWNEEATIAFDQLKLAMTTLPVLALP
        V AHS++QYLGH IS +GVEADE KI+SMV+WPRP D++ LRGFLGLTGYYRRFVK Y +IA PLTKLLQKNAF W EEA  AF +LK+AMTT+PVLALP
Subjt:  VIAHSQVQYLGHLISSRGVEADEAKIRSMVNWPRPKDITGLRGFLGLTGYYRRFVKSYGEIAAPLTKLLQKNAFHWNEEATIAFDQLKLAMTTLPVLALP

Query:  DWSKPFTIETDASGVGLGAVLSQDGHPIAFFSQKLSPRAQGKSIYERELMAVVLSVQK
        DW+ PFTIETDASG GLGAVLSQ GHPIAFFSQKLS RAQ KSIYERELM VVLSVQK
Subjt:  DWSKPFTIETDASGVGLGAVLSQDGHPIAFFSQKLSPRAQGKSIYERELMAVVLSVQK

TYK14624.1 uncharacterized protein E5676_scaffold1275G00160 [Cucumis melo var. makuwa]2.63e-30975.45Show/hide
Query:  MKLKGHVNGKEVVILIDSRATNNFISQVLVDELQLSIDPGTRFGVTIGNGNQCEGSGICKRVKVKLKELIIVADFLAVELGTVDLVLGMQWLDSTGTMKV
        MKLKG +  KE+VILIDS AT+NFI Q L  +L+L ++  T+FG TIGNG +C+G GIC+RV+VKL+E+ I+ADFLAVELG+VD VLGMQWLD+ GTMK+
Subjt:  MKLKGHVNGKEVVILIDSRATNNFISQVLVDELQLSIDPGTRFGVTIGNGNQCEGSGICKRVKVKLKELIIVADFLAVELGTVDLVLGMQWLDSTGTMKV

Query:  HWPSLTMTFWMKGRRIILKGDPSLTKSECSLRTLEKTWQSGDQGFLLEFQNYEVNYEGESETEAELKGKEEGLPMVQRFLEQYADVFRLPTGLPPRRAID
        HWPSLTM+FW +GR+IILKGDP L K+ECSLRTLEKTWQ  DQGFLLE+ N EV  E   +T+ + KG E  +PM++  L+QY D+F  P GLPP+R ID
Subjt:  HWPSLTMTFWMKGRRIILKGDPSLTKSECSLRTLEKTWQSGDQGFLLEFQNYEVNYEGESETEAELKGKEEGLPMVQRFLEQYADVFRLPTGLPPRRAID

Query:  HRILTVADQKPINVRPYKYGHVQKEEIEKLVLEMLQAGVIRPSRSPYSSPILLVKKKDGGWRFCVDYRKLNQVTVADKFPIPMIEELLDELHGATVFSKL
        HRILT+ +Q+PINVRPYKYGHVQK EIE LV EMLQ G+IRPSRSPYSS +LLVKKKDGGWRFCVDYRKLNQ TV+DKFPIP+IEELLDEL+GA VFSKL
Subjt:  HRILTVADQKPINVRPYKYGHVQKEEIEKLVLEMLQAGVIRPSRSPYSSPILLVKKKDGGWRFCVDYRKLNQVTVADKFPIPMIEELLDELHGATVFSKL

Query:  DLKSGYHQIRMREEDVEKTTFRTHEGHYEFLVMPFGLTNAPATFQSLMNEVFKPFLRRCVLVFFYDILVYSVDIDEHMKHLGMVFAILRDHELFANRSKC
        DLKSGYHQIRM+EED+EKT FRTHEGHYEFLVMPFGLTNAPATFQSLMN+VFKPFLRRCVLVFF DILVYS D+ EH KHLGM+FA+LRD++L+AN  KC
Subjt:  DLKSGYHQIRMREEDVEKTTFRTHEGHYEFLVMPFGLTNAPATFQSLMNEVFKPFLRRCVLVFFYDILVYSVDIDEHMKHLGMVFAILRDHELFANRSKC

Query:  VIAHSQVQYLGHLISSRGVEADEAKIRSMVNWPRPKDITGLRGFLGLTGYYRRFVKSYGEIAAPLTKLLQKNAFHWNEEATIAFDQLKLAMTTLPVLALP
        V AHS++QYLGH IS  GVEAD+ KIRSMVNWPRP D+T LRGFLGLTGYYRRFVK Y  I  PLTKLLQKNAF WNEEA   F +LK+AMTT+PVLALP
Subjt:  VIAHSQVQYLGHLISSRGVEADEAKIRSMVNWPRPKDITGLRGFLGLTGYYRRFVKSYGEIAAPLTKLLQKNAFHWNEEATIAFDQLKLAMTTLPVLALP

Query:  DWSKPFTIETDASGVGLGAVLSQDGHPIAFFSQKLSPRAQGKSIYERELMAVVLSVQK
        DWS PFTIETDASG GLGAVLSQ GHPIAF+S+KLS RAQ KSIYERELMAVVLSVQ+
Subjt:  DWSKPFTIETDASGVGLGAVLSQDGHPIAFFSQKLSPRAQGKSIYERELMAVVLSVQK

XP_031742100.1 uncharacterized protein LOC116404055 [Cucumis sativus]0.097.68Show/hide
Query:  LSIDPGTRFGVTIGNGNQCEGSGICKRVKVKLKELIIVADFLAVELGTVDLVLGMQWLDSTGTMKVHWPSLTMTFWMKGRRIILKGDPSLTKSECSLRTL
        LSIDPGTRFGVTIGN NQCEGSGICKRVKVKLKEL IVADFLAVELGTVDLVLGMQWLDSTGTMKVHWPSLTMTFWMKGRRIILKGDPSLTKSECSLRTL
Subjt:  LSIDPGTRFGVTIGNGNQCEGSGICKRVKVKLKELIIVADFLAVELGTVDLVLGMQWLDSTGTMKVHWPSLTMTFWMKGRRIILKGDPSLTKSECSLRTL

Query:  EKTWQSGDQGFLLEFQNYEVNYEGESETEAELKGKEEGLPMVQRFLEQYADVFRLPTGLPPRRAIDHRILTVADQKPINVRPYKYGHVQKEEIEKLVLEM
        EKTWQSGDQGFLLEFQNYEVNYEGESETEAELKG+EEGLPMVQR LEQYADVFRLPTGLPPRRAIDHRILTVADQKPINVRPYKYGHVQKEEIEKLVLEM
Subjt:  EKTWQSGDQGFLLEFQNYEVNYEGESETEAELKGKEEGLPMVQRFLEQYADVFRLPTGLPPRRAIDHRILTVADQKPINVRPYKYGHVQKEEIEKLVLEM

Query:  LQAGVIRPSRSPYSSPILLVKKKDGGWRFCVDYRKLNQVTVADKFPIPMIEELLDELHGATVFSKLDLKSGYHQIRMREEDVEKTTFRTHEGHYEFLVMP
        LQAGVIRPSRSPYSSP+LLVK KDGGWRFCVDYRKLNQVTVADKFPIP+IEELLDELHGAT FSKLDLKSGYHQIRMREEDVEKT F THEGHYEFLVMP
Subjt:  LQAGVIRPSRSPYSSPILLVKKKDGGWRFCVDYRKLNQVTVADKFPIPMIEELLDELHGATVFSKLDLKSGYHQIRMREEDVEKTTFRTHEGHYEFLVMP

Query:  FGLTNAPATFQSLMNEVFKPFLRRCVLVFFYDILVYSVDIDEHMKHLGMVFAILRDHELFANRSKCVIAHSQVQYLGHLISSRGVEADEAKIRSMVNWPR
        FGLTNAPATFQSLMNEVFKPFLRRCVLVFFYDILVYSVDIDEHMKHLGMVFAILRDHELFANRSKCVIAHSQVQYLGHLISSRGVEADE KIRSMVNWPR
Subjt:  FGLTNAPATFQSLMNEVFKPFLRRCVLVFFYDILVYSVDIDEHMKHLGMVFAILRDHELFANRSKCVIAHSQVQYLGHLISSRGVEADEAKIRSMVNWPR

Query:  PKDITGLRGFLGLTGYYRRFVKSYGEIAAPLTKLLQKNAFHWNEEATIAFDQLKLAMTTLPVLALPDWSKPFTIETDASGVGLGAVLSQDGHPIAFFSQK
        PKDITGLRGFLGLTGYYRRFVKSYGEIAAPLTKLLQKNAFHWNEEATIAFDQLKLAMTTLPVLALPDWS+PFTIETDASGVGLGAVLSQDGHPIAFFSQK
Subjt:  PKDITGLRGFLGLTGYYRRFVKSYGEIAAPLTKLLQKNAFHWNEEATIAFDQLKLAMTTLPVLALPDWSKPFTIETDASGVGLGAVLSQDGHPIAFFSQK

Query:  LSPRAQGKSIYERELMA
        LSPRAQGKSIYERELMA
Subjt:  LSPRAQGKSIYERELMA

TrEMBL top hitse value%identityAlignment
A0A5A7UM77 Ty3/gypsy retrotransposon protein9.76e-30476.34Show/hide
Query:  MKLKGHVNGKEVVILIDSRATNNFISQVLVDELQLSIDPGTRFGVTIGNGNQCEGSGICKRVKVKLKELIIVADFLAVELGTVDLVLGMQWLDSTGTMKV
        MKLKG +  KEVV+LIDS AT+NFI   L  +L+L +DP T FG TIGNG +C G GIC+RV+VKL E+ I+ADFLAVELG+VD VLGMQWLD+TGTMK+
Subjt:  MKLKGHVNGKEVVILIDSRATNNFISQVLVDELQLSIDPGTRFGVTIGNGNQCEGSGICKRVKVKLKELIIVADFLAVELGTVDLVLGMQWLDSTGTMKV

Query:  HWPSLTMTFWMKGRRIILKGDPSLTKSECSLRTLEKTWQSGDQGFLLEFQNYEVNYEGESETEAELKGKEEGLPMVQRFLEQYADVFRLPTGLPPRRAID
        HWPSLTM+FW  GR+I+LKGDPSL ++ECSLRTLEKTWQ  DQGFLLE+ N +V  +   +T+ + +G E  +PM++  L+QY D+F  P GLPP+R ID
Subjt:  HWPSLTMTFWMKGRRIILKGDPSLTKSECSLRTLEKTWQSGDQGFLLEFQNYEVNYEGESETEAELKGKEEGLPMVQRFLEQYADVFRLPTGLPPRRAID

Query:  HRILTVADQKPINVRPYKYGHVQKEEIEKLVLEMLQAGVIRPSRSPYSSPILLVKKKDGGWRFCVDYRKLNQVTVADKFPIPMIEELLDELHGATVFSKL
        HRILT+ DQKPINVRPYKYGH+QK EIEKLV EMLQ GVIRPSRSPYSSP+LLVKKK+GGWRFCVDYRKLNQ T++DKFPIP+IEELLDEL+GA VFSKL
Subjt:  HRILTVADQKPINVRPYKYGHVQKEEIEKLVLEMLQAGVIRPSRSPYSSPILLVKKKDGGWRFCVDYRKLNQVTVADKFPIPMIEELLDELHGATVFSKL

Query:  DLKSGYHQIRMREEDVEKTTFRTHEGHYEFLVMPFGLTNAPATFQSLMNEVFKPFLRRCVLVFFYDILVYSVDIDEHMKHLGMVFAILRDHELFANRSKC
        DLKSGYHQIRM+EED+EKT FRTHEGHYEFLVMPFGLTNAPATFQSLMN+VFKPFLRRCVLVFFYDILVYS DI EH KHLGMVFA+LRD++L+AN  KC
Subjt:  DLKSGYHQIRMREEDVEKTTFRTHEGHYEFLVMPFGLTNAPATFQSLMNEVFKPFLRRCVLVFFYDILVYSVDIDEHMKHLGMVFAILRDHELFANRSKC

Query:  VIAHSQVQYLGHLISSRGVEADEAKIRSMVNWPRPKDITGLRGFLGLTGYYRRFVKSYGEIAAPLTKLLQKNAFHWNEEATIAFDQLKLAMTTLPVLALP
        V AHS++QYLGH IS  GVEADE KIRSMVNWPRP D+T LRGFL LTGYYRRFVK Y  IA PLTKLLQKNAF WNE+A  AF +LK+AMTT+PVLALP
Subjt:  VIAHSQVQYLGHLISSRGVEADEAKIRSMVNWPRPKDITGLRGFLGLTGYYRRFVKSYGEIAAPLTKLLQKNAFHWNEEATIAFDQLKLAMTTLPVLALP

Query:  DWSKPFTIETDASGVGLGAVLSQDGHPIAFFSQKLSPRAQGKSIYERELMAVVLSVQK
        DWS PFTIETDASG GLGAVLSQ GHPIAF+SQKLS RAQ KSIYERELMAVVLSVQ+
Subjt:  DWSKPFTIETDASGVGLGAVLSQDGHPIAFFSQKLSPRAQGKSIYERELMAVVLSVQK

A0A5A7VK94 Ty3/gypsy retrotransposon protein4.02e-30474.19Show/hide
Query:  MKLKGHVNGKEVVILIDSRATNNFISQVLVDELQLSIDPGTRFGVTIGNGNQCEGSGICKRVKVKLKELIIVADFLAVELGTVDLVLGMQWLDSTGTMKV
        MKL+G V GKEV++LIDS AT+NFI   LV E ++ I+  T+FG+TIG+G  C+G GIC +V+++L+ L +V D L V LGT+D+VLGMQWLD+TGTMK+
Subjt:  MKLKGHVNGKEVVILIDSRATNNFISQVLVDELQLSIDPGTRFGVTIGNGNQCEGSGICKRVKVKLKELIIVADFLAVELGTVDLVLGMQWLDSTGTMKV

Query:  HWPSLTMTFWMKGRRIILKGDPSLTKSECSLRTLEKTWQSGDQGFLLEFQNYEVNYEGESETEAELKGKEEGLPMVQRFLEQYADVFRLPTGLPPRRAID
        HWPSLTM FW +G +++LKGDP+L ++ECSL+TLEKTW++ DQGFLL++Q YE+  E          G EEGLPM+Q  L QY+DVF  PT LPP+R ID
Subjt:  HWPSLTMTFWMKGRRIILKGDPSLTKSECSLRTLEKTWQSGDQGFLLEFQNYEVNYEGESETEAELKGKEEGLPMVQRFLEQYADVFRLPTGLPPRRAID

Query:  HRILTVADQKPINVRPYKYGHVQKEEIEKLVLEMLQAGVIRPSRSPYSSPILLVKKKDGGWRFCVDYRKLNQVTVADKFPIPMIEELLDELHGATVFSKL
        HRILT+  QKPINVRPYKYGH QKEEIEKLV+EMLQ G+IRPS SP+SSP+LLVKKKDGGWRFCVDYRKLN++T+ADKFPIP+IEELLDELHGAT+FSKL
Subjt:  HRILTVADQKPINVRPYKYGHVQKEEIEKLVLEMLQAGVIRPSRSPYSSPILLVKKKDGGWRFCVDYRKLNQVTVADKFPIPMIEELLDELHGATVFSKL

Query:  DLKSGYHQIRMREEDVEKTTFRTHEGHYEFLVMPFGLTNAPATFQSLMNEVFKPFLRRCVLVFFYDILVYSVDIDEHMKHLGMVFAILRDHELFANRSKC
        DLKSGYHQIRM+EED+EKT FRTHEGHYEF+VMPFGLTNAPATFQSLMN+VFKPFLRRCVLVFF DILVYS DI EH KHLGMVFA LRD++L+ANR KC
Subjt:  DLKSGYHQIRMREEDVEKTTFRTHEGHYEFLVMPFGLTNAPATFQSLMNEVFKPFLRRCVLVFFYDILVYSVDIDEHMKHLGMVFAILRDHELFANRSKC

Query:  VIAHSQVQYLGHLISSRGVEADEAKIRSMVNWPRPKDITGLRGFLGLTGYYRRFVKSYGEIAAPLTKLLQKNAFHWNEEATIAFDQLKLAMTTLPVLALP
        V AHSQ+ YLGH+IS  GVEAD+ K++ M+ WP+PKD+TGLRGFLGLTGYYRRFVKSYGEIAAPLTKLLQKNAF W+E AT+AF+ LK AM+T+PVLALP
Subjt:  VIAHSQVQYLGHLISSRGVEADEAKIRSMVNWPRPKDITGLRGFLGLTGYYRRFVKSYGEIAAPLTKLLQKNAFHWNEEATIAFDQLKLAMTTLPVLALP

Query:  DWSKPFTIETDASGVGLGAVLSQDGHPIAFFSQKLSPRAQGKSIYERELMAVVLSVQK
        DWS PF IETDASG GLGAVLSQ+ HPIAFFSQKLS RAQ KSIYERELMAVVLSVQK
Subjt:  DWSKPFTIETDASGVGLGAVLSQDGHPIAFFSQKLSPRAQGKSIYERELMAVVLSVQK

A0A5D3BJ50 Ty3/gypsy retrotransposon protein9.76e-30476.34Show/hide
Query:  MKLKGHVNGKEVVILIDSRATNNFISQVLVDELQLSIDPGTRFGVTIGNGNQCEGSGICKRVKVKLKELIIVADFLAVELGTVDLVLGMQWLDSTGTMKV
        MKLKG +  KEVV+LIDS AT+NFI   L  +L+L +DP T FG TIGNG +C G GIC+RV+VKL E+ I+ADFLAVELG+VD VLGMQWLD+TGTMK+
Subjt:  MKLKGHVNGKEVVILIDSRATNNFISQVLVDELQLSIDPGTRFGVTIGNGNQCEGSGICKRVKVKLKELIIVADFLAVELGTVDLVLGMQWLDSTGTMKV

Query:  HWPSLTMTFWMKGRRIILKGDPSLTKSECSLRTLEKTWQSGDQGFLLEFQNYEVNYEGESETEAELKGKEEGLPMVQRFLEQYADVFRLPTGLPPRRAID
        HWPSLTM+FW  GR+I+LKGDPSL ++ECSLRTLEKTWQ  DQGFLLE+ N +V  +   +T+ + +G E  +PM++  L+QY D+F  P GLPP+R ID
Subjt:  HWPSLTMTFWMKGRRIILKGDPSLTKSECSLRTLEKTWQSGDQGFLLEFQNYEVNYEGESETEAELKGKEEGLPMVQRFLEQYADVFRLPTGLPPRRAID

Query:  HRILTVADQKPINVRPYKYGHVQKEEIEKLVLEMLQAGVIRPSRSPYSSPILLVKKKDGGWRFCVDYRKLNQVTVADKFPIPMIEELLDELHGATVFSKL
        HRILT+ DQKPINVRPYKYGH+QK EIEKLV EMLQ GVIRPSRSPYSSP+LLVKKK+GGWRFCVDYRKLNQ T++DKFPIP+IEELLDEL+GA VFSKL
Subjt:  HRILTVADQKPINVRPYKYGHVQKEEIEKLVLEMLQAGVIRPSRSPYSSPILLVKKKDGGWRFCVDYRKLNQVTVADKFPIPMIEELLDELHGATVFSKL

Query:  DLKSGYHQIRMREEDVEKTTFRTHEGHYEFLVMPFGLTNAPATFQSLMNEVFKPFLRRCVLVFFYDILVYSVDIDEHMKHLGMVFAILRDHELFANRSKC
        DLKSGYHQIRM+EED+EKT FRTHEGHYEFLVMPFGLTNAPATFQSLMN+VFKPFLRRCVLVFFYDILVYS DI EH KHLGMVFA+LRD++L+AN  KC
Subjt:  DLKSGYHQIRMREEDVEKTTFRTHEGHYEFLVMPFGLTNAPATFQSLMNEVFKPFLRRCVLVFFYDILVYSVDIDEHMKHLGMVFAILRDHELFANRSKC

Query:  VIAHSQVQYLGHLISSRGVEADEAKIRSMVNWPRPKDITGLRGFLGLTGYYRRFVKSYGEIAAPLTKLLQKNAFHWNEEATIAFDQLKLAMTTLPVLALP
        V AHS++QYLGH IS  GVEADE KIRSMVNWPRP D+T LRGFL LTGYYRRFVK Y  IA PLTKLLQKNAF WNE+A  AF +LK+AMTT+PVLALP
Subjt:  VIAHSQVQYLGHLISSRGVEADEAKIRSMVNWPRPKDITGLRGFLGLTGYYRRFVKSYGEIAAPLTKLLQKNAFHWNEEATIAFDQLKLAMTTLPVLALP

Query:  DWSKPFTIETDASGVGLGAVLSQDGHPIAFFSQKLSPRAQGKSIYERELMAVVLSVQK
        DWS PFTIETDASG GLGAVLSQ GHPIAF+SQKLS RAQ KSIYERELMAVVLSVQ+
Subjt:  DWSKPFTIETDASGVGLGAVLSQDGHPIAFFSQKLSPRAQGKSIYERELMAVVLSVQK

A0A5D3CUL0 Reverse transcriptase domain-containing protein1.27e-30975.45Show/hide
Query:  MKLKGHVNGKEVVILIDSRATNNFISQVLVDELQLSIDPGTRFGVTIGNGNQCEGSGICKRVKVKLKELIIVADFLAVELGTVDLVLGMQWLDSTGTMKV
        MKLKG +  KE+VILIDS AT+NFI Q L  +L+L ++  T+FG TIGNG +C+G GIC+RV+VKL+E+ I+ADFLAVELG+VD VLGMQWLD+ GTMK+
Subjt:  MKLKGHVNGKEVVILIDSRATNNFISQVLVDELQLSIDPGTRFGVTIGNGNQCEGSGICKRVKVKLKELIIVADFLAVELGTVDLVLGMQWLDSTGTMKV

Query:  HWPSLTMTFWMKGRRIILKGDPSLTKSECSLRTLEKTWQSGDQGFLLEFQNYEVNYEGESETEAELKGKEEGLPMVQRFLEQYADVFRLPTGLPPRRAID
        HWPSLTM+FW +GR+IILKGDP L K+ECSLRTLEKTWQ  DQGFLLE+ N EV  E   +T+ + KG E  +PM++  L+QY D+F  P GLPP+R ID
Subjt:  HWPSLTMTFWMKGRRIILKGDPSLTKSECSLRTLEKTWQSGDQGFLLEFQNYEVNYEGESETEAELKGKEEGLPMVQRFLEQYADVFRLPTGLPPRRAID

Query:  HRILTVADQKPINVRPYKYGHVQKEEIEKLVLEMLQAGVIRPSRSPYSSPILLVKKKDGGWRFCVDYRKLNQVTVADKFPIPMIEELLDELHGATVFSKL
        HRILT+ +Q+PINVRPYKYGHVQK EIE LV EMLQ G+IRPSRSPYSS +LLVKKKDGGWRFCVDYRKLNQ TV+DKFPIP+IEELLDEL+GA VFSKL
Subjt:  HRILTVADQKPINVRPYKYGHVQKEEIEKLVLEMLQAGVIRPSRSPYSSPILLVKKKDGGWRFCVDYRKLNQVTVADKFPIPMIEELLDELHGATVFSKL

Query:  DLKSGYHQIRMREEDVEKTTFRTHEGHYEFLVMPFGLTNAPATFQSLMNEVFKPFLRRCVLVFFYDILVYSVDIDEHMKHLGMVFAILRDHELFANRSKC
        DLKSGYHQIRM+EED+EKT FRTHEGHYEFLVMPFGLTNAPATFQSLMN+VFKPFLRRCVLVFF DILVYS D+ EH KHLGM+FA+LRD++L+AN  KC
Subjt:  DLKSGYHQIRMREEDVEKTTFRTHEGHYEFLVMPFGLTNAPATFQSLMNEVFKPFLRRCVLVFFYDILVYSVDIDEHMKHLGMVFAILRDHELFANRSKC

Query:  VIAHSQVQYLGHLISSRGVEADEAKIRSMVNWPRPKDITGLRGFLGLTGYYRRFVKSYGEIAAPLTKLLQKNAFHWNEEATIAFDQLKLAMTTLPVLALP
        V AHS++QYLGH IS  GVEAD+ KIRSMVNWPRP D+T LRGFLGLTGYYRRFVK Y  I  PLTKLLQKNAF WNEEA   F +LK+AMTT+PVLALP
Subjt:  VIAHSQVQYLGHLISSRGVEADEAKIRSMVNWPRPKDITGLRGFLGLTGYYRRFVKSYGEIAAPLTKLLQKNAFHWNEEATIAFDQLKLAMTTLPVLALP

Query:  DWSKPFTIETDASGVGLGAVLSQDGHPIAFFSQKLSPRAQGKSIYERELMAVVLSVQK
        DWS PFTIETDASG GLGAVLSQ GHPIAF+S+KLS RAQ KSIYERELMAVVLSVQ+
Subjt:  DWSKPFTIETDASGVGLGAVLSQDGHPIAFFSQKLSPRAQGKSIYERELMAVVLSVQK

A0A5D3CW02 Uncharacterized protein9.14e-31376.34Show/hide
Query:  MKLKGHVNGKEVVILIDSRATNNFISQVLVDELQLSIDPGTRFGVTIGNGNQCEGSGICKRVKVKLKELIIVADFLAVELGTVDLVLGMQWLDSTGTMKV
        MKLKG V  KE+V+LIDS AT+NFI Q L +ELQ+ ++  T+FG TIGNG +C+G G+C+RV++KLKE+ I+ADFLAVELGTVD VLGMQWLD+TGTM++
Subjt:  MKLKGHVNGKEVVILIDSRATNNFISQVLVDELQLSIDPGTRFGVTIGNGNQCEGSGICKRVKVKLKELIIVADFLAVELGTVDLVLGMQWLDSTGTMKV

Query:  HWPSLTMTFWMKGRRIILKGDPSLTKSECSLRTLEKTWQSGDQGFLLEFQNYEVNYEGESETEAELKGKEEGLPMVQRFLEQYADVFRLPTGLPPRRAID
        HWPSLTM FW +GR+I+LKGDPSL K+ECSL+TLEKTWQ  DQGFLLE+ N E++ E + ET+ E+KG E  +PM++  L+QYAD+F  P GLPP+R ID
Subjt:  HWPSLTMTFWMKGRRIILKGDPSLTKSECSLRTLEKTWQSGDQGFLLEFQNYEVNYEGESETEAELKGKEEGLPMVQRFLEQYADVFRLPTGLPPRRAID

Query:  HRILTVADQKPINVRPYKYGHVQKEEIEKLVLEMLQAGVIRPSRSPYSSPILLVKKKDGGWRFCVDYRKLNQVTVADKFPIPMIEELLDELHGATVFSKL
        HRILT+ DQ+PINVRPYKYGHVQKEEIE LV EMLQ G+IRPS SPYSSP+LLVK+KDGGWRFCVDYRKLNQ TV+DKFPIP+IEELLDELHGA VFSKL
Subjt:  HRILTVADQKPINVRPYKYGHVQKEEIEKLVLEMLQAGVIRPSRSPYSSPILLVKKKDGGWRFCVDYRKLNQVTVADKFPIPMIEELLDELHGATVFSKL

Query:  DLKSGYHQIRMREEDVEKTTFRTHEGHYEFLVMPFGLTNAPATFQSLMNEVFKPFLRRCVLVFFYDILVYSVDIDEHMKHLGMVFAILRDHELFANRSKC
        DLKSGYHQIRM+EED+EKT FRTHEGHYEFLVMPFGLTNA ATFQSLMN+VFKPFLRRCVLVF YDILVYS DI EH KHLGMVFA+LRD++L+AN+ KC
Subjt:  DLKSGYHQIRMREEDVEKTTFRTHEGHYEFLVMPFGLTNAPATFQSLMNEVFKPFLRRCVLVFFYDILVYSVDIDEHMKHLGMVFAILRDHELFANRSKC

Query:  VIAHSQVQYLGHLISSRGVEADEAKIRSMVNWPRPKDITGLRGFLGLTGYYRRFVKSYGEIAAPLTKLLQKNAFHWNEEATIAFDQLKLAMTTLPVLALP
        V AHS++QYLGH IS +GVEADE KI+SMV+WPRP D++ LRGFLGLTGYYRRFVK Y +IA PLTKLLQKNAF W EEA  AF +LK+AMTT+PVLALP
Subjt:  VIAHSQVQYLGHLISSRGVEADEAKIRSMVNWPRPKDITGLRGFLGLTGYYRRFVKSYGEIAAPLTKLLQKNAFHWNEEATIAFDQLKLAMTTLPVLALP

Query:  DWSKPFTIETDASGVGLGAVLSQDGHPIAFFSQKLSPRAQGKSIYERELMAVVLSVQK
        DW+ PFTIETDASG GLGAVLSQ GHPIAFFSQKLS RAQ KSIYERELM VVLSVQK
Subjt:  DWSKPFTIETDASGVGLGAVLSQDGHPIAFFSQKLSPRAQGKSIYERELMAVVLSVQK

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.65.7e-8243.6Show/hide
Query:  YKYGHVQKEEIEKLVLEMLQAGVIRPSRSPYSSPILLVKKKDGG-----WRFCVDYRKLNQVTVADKFPIPMIEELLDELHGATVFSKLDLKSGYHQIRM
        Y Y    ++E+E  + +ML  G+IR S SPY+SPI +V KK        +R  +DYRKLN++TV D+ PIP ++E+L +L     F+ +DL  G+HQI M
Subjt:  YKYGHVQKEEIEKLVLEMLQAGVIRPSRSPYSSPILLVKKKDGG-----WRFCVDYRKLNQVTVADKFPIPMIEELLDELHGATVFSKLDLKSGYHQIRM

Query:  REEDVEKTTFRTHEGHYEFLVMPFGLTNAPATFQSLMNEVFKPFLRRCVLVFFYDILVYSVDIDEHMKHLGMVFAILRDHELFANRSKCVIAHSQVQYLG
          E V KT F T  GHYE+L MPFGL NAPATFQ  MN++ +P L +  LV+  DI+V+S  +DEH++ LG+VF  L    L     KC     +  +LG
Subjt:  REEDVEKTTFRTHEGHYEFLVMPFGLTNAPATFQSLMNEVFKPFLRRCVLVFFYDILVYSVDIDEHMKHLGMVFAILRDHELFANRSKCVIAHSQVQYLG

Query:  HLISSRGVEADEAKIRSMVNWPRPKDITGLRGFLGLTGYYRRFVKSYGEIAAPLTKLLQKN--AFHWNEEATIAFDQLKLAMTTLPVLALPDWSKPFTIE
        H+++  G++ +  KI ++  +P P     ++ FLGLTGYYR+F+ ++ +IA P+TK L+KN      N E   AF +LK  ++  P+L +PD++K FT+ 
Subjt:  HLISSRGVEADEAKIRSMVNWPRPKDITGLRGFLGLTGYYRRFVKSYGEIAAPLTKLLQKN--AFHWNEEATIAFDQLKLAMTTLPVLALPDWSKPFTIE

Query:  TDASGVGLGAVLSQDGHPIAFFSQKLSPRAQGKSIYERELMAVV
        TDAS V LGAVLSQDGHP+++ S+ L+      S  E+EL+A+V
Subjt:  TDASGVGLGAVLSQDGHPIAFFSQKLSPRAQGKSIYERELMAVV

P10401 Retrovirus-related Pol polyprotein from transposon gypsy2.0e-6636.62Show/hide
Query:  LPPRRAIDHRILTVADQKPINVRPYKYGHVQKEEIEKLVLEMLQAGVIRPSRSPYSSPILLVKKK------DGGWRFCVDYRKLNQVTVADKFPIPMIEE
        LP   A+   I TV D +P+  R Y       + +   V ++L+ G+IRPSRSPY+SP  +V KK      +   R  +D+RKLN+ T+ D++P+P I  
Subjt:  LPPRRAIDHRILTVADQKPINVRPYKYGHVQKEEIEKLVLEMLQAGVIRPSRSPYSSPILLVKKK------DGGWRFCVDYRKLNQVTVADKFPIPMIEE

Query:  LLDELHGATVFSKLDLKSGYHQIRMREEDVEKTTFRTHEGHYEFLVMPFGLTNAPATFQSLMNEVFKPFLRRCVLVFFYDILVYSVDIDEHMKHLGMVFA
        +L  L  A  F+ LDLKSGYHQI + E D EKT+F  + G YEF  +PFGL NA + FQ  +++V +  + +   V+  D++++S +  +H++H+  V  
Subjt:  LLDELHGATVFSKLDLKSGYHQIRMREEDVEKTTFRTHEGHYEFLVMPFGLTNAPATFQSLMNEVFKPFLRRCVLVFFYDILVYSVDIDEHMKHLGMVFA

Query:  ILRDHELFANRSKCVIAHSQVQYLGHLISSRGVEADEAKIRSMVNWPRPKDITGLRGFLGLTGYYRRFVKSYGEIAAPLTKLL------------QKNAF
         L D  +  ++ K       V+YLG ++S  G ++D  K++++  +P P  +  +R FLGL  YYR F+K +  IA P+T +L            +K   
Subjt:  ILRDHELFANRSKCVIAHSQVQYLGHLISSRGVEADEAKIRSMVNWPRPKDITGLRGFLGLTGYYRRFVKSYGEIAAPLTKLL------------QKNAF

Query:  HWNEEATIAFDQLKLAMTTLPV-LALPDWSKPFTIETDASGVGLGAVLSQDGHPIAFFSQKLSPRAQGKSIYERELMAVVLSVQK
         +NE    AF +L+  + +  V L  PD+ KPF + TDAS  G+GAVLSQ+G PI   S+ L    Q  +  EREL+A+V ++ K
Subjt:  HWNEEATIAFDQLKLAMTTLPV-LALPDWSKPFTIETDASGVGLGAVLSQDGHPIAFFSQKLSPRAQGKSIYERELMAVVLSVQK

P20825 Retrovirus-related Pol polyprotein from transposon 2976.3e-8135.7Show/hide
Query:  DLVLGMQWLDSTGTMKVHWPSLTMTFWMKGRRIILKGDPSLTKSECSLRTLEKTWQSGDQGFL--LEFQNYEVNYEGESETEAELKGKEEGLPMVQRFLE
        D+++G + L +  ++ +++ + T+T + +  ++I     S       ++   ++  S DQ  +  L+F  + +++  + ET      K +GL    R LE
Subjt:  DLVLGMQWLDSTGTMKVHWPSLTMTFWMKGRRIILKGDPSLTKSECSLRTLEKTWQSGDQGFL--LEFQNYEVNYEGESETEAELKGKEEGLPMVQRFLE

Query:  QYADVFRLPTGLPPRRAIDHRILTVADQKPINVRPYKYGHVQKEEIEKLVLEMLQAGVIRPSRSPYSSPILLVKKKDGG-----WRFCVDYRKLNQVTVA
             ++    L     I H +L      PI  + Y      + E+E  V EML  G+IR S SPY+SP  +V KK        +R  +DYRKLN++T+ 
Subjt:  QYADVFRLPTGLPPRRAIDHRILTVADQKPINVRPYKYGHVQKEEIEKLVLEMLQAGVIRPSRSPYSSPILLVKKKDGG-----WRFCVDYRKLNQVTVA

Query:  DKFPIPMIEELLDELHGATVFSKLDLKSGYHQIRMREEDVEKTTFRTHEGHYEFLVMPFGLTNAPATFQSLMNEVFKPFLRRCVLVFFYDILVYSVDIDE
        D++PIP ++E+L +L     F+ +DL  G+HQI M EE + KT F T  GHYE+L MPFGL NAPATFQ  MN + +P L +  LV+  DI+++S  + E
Subjt:  DKFPIPMIEELLDELHGATVFSKLDLKSGYHQIRMREEDVEKTTFRTHEGHYEFLVMPFGLTNAPATFQSLMNEVFKPFLRRCVLVFFYDILVYSVDIDE

Query:  HMKHLGMVFAILRDHELFANRSKCVIAHSQVQYLGHLISSRGVEADEAKIRSMVNWPRPKDITGLRGFLGLTGYYRRFVKSYGEIAAPLTKLLQKNAFHW
        H+  + +VF  L D  L     KC     +  +LGH+++  G++ +  K++++V++P P     +R FLGLTGYYR+F+ +Y +IA P+T  L+K     
Subjt:  HMKHLGMVFAILRDHELFANRSKCVIAHSQVQYLGHLISSRGVEADEAKIRSMVNWPRPKDITGLRGFLGLTGYYRRFVKSYGEIAAPLTKLLQKNAFHW

Query:  NE--EATIAFDQLKLAMTTLPVLALPDWSKPFTIETDASGVGLGAVLSQDGHPIAFFSQKLSPRAQGKSIYERELMAVV
         +  E   AF++LK  +   P+L LPD+ K F + TDAS + LGAVLSQ+GHPI+F S+ L+      S  E+EL+A+V
Subjt:  NE--EATIAFDQLKLAMTTLPVLALPDWSKPFTIETDASGVGLGAVLSQDGHPIAFFSQKLSPRAQGKSIYERELMAVV

Q8I7P9 Retrovirus-related Pol polyprotein from transposon opus4.4e-7437.38Show/hide
Query:  MVQRFLEQYADVFRLP-TGLPPRRAIDHRILTVADQKPINVRPYKYGHVQKEEIEKLVLEMLQAGVIRPSRSPYSSPILLVKKK-----DGGWRFCVDYR
        ++   L ++  +F  P +G+    A+   I T   Q PI  + Y Y    + E+E+ + E+LQ G+IRPS SPY+SPI +V KK     +  +R  VD++
Subjt:  MVQRFLEQYADVFRLP-TGLPPRRAIDHRILTVADQKPINVRPYKYGHVQKEEIEKLVLEMLQAGVIRPSRSPYSSPILLVKKK-----DGGWRFCVDYR

Query:  KLNQVTVADKFPIPMIEELLDELHGATVFSKLDLKSGYHQIRMREEDVEKTTFRTHEGHYEFLVMPFGLTNAPATFQSLMNEVFKPFLRRCVLVFFYDIL
        +LN VT+ D +PIP I   L  L  A  F+ LDL SG+HQI M+E D+ KT F T  G YEFL +PFGL NAPA FQ +++++ +  + +   V+  DI+
Subjt:  KLNQVTVADKFPIPMIEELLDELHGATVFSKLDLKSGYHQIRMREEDVEKTTFRTHEGHYEFLVMPFGLTNAPATFQSLMNEVFKPFLRRCVLVFFYDIL

Query:  VYSVDIDEHMKHLGMVFAILRDHELFANRSKCVIAHSQVQYLGHLISSRGVEADEAKIRSMVNWPRPKDITGLRGFLGLTGYYRRFVKSYGEIAAPLTKL
        V+S D D H K+L +V A L    L  N  K     +QV++LG+++++ G++AD  K+R++   P P  +  L+ FLG+T YYR+F++ Y ++A PLT L
Subjt:  VYSVDIDEHMKHLGMVFAILRDHELFANRSKCVIAHSQVQYLGHLISSRGVEADEAKIRSMVNWPRPKDITGLRGFLGLTGYYRRFVKSYGEIAAPLTKL

Query:  LQ------------KNAFHWNEEATIAFDQLKLAMTTLPVLALPDWSKPFTIETDASGVGLGAVLSQD----GHPIAFFSQKLSPRAQGKSIYERELMAV
         +            K     +E A  +F+ LK  + +  +LA P ++KPF + TDAS   +GAVLSQD      PIA+ S+ L+   +  +  E+E++A+
Subjt:  LQ------------KNAFHWNEEATIAFDQLKLAMTTLPVLALPDWSKPFTIETDASGVGLGAVLSQD----GHPIAFFSQKLSPRAQGKSIYERELMAV

Query:  VLSV
        + S+
Subjt:  VLSV

Q99315 Transposon Ty3-G Gag-Pol polyprotein2.9e-6536.62Show/hide
Query:  EQYADVFRLPTGLPPRRA------IDHRILTVADQKPINVRPYKYGHVQKEEIEKLVLEMLQAGVIRPSRSPYSSPILLVKKKDGGWRFCVDYRKLNQVT
        ++Y ++ R    LPPR A      + H I      +   ++PY      ++EI K+V ++L    I PS+SP SSP++LV KKDG +R CVDYR LN+ T
Subjt:  EQYADVFRLPTGLPPRRA------IDHRILTVADQKPINVRPYKYGHVQKEEIEKLVLEMLQAGVIRPSRSPYSSPILLVKKKDGGWRFCVDYRKLNQVT

Query:  VADKFPIPMIEELLDELHGATVFSKLDLKSGYHQIRMREEDVEKTTFRTHEGHYEFLVMPFGLTNAPATFQSLMNEVFKPFLRRCVLVFFYDILVYSVDI
        ++D FP+P I+ LL  +  A +F+ LDL SGYHQI M  +D  KT F T  G YE+ VMPFGL NAP+TF   M + F+    R V V+  DIL++S   
Subjt:  VADKFPIPMIEELLDELHGATVFSKLDLKSGYHQIRMREEDVEKTTFRTHEGHYEFLVMPFGLTNAPATFQSLMNEVFKPFLRRCVLVFFYDILVYSVDI

Query:  DEHMKHLGMVFAILRDHELFANRSKCVIAHSQVQYLGHLISSRGVEADEAKIRSMVNWPRPKDITGLRGFLGLTGYYRRFVKSYGEIAAPLTKLLQKNAF
        +EH KHL  V   L++  L   + KC  A  + ++LG+ I  + +   + K  ++ ++P PK +   + FLG+  YYRRF+ +  +IA P+ +L   +  
Subjt:  DEHMKHLGMVFAILRDHELFANRSKCVIAHSQVQYLGHLISSRGVEADEAKIRSMVNWPRPKDITGLRGFLGLTGYYRRFVKSYGEIAAPLTKLLQKNAF

Query:  HWNEEATIAFDQLKLAMTTLPVLALPDWSKPFTIETDASGVGLGAVLSQDGHP------IAFFSQKLSPRAQGKSIYERELMAVV
         W E+   A D+LK A+   PVL   +    + + TDAS  G+GAVL +  +       + +FS+ L    +     E EL+ ++
Subjt:  HWNEEATIAFDQLKLAMTTLPVLALPDWSKPFTIETDASGVGLGAVLSQDGHP------IAFFSQKLSPRAQGKSIYERELMAVV

Arabidopsis top hitse value%identityAlignment
AT3G29750.1 Eukaryotic aspartyl protease family protein5.1e-0925.58Show/hide
Query:  MKLKGHVNGKEVVILIDSRATNNFISQVLVDELQLSIDPGTRFGVTIGNGNQCEGSGICKRVKVKLKELIIVADFLAVELG--TVDLVLGMQWLDSTGTM
        M+  G +   +VV+ IDS AT+NFI   L   L+L      +  V +G     +  G C  +++ ++E+ I  +FL ++L    VD++LG +WL   G  
Subjt:  MKLKGHVNGKEVVILIDSRATNNFISQVLVDELQLSIDPGTRFGVTIGNGNQCEGSGICKRVKVKLKELIIVADFLAVELG--TVDLVLGMQWLDSTGTM

Query:  KVHWPSLTMTFWMKGRRIILKGDPS---------LTKSECSLRTLEKTWQSGDQGFLLEFQNYEVNYEGESE
         V+W +   +F    + I L  +             KSE     +E+   +  +  ++ +   +V  +GES+
Subjt:  KVHWPSLTMTFWMKGRRIILKGDPS---------LTKSECSLRTLEKTWQSGDQGFLLEFQNYEVNYEGESE

AT3G30770.1 Eukaryotic aspartyl protease family protein2.6e-0533.33Show/hide
Query:  MKLKGHVNGKEVVILIDSRATNNFISQVLVDELQLSIDPGTRFGVTIGNGNQCEGSGICKRVKVKLKELIIVADFLAVEL--GTVDLVLG
        M+  G ++  +VV++IDS ATNNFIS  L   L+L      +  V +G     +  G C  + + ++E+ I  +FL ++L    VD++LG
Subjt:  MKLKGHVNGKEVVILIDSRATNNFISQVLVDELQLSIDPGTRFGVTIGNGNQCEGSGICKRVKVKLKELIIVADFLAVEL--GTVDLVLG

ATMG00850.1 DNA/RNA polymerases superfamily protein1.1e-0655Show/hide
Query:  VQKEEIEKLVLEMLQAGVIRPSRSPYSSPILLVKKKDGGW
        +++  ++  + EML+A +I+PS SPYSSP+LLV+KKDGGW
Subjt:  VQKEEIEKLVLEMLQAGVIRPSRSPYSSPILLVKKKDGGW

ATMG00860.1 DNA/RNA polymerases superfamily protein5.0e-4160.31Show/hide
Query:  MKHLGMVFAILRDHELFANRSKCVIAHSQVQYLG--HLISSRGVEADEAKIRSMVNWPRPKDITGLRGFLGLTGYYRRFVKSYGEIAAPLTKLLQKNAFH
        M HLGMV  I   H+ +ANR KC     Q+ YLG  H+IS  GV AD AK+ +MV WP PK+ T LRGFLGLTGYYRRFVK+YG+I  PLT+LL+KN+  
Subjt:  MKHLGMVFAILRDHELFANRSKCVIAHSQVQYLG--HLISSRGVEADEAKIRSMVNWPRPKDITGLRGFLGLTGYYRRFVKSYGEIAAPLTKLLQKNAFH

Query:  WNEEATIAFDQLKLAMTTLPVLALPDWSKPF
        W E A +AF  LK A+TTLPVLALPD   PF
Subjt:  WNEEATIAFDQLKLAMTTLPVLALPDWSKPF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAACTAAAAGGACATGTGAACGGAAAGGAAGTAGTTATTCTGATTGACAGCAGGGCGACCAATAACTTCATCAGCCAAGTGTTGGTAGATGAACTACAGTTGAGCAT
CGATCCAGGAACTCGTTTTGGAGTTACCATTGGGAATGGCAACCAATGTGAAGGAAGTGGGATTTGCAAGAGGGTGAAGGTGAAGTTAAAAGAGTTAATAATCGTAGCAG
ATTTCCTGGCGGTAGAGTTAGGAACGGTAGACTTGGTGCTTGGGATGCAATGGCTAGATTCGACAGGAACCATGAAGGTTCACTGGCCATCCCTAACCATGACGTTTTGG
ATGAAGGGTAGAAGAATAATCCTAAAAGGTGACCCTTCTCTAACGAAGTCAGAATGTTCATTGAGAACCTTAGAGAAAACGTGGCAATCCGGGGACCAAGGATTCCTCTT
GGAATTCCAAAACTATGAAGTAAACTATGAAGGAGAATCGGAAACAGAAGCAGAATTGAAGGGAAAGGAAGAAGGATTACCCATGGTTCAGCGATTTCTCGAGCAATATG
CAGATGTCTTTAGGTTGCCCACGGGTTTACCGCCAAGGAGAGCCATAGACCATCGCATTCTGACCGTGGCCGATCAGAAACCAATTAATGTAAGACCATATAAGTATGGC
CATGTACAAAAGGAAGAGATTGAAAAATTGGTGTTAGAAATGTTACAAGCTGGGGTGATTCGTCCAAGCCGCAGCCCATATTCGAGCCCGATCCTCTTAGTGAAGAAAAA
AGATGGAGGGTGGAGATTTTGTGTAGATTACAGGAAACTCAATCAAGTAACGGTGGCTGACAAATTTCCAATTCCCATGATCGAAGAACTCTTAGATGAACTTCATGGTG
CGACAGTTTTCTCAAAGTTAGACCTTAAATCTGGATACCACCAGATTAGGATGAGGGAGGAAGATGTGGAGAAAACAACTTTCCGCACGCATGAAGGACATTATGAGTTC
TTGGTGATGCCTTTCGGCCTTACGAATGCTCCTGCCACCTTCCAATCTCTCATGAACGAGGTGTTCAAACCATTCCTTCGAAGGTGTGTCCTGGTTTTTTTTTATGACAT
TCTAGTTTATAGTGTGGACATAGATGAGCACATGAAACATTTAGGAATGGTTTTTGCGATCTTGAGGGACCATGAATTGTTTGCAAATAGGTCTAAATGTGTCATTGCTC
ATTCCCAAGTTCAATATTTGGGTCATCTGATTTCCAGCAGAGGAGTGGAGGCTGATGAGGCCAAGATTCGCAGTATGGTAAATTGGCCACGGCCGAAGGATATAACTGGG
CTGAGGGGATTCCTTGGACTGACTGGGTATTATAGAAGATTTGTGAAAAGCTATGGAGAAATAGCTGCACCCTTAACCAAATTACTTCAGAAAAATGCATTCCATTGGAA
TGAGGAAGCCACAATAGCGTTTGACCAGCTGAAGCTAGCAATGACAACCTTACCGGTATTAGCATTGCCGGATTGGTCTAAGCCCTTCACAATCGAAACTGATGCTTCAG
GAGTAGGTTTAGGCGCAGTTTTATCACAGGATGGTCATCCCATCGCATTCTTCAGCCAGAAACTGTCCCCAAGAGCCCAGGGTAAGTCGATCTATGAAAGGGAATTGATG
GCGGTTGTCCTTTCAGTGCAAAAATGA
mRNA sequenceShow/hide mRNA sequence
ATGAAACTAAAAGGACATGTGAACGGAAAGGAAGTAGTTATTCTGATTGACAGCAGGGCGACCAATAACTTCATCAGCCAAGTGTTGGTAGATGAACTACAGTTGAGCAT
CGATCCAGGAACTCGTTTTGGAGTTACCATTGGGAATGGCAACCAATGTGAAGGAAGTGGGATTTGCAAGAGGGTGAAGGTGAAGTTAAAAGAGTTAATAATCGTAGCAG
ATTTCCTGGCGGTAGAGTTAGGAACGGTAGACTTGGTGCTTGGGATGCAATGGCTAGATTCGACAGGAACCATGAAGGTTCACTGGCCATCCCTAACCATGACGTTTTGG
ATGAAGGGTAGAAGAATAATCCTAAAAGGTGACCCTTCTCTAACGAAGTCAGAATGTTCATTGAGAACCTTAGAGAAAACGTGGCAATCCGGGGACCAAGGATTCCTCTT
GGAATTCCAAAACTATGAAGTAAACTATGAAGGAGAATCGGAAACAGAAGCAGAATTGAAGGGAAAGGAAGAAGGATTACCCATGGTTCAGCGATTTCTCGAGCAATATG
CAGATGTCTTTAGGTTGCCCACGGGTTTACCGCCAAGGAGAGCCATAGACCATCGCATTCTGACCGTGGCCGATCAGAAACCAATTAATGTAAGACCATATAAGTATGGC
CATGTACAAAAGGAAGAGATTGAAAAATTGGTGTTAGAAATGTTACAAGCTGGGGTGATTCGTCCAAGCCGCAGCCCATATTCGAGCCCGATCCTCTTAGTGAAGAAAAA
AGATGGAGGGTGGAGATTTTGTGTAGATTACAGGAAACTCAATCAAGTAACGGTGGCTGACAAATTTCCAATTCCCATGATCGAAGAACTCTTAGATGAACTTCATGGTG
CGACAGTTTTCTCAAAGTTAGACCTTAAATCTGGATACCACCAGATTAGGATGAGGGAGGAAGATGTGGAGAAAACAACTTTCCGCACGCATGAAGGACATTATGAGTTC
TTGGTGATGCCTTTCGGCCTTACGAATGCTCCTGCCACCTTCCAATCTCTCATGAACGAGGTGTTCAAACCATTCCTTCGAAGGTGTGTCCTGGTTTTTTTTTATGACAT
TCTAGTTTATAGTGTGGACATAGATGAGCACATGAAACATTTAGGAATGGTTTTTGCGATCTTGAGGGACCATGAATTGTTTGCAAATAGGTCTAAATGTGTCATTGCTC
ATTCCCAAGTTCAATATTTGGGTCATCTGATTTCCAGCAGAGGAGTGGAGGCTGATGAGGCCAAGATTCGCAGTATGGTAAATTGGCCACGGCCGAAGGATATAACTGGG
CTGAGGGGATTCCTTGGACTGACTGGGTATTATAGAAGATTTGTGAAAAGCTATGGAGAAATAGCTGCACCCTTAACCAAATTACTTCAGAAAAATGCATTCCATTGGAA
TGAGGAAGCCACAATAGCGTTTGACCAGCTGAAGCTAGCAATGACAACCTTACCGGTATTAGCATTGCCGGATTGGTCTAAGCCCTTCACAATCGAAACTGATGCTTCAG
GAGTAGGTTTAGGCGCAGTTTTATCACAGGATGGTCATCCCATCGCATTCTTCAGCCAGAAACTGTCCCCAAGAGCCCAGGGTAAGTCGATCTATGAAAGGGAATTGATG
GCGGTTGTCCTTTCAGTGCAAAAATGA
Protein sequenceShow/hide protein sequence
MKLKGHVNGKEVVILIDSRATNNFISQVLVDELQLSIDPGTRFGVTIGNGNQCEGSGICKRVKVKLKELIIVADFLAVELGTVDLVLGMQWLDSTGTMKVHWPSLTMTFW
MKGRRIILKGDPSLTKSECSLRTLEKTWQSGDQGFLLEFQNYEVNYEGESETEAELKGKEEGLPMVQRFLEQYADVFRLPTGLPPRRAIDHRILTVADQKPINVRPYKYG
HVQKEEIEKLVLEMLQAGVIRPSRSPYSSPILLVKKKDGGWRFCVDYRKLNQVTVADKFPIPMIEELLDELHGATVFSKLDLKSGYHQIRMREEDVEKTTFRTHEGHYEF
LVMPFGLTNAPATFQSLMNEVFKPFLRRCVLVFFYDILVYSVDIDEHMKHLGMVFAILRDHELFANRSKCVIAHSQVQYLGHLISSRGVEADEAKIRSMVNWPRPKDITG
LRGFLGLTGYYRRFVKSYGEIAAPLTKLLQKNAFHWNEEATIAFDQLKLAMTTLPVLALPDWSKPFTIETDASGVGLGAVLSQDGHPIAFFSQKLSPRAQGKSIYERELM
AVVLSVQK