; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI07G00110 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI07G00110
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationChr7:209132..213094
RNA-Seq ExpressionCSPI07G00110
SyntenyCSPI07G00110
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0039770.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]6.7e-17554.89Show/hide
Query:  IINENVNSTYIALIAKKETCSVPSNYRPISLTTSLYKLIAKVIAERLKLVQPDTISENQLAFVKGRHILDAILIANEAVNFWKQKKIKGFVVKLDIEKAF
        IIN+ VN T IALIAKKE C+ P++YRPISLTTS+YKLIAKVIAERLK   P T++ENQ+AFVKGR I+DAIL+ANEA+++W+ KKI+GFV+KLDIEKAF
Subjt:  IINENVNSTYIALIAKKETCSVPSNYRPISLTTSLYKLIAKVIAERLKLVQPDTISENQLAFVKGRHILDAILIANEAVNFWKQKKIKGFVVKLDIEKAF

Query:  DKINWTFIDFMLLKKGFPQNWGQWISACITSVQYSILINGRPRGKIKPMRGIRQGDPISPFIFVLTMDYLSILLNQLEKDNLIKGVSFNEKHNLTHLLFA
        DK+NW FIDFML+KKG+P  W  WI ACI+SVQYSI+INGRPRGKI+P RGIRQGDPISPFIFVL MDY+S LLN + +   IKGV      NLTHLLFA
Subjt:  DKINWTFIDFMLLKKGFPQNWGQWISACITSVQYSILINGRPRGKIKPMRGIRQGDPISPFIFVLTMDYLSILLNQLEKDNLIKGVSFNEKHNLTHLLFA

Query:  DDILLFMEDDDETIDNMRFALRLFELASGLNINLNKSTISPTNIDLQRTNCVTSKWGFSINFLPIQYLGVPLGGKPISRHFWSEITGKLQRKINNWKYAS
        DDILLF+EDD+ +I N++  + LF+LASGL+INLNKSTISP N+D  RT  + S+WG S  FLPI YLGVPLGGK I++ FW  +  K+ +K+ +WKY+ 
Subjt:  DDILLFMEDDDETIDNMRFALRLFELASGLNINLNKSTISPTNIDLQRTNCVTSKWGFSINFLPIQYLGVPLGGKPISRHFWSEITGKLQRKINNWKYAS

Query:  ISRGDKVTLIKATLASIPNYHLSVFKPPKSVCNDIEKNWMNFLWRNTYEKKNINLIKWSSVLSPTSRGGLSINNVESTNFALLSKWIWRFFEEKNPLWKR
        +S+G K+TLIK++LAS+P Y LS+FK P S C +IEK W NFLW+N  E   ++L+ W+ + S   +GGL I+ ++ TNFALL+KW+WR+  E +PLWK+
Subjt:  ISRGDKVTLIKATLASIPNYHLSVFKPPKSVCNDIEKNWMNFLWRNTYEKKNINLIKWSSVLSPTSRGGLSINNVESTNFALLSKWIWRFFEEKNPLWKR

Query:  IIIAKYEQTYLGELPTKSKYSSSKALWMSIIKGVGWVLPQIKWSIRRGDTLSFWHSKWH----------------------------------DLSPR--
        II AKY     G++P    +SSS++ W SI KG+ W    + W I+ G + SFWHS WH                                  DL+PR  
Subjt:  IIIAKYEQTYLGELPTKSKYSSSKALWMSIIKGVGWVLPQIKWSIRRGDTLSFWHSKWH----------------------------------DLSPR--

Query:  LRNVEEALWDDMKASL-PPLPDTGLDRPIWNLNSNDIFSMASTKIARTQNNQ
        LR  E  LW ++K SL     + G D P+W LNSN ++++AS K    Q  Q
Subjt:  LRNVEEALWDDMKASL-PPLPDTGLDRPIWNLNSNDIFSMASTKIARTQNNQ

KAA0039770.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]2.2e-0863.83Show/hide
Query:  SRLDRFLYTPNWESIFEPHFSRLLPRETSDHFSITLESNSLKWGPSP
        SR+DRFLYT NWE++F  H+S+ L R TSDHF I LES+ + WGPSP
Subjt:  SRLDRFLYTPNWESIFEPHFSRLLPRETSDHFSITLESNSLKWGPSP

KAA0039770.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]8.8e-17554.89Show/hide
Query:  IINENVNSTYIALIAKKETCSVPSNYRPISLTTSLYKLIAKVIAERLKLVQPDTISENQLAFVKGRHILDAILIANEAVNFWKQKKIKGFVVKLDIEKAF
        IIN+ VN T IALIAKKE C+ P++YRPISLTTS+YKLIAKVIAERLK   P T++ENQ+AFVKGR I+DAIL+ANEA+++W+ KKI+GFV+KLDIEKAF
Subjt:  IINENVNSTYIALIAKKETCSVPSNYRPISLTTSLYKLIAKVIAERLKLVQPDTISENQLAFVKGRHILDAILIANEAVNFWKQKKIKGFVVKLDIEKAF

Query:  DKINWTFIDFMLLKKGFPQNWGQWISACITSVQYSILINGRPRGKIKPMRGIRQGDPISPFIFVLTMDYLSILLNQLEKDNLIKGVSFNEKHNLTHLLFA
        DK+NW FIDFML+KKG+P  W  WI ACI+SVQYSI+INGRPRGKI+P RGIRQGDPISPFIFVL MDY+S LLN + +   IKGV      NLTHLLFA
Subjt:  DKINWTFIDFMLLKKGFPQNWGQWISACITSVQYSILINGRPRGKIKPMRGIRQGDPISPFIFVLTMDYLSILLNQLEKDNLIKGVSFNEKHNLTHLLFA

Query:  DDILLFMEDDDETIDNMRFALRLFELASGLNINLNKSTISPTNIDLQRTNCVTSKWGFSINFLPIQYLGVPLGGKPISRHFWSEITGKLQRKINNWKYAS
        DDILLF+EDD+ +I N++  + LF+LASGL+INLNKSTISP N+D  RT  + S+WG S  FLPI YLGVPLGGK I++ FW  +  K+ +K+ +WKY+ 
Subjt:  DDILLFMEDDDETIDNMRFALRLFELASGLNINLNKSTISPTNIDLQRTNCVTSKWGFSINFLPIQYLGVPLGGKPISRHFWSEITGKLQRKINNWKYAS

Query:  ISRGDKVTLIKATLASIPNYHLSVFKPPKSVCNDIEKNWMNFLWRNTYEKKNINLIKWSSVLSPTSRGGLSINNVESTNFALLSKWIWRFFEEKNPLWKR
        +S+G K+TLIK++LAS+P Y LS+FK P S C +IEK W NFLW+N  E   ++L+ W+ + S   +GGL I+ ++ TNFALL+KW+WR+  E +PLWK+
Subjt:  ISRGDKVTLIKATLASIPNYHLSVFKPPKSVCNDIEKNWMNFLWRNTYEKKNINLIKWSSVLSPTSRGGLSINNVESTNFALLSKWIWRFFEEKNPLWKR

Query:  IIIAKYEQTYLGELPTKSKYSSSKALWMSIIKGVGWVLPQIKWSIRRGDTLSFWHSKWH----------------------------------DLSPR--
        II AKY     G++P    +SSS++ W SI KG+ W    + W I+ G + SFWHS WH                                  DL+PR  
Subjt:  IIIAKYEQTYLGELPTKSKYSSSKALWMSIIKGVGWVLPQIKWSIRRGDTLSFWHSKWH----------------------------------DLSPR--

Query:  LRNVEEALWDDMKASL-PPLPDTGLDRPIWNLNSNDIFSMASTKIARTQNNQ
        LR  E  LW ++K SL     + G D P+W LNSN ++++AS K    Q  Q
Subjt:  LRNVEEALWDDMKASL-PPLPDTGLDRPIWNLNSNDIFSMASTKIARTQNNQ

KAA0039950.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]6.9e-18038.06Show/hide
Query:  AVYGPSKRENRGDFWMELEEIKSTCLPRWLMGGDFNIVRWQSKTTAKNIAVPCMNYSILSFRDAISMIPQFQMP-----------NALGQTSRLDRFLYT
        A+YGP+KR+NR  FW ELE +KS CLP W++GGDFN++RW+ +TT KN A+     S+  F   IS       P            A    SRLDRFL+T
Subjt:  AVYGPSKRENRGDFWMELEEIKSTCLPRWLMGGDFNIVRWQSKTTAKNIAVPCMNYSILSFRDAISMIPQFQMP-----------NALGQTSRLDRFLYT

Query:  PNWESIFEPHFSRLLPRETSDHFSITLESNSLKWGPSP--------------------------------------------------------------
          WE+IF  H S++L R TSDHF I LES+++ WGPSP                                                              
Subjt:  PNWESIFEPHFSRLLPRETSDHFSITLESNSLKWGPSP--------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -----------------------------------------------------------------------------------GIINENVNSTYIALIAK
                                                                                            IIN+ VN T I LIAK
Subjt:  -----------------------------------------------------------------------------------GIINENVNSTYIALIAK

Query:  KETCSVPSNYRPISLTTSLYKLIAKVIAERLKLVQPDTISENQLAFVKGRHILDAILIANEAVNFWKQKKIKGFVVKLDIEKAFDKINWTFIDFMLLKKG
        KE C   +++RPISLTT++YKLIAK +A+RLK   PDTISE+Q+AFVKGR I +AILIANEA++FW+ KK +GFV+KLDIEKAFDK+NW FIDF+L+KK 
Subjt:  KETCSVPSNYRPISLTTSLYKLIAKVIAERLKLVQPDTISENQLAFVKGRHILDAILIANEAVNFWKQKKIKGFVVKLDIEKAFDKINWTFIDFMLLKKG

Query:  FPQNWGQWISACITSVQYSILINGRPRGKIKPMRGIRQGDPISPFIFVLTMDYLSILLNQLEKDNLIKGVSFNEKHNLTHLLFADDILLFMEDDDETIDN
        + Q W + I++CI+SVQYSILINGRPRG+IKP RGIRQGDP+SPFIFVL MDYLS LLN L     I GV F+   NLTH+LFADDIL+F+ED D+ + N
Subjt:  FPQNWGQWISACITSVQYSILINGRPRGKIKPMRGIRQGDPISPFIFVLTMDYLSILLNQLEKDNLIKGVSFNEKHNLTHLLFADDILLFMEDDDETIDN

Query:  MRFALRLFELASGLNINLNKSTISPTNIDLQRTNCVTSKWGFSINFLPIQYLGVPLGGKPISRHFWSEITGKLQRKINNWKYASISRGDKVTLIKATLAS
        ++  L LFE ASGLNINL+KSTI P N+   R   +   WG S   LP  YLG+PLGG+P S +FW  +  K+Q+K++NWKY+ +S+G ++TLI +TL S
Subjt:  MRFALRLFELASGLNINLNKSTISPTNIDLQRTNCVTSKWGFSINFLPIQYLGVPLGGKPISRHFWSEITGKLQRKINNWKYASISRGDKVTLIKATLAS

Query:  IPNYHLSVFKPPKSVCNDIEKNWMNFLWRNTYEKKNINLIKWSSVLSPTSRGGLSINNVESTNFALLSKWIWRFFEEKNPLWKRIIIAKYEQTYLGELPT
        +P Y +SVFK PK +   IE +W NFLW       NI+LI+W+ ++SP  +GGL I++V STNFALL KW+W+F  EK+PLWKR+II+KY++  +G  P+
Subjt:  IPNYHLSVFKPPKSVCNDIEKNWMNFLWRNTYEKKNINLIKWSSVLSPTSRGGLSINNVESTNFALLSKWIWRFFEEKNPLWKRIIIAKYEQTYLGELPT

Query:  KSKYSSSKALWMSIIKGVGWVLPQIKWSIRRGDTLSFWHSKWHDLSP------------------------------------RLRNVEEALWDDMKASL
          K+SS+ + W ++ + + W    I W +  G+ +SFW   W+  +P                                     LR+ EE LW ++KASL
Subjt:  KSKYSSSKALWMSIIKGVGWVLPQIKWSIRRGDTLSFWHSKWHDLSP------------------------------------RLRNVEEALWDDMKASL

Query:  P-PLPDTGLDRPIWNLNSNDIFSMASTKIA
        P PLP+ G  +P+WNLNSN+IF  AS K A
Subjt:  P-PLPDTGLDRPIWNLNSNDIFSMASTKIA

KAA0041367.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]1.2e-17655.43Show/hide
Query:  IINENVNSTYIALIAKKETCSVPSNYRPISLTTSLYKLIAKVIAERLKLVQPDTISENQLAFVKGRHILDAILIANEAVNFWKQKKIKGFVVKLDIEKAF
        IIN+ VN T IALIAKKE C+ P++YRPISLTTS+YKLIAKVIAERLK   P T++ENQ+AFVKGR I+DAIL+ANEA+++W+ KKI+GFV+KLDIEKAF
Subjt:  IINENVNSTYIALIAKKETCSVPSNYRPISLTTSLYKLIAKVIAERLKLVQPDTISENQLAFVKGRHILDAILIANEAVNFWKQKKIKGFVVKLDIEKAF

Query:  DKINWTFIDFMLLKKGFPQNWGQWISACITSVQYSILINGRPRGKIKPMRGIRQGDPISPFIFVLTMDYLSILLNQLEKDNLIKGVSFNEKHNLTHLLFA
        DK+NW FIDFML+KKG+P  W +WI ACI+SVQYSI+INGRPRGKI+P RGIRQGDPISPFIFVL MDY+S LLN + +   IKGV      NLTHLLFA
Subjt:  DKINWTFIDFMLLKKGFPQNWGQWISACITSVQYSILINGRPRGKIKPMRGIRQGDPISPFIFVLTMDYLSILLNQLEKDNLIKGVSFNEKHNLTHLLFA

Query:  DDILLFMEDDDETIDNMRFALRLFELASGLNINLNKSTISPTNIDLQRTNCVTSKWGFSINFLPIQYLGVPLGGKPISRHFWSEITGKLQRKINNWKYAS
        DDILLF+EDD+ +I N++  + LF+LASGL+INLNKSTISP N+D  RT  + S+WG S  FLPI YLGVPLGGK  ++ FW  +  K+ +K+ +WKY+ 
Subjt:  DDILLFMEDDDETIDNMRFALRLFELASGLNINLNKSTISPTNIDLQRTNCVTSKWGFSINFLPIQYLGVPLGGKPISRHFWSEITGKLQRKINNWKYAS

Query:  ISRGDKVTLIKATLASIPNYHLSVFKPPKSVCNDIEKNWMNFLWRNTYEKKNINLIKWSSVLSPTSRGGLSINNVESTNFALLSKWIWRFFEEKNPLWKR
        +S+G K+TLIK++LAS+P Y LS+FK P S C +IEK W NFLW+N  E   ++L+ W+ + SP  RGGL I+ ++ TNFALL+KW+WR+  E +PLWK+
Subjt:  ISRGDKVTLIKATLASIPNYHLSVFKPPKSVCNDIEKNWMNFLWRNTYEKKNINLIKWSSVLSPTSRGGLSINNVESTNFALLSKWIWRFFEEKNPLWKR

Query:  IIIAKYEQTYLGELPTKSKYSSSKALWMSIIKGVGWVLPQIKWSIRRGDTLSFWHSKWH----------------------------------DLSPR--
        II AKY     G++P    +SSS++ W SI KGV W    + W I+ G + SFWHS WH                                  DL+PR  
Subjt:  IIIAKYEQTYLGELPTKSKYSSSKALWMSIIKGVGWVLPQIKWSIRRGDTLSFWHSKWH----------------------------------DLSPR--

Query:  LRNVEEALWDDMKASL-PPLPDTGLDRPIWNLNSNDIFSMASTKIARTQNNQ
        LR  E  LW ++K SL     + G D P W LNSN ++++AS K A  Q +Q
Subjt:  LRNVEEALWDDMKASL-PPLPDTGLDRPIWNLNSNDIFSMASTKIARTQNNQ

TYK06777.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]1.8e-18039.62Show/hide
Query:  AVYGPSKRENRGDFWMELEEIKSTCLPRWLMGGDFNIVRWQSKTTAKN---IAVPCMNYSIL--SFRDAISMIPQFQMPNALGQ--TSRLDRFLYTPNWE
        A+YGPS   NR  FW EL ++K+ C P WL+ GDFN+VR+ S+T+A+N    ++ C N  I   +  D      +F   N       SR+DRFLYT NWE
Subjt:  AVYGPSKRENRGDFWMELEEIKSTCLPRWLMGGDFNIVRWQSKTTAKN---IAVPCMNYSIL--SFRDAISMIPQFQMPNALGQ--TSRLDRFLYTPNWE

Query:  SIFEPHFSRLLPRETSDHFSITLESNSLKWGPSP------------------------------------------------------------------
        ++F  H+S+ L R TSDHF I LES+ + WGPSP                                                                  
Subjt:  SIFEPHFSRLLPRETSDHFSITLESNSLKWGPSP------------------------------------------------------------------

Query:  ------------------------------------------------------------------------------GI--------------------
                                                                                      G+                    
Subjt:  ------------------------------------------------------------------------------GI--------------------

Query:  -------------------------------------------------------INENVNSTYIALIAKKETCSVPSNYRPISLTTSLYKLIAKVIAER
                                                               ++  +N T IALIAKKE C+ P++YRPISLTTS+YKLIAKVIAER
Subjt:  -------------------------------------------------------INENVNSTYIALIAKKETCSVPSNYRPISLTTSLYKLIAKVIAER

Query:  LKLVQPDTISENQLAFVKGRHILDAILIANEAVNFWKQKKIKGFVVKLDIEKAFDKINWTFIDFMLLKKGFPQNWGQWISACITSVQYSILINGRPRGKI
        LK   P T++ENQ+AFVK R I+DAIL+ANEA+++W+ KKI+GFV+KLDIEKAFDK+NW FIDFML+KKG+P  W  WI ACI+SVQYSI+INGRPRGKI
Subjt:  LKLVQPDTISENQLAFVKGRHILDAILIANEAVNFWKQKKIKGFVVKLDIEKAFDKINWTFIDFMLLKKGFPQNWGQWISACITSVQYSILINGRPRGKI

Query:  KPMRGIRQGDPISPFIFVLTMDYLSILLNQLEKDNLIKGVSFNEKHNLTHLLFADDILLFMEDDDETIDNMRFALRLFELASGLNINLNKSTISPTNIDL
        +P RGIRQGDPISPFIFVL MDY+S LLN + +   IKGV      NLTHLLFADDILLF+EDD+ +I N++  + LF+LASGL+INLNKSTISP N+D 
Subjt:  KPMRGIRQGDPISPFIFVLTMDYLSILLNQLEKDNLIKGVSFNEKHNLTHLLFADDILLFMEDDDETIDNMRFALRLFELASGLNINLNKSTISPTNIDL

Query:  QRTNCVTSKWGFSINFLPIQYLGVPLGGKPISRHFWSEITGKLQRKINNWKYASISRGDKVTLIKATLASIPNYHLSVFKPPKSVCNDIEKNWMNFLWRN
         RT  + S+WG S  FLPI YLGVPLGGK  ++ FW  +  K+ +K+ +WKY+ +S+G K+TLIK++LAS+P Y LS+FK P S C +IEK W NFLW+N
Subjt:  QRTNCVTSKWGFSINFLPIQYLGVPLGGKPISRHFWSEITGKLQRKINNWKYASISRGDKVTLIKATLASIPNYHLSVFKPPKSVCNDIEKNWMNFLWRN

Query:  TYEKKNINLIKWSSVLSPTSRGGLSINNVESTNFALLSKWIWRFFEEKNPLWKRIIIAKYEQTYLGELPTKSKYSSSKALWMSIIKGVGWVLPQIKWSIR
          E   ++L+ W+ + S   +GGL I+ ++ TNFALL+KW+WR+  E +PLWK+II AKY     G++P    +SSS++ W SI KG+ W    + W I+
Subjt:  TYEKKNINLIKWSSVLSPTSRGGLSINNVESTNFALLSKWIWRFFEEKNPLWKRIIIAKYEQTYLGELPTKSKYSSSKALWMSIIKGVGWVLPQIKWSIR

Query:  RGDTLSFWHSKWH----------------------------------DLSPR--LRNVEEALWDDMKASL-PPLPDTGLDRPIWNLNSNDIFSMASTKIA
         G + SFWH  WH                                  DL+PR  LR  E  LW ++K S+     + G D P+W LNSN ++++AS K A
Subjt:  RGDTLSFWHSKWH----------------------------------DLSPR--LRNVEEALWDDMKASL-PPLPDTGLDRPIWNLNSNDIFSMASTKIA

Query:  RTQNNQ
          Q  Q
Subjt:  RTQNNQ

XP_016902461.1 PREDICTED: LINE-1 retrotransposable element ORF2 protein [Cucumis melo]2.2e-0863.83Show/hide
Query:  SRLDRFLYTPNWESIFEPHFSRLLPRETSDHFSITLESNSLKWGPSP
        SR+DRFLYT NWE++F  H+S+ L R TSDHF I LES+ + WGPSP
Subjt:  SRLDRFLYTPNWESIFEPHFSRLLPRETSDHFSITLESNSLKWGPSP

TrEMBL top hitse value%identityAlignment
A0A1S4E2K5 LINE-1 retrotransposable element ORF2 protein1.1e-0863.83Show/hide
Query:  SRLDRFLYTPNWESIFEPHFSRLLPRETSDHFSITLESNSLKWGPSP
        SR+DRFLYT NWE++F  H+S+ L R TSDHF I LES+ + WGPSP
Subjt:  SRLDRFLYTPNWESIFEPHFSRLLPRETSDHFSITLESNSLKWGPSP

A0A5A7T9I7 LINE-1 retrotransposable element ORF2 protein3.4e-18038.06Show/hide
Query:  AVYGPSKRENRGDFWMELEEIKSTCLPRWLMGGDFNIVRWQSKTTAKNIAVPCMNYSILSFRDAISMIPQFQMP-----------NALGQTSRLDRFLYT
        A+YGP+KR+NR  FW ELE +KS CLP W++GGDFN++RW+ +TT KN A+     S+  F   IS       P            A    SRLDRFL+T
Subjt:  AVYGPSKRENRGDFWMELEEIKSTCLPRWLMGGDFNIVRWQSKTTAKNIAVPCMNYSILSFRDAISMIPQFQMP-----------NALGQTSRLDRFLYT

Query:  PNWESIFEPHFSRLLPRETSDHFSITLESNSLKWGPSP--------------------------------------------------------------
          WE+IF  H S++L R TSDHF I LES+++ WGPSP                                                              
Subjt:  PNWESIFEPHFSRLLPRETSDHFSITLESNSLKWGPSP--------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -----------------------------------------------------------------------------------GIINENVNSTYIALIAK
                                                                                            IIN+ VN T I LIAK
Subjt:  -----------------------------------------------------------------------------------GIINENVNSTYIALIAK

Query:  KETCSVPSNYRPISLTTSLYKLIAKVIAERLKLVQPDTISENQLAFVKGRHILDAILIANEAVNFWKQKKIKGFVVKLDIEKAFDKINWTFIDFMLLKKG
        KE C   +++RPISLTT++YKLIAK +A+RLK   PDTISE+Q+AFVKGR I +AILIANEA++FW+ KK +GFV+KLDIEKAFDK+NW FIDF+L+KK 
Subjt:  KETCSVPSNYRPISLTTSLYKLIAKVIAERLKLVQPDTISENQLAFVKGRHILDAILIANEAVNFWKQKKIKGFVVKLDIEKAFDKINWTFIDFMLLKKG

Query:  FPQNWGQWISACITSVQYSILINGRPRGKIKPMRGIRQGDPISPFIFVLTMDYLSILLNQLEKDNLIKGVSFNEKHNLTHLLFADDILLFMEDDDETIDN
        + Q W + I++CI+SVQYSILINGRPRG+IKP RGIRQGDP+SPFIFVL MDYLS LLN L     I GV F+   NLTH+LFADDIL+F+ED D+ + N
Subjt:  FPQNWGQWISACITSVQYSILINGRPRGKIKPMRGIRQGDPISPFIFVLTMDYLSILLNQLEKDNLIKGVSFNEKHNLTHLLFADDILLFMEDDDETIDN

Query:  MRFALRLFELASGLNINLNKSTISPTNIDLQRTNCVTSKWGFSINFLPIQYLGVPLGGKPISRHFWSEITGKLQRKINNWKYASISRGDKVTLIKATLAS
        ++  L LFE ASGLNINL+KSTI P N+   R   +   WG S   LP  YLG+PLGG+P S +FW  +  K+Q+K++NWKY+ +S+G ++TLI +TL S
Subjt:  MRFALRLFELASGLNINLNKSTISPTNIDLQRTNCVTSKWGFSINFLPIQYLGVPLGGKPISRHFWSEITGKLQRKINNWKYASISRGDKVTLIKATLAS

Query:  IPNYHLSVFKPPKSVCNDIEKNWMNFLWRNTYEKKNINLIKWSSVLSPTSRGGLSINNVESTNFALLSKWIWRFFEEKNPLWKRIIIAKYEQTYLGELPT
        +P Y +SVFK PK +   IE +W NFLW       NI+LI+W+ ++SP  +GGL I++V STNFALL KW+W+F  EK+PLWKR+II+KY++  +G  P+
Subjt:  IPNYHLSVFKPPKSVCNDIEKNWMNFLWRNTYEKKNINLIKWSSVLSPTSRGGLSINNVESTNFALLSKWIWRFFEEKNPLWKRIIIAKYEQTYLGELPT

Query:  KSKYSSSKALWMSIIKGVGWVLPQIKWSIRRGDTLSFWHSKWHDLSP------------------------------------RLRNVEEALWDDMKASL
          K+SS+ + W ++ + + W    I W +  G+ +SFW   W+  +P                                     LR+ EE LW ++KASL
Subjt:  KSKYSSSKALWMSIIKGVGWVLPQIKWSIRRGDTLSFWHSKWHDLSP------------------------------------RLRNVEEALWDDMKASL

Query:  P-PLPDTGLDRPIWNLNSNDIFSMASTKIA
        P PLP+ G  +P+WNLNSN+IF  AS K A
Subjt:  P-PLPDTGLDRPIWNLNSNDIFSMASTKIA

A0A5A7TI93 LINE-1 retrotransposable element ORF2 protein5.9e-17755.43Show/hide
Query:  IINENVNSTYIALIAKKETCSVPSNYRPISLTTSLYKLIAKVIAERLKLVQPDTISENQLAFVKGRHILDAILIANEAVNFWKQKKIKGFVVKLDIEKAF
        IIN+ VN T IALIAKKE C+ P++YRPISLTTS+YKLIAKVIAERLK   P T++ENQ+AFVKGR I+DAIL+ANEA+++W+ KKI+GFV+KLDIEKAF
Subjt:  IINENVNSTYIALIAKKETCSVPSNYRPISLTTSLYKLIAKVIAERLKLVQPDTISENQLAFVKGRHILDAILIANEAVNFWKQKKIKGFVVKLDIEKAF

Query:  DKINWTFIDFMLLKKGFPQNWGQWISACITSVQYSILINGRPRGKIKPMRGIRQGDPISPFIFVLTMDYLSILLNQLEKDNLIKGVSFNEKHNLTHLLFA
        DK+NW FIDFML+KKG+P  W +WI ACI+SVQYSI+INGRPRGKI+P RGIRQGDPISPFIFVL MDY+S LLN + +   IKGV      NLTHLLFA
Subjt:  DKINWTFIDFMLLKKGFPQNWGQWISACITSVQYSILINGRPRGKIKPMRGIRQGDPISPFIFVLTMDYLSILLNQLEKDNLIKGVSFNEKHNLTHLLFA

Query:  DDILLFMEDDDETIDNMRFALRLFELASGLNINLNKSTISPTNIDLQRTNCVTSKWGFSINFLPIQYLGVPLGGKPISRHFWSEITGKLQRKINNWKYAS
        DDILLF+EDD+ +I N++  + LF+LASGL+INLNKSTISP N+D  RT  + S+WG S  FLPI YLGVPLGGK  ++ FW  +  K+ +K+ +WKY+ 
Subjt:  DDILLFMEDDDETIDNMRFALRLFELASGLNINLNKSTISPTNIDLQRTNCVTSKWGFSINFLPIQYLGVPLGGKPISRHFWSEITGKLQRKINNWKYAS

Query:  ISRGDKVTLIKATLASIPNYHLSVFKPPKSVCNDIEKNWMNFLWRNTYEKKNINLIKWSSVLSPTSRGGLSINNVESTNFALLSKWIWRFFEEKNPLWKR
        +S+G K+TLIK++LAS+P Y LS+FK P S C +IEK W NFLW+N  E   ++L+ W+ + SP  RGGL I+ ++ TNFALL+KW+WR+  E +PLWK+
Subjt:  ISRGDKVTLIKATLASIPNYHLSVFKPPKSVCNDIEKNWMNFLWRNTYEKKNINLIKWSSVLSPTSRGGLSINNVESTNFALLSKWIWRFFEEKNPLWKR

Query:  IIIAKYEQTYLGELPTKSKYSSSKALWMSIIKGVGWVLPQIKWSIRRGDTLSFWHSKWH----------------------------------DLSPR--
        II AKY     G++P    +SSS++ W SI KGV W    + W I+ G + SFWHS WH                                  DL+PR  
Subjt:  IIIAKYEQTYLGELPTKSKYSSSKALWMSIIKGVGWVLPQIKWSIRRGDTLSFWHSKWH----------------------------------DLSPR--

Query:  LRNVEEALWDDMKASL-PPLPDTGLDRPIWNLNSNDIFSMASTKIARTQNNQ
        LR  E  LW ++K SL     + G D P W LNSN ++++AS K A  Q +Q
Subjt:  LRNVEEALWDDMKASL-PPLPDTGLDRPIWNLNSNDIFSMASTKIARTQNNQ

A0A5D3C4J1 LINE-1 retrotransposable element ORF2 protein8.8e-18139.62Show/hide
Query:  AVYGPSKRENRGDFWMELEEIKSTCLPRWLMGGDFNIVRWQSKTTAKN---IAVPCMNYSIL--SFRDAISMIPQFQMPNALGQ--TSRLDRFLYTPNWE
        A+YGPS   NR  FW EL ++K+ C P WL+ GDFN+VR+ S+T+A+N    ++ C N  I   +  D      +F   N       SR+DRFLYT NWE
Subjt:  AVYGPSKRENRGDFWMELEEIKSTCLPRWLMGGDFNIVRWQSKTTAKN---IAVPCMNYSIL--SFRDAISMIPQFQMPNALGQ--TSRLDRFLYTPNWE

Query:  SIFEPHFSRLLPRETSDHFSITLESNSLKWGPSP------------------------------------------------------------------
        ++F  H+S+ L R TSDHF I LES+ + WGPSP                                                                  
Subjt:  SIFEPHFSRLLPRETSDHFSITLESNSLKWGPSP------------------------------------------------------------------

Query:  ------------------------------------------------------------------------------GI--------------------
                                                                                      G+                    
Subjt:  ------------------------------------------------------------------------------GI--------------------

Query:  -------------------------------------------------------INENVNSTYIALIAKKETCSVPSNYRPISLTTSLYKLIAKVIAER
                                                               ++  +N T IALIAKKE C+ P++YRPISLTTS+YKLIAKVIAER
Subjt:  -------------------------------------------------------INENVNSTYIALIAKKETCSVPSNYRPISLTTSLYKLIAKVIAER

Query:  LKLVQPDTISENQLAFVKGRHILDAILIANEAVNFWKQKKIKGFVVKLDIEKAFDKINWTFIDFMLLKKGFPQNWGQWISACITSVQYSILINGRPRGKI
        LK   P T++ENQ+AFVK R I+DAIL+ANEA+++W+ KKI+GFV+KLDIEKAFDK+NW FIDFML+KKG+P  W  WI ACI+SVQYSI+INGRPRGKI
Subjt:  LKLVQPDTISENQLAFVKGRHILDAILIANEAVNFWKQKKIKGFVVKLDIEKAFDKINWTFIDFMLLKKGFPQNWGQWISACITSVQYSILINGRPRGKI

Query:  KPMRGIRQGDPISPFIFVLTMDYLSILLNQLEKDNLIKGVSFNEKHNLTHLLFADDILLFMEDDDETIDNMRFALRLFELASGLNINLNKSTISPTNIDL
        +P RGIRQGDPISPFIFVL MDY+S LLN + +   IKGV      NLTHLLFADDILLF+EDD+ +I N++  + LF+LASGL+INLNKSTISP N+D 
Subjt:  KPMRGIRQGDPISPFIFVLTMDYLSILLNQLEKDNLIKGVSFNEKHNLTHLLFADDILLFMEDDDETIDNMRFALRLFELASGLNINLNKSTISPTNIDL

Query:  QRTNCVTSKWGFSINFLPIQYLGVPLGGKPISRHFWSEITGKLQRKINNWKYASISRGDKVTLIKATLASIPNYHLSVFKPPKSVCNDIEKNWMNFLWRN
         RT  + S+WG S  FLPI YLGVPLGGK  ++ FW  +  K+ +K+ +WKY+ +S+G K+TLIK++LAS+P Y LS+FK P S C +IEK W NFLW+N
Subjt:  QRTNCVTSKWGFSINFLPIQYLGVPLGGKPISRHFWSEITGKLQRKINNWKYASISRGDKVTLIKATLASIPNYHLSVFKPPKSVCNDIEKNWMNFLWRN

Query:  TYEKKNINLIKWSSVLSPTSRGGLSINNVESTNFALLSKWIWRFFEEKNPLWKRIIIAKYEQTYLGELPTKSKYSSSKALWMSIIKGVGWVLPQIKWSIR
          E   ++L+ W+ + S   +GGL I+ ++ TNFALL+KW+WR+  E +PLWK+II AKY     G++P    +SSS++ W SI KG+ W    + W I+
Subjt:  TYEKKNINLIKWSSVLSPTSRGGLSINNVESTNFALLSKWIWRFFEEKNPLWKRIIIAKYEQTYLGELPTKSKYSSSKALWMSIIKGVGWVLPQIKWSIR

Query:  RGDTLSFWHSKWH----------------------------------DLSPR--LRNVEEALWDDMKASL-PPLPDTGLDRPIWNLNSNDIFSMASTKIA
         G + SFWH  WH                                  DL+PR  LR  E  LW ++K S+     + G D P+W LNSN ++++AS K A
Subjt:  RGDTLSFWHSKWH----------------------------------DLSPR--LRNVEEALWDDMKASL-PPLPDTGLDRPIWNLNSNDIFSMASTKIA

Query:  RTQNNQ
          Q  Q
Subjt:  RTQNNQ

A0A5D3DM72 LINE-1 retrotransposable element ORF2 protein3.2e-17554.89Show/hide
Query:  IINENVNSTYIALIAKKETCSVPSNYRPISLTTSLYKLIAKVIAERLKLVQPDTISENQLAFVKGRHILDAILIANEAVNFWKQKKIKGFVVKLDIEKAF
        IIN+ VN T IALIAKKE C+ P++YRPISLTTS+YKLIAKVIAERLK   P T++ENQ+AFVKGR I+DAIL+ANEA+++W+ KKI+GFV+KLDIEKAF
Subjt:  IINENVNSTYIALIAKKETCSVPSNYRPISLTTSLYKLIAKVIAERLKLVQPDTISENQLAFVKGRHILDAILIANEAVNFWKQKKIKGFVVKLDIEKAF

Query:  DKINWTFIDFMLLKKGFPQNWGQWISACITSVQYSILINGRPRGKIKPMRGIRQGDPISPFIFVLTMDYLSILLNQLEKDNLIKGVSFNEKHNLTHLLFA
        DK+NW FIDFML+KKG+P  W  WI ACI+SVQYSI+INGRPRGKI+P RGIRQGDPISPFIFVL MDY+S LLN + +   IKGV      NLTHLLFA
Subjt:  DKINWTFIDFMLLKKGFPQNWGQWISACITSVQYSILINGRPRGKIKPMRGIRQGDPISPFIFVLTMDYLSILLNQLEKDNLIKGVSFNEKHNLTHLLFA

Query:  DDILLFMEDDDETIDNMRFALRLFELASGLNINLNKSTISPTNIDLQRTNCVTSKWGFSINFLPIQYLGVPLGGKPISRHFWSEITGKLQRKINNWKYAS
        DDILLF+EDD+ +I N++  + LF+LASGL+INLNKSTISP N+D  RT  + S+WG S  FLPI YLGVPLGGK I++ FW  +  K+ +K+ +WKY+ 
Subjt:  DDILLFMEDDDETIDNMRFALRLFELASGLNINLNKSTISPTNIDLQRTNCVTSKWGFSINFLPIQYLGVPLGGKPISRHFWSEITGKLQRKINNWKYAS

Query:  ISRGDKVTLIKATLASIPNYHLSVFKPPKSVCNDIEKNWMNFLWRNTYEKKNINLIKWSSVLSPTSRGGLSINNVESTNFALLSKWIWRFFEEKNPLWKR
        +S+G K+TLIK++LAS+P Y LS+FK P S C +IEK W NFLW+N  E   ++L+ W+ + S   +GGL I+ ++ TNFALL+KW+WR+  E +PLWK+
Subjt:  ISRGDKVTLIKATLASIPNYHLSVFKPPKSVCNDIEKNWMNFLWRNTYEKKNINLIKWSSVLSPTSRGGLSINNVESTNFALLSKWIWRFFEEKNPLWKR

Query:  IIIAKYEQTYLGELPTKSKYSSSKALWMSIIKGVGWVLPQIKWSIRRGDTLSFWHSKWH----------------------------------DLSPR--
        II AKY     G++P    +SSS++ W SI KG+ W    + W I+ G + SFWHS WH                                  DL+PR  
Subjt:  IIIAKYEQTYLGELPTKSKYSSSKALWMSIIKGVGWVLPQIKWSIRRGDTLSFWHSKWH----------------------------------DLSPR--

Query:  LRNVEEALWDDMKASL-PPLPDTGLDRPIWNLNSNDIFSMASTKIARTQNNQ
        LR  E  LW ++K SL     + G D P+W LNSN ++++AS K    Q  Q
Subjt:  LRNVEEALWDDMKASL-PPLPDTGLDRPIWNLNSNDIFSMASTKIARTQNNQ

A0A5D3DM72 LINE-1 retrotransposable element ORF2 protein1.1e-0863.83Show/hide
Query:  SRLDRFLYTPNWESIFEPHFSRLLPRETSDHFSITLESNSLKWGPSP
        SR+DRFLYT NWE++F  H+S+ L R TSDHF I LES+ + WGPSP
Subjt:  SRLDRFLYTPNWESIFEPHFSRLLPRETSDHFSITLESNSLKWGPSP

A0A5D3DM72 LINE-1 retrotransposable element ORF2 protein4.2e-17554.89Show/hide
Query:  IINENVNSTYIALIAKKETCSVPSNYRPISLTTSLYKLIAKVIAERLKLVQPDTISENQLAFVKGRHILDAILIANEAVNFWKQKKIKGFVVKLDIEKAF
        IIN+ VN T IALIAKKE C+ P++YRPISLTTS+YKLIAKVIAERLK   P T++ENQ+AFVKGR I+DAIL+ANEA+++W+ KKI+GFV+KLDIEKAF
Subjt:  IINENVNSTYIALIAKKETCSVPSNYRPISLTTSLYKLIAKVIAERLKLVQPDTISENQLAFVKGRHILDAILIANEAVNFWKQKKIKGFVVKLDIEKAF

Query:  DKINWTFIDFMLLKKGFPQNWGQWISACITSVQYSILINGRPRGKIKPMRGIRQGDPISPFIFVLTMDYLSILLNQLEKDNLIKGVSFNEKHNLTHLLFA
        DK+NW FIDFML+KKG+P  W  WI ACI+SVQYSI+INGRPRGKI+P RGIRQGDPISPFIFVL MDY+S LLN + +   IKGV      NLTHLLFA
Subjt:  DKINWTFIDFMLLKKGFPQNWGQWISACITSVQYSILINGRPRGKIKPMRGIRQGDPISPFIFVLTMDYLSILLNQLEKDNLIKGVSFNEKHNLTHLLFA

Query:  DDILLFMEDDDETIDNMRFALRLFELASGLNINLNKSTISPTNIDLQRTNCVTSKWGFSINFLPIQYLGVPLGGKPISRHFWSEITGKLQRKINNWKYAS
        DDILLF+EDD+ +I N++  + LF+LASGL+INLNKSTISP N+D  RT  + S+WG S  FLPI YLGVPLGGK I++ FW  +  K+ +K+ +WKY+ 
Subjt:  DDILLFMEDDDETIDNMRFALRLFELASGLNINLNKSTISPTNIDLQRTNCVTSKWGFSINFLPIQYLGVPLGGKPISRHFWSEITGKLQRKINNWKYAS

Query:  ISRGDKVTLIKATLASIPNYHLSVFKPPKSVCNDIEKNWMNFLWRNTYEKKNINLIKWSSVLSPTSRGGLSINNVESTNFALLSKWIWRFFEEKNPLWKR
        +S+G K+TLIK++LAS+P Y LS+FK P S C +IEK W NFLW+N  E   ++L+ W+ + S   +GGL I+ ++ TNFALL+KW+WR+  E +PLWK+
Subjt:  ISRGDKVTLIKATLASIPNYHLSVFKPPKSVCNDIEKNWMNFLWRNTYEKKNINLIKWSSVLSPTSRGGLSINNVESTNFALLSKWIWRFFEEKNPLWKR

Query:  IIIAKYEQTYLGELPTKSKYSSSKALWMSIIKGVGWVLPQIKWSIRRGDTLSFWHSKWH----------------------------------DLSPR--
        II AKY     G++P    +SSS++ W SI KG+ W    + W I+ G + SFWHS WH                                  DL+PR  
Subjt:  IIIAKYEQTYLGELPTKSKYSSSKALWMSIIKGVGWVLPQIKWSIRRGDTLSFWHSKWH----------------------------------DLSPR--

Query:  LRNVEEALWDDMKASL-PPLPDTGLDRPIWNLNSNDIFSMASTKIARTQNNQ
        LR  E  LW ++K SL     + G D P+W LNSN ++++AS K    Q  Q
Subjt:  LRNVEEALWDDMKASL-PPLPDTGLDRPIWNLNSNDIFSMASTKIARTQNNQ

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein9.8e-2825.67Show/hide
Query:  GIINENVNSTYIALIAKK-ETCSVPSNYRPISLTTSLYKLIAKVIAERLKLVQPDTISENQLAFVKGRHILDAILIANEAVNFWKQKKIKGFV-VKLDIE
        GI+  +     I LI K     +   N+RPISL     K++ K++A R++      I  +Q+ F+ G      I  +   +    + K K  V + +D E
Subjt:  GIINENVNSTYIALIAKK-ETCSVPSNYRPISLTTSLYKLIAKVIAERLKLVQPDTISENQLAFVKGRHILDAILIANEAVNFWKQKKIKGFV-VKLDIE

Query:  KAFDKINWTFIDFMLLKKGFPQNWGQWISACITSVQYSILINGRPRGKIKPMRGIRQGDPISPFIFVLTMDYLSILLNQLEKDNLIKGVSFNEKHNLTHL
        KAFDKI   F+   L K G    + + I A       +I++NG+         G RQG P+SP +F + ++ L+  + Q EK+  IKG+    K  +   
Subjt:  KAFDKINWTFIDFMLLKKGFPQNWGQWISACITSVQYSILINGRPRGKIKPMRGIRQGDPISPFIFVLTMDYLSILLNQLEKDNLIKGVSFNEKHNLTHL

Query:  LFADDILLFMEDDDETIDNMRFALRLFELASGLNINLNKSTISPTNIDLQRTNCVTSKWGFSINFLPIQYLGVPL--GGKPISRHFWSEITGKLQRKINN
        LFADD+++++E+   +  N+   +  F   SG  IN+ KS     N + Q  + +  +  F+I    I+YLG+ L    K + +  +  +  +++   N 
Subjt:  LFADDILLFMEDDDETIDNMRFALRLFELASGLNINLNKSTISPTNIDLQRTNCVTSKWGFSINFLPIQYLGVPL--GGKPISRHFWSEITGKLQRKINN

Query:  WKYASISRGDKVTLIKATLAS--IPNYHLSVFKPPKSVCNDIEKNWMNFLWRNTYEKKNINLIKWSSVLSPTSR-GGLSINNVESTNFALLSKWIWRFFE
        WK    S   ++ ++K  +    I  ++    K P +   ++EK  + F+W     +K   + K  S+LS  ++ GG+++ + +    A ++K  W +++
Subjt:  WKYASISRGDKVTLIKATLAS--IPNYHLSVFKPPKSVCNDIEKNWMNFLWRNTYEKKNINLIKWSSVLSPTSR-GGLSINNVESTNFALLSKWIWRFFE

Query:  EKN-PLWKR
         ++   W R
Subjt:  EKN-PLWKR

P08548 LINE-1 reverse transcriptase homolog7.0e-2624.27Show/hide
Query:  GIINENVNSTYIALIAKK-ETCSVPSNYRPISLTTSLYKLIAKVIAERLKLVQPDTISENQLAFVKGR----HILDAILIANEAVNFWKQKKIKGFVVKL
        GI+        I LI K  +  +   NYRPISL     K++ K++  R++      I  +Q+ F+ G     +I  +I   N   +  K K     ++ +
Subjt:  GIINENVNSTYIALIAKK-ETCSVPSNYRPISLTTSLYKLIAKVIAERLKLVQPDTISENQLAFVKGR----HILDAILIANEAVNFWKQKKIKGFVVKL

Query:  DIEKAFDKINWTFIDFMLLKKGFPQNWGQWISACITSVQYSILINGRPRGKIKPMRGIRQGDPISPFIFVLTMDYLSILLNQLEKDNLIKGVSFNEKHNL
        D EKAFD I   F+   L K G    + + I A  +    +I++NG          G RQG P+SP +F + M+ L+I + +   +  IKG+    +  +
Subjt:  DIEKAFDKINWTFIDFMLLKKGFPQNWGQWISACITSVQYSILINGRPRGKIKPMRGIRQGDPISPFIFVLTMDYLSILLNQLEKDNLIKGVSFNEKHNL

Query:  THLLFADDILLFMEDDDETIDNMRFALRLFELASGLNINLNKSTISPTNIDLQRTNCVTSKWGFSINFLPIQYLGVPL--GGKPISRHFWSEITGKLQRK
           LFADD+++++E+  ++   +   ++ +   SG  IN +KS       + Q    V     F++    ++YLGV L    K + +  +  +  ++   
Subjt:  THLLFADDILLFMEDDDETIDNMRFALRLFELASGLNINLNKSTISPTNIDLQRTNCVTSKWGFSINFLPIQYLGVPL--GGKPISRHFWSEITGKLQRK

Query:  INNWKYASISRGDKVTLIKATL--ASIPNYHLSVFKPPKSVCNDIEKNWMNFLWRNTYEKKNINLIKWSSVLSPTSRGGLSINNVESTNFALLSKWIWRF
        +N WK    S   ++ ++K ++   +I N++    K P S   D+EK  ++F+W     +K   + K + + +    GG+++ ++     +++ K  W +
Subjt:  INNWKYASISRGDKVTLIKATL--ASIPNYHLSVFKPPKSVCNDIEKNWMNFLWRNTYEKKNINLIKWSSVLSPTSRGGLSINNVESTNFALLSKWIWRF

Query:  FEEKN-PLWKRI
         + +   +W RI
Subjt:  FEEKN-PLWKRI

P0C2F6 Putative ribonuclease H protein At1g657502.7e-2533.5Show/hide
Query:  VPLGGKPISRHFWSEITGKLQRKINNWKYASISRGDKVTLIKATLASIPNYHLSVFKPPKSVCNDIEKNWMNFLWRNTYEKKNINLIKWSSVLSPTSRGG
        +P+  K I++  + EI  ++  +++ W+  ++S   ++TL KA L+S+P + +S    P+S+ N +++    FLW +T EKK  +L+KWS V SP   GG
Subjt:  VPLGGKPISRHFWSEITGKLQRKINNWKYASISRGDKVTLIKATLASIPNYHLSVFKPPKSVCNDIEKNWMNFLWRNTYEKKNINLIKWSSVLSPTSRGG

Query:  LSINNVESTNFALLSKWIWRFFEEKNPLWKRIIIAKYEQTYLGE------LPTKSKYSSSKALWMSIIKGVGWVLPQ-IKWSIRRGDTLSFWHSKWHDLS
        L +   +S N AL+SK  WR  +EKN LW  ++  KY   ++GE      L  K  +SS+   W SI  G+  V+   + W    G  + FW  +W    
Subjt:  LSINNVESTNFALLSKWIWRFFEEKNPLWKRIIIAKYEQTYLGE------LPTKSKYSSSKALWMSIIKGVGWVLPQ-IKWSIRRGDTLSFWHSKWHDLS

Query:  PRL
        P L
Subjt:  PRL

P11369 LINE-1 retrotransposable element ORF2 protein1.7e-2424.54Show/hide
Query:  NYRPISLTTSLYKLIAKVIAERLKLVQPDTISENQLAFVKGRHILDAILIANEAVNFWKQKKIKG-FVVKLDIEKAFDKINWTFIDFMLLKKGFPQNWGQ
        N+RPISL     K++ K++A R++      I  +Q+ F+ G      I  +   +++  + K K   ++ LD EKAFDKI   F+  +L + G    +  
Subjt:  NYRPISLTTSLYKLIAKVIAERLKLVQPDTISENQLAFVKGRHILDAILIANEAVNFWKQKKIKG-FVVKLDIEKAFDKINWTFIDFMLLKKGFPQNWGQ

Query:  WISACITSVQYSILINGRPRGKIKPMRGIRQGDPISPFIFVLTMDYLSILLNQLEKDNLIKGVSFNEKHNLTHLLFADDILLFMEDDDETIDNMRFALRL
         I A  +    +I +NG     I    G RQG P+SP++F + ++ L+  + Q ++   IKG+    K  +   L ADD+++++ D   +   +   +  
Subjt:  WISACITSVQYSILINGRPRGKIKPMRGIRQGDPISPFIFVLTMDYLSILLNQLEKDNLIKGVSFNEKHNLTHLLFADDILLFMEDDDETIDNMRFALRL

Query:  FELASGLNINLNKSTISPTNIDLQRTNCVTSKWGFSINFLPIQYLGVPLGG--KPISRHFWSEITGKLQRKINNWKYASISRGDKVTLIKATL--ASIPN
        F    G  IN NKS       + Q    +     FSI    I+YLGV L    K +    +  +  +++  +  WK    S   ++ ++K  +   +I  
Subjt:  FELASGLNINLNKSTISPTNIDLQRTNCVTSKWGFSINFLPIQYLGVPLGG--KPISRHFWSEITGKLQRKINNWKYASISRGDKVTLIKATL--ASIPN

Query:  YHLSVFKPPKSVCNDIEKNWMNFLWRNTYEKKNINLIKWSSVLSPTSRGGLSINNVESTNFALLSKWIWRFFEEKN-PLWKRI
        ++    K P    N++E     F+W N   +   +L+K        + GG+++ +++    A++ K  W ++ ++    W RI
Subjt:  YHLSVFKPPKSVCNDIEKNWMNFLWRNTYEKKNINLIKWSSVLSPTSRGGLSINNVESTNFALLSKWIWRFFEEKN-PLWKRI

P14381 Transposon TX1 uncharacterized 149 kDa protein6.2e-2224.39Show/hide
Query:  IALIAKKETCSVPSNYRPISLTTSLYKLIAKVIAERLKLVQPDTISENQLAFVKGRHILDAILIANEAVNFWKQKKIKGFVVKLDIEKAFDKINWTFIDF
        ++L+ KK    +  N+RP+SL ++ YK++AK I+ RLK V  + I  +Q   V GR I D + +  + ++F ++  +    + LD EKAFD+++  ++  
Subjt:  IALIAKKETCSVPSNYRPISLTTSLYKLIAKVIAERLKLVQPDTISENQLAFVKGRHILDAILIANEAVNFWKQKKIKGFVVKLDIEKAFDKINWTFIDF

Query:  MLLKKGFPQNWGQWISACITSVQYSILINGRPRGKIKPMRGIRQGDPISPFIFVLTMDYLSILLNQLEKDNLIKGVSFNEKHNLTHL-LFADDILLFMED
         L    F   +  ++     S +  + IN      +   RG+RQG P+S  ++ L ++    LL +      + G+   E      L  +ADD++L  +D
Subjt:  MLLKKGFPQNWGQWISACITSVQYSILINGRPRGKIKPMRGIRQGDPISPFIFVLTMDYLSILLNQLEKDNLIKGVSFNEKHNLTHL-LFADDILLFMED

Query:  DDETIDNMRFALRLFELASGLNINLNKST---ISPTNIDLQRTNCVTSKWGFSINFLPIQYLGVPLGGK--PISRHFWSEITGKLQRKINNWK-YASI-S
          + ++  +    ++  AS   IN +KS+        +D          W   I    I+YLGV L  +  P+S++F  E+   +  ++  WK +A + S
Subjt:  DDETIDNMRFALRLFELASGLNINLNKST---ISPTNIDLQRTNCVTSKWGFSINFLPIQYLGVPLGGK--PISRHFWSEITGKLQRKINNWK-YASI-S

Query:  RGDKVTLIKATLASIPNYHLSVFKPPKSVCNDIEKNWMNFLWRNTYEKKNINLIKWSSV---LSPTSRGGLSINNVESTNFALLSKWIWRF-FEEKNPLW
           +  +I   +AS   Y L    P +     I++  ++FLW   +         W S      P   GG  +  + S       + I R+ + + +P W
Subjt:  RGDKVTLIKATLASIPNYHLSVFKPPKSVCNDIEKNWMNFLWRNTYEKKNINLIKWSSV---LSPTSRGGLSINNVESTNFALLSKWIWRF-FEEKNPLW

Query:  KRIIIAKYEQ
          +  + Y Q
Subjt:  KRIIIAKYEQ

Arabidopsis top hitse value%identityAlignment
AT3G24255.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein9.2e-1325.53Show/hide
Query:  FSINFLPIQYLGVPLGGKPISRHFWSEITGKLQRKINNWKYASISRGDKVTLIKATLASIPNYHLSVFKPPKSVCNDIEKNWMNFLWRNTYEKKNINLIK
        F+   LP++YLG+PL  K ++   +  +  K++ +I  W    +S   ++ LI + + S+ N+ +S F+ P +   +I+    +FLW           + 
Subjt:  FSINFLPIQYLGVPLGGKPISRHFWSEITGKLQRKINNWKYASISRGDKVTLIKATLASIPNYHLSVFKPPKSVCNDIEKNWMNFLWRNTYEKKNINLIK

Query:  WSSVLSPTSRGGLSINNVESTN---------FALLSKWIWR
        WS V +P   GGL I +++  N            L  W+W+
Subjt:  WSSVLSPTSRGGLSINNVESTN---------FALLSKWIWR

AT4G20520.1 RNA binding;RNA-directed DNA polymerases4.3e-1039.51Show/hide
Query:  IAERLKLVQPDTISENQLAFVKGRHILDAILIANEAVNFWKQKK-IKGF-VVKLDIEKAFDKINWTFIDFMLLKKGFPQNW
        + ERLK +  + I   Q +F+ GR   D I+   EAV+  ++KK +KG+ ++KLD+EKA+D+I W +++  L+  GFP+ W
Subjt:  IAERLKLVQPDTISENQLAFVKGRHILDAILIANEAVNFWKQKK-IKGF-VVKLDIEKAFDKINWTFIDFMLLKKGFPQNW

AT4G29090.1 Ribonuclease H-like superfamily protein2.3e-1125.5Show/hide
Query:  SIPNYHLSVFKPPKSVCNDIEKNWMNFLWRNTYEKKNINLIKWSSVLSPTSRGGLSINNVESTNFALLSKWIWRFFEEKNPLWKRIIIAKYEQTYLGELP
        ++P Y ++ F  PK+VC  I     +F WRN  E K ++   W  +    + GG+   ++E+ N ALL K +WR       L  ++  ++Y   +    P
Subjt:  SIPNYHLSVFKPPKSVCNDIEKNWMNFLWRNTYEKKNINLIKWSSVLSPTSRGGLSINNVESTNFALLSKWIWRFFEEKNPLWKRIIIAKYEQTYLGELP

Query:  TKSKYSSSKA-LWMSIIKGVGWVLPQIKWSIRRGDTLSFWHSKWHDLSP
          +   S  + +W SI      +    +  +  G+ +  W  KW D  P
Subjt:  TKSKYSSSKA-LWMSIIKGVGWVLPQIKWSIRRGDTLSFWHSKWHDLSP

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)1.1e-1043.28Show/hide
Query:  LINGRPRGKIKPMRGIRQGDPISPFIFVLTMDYLSILLNQLEKDNLIKGVSF-NEKHNLTHLLFADD
        +ING P+G + P RG+RQGDP+SP++F+L  + LS L  + ++   + G+   N    + HLLFADD
Subjt:  LINGRPRGKIKPMRGIRQGDPISPFIFVLTMDYLSILLNQLEKDNLIKGVSF-NEKHNLTHLLFADD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGGGATAAACCTCACGGTAGACTTGGGCCATCTTTCCCCCATCACTGATGTTGCTATCTCAAGCCCAGAAAACATCACTCCAAATACTCATCTTTCTATAGCTCA
ATCGGAGATTGGCAATGACAGCATTGGTCCCATAAACAACAACCTGAAGCTCACTACAAGCTCCTCTGCTGGTATTGTTAATGAAGATAGTGAGACTATTGGTGATGAGA
ATGTAATGATTCTATTTTGTGATTCTTGTGGAAACGAAACTGCAACATATCAACAAAAAATTATTAAATCTCTTTGGAGCCCTATAAGTATTAAATGGCAATTCTCCCCT
GCTGAAAATAGTTCGGGAGCAGTCTATGGGCCATCGAAACGGGAAAATAGAGGAGATTTCTGGATGGAGCTTGAAGAGATTAAATCAACTTGCCTTCCAAGATGGCTTAT
GGGTGGGGACTTCAATATTGTTAGATGGCAATCGAAAACTACAGCAAAAAATATTGCAGTTCCCTGTATGAATTATTCAATTCTTTCATTTCGGGATGCGATTTCTATGA
TCCCCCAATTTCAAATGCCAAATGCACTTGGTCAAACCTCAAGGCTTGACAGATTCCTTTATACTCCAAATTGGGAATCTATTTTTGAACCCCACTTTTCAAGACTGCTT
CCCAGAGAAACATCAGACCATTTTTCCATTACACTTGAGTCAAACAGCCTTAAGTGGGGCCCATCACCTGGCATTATCAACGAGAATGTGAACTCCACATATATTGCTCT
TATTGCCAAAAAGGAGACATGCTCAGTTCCTTCGAACTACAGACCCATAAGCTTAACGACTAGTCTATACAAGCTCATTGCTAAAGTTATTGCTGAAAGACTTAAGCTCG
TTCAACCTGATACAATCTCAGAGAATCAATTAGCTTTCGTCAAAGGCAGACATATTCTTGATGCCATTCTGATTGCAAATGAAGCGGTGAACTTCTGGAAACAGAAAAAA
ATCAAAGGCTTTGTGGTCAAGCTTGACATTGAAAAGGCTTTCGATAAAATAAATTGGACCTTCATTGACTTCATGCTCCTTAAGAAAGGCTTCCCCCAAAATTGGGGCCA
ATGGATTAGTGCATGTATTACTAGTGTCCAATACTCCATCCTCATCAATGGTAGACCCAGAGGAAAAATCAAACCAATGAGAGGCATCCGACAAGGAGATCCTATCTCTC
CTTTTATCTTTGTACTCACTATGGATTATCTTAGTATACTCCTCAATCAATTGGAAAAAGATAACTTGATTAAAGGTGTAAGTTTCAACGAGAAACACAACCTCACTCAC
CTCCTGTTTGCTGATGATATCTTACTTTTTATGGAGGATGATGACGAAACCATTGATAACATGAGATTTGCCCTTCGGCTTTTTGAATTGGCCTCAGGTCTCAACATCAA
TCTCAATAAATCTACGATTTCACCTACCAACATTGATTTGCAGAGAACAAATTGCGTGACATCAAAATGGGGTTTCTCCATAAACTTTCTTCCCATCCAATATTTGGGAG
TGCCTTTAGGAGGTAAACCAATTTCTAGACACTTCTGGTCTGAAATTACTGGGAAACTCCAGAGGAAAATCAATAATTGGAAATACGCTTCTATTTCCAGAGGTGACAAA
GTTACTCTCATTAAAGCTACTTTAGCTAGCATCCCAAATTACCATCTTTCGGTTTTCAAGCCTCCCAAATCTGTCTGCAATGATATTGAGAAAAATTGGATGAACTTTCT
ATGGAGAAACACTTATGAAAAGAAAAATATCAACCTCATTAAATGGTCATCGGTTCTGTCTCCTACCAGCAGAGGTGGCCTGAGCATCAACAACGTTGAGAGTACAAATT
TTGCTCTTTTGAGCAAATGGATTTGGAGATTCTTTGAAGAGAAAAATCCTCTATGGAAACGTATTATCATTGCAAAATACGAGCAAACATACTTGGGTGAGCTTCCAACT
AAGAGCAAATACAGCAGCTCAAAAGCTCTTTGGATGTCTATCATTAAAGGTGTTGGCTGGGTTCTCCCTCAGATTAAATGGAGCATTAGAAGGGGTGACACACTATCATT
TTGGCACAGTAAATGGCACGACCTTAGTCCGCGATTAAGAAATGTTGAGGAAGCCCTCTGGGATGACATGAAAGCTTCCCTTCCTCCCTTGCCTGACACTGGACTCGACA
GACCTATCTGGAATTTAAACAGCAATGACATTTTCTCGATGGCTTCCACGAAAATTGCAAGAACACAAAACAACCAAGTTGAATATGACGACGATGGGCTGCTGAGGAAG
ACAGACACCATTTATTCTCCCTCTGCCCATTGTCAAAAAACCTCTGGAAAAAAGTTGAAGAGGTACTGGACAAACCCCTCCCCCATACGAATCCCTCTGTTTTATGCAAA
GAACCTTTTAAAGCAAAAGGAAAATAAAAATAACAAACCATCAAACAATATCTGGTGGCTGCTACCCTGTGGAATATCTGGAATGAAAGAAACAGAAGGACTTTTAAGGG
AGAAGAAAAATCAGCTGATTCAGTTTGGGAAGACATTCAAGCTACAACTGGTCTATGGACTAGTCGTTCTTCCCTTTTCAAAAATTATTCGCCCAGCTCTATTGCTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGAAGGGATAAACCTCACGGTAGACTTGGGCCATCTTTCCCCCATCACTGATGTTGCTATCTCAAGCCCAGAAAACATCACTCCAAATACTCATCTTTCTATAGCTCA
ATCGGAGATTGGCAATGACAGCATTGGTCCCATAAACAACAACCTGAAGCTCACTACAAGCTCCTCTGCTGGTATTGTTAATGAAGATAGTGAGACTATTGGTGATGAGA
ATGTAATGATTCTATTTTGTGATTCTTGTGGAAACGAAACTGCAACATATCAACAAAAAATTATTAAATCTCTTTGGAGCCCTATAAGTATTAAATGGCAATTCTCCCCT
GCTGAAAATAGTTCGGGAGCAGTCTATGGGCCATCGAAACGGGAAAATAGAGGAGATTTCTGGATGGAGCTTGAAGAGATTAAATCAACTTGCCTTCCAAGATGGCTTAT
GGGTGGGGACTTCAATATTGTTAGATGGCAATCGAAAACTACAGCAAAAAATATTGCAGTTCCCTGTATGAATTATTCAATTCTTTCATTTCGGGATGCGATTTCTATGA
TCCCCCAATTTCAAATGCCAAATGCACTTGGTCAAACCTCAAGGCTTGACAGATTCCTTTATACTCCAAATTGGGAATCTATTTTTGAACCCCACTTTTCAAGACTGCTT
CCCAGAGAAACATCAGACCATTTTTCCATTACACTTGAGTCAAACAGCCTTAAGTGGGGCCCATCACCTGGCATTATCAACGAGAATGTGAACTCCACATATATTGCTCT
TATTGCCAAAAAGGAGACATGCTCAGTTCCTTCGAACTACAGACCCATAAGCTTAACGACTAGTCTATACAAGCTCATTGCTAAAGTTATTGCTGAAAGACTTAAGCTCG
TTCAACCTGATACAATCTCAGAGAATCAATTAGCTTTCGTCAAAGGCAGACATATTCTTGATGCCATTCTGATTGCAAATGAAGCGGTGAACTTCTGGAAACAGAAAAAA
ATCAAAGGCTTTGTGGTCAAGCTTGACATTGAAAAGGCTTTCGATAAAATAAATTGGACCTTCATTGACTTCATGCTCCTTAAGAAAGGCTTCCCCCAAAATTGGGGCCA
ATGGATTAGTGCATGTATTACTAGTGTCCAATACTCCATCCTCATCAATGGTAGACCCAGAGGAAAAATCAAACCAATGAGAGGCATCCGACAAGGAGATCCTATCTCTC
CTTTTATCTTTGTACTCACTATGGATTATCTTAGTATACTCCTCAATCAATTGGAAAAAGATAACTTGATTAAAGGTGTAAGTTTCAACGAGAAACACAACCTCACTCAC
CTCCTGTTTGCTGATGATATCTTACTTTTTATGGAGGATGATGACGAAACCATTGATAACATGAGATTTGCCCTTCGGCTTTTTGAATTGGCCTCAGGTCTCAACATCAA
TCTCAATAAATCTACGATTTCACCTACCAACATTGATTTGCAGAGAACAAATTGCGTGACATCAAAATGGGGTTTCTCCATAAACTTTCTTCCCATCCAATATTTGGGAG
TGCCTTTAGGAGGTAAACCAATTTCTAGACACTTCTGGTCTGAAATTACTGGGAAACTCCAGAGGAAAATCAATAATTGGAAATACGCTTCTATTTCCAGAGGTGACAAA
GTTACTCTCATTAAAGCTACTTTAGCTAGCATCCCAAATTACCATCTTTCGGTTTTCAAGCCTCCCAAATCTGTCTGCAATGATATTGAGAAAAATTGGATGAACTTTCT
ATGGAGAAACACTTATGAAAAGAAAAATATCAACCTCATTAAATGGTCATCGGTTCTGTCTCCTACCAGCAGAGGTGGCCTGAGCATCAACAACGTTGAGAGTACAAATT
TTGCTCTTTTGAGCAAATGGATTTGGAGATTCTTTGAAGAGAAAAATCCTCTATGGAAACGTATTATCATTGCAAAATACGAGCAAACATACTTGGGTGAGCTTCCAACT
AAGAGCAAATACAGCAGCTCAAAAGCTCTTTGGATGTCTATCATTAAAGGTGTTGGCTGGGTTCTCCCTCAGATTAAATGGAGCATTAGAAGGGGTGACACACTATCATT
TTGGCACAGTAAATGGCACGACCTTAGTCCGCGATTAAGAAATGTTGAGGAAGCCCTCTGGGATGACATGAAAGCTTCCCTTCCTCCCTTGCCTGACACTGGACTCGACA
GACCTATCTGGAATTTAAACAGCAATGACATTTTCTCGATGGCTTCCACGAAAATTGCAAGAACACAAAACAACCAAGTTGAATATGACGACGATGGGCTGCTGAGGAAG
ACAGACACCATTTATTCTCCCTCTGCCCATTGTCAAAAAACCTCTGGAAAAAAGTTGAAGAGGTACTGGACAAACCCCTCCCCCATACGAATCCCTCTGTTTTATGCAAA
GAACCTTTTAAAGCAAAAGGAAAATAAAAATAACAAACCATCAAACAATATCTGGTGGCTGCTACCCTGTGGAATATCTGGAATGAAAGAAACAGAAGGACTTTTAAGGG
AGAAGAAAAATCAGCTGATTCAGTTTGGGAAGACATTCAAGCTACAACTGGTCTATGGACTAGTCGTTCTTCCCTTTTCAAAAATTATTCGCCCAGCTCTATTGCTTTAA
Protein sequenceShow/hide protein sequence
MEGINLTVDLGHLSPITDVAISSPENITPNTHLSIAQSEIGNDSIGPINNNLKLTTSSSAGIVNEDSETIGDENVMILFCDSCGNETATYQQKIIKSLWSPISIKWQFSP
AENSSGAVYGPSKRENRGDFWMELEEIKSTCLPRWLMGGDFNIVRWQSKTTAKNIAVPCMNYSILSFRDAISMIPQFQMPNALGQTSRLDRFLYTPNWESIFEPHFSRLL
PRETSDHFSITLESNSLKWGPSPGIINENVNSTYIALIAKKETCSVPSNYRPISLTTSLYKLIAKVIAERLKLVQPDTISENQLAFVKGRHILDAILIANEAVNFWKQKK
IKGFVVKLDIEKAFDKINWTFIDFMLLKKGFPQNWGQWISACITSVQYSILINGRPRGKIKPMRGIRQGDPISPFIFVLTMDYLSILLNQLEKDNLIKGVSFNEKHNLTH
LLFADDILLFMEDDDETIDNMRFALRLFELASGLNINLNKSTISPTNIDLQRTNCVTSKWGFSINFLPIQYLGVPLGGKPISRHFWSEITGKLQRKINNWKYASISRGDK
VTLIKATLASIPNYHLSVFKPPKSVCNDIEKNWMNFLWRNTYEKKNINLIKWSSVLSPTSRGGLSINNVESTNFALLSKWIWRFFEEKNPLWKRIIIAKYEQTYLGELPT
KSKYSSSKALWMSIIKGVGWVLPQIKWSIRRGDTLSFWHSKWHDLSPRLRNVEEALWDDMKASLPPLPDTGLDRPIWNLNSNDIFSMASTKIARTQNNQVEYDDDGLLRK
TDTIYSPSAHCQKTSGKKLKRYWTNPSPIRIPLFYAKNLLKQKENKNNKPSNNIWWLLPCGISGMKETEGLLREKKNQLIQFGKTFKLQLVYGLVVLPFSKIIRPALLL