; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI07G08770 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI07G08770
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionReverse transcriptase
Genome locationChr7:6611136..6613930
RNA-Seq ExpressionCSPI07G08770
SyntenyCSPI07G08770
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0016787 - hydrolase activity (molecular function)
InterPro domainsIPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR041588 - Integrase zinc-binding domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAA7014963.1 unnamed protein product [Microthlaspi erraticum]7.2e-14443.51Show/hide
Query:  ITKCIFCTEEISFLGFIISENQVKMDESKVEAVTKWSWMKARWKL-----------KFIKNFSFITAPMTDCLKKGAFYWGEKQQHNFDSLKRKLVSQPV
        + KC FCT ++ FLGF++ E  +++DE KV A+  W   K+  ++           +F+++FS ITAP+T+CLKKG F+WG +Q  +F  +K KL + PV
Subjt:  ITKCIFCTEEISFLGFIISENQVKMDESKVEAVTKWSWMKARWKL-----------KFIKNFSFITAPMTDCLKKGAFYWGEKQQHNFDSLKRKLVSQPV

Query:  LKLPKFDSPFEVAVDASGVGIGAVLSQGGHPVEFFSEKLK----------QELYALVRALKQWEHYLLSKEFVLLTDHFSLKYLQAQKNINKMHARWVSY
        L LP FD  F+V  DASGVGIGAVLSQ   PV FFSEKL           QE YA+ RAL+QWEHYL+ +EF+L TDH +LK+L +QK INKMHARWVS+
Subjt:  LKLPKFDSPFEVAVDASGVGIGAVLSQGGHPVEFFSEKLK----------QELYALVRALKQWEHYLLSKEFVLLTDHFSLKYLQAQKNINKMHARWVSY

Query:  IQKFDFLIKHQACKENRAADALSRKS-----------------DSPHCALNFSEIWSHCTVHIHDQDYHLVEGFLVKGDQLCIPHTSLREALIKEAHSRG
        +QKF F+I+H++   N+ ADALSR++                 +       F E+W  C       D+H+ +GFL KGD+LCIP +SLRE LI++ H  G
Subjt:  IQKFDFLIKHQACKENRAADALSRKS-----------------DSPHCALNFSEIWSHCTVHIHDQDYHLVEGFLVKGDQLCIPHTSLREALIKEAHSRG

Query:  LARHFGQEKTFQIIMKKFYWSQARRDINNFVKRCHICQRAKGSSSNVGLYTPLPIPKNMWEDLSIDFVVGLPKTQMRVDLVMVVVDRFSKMSHFLPYKKT
        L+ H G++KT   + +++YW   RRD    VKRC+ICQ +KG S N GLY PLP+P ++W+DLS+DFV+GLP+TQ  VD V VVVDRFSKM+HF+  KKT
Subjt:  LARHFGQEKTFQIIMKKFYWSQARRDINNFVKRCHICQRAKGSSSNVGLYTPLPIPKNMWEDLSIDFVVGLPKTQMRVDLVMVVVDRFSKMSHFLPYKKT

Query:  IDAPCLEDIV-------------------------------EKFDTTLKFSTTAHPQTDRQTEVTNRSLGNLIRCLSGNHPRQWDMA-------------
         DA  +  +                                  F TTLK S+TAHPQTD QTEVTNR+LGN+IR + G+ P+QWD+A             
Subjt:  IDAPCLEDIV-------------------------------EKFDTTLKFSTTAHPQTDRQTEVTNRSLGNLIRCLSGNHPRQWDMA-------------

Query:  ---------------PRLTFDLTSLPKEVKIQEEAEQLAERIQKLHTEVIDYITKTTESYKEEKNKKRRDVHFQVRDLVMAHLKKKRFPLGTYGKMKDKQ
                       P+   DL  LPK   +   AE +AE I      V   +  T +  K   +K+RR   F+  D VM  L+K+RFP+GTY K++  +
Subjt:  ---------------PRLTFDLTSLPKEVKIQEEAEQLAERIQKLHTEVIDYITKTTESYKEEKNKKRRDVHFQVRDLVMAHLKKKRFPLGTYGKMKDKQ

Query:  IGPCRILAKYEPNAYKIELPRGFNISPIFNVADLRSYKA
         GP ++L K   NAY ++LP   NIS  FNVAD+  Y A
Subjt:  IGPCRILAKYEPNAYKIELPRGFNISPIFNVADLRSYKA

CAA7028195.1 unnamed protein product [Microthlaspi erraticum]7.2e-14443.51Show/hide
Query:  ITKCIFCTEEISFLGFIISENQVKMDESKVEAVTKWSWMKARWKL-----------KFIKNFSFITAPMTDCLKKGAFYWGEKQQHNFDSLKRKLVSQPV
        + KC FCT ++ FLGF++ E  +++DE KV A+  W   K+  ++           +F+++FS ITAP+T+CLKKG F+WG +Q  +F  +K KL + PV
Subjt:  ITKCIFCTEEISFLGFIISENQVKMDESKVEAVTKWSWMKARWKL-----------KFIKNFSFITAPMTDCLKKGAFYWGEKQQHNFDSLKRKLVSQPV

Query:  LKLPKFDSPFEVAVDASGVGIGAVLSQGGHPVEFFSEKLK----------QELYALVRALKQWEHYLLSKEFVLLTDHFSLKYLQAQKNINKMHARWVSY
        L LP FD  F+V  DASGVGIGAVLSQ   PV FFSEKL           QE YA+ RAL+QWEHYL+ +EF+L TDH +LK+L +QK INKMHARWVS+
Subjt:  LKLPKFDSPFEVAVDASGVGIGAVLSQGGHPVEFFSEKLK----------QELYALVRALKQWEHYLLSKEFVLLTDHFSLKYLQAQKNINKMHARWVSY

Query:  IQKFDFLIKHQACKENRAADALSRKS-----------------DSPHCALNFSEIWSHCTVHIHDQDYHLVEGFLVKGDQLCIPHTSLREALIKEAHSRG
        +QKF F+I+H++   N+ ADALSR++                 +       F E+W  C       D+H+ +GFL KGD+LCIP +SLRE LI++ H  G
Subjt:  IQKFDFLIKHQACKENRAADALSRKS-----------------DSPHCALNFSEIWSHCTVHIHDQDYHLVEGFLVKGDQLCIPHTSLREALIKEAHSRG

Query:  LARHFGQEKTFQIIMKKFYWSQARRDINNFVKRCHICQRAKGSSSNVGLYTPLPIPKNMWEDLSIDFVVGLPKTQMRVDLVMVVVDRFSKMSHFLPYKKT
        L+ H G++KT   + +++YW   RRD    VKRC+ICQ +KG S N GLY PLP+P ++W+DLS+DFV+GLP+TQ  VD V VVVDRFSKM+HF+  KKT
Subjt:  LARHFGQEKTFQIIMKKFYWSQARRDINNFVKRCHICQRAKGSSSNVGLYTPLPIPKNMWEDLSIDFVVGLPKTQMRVDLVMVVVDRFSKMSHFLPYKKT

Query:  IDAPCLEDIV-------------------------------EKFDTTLKFSTTAHPQTDRQTEVTNRSLGNLIRCLSGNHPRQWDMA-------------
         DA  +  +                                  F TTLK S+TAHPQTD QTEVTNR+LGN+IR + G+ P+QWD+A             
Subjt:  IDAPCLEDIV-------------------------------EKFDTTLKFSTTAHPQTDRQTEVTNRSLGNLIRCLSGNHPRQWDMA-------------

Query:  ---------------PRLTFDLTSLPKEVKIQEEAEQLAERIQKLHTEVIDYITKTTESYKEEKNKKRRDVHFQVRDLVMAHLKKKRFPLGTYGKMKDKQ
                       P+   DL  LPK   +   AE +AE I      V   +  T +  K   +K+RR   F+  D VM  L+K+RFP+GTY K++  +
Subjt:  ---------------PRLTFDLTSLPKEVKIQEEAEQLAERIQKLHTEVIDYITKTTESYKEEKNKKRRDVHFQVRDLVMAHLKKKRFPLGTYGKMKDKQ

Query:  IGPCRILAKYEPNAYKIELPRGFNISPIFNVADLRSYKA
         GP ++L K   NAY ++LP   NIS  FNVAD+  Y A
Subjt:  IGPCRILAKYEPNAYKIELPRGFNISPIFNVADLRSYKA

PWA81295.1 transposon Ty3-I Gag-Pol polyprotein [Artemisia annua]1.7e-14036.39Show/hide
Query:  MSPKEYEFLHQHIEDFLTKGHIQPNISPCEHRSLCRTSSINSKEGWQRMCVDSRAINKIT----------------------------------------
        MSPKE + L + +E+ L KGHIQ +ISPC   +L        K+G  RMCVDSRAINKIT                                        
Subjt:  MSPKEYEFLHQHIEDFLTKGHIQPNISPCEHRSLCRTSSINSKEGWQRMCVDSRAINKIT----------------------------------------

Query:  ------------------------------------------------------------------------------------KCIFCTEEISFLGFII
                                                                                            KC F T ++ FLG+I+
Subjt:  ------------------------------------------------------------------------------------KCIFCTEEISFLGFII

Query:  SENQVKMDESKVEAVTKWSWMK-----------ARWKLKFIKNFSFITAPMTDCLKKGAFYWGEKQQHNFDSLKRKLVSQPVLKLPKFDSPFEVAVDASG
        S + + +DE KV+AV  W   K           A +  +F++NFS I AP+T+C+KKG F W ++ + +F  +K +L + PVL LP FD+ FE+  DA G
Subjt:  SENQVKMDESKVEAVTKWSWMK-----------ARWKLKFIKNFSFITAPMTDCLKKGAFYWGEKQQHNFDSLKRKLVSQPVLKLPKFDSPFEVAVDASG

Query:  VGIGAVLSQGGHPVEFFSEKL----------KQELYALVRALKQWEHYLLSKEFVLLTDHFSLKYLQAQKNINKMHARWVSYIQKFDFLIKHQACKENRA
         GIGAVLSQ G PV F SEKL          +QELYA+V+A+K+WEHYL+ +EFV+ +DH +LKY Q Q+++NK+HARW S+++KF+++IKH++   N+ 
Subjt:  VGIGAVLSQGGHPVEFFSEKL----------KQELYALVRALKQWEHYLLSKEFVLLTDHFSLKYLQAQKNINKMHARWVSYIQKFDFLIKHQACKENRA

Query:  ADALSRKSDSPHCALN-----------------FSEIWSHCTVHIHDQDYHLVEGFLVKGDQLCIPHTSLREALIKEAHSRGLARHFGQEKTFQIIMKKF
        ADALSRK+       N                 F   W       H  ++ L++G+L KG++LCIP TSLR  LIKE H+ GL+ H G++KT   +  +F
Subjt:  ADALSRKSDSPHCALN-----------------FSEIWSHCTVHIHDQDYHLVEGFLVKGDQLCIPHTSLREALIKEAHSRGLARHFGQEKTFQIIMKKF

Query:  YWSQARRDINNFVKRCHICQRAKGSSSNVGLYTPLPIPKNMWEDLSIDFVVGLPKTQMRVDLVMVVVDRFSKMSHFLPYKKTIDAPCL-----EDIV---
        YW Q +RD+ +FV+RC +CQ  KG + N GLY PLP+P++ W D+S+DFV+GLP+TQ  VD V VVVDRFSKM+HF+P KKT DA  +     +++V   
Subjt:  YWSQARRDINNFVKRCHICQRAKGSSSNVGLYTPLPIPKNMWEDLSIDFVVGLPKTQMRVDLVMVVVDRFSKMSHFLPYKKTIDAPCL-----EDIV---

Query:  -----------------------EKFDTTLKFSTTAHPQTDRQTEVTNRSLGNLIRCLSGNHPRQWDM----------------------------APRL
                                +  T+L FS+TAHPQTD QTEV NR+LGN+IRCL G  P+ WD+                            +PR 
Subjt:  -----------------------EKFDTTLKFSTTAHPQTDRQTEVTNRSLGNLIRCLSGNHPRQWDM----------------------------APRL

Query:  TFDLTSLPKEVKIQEEAEQLAERIQKLHTEVIDYITKTTESYKEEKNKKRRDVHFQVRDLVMAHLKKKRFPLGTYGKMKDKQIGPCRILAKYEPNAYKIE
          DL  LP +  +Q  A ++ E +Q  H  V   I+++   YK   +K RR   F+V D VM  L+K+RFP+GTY K++ K+ GP +IL K   NAY ++
Subjt:  TFDLTSLPKEVKIQEEAEQLAERIQKLHTEVIDYITKTTESYKEEKNKKRRDVHFQVRDLVMAHLKKKRFPLGTYGKMKDKQIGPCRILAKYEPNAYKIE

Query:  LPRGFNISPIFNVADLRSY
        LP   +IS  FNV+D+  +
Subjt:  LPRGFNISPIFNVADLRSY

XP_024641774.2 uncharacterized protein LOC112422671 [Medicago truncatula]9.8e-14143.03Show/hide
Query:  DSRAINKITKCIFCTEEISFLGFIISENQVKMDESKVEAVTKWSWMK-----------ARWKLKFIKNFSFITAPMTDCLKKGAFYWGEKQQHNFDSLKR
        D++    + KC F T  + FLGFI+ E  +++DE KV A+  W               A +  +FI++FS ITAP+T+CLKKG F WG +Q  +F  +K+
Subjt:  DSRAINKITKCIFCTEEISFLGFIISENQVKMDESKVEAVTKWSWMK-----------ARWKLKFIKNFSFITAPMTDCLKKGAFYWGEKQQHNFDSLKR

Query:  KLVSQPVLKLPKFDSPFEVAVDASGVGIGAVLSQGGHPVEFFSEKLK----------QELYALVRALKQWEHYLLSKEFVLLTDHFSLKYLQAQKNINKM
        KL + PVL LP F+  F+V  DASG+GIGAVLSQ   P+ FFSEKL           QE YA+ RAL+QWEHYL+ +EF+L TDH +LK+L +QK INKM
Subjt:  KLVSQPVLKLPKFDSPFEVAVDASGVGIGAVLSQGGHPVEFFSEKLK----------QELYALVRALKQWEHYLLSKEFVLLTDHFSLKYLQAQKNINKM

Query:  HARWVSYIQKFDFLIKHQACKENRAADALSRKS-----------------DSPHCALNFSEIWSHCTVHIHDQDYHLVEGFLVKGDQLCIPHTSLREALI
        HARWVS++QKF F+I+H++   N+ ADALSR++                 +       F E+++ C       D+H+ EG+L KGDQLCIP +SLRE LI
Subjt:  HARWVSYIQKFDFLIKHQACKENRAADALSRKS-----------------DSPHCALNFSEIWSHCTVHIHDQDYHLVEGFLVKGDQLCIPHTSLREALI

Query:  KEAHSRGLARHFGQEKTFQIIMKKFYWSQARRDINNFVKRCHICQRAKGSSSNVGLYTPLPIPKNMWEDLSIDFVVGLPKTQMRVDLVMVVVDRFSKMSH
        ++ HS GL+ H G++KT   + ++FYW   R+D+   VK+C+ CQ +KG S N GLY PLPIP ++W+DLS+DFV+GLP+TQ  VD V VVVDRFSKMSH
Subjt:  KEAHSRGLARHFGQEKTFQIIMKKFYWSQARRDINNFVKRCHICQRAKGSSSNVGLYTPLPIPKNMWEDLSIDFVVGLPKTQMRVDLVMVVVDRFSKMSH

Query:  FLPYKKTIDAPCLEDIV-------------------------------EKFDTTLKFSTTAHPQTDRQTEVTNRSLGNLIRCLSGNHPRQWDMA------
        F+  K+T DA  +  +                                + F T+L  S+TAHPQTD QTEVTNR+LGN+IRC+ G+ P+QWD+A      
Subjt:  FLPYKKTIDAPCLEDIV-------------------------------EKFDTTLKFSTTAHPQTDRQTEVTNRSLGNLIRCLSGNHPRQWDMA------

Query:  ----------------------PRLTFDLTSLPKEVKIQEEAEQLAERIQKLHTEVIDYITKTTESYKEEKNKKRRDVHFQVRDLVMAHLKKKRFPLGTY
                              PR   DL  LPK   +   AE++AE I  +   V   +  T    K   +K+RR   F V D VM  L+K+RFP+GTY
Subjt:  ----------------------PRLTFDLTSLPKEVKIQEEAEQLAERIQKLHTEVIDYITKTTESYKEEKNKKRRDVHFQVRDLVMAHLKKKRFPLGTY

Query:  GKMKDKQIGPCRILAKYEPNAYKIELPRGFNISPIFNVADLRSYKA
        GK++ ++ GP ++  K   NAY + LP   NIS  FNVAD+  Y A
Subjt:  GKMKDKQIGPCRILAKYEPNAYKIELPRGFNISPIFNVADLRSYKA

XP_025979678.1 uncharacterized protein LOC112997809 [Glycine max]2.8e-14043.59Show/hide
Query:  ITKCIFCTEEISFLGFIISENQVKMDESKVEAVTKWSWMK-----------ARWKLKFIKNFSFITAPMTDCLKKGAFYWGEKQQHNFDSLKRKLVSQPV
        + KC F T ++ FLGF++ E+ +++DE KV A+  W               A +  +FI++FS ITAP+T+CLKKG + WG +Q+ +F  +K KL + PV
Subjt:  ITKCIFCTEEISFLGFIISENQVKMDESKVEAVTKWSWMK-----------ARWKLKFIKNFSFITAPMTDCLKKGAFYWGEKQQHNFDSLKRKLVSQPV

Query:  LKLPKFDSPFEVAVDASGVGIGAVLSQGGHPVEFFSEKLK----------QELYALVRALKQWEHYLLSKEFVLLTDHFSLKYLQAQKNINKMHARWVSY
        L LP FD  F+V  DASG+GIGAVLSQ   P+ FFSEKL           QE YA+ RAL+QWEHYL+ +EF+L TDH +LK+L +QK INKMHARWVS+
Subjt:  LKLPKFDSPFEVAVDASGVGIGAVLSQGGHPVEFFSEKLK----------QELYALVRALKQWEHYLLSKEFVLLTDHFSLKYLQAQKNINKMHARWVSY

Query:  IQKFDFLIKHQACKENRAADALSRKSDSPHCAL------------------NFSEIWSHCTVHIHDQDYHLVEGFLVKGDQLCIPHTSLREALIKEAHSR
        +QKF F+I+H++   N+ ADALSR+ DS    L                   F E+W+ C  H  D D+H+ EGFL KG++LCIP +SLRE LI++ H  
Subjt:  IQKFDFLIKHQACKENRAADALSRKSDSPHCAL------------------NFSEIWSHCTVHIHDQDYHLVEGFLVKGDQLCIPHTSLREALIKEAHSR

Query:  GLARHFGQEKTFQIIMKKFYWSQARRDINNFVKRCHICQRAKGSSSNVGLYTPLPIPKNMWEDLSIDFVVGLPKTQMRVDLVMVVVDRFSKMSHFLPYKK
        GL+ H G++KT   + ++FYW   R+D    VK+C+ CQ +KG S N GLY PLPIP ++W+DL++DFV+GLP+TQ  VD V VVVDRFSKMSHF+  KK
Subjt:  GLARHFGQEKTFQIIMKKFYWSQARRDINNFVKRCHICQRAKGSSSNVGLYTPLPIPKNMWEDLSIDFVVGLPKTQMRVDLVMVVVDRFSKMSHFLPYKK

Query:  TIDAPCLEDIV-------------------------------EKFDTTLKFSTTAHPQTDRQTEVTNRSLGNLIRCLSGNHPRQWDMA------------
        T DA  +  +                                + FDT+L  S+TAHPQTD QTEVTNR+LGN+IRC+ G+ P+QWD+A            
Subjt:  TIDAPCLEDIV-------------------------------EKFDTTLKFSTTAHPQTDRQTEVTNRSLGNLIRCLSGNHPRQWDMA------------

Query:  ----------------PRLTFDLTSLPKEVKIQEEAEQLAERIQKLHTEVIDYITKTTESYKEEKNKKRRDVHFQVRDLVMAHLKKKRFPLGTYGKMKDK
                        PR   DL  LPK       AE +AE I  +   V   +  T    K   +K++R   F V D VM  L+K+RFP+GTY K++ +
Subjt:  ----------------PRLTFDLTSLPKEVKIQEEAEQLAERIQKLHTEVIDYITKTTESYKEEKNKKRRDVHFQVRDLVMAHLKKKRFPLGTYGKMKDK

Query:  QIGPCRILAKYEPNAYKIELPRGFNISPIFNVADLRSYKA
        + GP ++  K   NAY + LP   NIS  FNVAD+  Y A
Subjt:  QIGPCRILAKYEPNAYKIELPRGFNISPIFNVADLRSYKA

TrEMBL top hitse value%identityAlignment
A0A5B7BER3 Uncharacterized protein9.8e-14737.14Show/hide
Query:  MSPKEYEFLHQHIEDFLTKGHIQPNISPCEHRSLCRTSSINSKEGWQRMCVDSRAINKIT----------------------------------------
        MSPKE E L Q +ED + KG IQ ++SPC   +L        K+G  RMCVDSRAINKIT                                        
Subjt:  MSPKEYEFLHQHIEDFLTKGHIQPNISPCEHRSLCRTSSINSKEGWQRMCVDSRAINKIT----------------------------------------

Query:  ------------------------------------------------------------------------------------KCIFCTEEISFLGFII
                                                                                            KC F T  + FLGFII
Subjt:  ------------------------------------------------------------------------------------KCIFCTEEISFLGFII

Query:  SENQVKMDESKVEAVTKWSWMK-----------ARWKLKFIKNFSFITAPMTDCLKKGAFYWGEKQQHNFDSLKRKLVSQPVLKLPKFDSPFEVAVDASG
            +++DE KV A+  W   K           A +  +FI+NFS I AP+TDC+KKG F W + Q+ +F  +K KL + PVL LP F+  F+V  DAS 
Subjt:  SENQVKMDESKVEAVTKWSWMK-----------ARWKLKFIKNFSFITAPMTDCLKKGAFYWGEKQQHNFDSLKRKLVSQPVLKLPKFDSPFEVAVDASG

Query:  VGIGAVLSQGGHPVEFFSEKLKQ----------ELYALVRALKQWEHYLLSKEFVLLTDHFSLKYLQAQKNINKMHARWVSYIQKFDFLIKHQACKENRA
         GIGAVLSQ G PVEFFSEKL +          EL+A+VRALK WEHYL+ +EFV+ +DH +LK++  Q ++++MH RW++++Q+F F++KH+A ++N+ 
Subjt:  VGIGAVLSQGGHPVEFFSEKLKQ----------ELYALVRALKQWEHYLLSKEFVLLTDHFSLKYLQAQKNINKMHARWVSYIQKFDFLIKHQACKENRA

Query:  ADALSRKS------DSPHCAL-----------NFSEIWSHCTVHIHDQDYHLVEGFLVKGDQLCIPHTSLREALIKEAHSRGLARHFGQEKTFQIIMKKF
        ADALSR++       S   +            +F + W+ C +     ++H+ +G+L KG+QLCIP TSLRE ++++ HS GL  H G++KT  ++ +++
Subjt:  ADALSRKS------DSPHCAL-----------NFSEIWSHCTVHIHDQDYHLVEGFLVKGDQLCIPHTSLREALIKEAHSRGLARHFGQEKTFQIIMKKF

Query:  YWSQARRDINNFVKRCHICQRAKGSSSNVGLYTPLPIPKNMWEDLSIDFVVGLPKTQMRVDLVMVVVDRFSKMSHFLPYKKTIDAPCLEDIV--------
        YW Q +RD+  FV++C ICQ AKG + N GLYTPLP+P+++WEDL++DF++GLP+TQ  +D V VVVDRFSKM+HF+P KKT DA  + ++         
Subjt:  YWSQARRDINNFVKRCHICQRAKGSSSNVGLYTPLPIPKNMWEDLSIDFVVGLPKTQMRVDLVMVVVDRFSKMSHFLPYKKTIDAPCLEDIV--------

Query:  -----------------------EKFDTTLKFSTTAHPQTDRQTEVTNRSLGNLIRCLSGNHPRQWDMA----------------------------PRL
                                KFDT+L++S+TAHPQTD QTEVTNR+LGNLIRC SG+ P+QWD+                             P+ 
Subjt:  -----------------------EKFDTTLKFSTTAHPQTDRQTEVTNRSLGNLIRCLSGNHPRQWDMA----------------------------PRL

Query:  TFDLTSLPKEVKIQEEAEQLAERIQKLHTEVIDYITKTTESYKEEKNKKRRDVHFQVRDLVMAHLKKKRFPLGTYGKMKDKQIGPCRILAKYEPNAYKIE
          DL  LPK       AE  A+R   +  EV   + K    YK   +K RR   F   DLVM  L+K RFP+GTY K+K+++ GP R+  K   NAY +E
Subjt:  TFDLTSLPKEVKIQEEAEQLAERIQKLHTEVIDYITKTTESYKEEKNKKRRDVHFQVRDLVMAHLKKKRFPLGTYGKMKDKQIGPCRILAKYEPNAYKIE

Query:  LPRGFNISPIFNVADLRSYKAPNE
        LP    IS  FNVADL  Y  P+E
Subjt:  LPRGFNISPIFNVADLRSYKAPNE

A0A6D2HLB5 Reverse transcriptase3.5e-14443.51Show/hide
Query:  ITKCIFCTEEISFLGFIISENQVKMDESKVEAVTKWSWMKARWKL-----------KFIKNFSFITAPMTDCLKKGAFYWGEKQQHNFDSLKRKLVSQPV
        + KC FCT ++ FLGF++ E  +++DE KV A+  W   K+  ++           +F+++FS ITAP+T+CLKKG F+WG +Q  +F  +K KL + PV
Subjt:  ITKCIFCTEEISFLGFIISENQVKMDESKVEAVTKWSWMKARWKL-----------KFIKNFSFITAPMTDCLKKGAFYWGEKQQHNFDSLKRKLVSQPV

Query:  LKLPKFDSPFEVAVDASGVGIGAVLSQGGHPVEFFSEKLK----------QELYALVRALKQWEHYLLSKEFVLLTDHFSLKYLQAQKNINKMHARWVSY
        L LP FD  F+V  DASGVGIGAVLSQ   PV FFSEKL           QE YA+ RAL+QWEHYL+ +EF+L TDH +LK+L +QK INKMHARWVS+
Subjt:  LKLPKFDSPFEVAVDASGVGIGAVLSQGGHPVEFFSEKLK----------QELYALVRALKQWEHYLLSKEFVLLTDHFSLKYLQAQKNINKMHARWVSY

Query:  IQKFDFLIKHQACKENRAADALSRKS-----------------DSPHCALNFSEIWSHCTVHIHDQDYHLVEGFLVKGDQLCIPHTSLREALIKEAHSRG
        +QKF F+I+H++   N+ ADALSR++                 +       F E+W  C       D+H+ +GFL KGD+LCIP +SLRE LI++ H  G
Subjt:  IQKFDFLIKHQACKENRAADALSRKS-----------------DSPHCALNFSEIWSHCTVHIHDQDYHLVEGFLVKGDQLCIPHTSLREALIKEAHSRG

Query:  LARHFGQEKTFQIIMKKFYWSQARRDINNFVKRCHICQRAKGSSSNVGLYTPLPIPKNMWEDLSIDFVVGLPKTQMRVDLVMVVVDRFSKMSHFLPYKKT
        L+ H G++KT   + +++YW   RRD    VKRC+ICQ +KG S N GLY PLP+P ++W+DLS+DFV+GLP+TQ  VD V VVVDRFSKM+HF+  KKT
Subjt:  LARHFGQEKTFQIIMKKFYWSQARRDINNFVKRCHICQRAKGSSSNVGLYTPLPIPKNMWEDLSIDFVVGLPKTQMRVDLVMVVVDRFSKMSHFLPYKKT

Query:  IDAPCLEDIV-------------------------------EKFDTTLKFSTTAHPQTDRQTEVTNRSLGNLIRCLSGNHPRQWDMA-------------
         DA  +  +                                  F TTLK S+TAHPQTD QTEVTNR+LGN+IR + G+ P+QWD+A             
Subjt:  IDAPCLEDIV-------------------------------EKFDTTLKFSTTAHPQTDRQTEVTNRSLGNLIRCLSGNHPRQWDMA-------------

Query:  ---------------PRLTFDLTSLPKEVKIQEEAEQLAERIQKLHTEVIDYITKTTESYKEEKNKKRRDVHFQVRDLVMAHLKKKRFPLGTYGKMKDKQ
                       P+   DL  LPK   +   AE +AE I      V   +  T +  K   +K+RR   F+  D VM  L+K+RFP+GTY K++  +
Subjt:  ---------------PRLTFDLTSLPKEVKIQEEAEQLAERIQKLHTEVIDYITKTTESYKEEKNKKRRDVHFQVRDLVMAHLKKKRFPLGTYGKMKDKQ

Query:  IGPCRILAKYEPNAYKIELPRGFNISPIFNVADLRSYKA
         GP ++L K   NAY ++LP   NIS  FNVAD+  Y A
Subjt:  IGPCRILAKYEPNAYKIELPRGFNISPIFNVADLRSYKA

A0A6D2IKM3 Reverse transcriptase3.5e-14443.51Show/hide
Query:  ITKCIFCTEEISFLGFIISENQVKMDESKVEAVTKWSWMKARWKL-----------KFIKNFSFITAPMTDCLKKGAFYWGEKQQHNFDSLKRKLVSQPV
        + KC FCT ++ FLGF++ E  +++DE KV A+  W   K+  ++           +F+++FS ITAP+T+CLKKG F+WG +Q  +F  +K KL + PV
Subjt:  ITKCIFCTEEISFLGFIISENQVKMDESKVEAVTKWSWMKARWKL-----------KFIKNFSFITAPMTDCLKKGAFYWGEKQQHNFDSLKRKLVSQPV

Query:  LKLPKFDSPFEVAVDASGVGIGAVLSQGGHPVEFFSEKLK----------QELYALVRALKQWEHYLLSKEFVLLTDHFSLKYLQAQKNINKMHARWVSY
        L LP FD  F+V  DASGVGIGAVLSQ   PV FFSEKL           QE YA+ RAL+QWEHYL+ +EF+L TDH +LK+L +QK INKMHARWVS+
Subjt:  LKLPKFDSPFEVAVDASGVGIGAVLSQGGHPVEFFSEKLK----------QELYALVRALKQWEHYLLSKEFVLLTDHFSLKYLQAQKNINKMHARWVSY

Query:  IQKFDFLIKHQACKENRAADALSRKS-----------------DSPHCALNFSEIWSHCTVHIHDQDYHLVEGFLVKGDQLCIPHTSLREALIKEAHSRG
        +QKF F+I+H++   N+ ADALSR++                 +       F E+W  C       D+H+ +GFL KGD+LCIP +SLRE LI++ H  G
Subjt:  IQKFDFLIKHQACKENRAADALSRKS-----------------DSPHCALNFSEIWSHCTVHIHDQDYHLVEGFLVKGDQLCIPHTSLREALIKEAHSRG

Query:  LARHFGQEKTFQIIMKKFYWSQARRDINNFVKRCHICQRAKGSSSNVGLYTPLPIPKNMWEDLSIDFVVGLPKTQMRVDLVMVVVDRFSKMSHFLPYKKT
        L+ H G++KT   + +++YW   RRD    VKRC+ICQ +KG S N GLY PLP+P ++W+DLS+DFV+GLP+TQ  VD V VVVDRFSKM+HF+  KKT
Subjt:  LARHFGQEKTFQIIMKKFYWSQARRDINNFVKRCHICQRAKGSSSNVGLYTPLPIPKNMWEDLSIDFVVGLPKTQMRVDLVMVVVDRFSKMSHFLPYKKT

Query:  IDAPCLEDIV-------------------------------EKFDTTLKFSTTAHPQTDRQTEVTNRSLGNLIRCLSGNHPRQWDMA-------------
         DA  +  +                                  F TTLK S+TAHPQTD QTEVTNR+LGN+IR + G+ P+QWD+A             
Subjt:  IDAPCLEDIV-------------------------------EKFDTTLKFSTTAHPQTDRQTEVTNRSLGNLIRCLSGNHPRQWDMA-------------

Query:  ---------------PRLTFDLTSLPKEVKIQEEAEQLAERIQKLHTEVIDYITKTTESYKEEKNKKRRDVHFQVRDLVMAHLKKKRFPLGTYGKMKDKQ
                       P+   DL  LPK   +   AE +AE I      V   +  T +  K   +K+RR   F+  D VM  L+K+RFP+GTY K++  +
Subjt:  ---------------PRLTFDLTSLPKEVKIQEEAEQLAERIQKLHTEVIDYITKTTESYKEEKNKKRRDVHFQVRDLVMAHLKKKRFPLGTYGKMKDKQ

Query:  IGPCRILAKYEPNAYKIELPRGFNISPIFNVADLRSYKA
         GP ++L K   NAY ++LP   NIS  FNVAD+  Y A
Subjt:  IGPCRILAKYEPNAYKIELPRGFNISPIFNVADLRSYKA

M5W531 Reverse transcriptase2.0e-14744.2Show/hide
Query:  ITKCIFCTEEISFLGFIISENQVKMDESKVEAVTKWSWMK-----------ARWKLKFIKNFSFITAPMTDCLKKGAFYWGEKQQHNFDSLKRKLVSQPV
        + KC FCT ++ FLGF++ EN +++D+ K++A+  W   K           A + ++F+++FS I AP+T+CLKKG F WGE+Q+ +F  +K KL + PV
Subjt:  ITKCIFCTEEISFLGFIISENQVKMDESKVEAVTKWSWMK-----------ARWKLKFIKNFSFITAPMTDCLKKGAFYWGEKQQHNFDSLKRKLVSQPV

Query:  LKLPKFDSPFEVAVDASGVGIGAVLSQGGHPVEFFSEKLK----------QELYALVRALKQWEHYLLSKEFVLLTDHFSLKYLQAQKNINKMHARWVSY
        L LP F+  FEV  DASGVG+GAVL Q   PV FFSEKL           QE YA+VRALKQWEHYL+ KEFVL TDH +LKY+ +QKNI+KMHARWV++
Subjt:  LKLPKFDSPFEVAVDASGVGIGAVLSQGGHPVEFFSEKLK----------QELYALVRALKQWEHYLLSKEFVLLTDHFSLKYLQAQKNINKMHARWVSY

Query:  IQKFDFLIKHQACKENRAADALSRKSD----------SPHCAL-------NFSEIWSHCTVHIHDQDYHLVEGFLVKGDQLCIPHTSLREALIKEAHSRG
        +QKF F+IKH + K NR ADALSR++              C         +F EIW+ CT      DY L EG+L KG+QLCIP +SLRE LI++ H  G
Subjt:  IQKFDFLIKHQACKENRAADALSRKSD----------SPHCAL-------NFSEIWSHCTVHIHDQDYHLVEGFLVKGDQLCIPHTSLREALIKEAHSRG

Query:  LARHFGQEKTFQIIMKKFYWSQARRDINNFVKRCHICQRAKGSSSNVGLYTPLPIPKNMWEDLSIDFVVGLPKTQMRVDLVMVVVDRFSKMSHFLPYKKT
        L+ H G++KT   + ++FYW Q +RD+   V++C+ CQ +KG   N GLY PLP+P ++W+DL++DFV+G P+TQ RVD V VV DRFSKM+HF+  KKT
Subjt:  LARHFGQEKTFQIIMKKFYWSQARRDINNFVKRCHICQRAKGSSSNVGLYTPLPIPKNMWEDLSIDFVVGLPKTQMRVDLVMVVVDRFSKMSHFLPYKKT

Query:  IDAPCLEDIVEK-------------------------------FDTTLKFSTTAHPQTDRQTEVTNRSLGNLIRCLSGNHPRQWDMA-PRLTF-------
         DA  +  +  +                               F TTL  S+TAHPQTD QTEVTNR+LGN++R + G  P+QWD A P++ F       
Subjt:  IDAPCLEDIVEK-------------------------------FDTTLKFSTTAHPQTDRQTEVTNRSLGNLIRCLSGNHPRQWDMA-PRLTF-------

Query:  --------------------DLTSLPKEVKIQEEAEQLAERIQKLHTEVIDYITKTTESYKEEKNKKRRDVHFQVRDLVMAHLKKKRFPLGTYGKMKDKQ
                            DL  LP+  +    A+ LAE +  +  EV   + +T   YK   ++ RR   FQ  D VM  L+K+RFP GTY K+K K+
Subjt:  --------------------DLTSLPKEVKIQEEAEQLAERIQKLHTEVIDYITKTTESYKEEKNKKRRDVHFQVRDLVMAHLKKKRFPLGTYGKMKDKQ

Query:  IGPCRILAKYEPNAYKIELPRGFNISPIFNVADLRSYK
         GP ++L +   NAY IELP    IS IFNVADL  ++
Subjt:  IGPCRILAKYEPNAYKIELPRGFNISPIFNVADLRSYK

M5WCC7 Reverse transcriptase3.4e-14743.72Show/hide
Query:  DSRAINKITKCIFCTEEISFLGFIISENQVKMDESKVEAVTKWSWMK-----------ARWKLKFIKNFSFITAPMTDCLKKGAFYWGEKQQHNFDSLKR
        +++    + KC FCT ++ FLGF++ E+ +++D+ K++A+  W   K           A +  +F+++FS I AP+T+CLKKG F WGE+Q+ +F  +K 
Subjt:  DSRAINKITKCIFCTEEISFLGFIISENQVKMDESKVEAVTKWSWMK-----------ARWKLKFIKNFSFITAPMTDCLKKGAFYWGEKQQHNFDSLKR

Query:  KLVSQPVLKLPKFDSPFEVAVDASGVGIGAVLSQGGHPVEFFSEKLK----------QELYALVRALKQWEHYLLSKEFVLLTDHFSLKYLQAQKNINKM
        KL + PVL LP F+  FEV  DASGVG+GAVLSQ   PV FFSEKL           QE YA+VRALKQWEHYL+ KEFVL TDH +LKY+ +QKNI+KM
Subjt:  KLVSQPVLKLPKFDSPFEVAVDASGVGIGAVLSQGGHPVEFFSEKLK----------QELYALVRALKQWEHYLLSKEFVLLTDHFSLKYLQAQKNINKM

Query:  HARWVSYIQKFDFLIKHQACKENRAADALSRKSD----------SPHCA-------LNFSEIWSHCTVHIHDQDYHLVEGFLVKGDQLCIPHTSLREALI
        HARWV+++QKF F+IKH + K NR ADALSR++              C         +F EIW+ CT      DY L EG+L KG+QLCIP +SLRE LI
Subjt:  HARWVSYIQKFDFLIKHQACKENRAADALSRKSD----------SPHCA-------LNFSEIWSHCTVHIHDQDYHLVEGFLVKGDQLCIPHTSLREALI

Query:  KEAHSRGLARHFGQEKTFQIIMKKFYWSQARRDINNFVKRCHICQRAKGSSSNVGLYTPLPIPKNMWEDLSIDFVVGLPKTQMRVDLVMVVVDRFSKMSH
        ++ H  GL+ H G++KT   + ++FYW Q +RD+   V++C+ CQ +KG   N GLY PLP+P ++W+DL++DFV+GLP+TQ  VD V VVVDRFSKM+H
Subjt:  KEAHSRGLARHFGQEKTFQIIMKKFYWSQARRDINNFVKRCHICQRAKGSSSNVGLYTPLPIPKNMWEDLSIDFVVGLPKTQMRVDLVMVVVDRFSKMSH

Query:  FLPYKKTIDAPCLEDIVEK-------------------------------FDTTLKFSTTAHPQTDRQTEVTNRSLGNLIRCLSGNHPRQWDMA------
        F+  +KT DA  +  +  +                               F TTL  S+TAHPQTD QTEVTNR+LGN++R + G  P+QWD A      
Subjt:  FLPYKKTIDAPCLEDIVEK-------------------------------FDTTLKFSTTAHPQTDRQTEVTNRSLGNLIRCLSGNHPRQWDMA------

Query:  ----------------------PRLTFDLTSLPKEVKIQEEAEQLAERIQKLHTEVIDYITKTTESYKEEKNKKRRDVHFQVRDLVMAHLKKKRFPLGTY
                              P    DL  LP+  +    A+ LAE +  +  EV   + +T   YK   +K RR   FQ  D VM  L+K+RFP+GTY
Subjt:  ----------------------PRLTFDLTSLPKEVKIQEEAEQLAERIQKLHTEVIDYITKTTESYKEEKNKKRRDVHFQVRDLVMAHLKKKRFPLGTY

Query:  GKMKDKQIGPCRILAKYEPNAYKIELPRGFNISPIFNVADLRSYK
         K+K K+ GP ++L +   NAY IELP    IS IFNVADL  ++
Subjt:  GKMKDKQIGPCRILAKYEPNAYKIELPRGFNISPIFNVADLRSYK

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein8.2e-5828.38Show/hide
Query:  KCIFCTEEISFLGFIISENQVKMDESKVEAVTKWSWMKARWKL-----------KFIKNFSFITAPMTDCLKKGA-FYWGEKQQHNFDSLKRKLVSQPVL
        KC F   ++ F+G+ ISE      +  ++ V +W   K R +L           KFI   S +T P+ + LKK   + W   Q    +++K+ LVS PVL
Subjt:  KCIFCTEEISFLGFIISENQVKMDESKVEAVTKWSWMKARWKL-----------KFIKNFSFITAPMTDCLKKGA-FYWGEKQQHNFDSLKRKLVSQPVL

Query:  KLPKFDSPFEVAVDASGVGIGAVLSQGG-----HPVEFFSEKLK----------QELYALVRALKQWEHYLLS--KEFVLLTDHFSL--KYLQAQKNINK
        +   F     +  DAS V +GAVLSQ       +PV ++S K+           +E+ A++++LK W HYL S  + F +LTDH +L  +     +  NK
Subjt:  KLPKFDSPFEVAVDASGVGIGAVLSQGG-----HPVEFFSEKLK----------QELYALVRALKQWEHYLLS--KEFVLLTDHFSL--KYLQAQKNINK

Query:  MHARWVSYIQKFDFLIKHQACKENRAADALSR--------KSDSPHCALNF-----------SEIWSHCTVHIH------------DQDYHLVEGFLVKG
          ARW  ++Q F+F I ++    N  ADALSR          DS   ++NF           +++ +  T                +++  L +G L+  
Subjt:  MHARWVSYIQKFDFLIKHQACKENRAADALSR--------KSDSPHCALNF-----------SEIWSHCTVHIH------------DQDYHLVEGFLVKG

Query:  -DQLCIPH-TSLREALIKEAHSRGLARHFGQEKTFQIIMKKFYWSQARRDINNFVKRCHICQRAKGSSSN-VGLYTPLPIPKNMWEDLSIDFVVGLPKTQ
         DQ+ +P+ T L   +IK+ H  G   H G E    II+++F W   R+ I  +V+ CH CQ  K  +    G   P+P  +  WE LS+DF+  LP++ 
Subjt:  -DQLCIPH-TSLREALIKEAHSRGLARHFGQEKTFQIIMKKFYWSQARRDINNFVKRCHICQRAKGSSSN-VGLYTPLPIPKNMWEDLSIDFVVGLPKTQ

Query:  MRVDLVMVVVDRFSKMSHFLPYKKTIDA-------------------------------PCLEDIVEKFDTTLKFSTTAHPQTDRQTEVTNRSLGNLIRC
           + + VVVDRFSKM+  +P  K+I A                                  +D   K++  +KFS    PQTD QTE TN+++  L+RC
Subjt:  MRVDLVMVVVDRFSKMSHFLPYKKTIDA-------------------------------PCLEDIVEKFDTTLKFSTTAHPQTDRQTEVTNRSLGNLIRC

Query:  LSGNHPRQW------------------------DMAPRLTFDLTSLPKEVKIQEEAEQLAERIQKLHTEVIDYITKTTESYKEEKNKKRRDV-HFQVRDL
        +   HP  W                        ++  R +  L+ L       +  E   E IQ   T V +++       K+  + K +++  FQ  DL
Subjt:  LSGNHPRQW------------------------DMAPRLTFDLTSLPKEVKIQEEAEQLAERIQKLHTEVIDYITKTTESYKEEKNKKRRDV-HFQVRDL

Query:  VMAHLKKKRFPLGTYGKMKDKQIGPCRILAKYEPNAYKIELPRGFN--ISPIFNVADLRSYKAPNE
        VM    K  F L    K+     GP  +L K  PN Y+++LP       S  F+V+ L  Y+  +E
Subjt:  VMAHLKKKRFPLGTYGKMKDKQIGPCRILAKYEPNAYKIELPRGFN--ISPIFNVADLRSYKAPNE

P0CT35 Transposon Tf2-2 polyprotein8.2e-5828.38Show/hide
Query:  KCIFCTEEISFLGFIISENQVKMDESKVEAVTKWSWMKARWKL-----------KFIKNFSFITAPMTDCLKKGA-FYWGEKQQHNFDSLKRKLVSQPVL
        KC F   ++ F+G+ ISE      +  ++ V +W   K R +L           KFI   S +T P+ + LKK   + W   Q    +++K+ LVS PVL
Subjt:  KCIFCTEEISFLGFIISENQVKMDESKVEAVTKWSWMKARWKL-----------KFIKNFSFITAPMTDCLKKGA-FYWGEKQQHNFDSLKRKLVSQPVL

Query:  KLPKFDSPFEVAVDASGVGIGAVLSQGG-----HPVEFFSEKLK----------QELYALVRALKQWEHYLLS--KEFVLLTDHFSL--KYLQAQKNINK
        +   F     +  DAS V +GAVLSQ       +PV ++S K+           +E+ A++++LK W HYL S  + F +LTDH +L  +     +  NK
Subjt:  KLPKFDSPFEVAVDASGVGIGAVLSQGG-----HPVEFFSEKLK----------QELYALVRALKQWEHYLLS--KEFVLLTDHFSL--KYLQAQKNINK

Query:  MHARWVSYIQKFDFLIKHQACKENRAADALSR--------KSDSPHCALNF-----------SEIWSHCTVHIH------------DQDYHLVEGFLVKG
          ARW  ++Q F+F I ++    N  ADALSR          DS   ++NF           +++ +  T                +++  L +G L+  
Subjt:  MHARWVSYIQKFDFLIKHQACKENRAADALSR--------KSDSPHCALNF-----------SEIWSHCTVHIH------------DQDYHLVEGFLVKG

Query:  -DQLCIPH-TSLREALIKEAHSRGLARHFGQEKTFQIIMKKFYWSQARRDINNFVKRCHICQRAKGSSSN-VGLYTPLPIPKNMWEDLSIDFVVGLPKTQ
         DQ+ +P+ T L   +IK+ H  G   H G E    II+++F W   R+ I  +V+ CH CQ  K  +    G   P+P  +  WE LS+DF+  LP++ 
Subjt:  -DQLCIPH-TSLREALIKEAHSRGLARHFGQEKTFQIIMKKFYWSQARRDINNFVKRCHICQRAKGSSSN-VGLYTPLPIPKNMWEDLSIDFVVGLPKTQ

Query:  MRVDLVMVVVDRFSKMSHFLPYKKTIDA-------------------------------PCLEDIVEKFDTTLKFSTTAHPQTDRQTEVTNRSLGNLIRC
           + + VVVDRFSKM+  +P  K+I A                                  +D   K++  +KFS    PQTD QTE TN+++  L+RC
Subjt:  MRVDLVMVVVDRFSKMSHFLPYKKTIDA-------------------------------PCLEDIVEKFDTTLKFSTTAHPQTDRQTEVTNRSLGNLIRC

Query:  LSGNHPRQW------------------------DMAPRLTFDLTSLPKEVKIQEEAEQLAERIQKLHTEVIDYITKTTESYKEEKNKKRRDV-HFQVRDL
        +   HP  W                        ++  R +  L+ L       +  E   E IQ   T V +++       K+  + K +++  FQ  DL
Subjt:  LSGNHPRQW------------------------DMAPRLTFDLTSLPKEVKIQEEAEQLAERIQKLHTEVIDYITKTTESYKEEKNKKRRDV-HFQVRDL

Query:  VMAHLKKKRFPLGTYGKMKDKQIGPCRILAKYEPNAYKIELPRGFN--ISPIFNVADLRSYKAPNE
        VM    K  F L    K+     GP  +L K  PN Y+++LP       S  F+V+ L  Y+  +E
Subjt:  VMAHLKKKRFPLGTYGKMKDKQIGPCRILAKYEPNAYKIELPRGFN--ISPIFNVADLRSYKAPNE

P0CT36 Transposon Tf2-3 polyprotein8.2e-5828.38Show/hide
Query:  KCIFCTEEISFLGFIISENQVKMDESKVEAVTKWSWMKARWKL-----------KFIKNFSFITAPMTDCLKKGA-FYWGEKQQHNFDSLKRKLVSQPVL
        KC F   ++ F+G+ ISE      +  ++ V +W   K R +L           KFI   S +T P+ + LKK   + W   Q    +++K+ LVS PVL
Subjt:  KCIFCTEEISFLGFIISENQVKMDESKVEAVTKWSWMKARWKL-----------KFIKNFSFITAPMTDCLKKGA-FYWGEKQQHNFDSLKRKLVSQPVL

Query:  KLPKFDSPFEVAVDASGVGIGAVLSQGG-----HPVEFFSEKLK----------QELYALVRALKQWEHYLLS--KEFVLLTDHFSL--KYLQAQKNINK
        +   F     +  DAS V +GAVLSQ       +PV ++S K+           +E+ A++++LK W HYL S  + F +LTDH +L  +     +  NK
Subjt:  KLPKFDSPFEVAVDASGVGIGAVLSQGG-----HPVEFFSEKLK----------QELYALVRALKQWEHYLLS--KEFVLLTDHFSL--KYLQAQKNINK

Query:  MHARWVSYIQKFDFLIKHQACKENRAADALSR--------KSDSPHCALNF-----------SEIWSHCTVHIH------------DQDYHLVEGFLVKG
          ARW  ++Q F+F I ++    N  ADALSR          DS   ++NF           +++ +  T                +++  L +G L+  
Subjt:  MHARWVSYIQKFDFLIKHQACKENRAADALSR--------KSDSPHCALNF-----------SEIWSHCTVHIH------------DQDYHLVEGFLVKG

Query:  -DQLCIPH-TSLREALIKEAHSRGLARHFGQEKTFQIIMKKFYWSQARRDINNFVKRCHICQRAKGSSSN-VGLYTPLPIPKNMWEDLSIDFVVGLPKTQ
         DQ+ +P+ T L   +IK+ H  G   H G E    II+++F W   R+ I  +V+ CH CQ  K  +    G   P+P  +  WE LS+DF+  LP++ 
Subjt:  -DQLCIPH-TSLREALIKEAHSRGLARHFGQEKTFQIIMKKFYWSQARRDINNFVKRCHICQRAKGSSSN-VGLYTPLPIPKNMWEDLSIDFVVGLPKTQ

Query:  MRVDLVMVVVDRFSKMSHFLPYKKTIDA-------------------------------PCLEDIVEKFDTTLKFSTTAHPQTDRQTEVTNRSLGNLIRC
           + + VVVDRFSKM+  +P  K+I A                                  +D   K++  +KFS    PQTD QTE TN+++  L+RC
Subjt:  MRVDLVMVVVDRFSKMSHFLPYKKTIDA-------------------------------PCLEDIVEKFDTTLKFSTTAHPQTDRQTEVTNRSLGNLIRC

Query:  LSGNHPRQW------------------------DMAPRLTFDLTSLPKEVKIQEEAEQLAERIQKLHTEVIDYITKTTESYKEEKNKKRRDV-HFQVRDL
        +   HP  W                        ++  R +  L+ L       +  E   E IQ   T V +++       K+  + K +++  FQ  DL
Subjt:  LSGNHPRQW------------------------DMAPRLTFDLTSLPKEVKIQEEAEQLAERIQKLHTEVIDYITKTTESYKEEKNKKRRDV-HFQVRDL

Query:  VMAHLKKKRFPLGTYGKMKDKQIGPCRILAKYEPNAYKIELPRGFN--ISPIFNVADLRSYKAPNE
        VM    K  F L    K+     GP  +L K  PN Y+++LP       S  F+V+ L  Y+  +E
Subjt:  VMAHLKKKRFPLGTYGKMKDKQIGPCRILAKYEPNAYKIELPRGFN--ISPIFNVADLRSYKAPNE

P0CT41 Transposon Tf2-12 polyprotein8.2e-5828.38Show/hide
Query:  KCIFCTEEISFLGFIISENQVKMDESKVEAVTKWSWMKARWKL-----------KFIKNFSFITAPMTDCLKKGA-FYWGEKQQHNFDSLKRKLVSQPVL
        KC F   ++ F+G+ ISE      +  ++ V +W   K R +L           KFI   S +T P+ + LKK   + W   Q    +++K+ LVS PVL
Subjt:  KCIFCTEEISFLGFIISENQVKMDESKVEAVTKWSWMKARWKL-----------KFIKNFSFITAPMTDCLKKGA-FYWGEKQQHNFDSLKRKLVSQPVL

Query:  KLPKFDSPFEVAVDASGVGIGAVLSQGG-----HPVEFFSEKLK----------QELYALVRALKQWEHYLLS--KEFVLLTDHFSL--KYLQAQKNINK
        +   F     +  DAS V +GAVLSQ       +PV ++S K+           +E+ A++++LK W HYL S  + F +LTDH +L  +     +  NK
Subjt:  KLPKFDSPFEVAVDASGVGIGAVLSQGG-----HPVEFFSEKLK----------QELYALVRALKQWEHYLLS--KEFVLLTDHFSL--KYLQAQKNINK

Query:  MHARWVSYIQKFDFLIKHQACKENRAADALSR--------KSDSPHCALNF-----------SEIWSHCTVHIH------------DQDYHLVEGFLVKG
          ARW  ++Q F+F I ++    N  ADALSR          DS   ++NF           +++ +  T                +++  L +G L+  
Subjt:  MHARWVSYIQKFDFLIKHQACKENRAADALSR--------KSDSPHCALNF-----------SEIWSHCTVHIH------------DQDYHLVEGFLVKG

Query:  -DQLCIPH-TSLREALIKEAHSRGLARHFGQEKTFQIIMKKFYWSQARRDINNFVKRCHICQRAKGSSSN-VGLYTPLPIPKNMWEDLSIDFVVGLPKTQ
         DQ+ +P+ T L   +IK+ H  G   H G E    II+++F W   R+ I  +V+ CH CQ  K  +    G   P+P  +  WE LS+DF+  LP++ 
Subjt:  -DQLCIPH-TSLREALIKEAHSRGLARHFGQEKTFQIIMKKFYWSQARRDINNFVKRCHICQRAKGSSSN-VGLYTPLPIPKNMWEDLSIDFVVGLPKTQ

Query:  MRVDLVMVVVDRFSKMSHFLPYKKTIDA-------------------------------PCLEDIVEKFDTTLKFSTTAHPQTDRQTEVTNRSLGNLIRC
           + + VVVDRFSKM+  +P  K+I A                                  +D   K++  +KFS    PQTD QTE TN+++  L+RC
Subjt:  MRVDLVMVVVDRFSKMSHFLPYKKTIDA-------------------------------PCLEDIVEKFDTTLKFSTTAHPQTDRQTEVTNRSLGNLIRC

Query:  LSGNHPRQW------------------------DMAPRLTFDLTSLPKEVKIQEEAEQLAERIQKLHTEVIDYITKTTESYKEEKNKKRRDV-HFQVRDL
        +   HP  W                        ++  R +  L+ L       +  E   E IQ   T V +++       K+  + K +++  FQ  DL
Subjt:  LSGNHPRQW------------------------DMAPRLTFDLTSLPKEVKIQEEAEQLAERIQKLHTEVIDYITKTTESYKEEKNKKRRDV-HFQVRDL

Query:  VMAHLKKKRFPLGTYGKMKDKQIGPCRILAKYEPNAYKIELPRGFN--ISPIFNVADLRSYKAPNE
        VM    K  F L    K+     GP  +L K  PN Y+++LP       S  F+V+ L  Y+  +E
Subjt:  VMAHLKKKRFPLGTYGKMKDKQIGPCRILAKYEPNAYKIELPRGFN--ISPIFNVADLRSYKAPNE

Q9UR07 Transposon Tf2-11 polyprotein8.2e-5828.38Show/hide
Query:  KCIFCTEEISFLGFIISENQVKMDESKVEAVTKWSWMKARWKL-----------KFIKNFSFITAPMTDCLKKGA-FYWGEKQQHNFDSLKRKLVSQPVL
        KC F   ++ F+G+ ISE      +  ++ V +W   K R +L           KFI   S +T P+ + LKK   + W   Q    +++K+ LVS PVL
Subjt:  KCIFCTEEISFLGFIISENQVKMDESKVEAVTKWSWMKARWKL-----------KFIKNFSFITAPMTDCLKKGA-FYWGEKQQHNFDSLKRKLVSQPVL

Query:  KLPKFDSPFEVAVDASGVGIGAVLSQGG-----HPVEFFSEKLK----------QELYALVRALKQWEHYLLS--KEFVLLTDHFSL--KYLQAQKNINK
        +   F     +  DAS V +GAVLSQ       +PV ++S K+           +E+ A++++LK W HYL S  + F +LTDH +L  +     +  NK
Subjt:  KLPKFDSPFEVAVDASGVGIGAVLSQGG-----HPVEFFSEKLK----------QELYALVRALKQWEHYLLS--KEFVLLTDHFSL--KYLQAQKNINK

Query:  MHARWVSYIQKFDFLIKHQACKENRAADALSR--------KSDSPHCALNF-----------SEIWSHCTVHIH------------DQDYHLVEGFLVKG
          ARW  ++Q F+F I ++    N  ADALSR          DS   ++NF           +++ +  T                +++  L +G L+  
Subjt:  MHARWVSYIQKFDFLIKHQACKENRAADALSR--------KSDSPHCALNF-----------SEIWSHCTVHIH------------DQDYHLVEGFLVKG

Query:  -DQLCIPH-TSLREALIKEAHSRGLARHFGQEKTFQIIMKKFYWSQARRDINNFVKRCHICQRAKGSSSN-VGLYTPLPIPKNMWEDLSIDFVVGLPKTQ
         DQ+ +P+ T L   +IK+ H  G   H G E    II+++F W   R+ I  +V+ CH CQ  K  +    G   P+P  +  WE LS+DF+  LP++ 
Subjt:  -DQLCIPH-TSLREALIKEAHSRGLARHFGQEKTFQIIMKKFYWSQARRDINNFVKRCHICQRAKGSSSN-VGLYTPLPIPKNMWEDLSIDFVVGLPKTQ

Query:  MRVDLVMVVVDRFSKMSHFLPYKKTIDA-------------------------------PCLEDIVEKFDTTLKFSTTAHPQTDRQTEVTNRSLGNLIRC
           + + VVVDRFSKM+  +P  K+I A                                  +D   K++  +KFS    PQTD QTE TN+++  L+RC
Subjt:  MRVDLVMVVVDRFSKMSHFLPYKKTIDA-------------------------------PCLEDIVEKFDTTLKFSTTAHPQTDRQTEVTNRSLGNLIRC

Query:  LSGNHPRQW------------------------DMAPRLTFDLTSLPKEVKIQEEAEQLAERIQKLHTEVIDYITKTTESYKEEKNKKRRDV-HFQVRDL
        +   HP  W                        ++  R +  L+ L       +  E   E IQ   T V +++       K+  + K +++  FQ  DL
Subjt:  LSGNHPRQW------------------------DMAPRLTFDLTSLPKEVKIQEEAEQLAERIQKLHTEVIDYITKTTESYKEEKNKKRRDV-HFQVRDL

Query:  VMAHLKKKRFPLGTYGKMKDKQIGPCRILAKYEPNAYKIELPRGFN--ISPIFNVADLRSYKAPNE
        VM    K  F L    K+     GP  +L K  PN Y+++LP       S  F+V+ L  Y+  +E
Subjt:  VMAHLKKKRFPLGTYGKMKDKQIGPCRILAKYEPNAYKIELPRGFN--ISPIFNVADLRSYKAPNE

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein3.3e-0934.21Show/hide
Query:  KCIFCTEEISFLG--FIISENQVKMDESKVEAVTKWSWMKARWKL-----------KFIKNFSFITAPMTDCLKKGAFYWGEKQQHNFDSLKRKLVSQPV
        KC F   +I++LG   IIS   V  D +K+EA+  W   K   +L           +F+KN+  I  P+T+ LKK +  W E     F +LK  + + PV
Subjt:  KCIFCTEEISFLG--FIISENQVKMDESKVEAVTKWSWMKARWKL-----------KFIKNFSFITAPMTDCLKKGAFYWGEKQQHNFDSLKRKLVSQPV

Query:  LKLPKFDSPFEVAV
        L LP    PF   V
Subjt:  LKLPKFDSPFEVAV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTCCTAAGGAGTATGAGTTCTTACACCAACACATTGAAGATTTTTTGACAAAAGGGCATATTCAGCCGAACATCAGTCCTTGTGAACATCGGTCCTTGTGCCGTAC
CAGCTCTATTAACTCCAAAGAAGGATGGCAGAGAATGTGTGTAGACAGTCGAGCAATCAACAAAATCACGAAATGTATTTTCTGCACGGAAGAAATATCTTTCCTAGGCT
TTATTATATCTGAAAATCAAGTGAAGATGGATGAAAGCAAGGTGGAAGCTGTAACTAAATGGTCATGGATGAAAGCAAGGTGGAAGCTAAAGTTTATTAAGAACTTTAGT
TTTATAACAGCTCCTATGACTGATTGTTTGAAGAAGGGAGCCTTTTATTGGGGAGAAAAACAGCAGCACAATTTTGATTCCCTCAAAAGAAAGCTTGTCAGCCAACCAGT
CCTCAAATTACCAAAGTTTGACAGCCCTTTTGAAGTAGCAGTAGACGCCAGTGGAGTTGGAATTGGTGCCGTCCTTTCCCAAGGAGGTCATCCGGTCGAATTCTTTAGTG
AAAAGTTGAAGCAGGAACTTTATGCACTTGTTAGGGCCTTAAAACAGTGGGAGCACTACTTATTATCTAAGGAGTTTGTGTTGCTCACTGATCATTTCTCCTTGAAGTAT
CTCCAAGCACAAAAAAATATTAACAAGATGCATGCTAGATGGGTATCTTATATCCAAAAATTTGATTTCTTAATCAAGCATCAAGCGTGCAAAGAAAACAGAGCTGCAGA
TGCCTTAAGTAGAAAAAGTGACTCTCCTCATTGTGCTCTCAACTTCAGTGAGATTTGGAGTCATTGTACTGTTCATATCCATGATCAAGATTATCATTTGGTGGAAGGTT
TTCTCGTTAAAGGAGACCAACTATGCATTCCACATACTTCCTTAAGGGAAGCACTAATAAAAGAAGCTCATTCGAGAGGCCTAGCTAGACACTTTGGCCAAGAAAAGACT
TTTCAAATTATCATGAAGAAGTTTTATTGGTCTCAAGCTAGAAGAGACATTAATAACTTTGTGAAAAGATGTCATATTTGCCAAAGAGCAAAAGGATCTTCATCTAATGT
CGGCCTCTACACTCCTCTACCGATTCCTAAGAACATGTGGGAGGATTTGTCAATTGATTTCGTAGTGGGTCTACCTAAGACTCAAATGAGAGTTGACTTGGTTATGGTTG
TGGTGGACAGATTTAGCAAGATGTCTCACTTCTTGCCTTATAAAAAAACTATAGACGCACCATGTTTGGAAGACATTGTGGAGAAGTTTGACACCACCTTGAAGTTTAGT
ACCACTGCTCACCCACAAACGGATAGACAAACTGAGGTAACTAATCGGTCCTTGGGAAATCTAATTCGCTGCCTTAGTGGAAACCACCCTAGACAATGGGACATGGCACC
AAGGTTGACATTCGACCTAACTAGCCTTCCTAAAGAAGTGAAAATCCAAGAGGAAGCTGAACAGTTAGCTGAAAGGATACAAAAGCTTCACACAGAAGTCATTGACTATA
TCACTAAAACTACTGAGTCTTACAAAGAAGAGAAGAATAAGAAGCGAAGGGATGTACATTTCCAAGTTAGAGATCTTGTAATGGCACATTTGAAGAAGAAGAGGTTCCCC
TTAGGAACCTATGGCAAGATGAAGGACAAGCAGATTGGCCCATGTAGAATACTTGCTAAATATGAGCCTAATGCTTACAAAATTGAACTACCGCGCGGATTTAACATTAG
CCCAATCTTCAATGTAGCAGACCTTAGAAGCTACAAGGCACCGAATGAATTTCAACTTCCATAG
mRNA sequenceShow/hide mRNA sequence
AAGAAATTTCTAGAAACCAGAGAAGCTGATATTTTGGGACTTGTTATTTCTGGCCCTCATACTACAAAAATTTCCAGCTTGGTTCCACCAAAAGTGCAAGACCTATTGAG
CCAATTTCATCCTATAATGGAGGAGCCCTCTGAATTGCCTCCTCTTCGAGACATCTAGCACATTATAGATTTGATACCCAACTCATCTCTTCCCAACCTTCCTCATTACA
GAATGAGTCCTAAGGAGTATGAGTTCTTACACCAACACATTGAAGATTTTTTGACAAAAGGGCATATTCAGCCGAACATCAGTCCTTGTGAACATCGGTCCTTGTGCCGT
ACCAGCTCTATTAACTCCAAAGAAGGATGGCAGAGAATGTGTGTAGACAGTCGAGCAATCAACAAAATCACGAAATGTATTTTCTGCACGGAAGAAATATCTTTCCTAGG
CTTTATTATATCTGAAAATCAAGTGAAGATGGATGAAAGCAAGGTGGAAGCTGTAACTAAATGGTCATGGATGAAAGCAAGGTGGAAGCTAAAGTTTATTAAGAACTTTA
GTTTTATAACAGCTCCTATGACTGATTGTTTGAAGAAGGGAGCCTTTTATTGGGGAGAAAAACAGCAGCACAATTTTGATTCCCTCAAAAGAAAGCTTGTCAGCCAACCA
GTCCTCAAATTACCAAAGTTTGACAGCCCTTTTGAAGTAGCAGTAGACGCCAGTGGAGTTGGAATTGGTGCCGTCCTTTCCCAAGGAGGTCATCCGGTCGAATTCTTTAG
TGAAAAGTTGAAGCAGGAACTTTATGCACTTGTTAGGGCCTTAAAACAGTGGGAGCACTACTTATTATCTAAGGAGTTTGTGTTGCTCACTGATCATTTCTCCTTGAAGT
ATCTCCAAGCACAAAAAAATATTAACAAGATGCATGCTAGATGGGTATCTTATATCCAAAAATTTGATTTCTTAATCAAGCATCAAGCGTGCAAAGAAAACAGAGCTGCA
GATGCCTTAAGTAGAAAAAGTGACTCTCCTCATTGTGCTCTCAACTTCAGTGAGATTTGGAGTCATTGTACTGTTCATATCCATGATCAAGATTATCATTTGGTGGAAGG
TTTTCTCGTTAAAGGAGACCAACTATGCATTCCACATACTTCCTTAAGGGAAGCACTAATAAAAGAAGCTCATTCGAGAGGCCTAGCTAGACACTTTGGCCAAGAAAAGA
CTTTTCAAATTATCATGAAGAAGTTTTATTGGTCTCAAGCTAGAAGAGACATTAATAACTTTGTGAAAAGATGTCATATTTGCCAAAGAGCAAAAGGATCTTCATCTAAT
GTCGGCCTCTACACTCCTCTACCGATTCCTAAGAACATGTGGGAGGATTTGTCAATTGATTTCGTAGTGGGTCTACCTAAGACTCAAATGAGAGTTGACTTGGTTATGGT
TGTGGTGGACAGATTTAGCAAGATGTCTCACTTCTTGCCTTATAAAAAAACTATAGACGCACCATGTTTGGAAGACATTGTGGAGAAGTTTGACACCACCTTGAAGTTTA
GTACCACTGCTCACCCACAAACGGATAGACAAACTGAGGTAACTAATCGGTCCTTGGGAAATCTAATTCGCTGCCTTAGTGGAAACCACCCTAGACAATGGGACATGGCA
CCAAGGTTGACATTCGACCTAACTAGCCTTCCTAAAGAAGTGAAAATCCAAGAGGAAGCTGAACAGTTAGCTGAAAGGATACAAAAGCTTCACACAGAAGTCATTGACTA
TATCACTAAAACTACTGAGTCTTACAAAGAAGAGAAGAATAAGAAGCGAAGGGATGTACATTTCCAAGTTAGAGATCTTGTAATGGCACATTTGAAGAAGAAGAGGTTCC
CCTTAGGAACCTATGGCAAGATGAAGGACAAGCAGATTGGCCCATGTAGAATACTTGCTAAATATGAGCCTAATGCTTACAAAATTGAACTACCGCGCGGATTTAACATT
AGCCCAATCTTCAATGTAGCAGACCTTAGAAGCTACAAGGCACCGAATGAATTTCAACTTCCATAGCAAACTCGGGG
Protein sequenceShow/hide protein sequence
MSPKEYEFLHQHIEDFLTKGHIQPNISPCEHRSLCRTSSINSKEGWQRMCVDSRAINKITKCIFCTEEISFLGFIISENQVKMDESKVEAVTKWSWMKARWKLKFIKNFS
FITAPMTDCLKKGAFYWGEKQQHNFDSLKRKLVSQPVLKLPKFDSPFEVAVDASGVGIGAVLSQGGHPVEFFSEKLKQELYALVRALKQWEHYLLSKEFVLLTDHFSLKY
LQAQKNINKMHARWVSYIQKFDFLIKHQACKENRAADALSRKSDSPHCALNFSEIWSHCTVHIHDQDYHLVEGFLVKGDQLCIPHTSLREALIKEAHSRGLARHFGQEKT
FQIIMKKFYWSQARRDINNFVKRCHICQRAKGSSSNVGLYTPLPIPKNMWEDLSIDFVVGLPKTQMRVDLVMVVVDRFSKMSHFLPYKKTIDAPCLEDIVEKFDTTLKFS
TTAHPQTDRQTEVTNRSLGNLIRCLSGNHPRQWDMAPRLTFDLTSLPKEVKIQEEAEQLAERIQKLHTEVIDYITKTTESYKEEKNKKRRDVHFQVRDLVMAHLKKKRFP
LGTYGKMKDKQIGPCRILAKYEPNAYKIELPRGFNISPIFNVADLRSYKAPNEFQLP