; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI05G15080 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI05G15080
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationChr5:15843379..15845377
RNA-Seq ExpressionCSPI05G15080
SyntenyCSPI05G15080
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG8480261.1 hypothetical protein CXB51_024850 [Gossypium anomalum]7.4e-6334.2Show/hide
Query:  KALLGQQKTHKDKLGDENEAFVLLNSLSEAYKEVKNALKYGRDSIKTDVIISALRTRELEIHSSHKENHSGDGLFVKDALASTLDQANHVNPLGK-----
        K +L   +  + +   E+   +LL SL  +Y   ++ + Y R+S+  D +  +L + +   H   K +  G+GL V+       D        G+     
Subjt:  KALLGQQKTHKDKLGDENEAFVLLNSLSEAYKEVKNALKYGRDSIKTDVIISALRTRELEIHSSHKENHSGDGLFVKDALASTLDQANHVNPLGK-----

Query:  -------HDWVLDSGCTYHMTPFRAWFNTYREISRESVFMGNNNESNIAGIGTVTMKLKDGTVKLLRNVRHVPHLKINLISLGMLDSLGCEYKGKCGVFQ
                +W+LDSGCT+HM+P R WF TY  +S   V MGNN    IAG+GT+ +K+ DG V+ L +VR+VP LK NLISL  LDS G  Y  + GV +
Subjt:  -------HDWVLDSGCTYHMTPFRAWFNTYREISRESVFMGNNNESNIAGIGTVTMKLKDGTVKLLRNVRHVPHLKINLISLGMLDSLGCEYKGKCGVFQ

Query:  VFMGSKLVLVGE-KVNDLFIIKGVEMIEEANTVLSLNLTEADI---WHKRLSHISQKGLEALSKQDILP-QDICSKL---------QSKKTKLHQSTTHN
        +  GS +V+ G+ K   L++++G  +  +A  V S +L++ DI   WH RL H+S+ G+  LSK+ +L  Q IC KL         + K+ +  +   + 
Subjt:  VFMGSKLVLVGE-KVNDLFIIKGVEMIEEANTVLSLNLTEADI---WHKRLSHISQKGLEALSKQDILP-QDICSKL---------QSKKTKLHQSTTHN

Query:  KRNPGLHPFGSMGSASTPSLSSS--------------------------------------RESGITRHKTVTYTPQQNGVAERLNRTIMERVRCLLSNA
        K           G +  PS   +                                      +  GI R+ TV +TPQQNGVAER+NRTIME+VRC+LSNA
Subjt:  KRNPGLHPFGSMGSASTPSLSSS--------------------------------------RESGITRHKTVTYTPQQNGVAERLNRTIMERVRCLLSNA

Query:  ILEEKFWVEVAAYVVYTLNRSPHTSLGFLTPEEKWSKHPPSLNDLKVFGCVGYADQNKG
         L + FW E A+   + +NRSP  ++   TP+E WS +P + +DLK+FGC  YA  N G
Subjt:  ILEEKFWVEVAAYVVYTLNRSPHTSLGFLTPEEKWSKHPPSLNDLKVFGCVGYADQNKG

RVW99173.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]1.6e-6235.09Show/hide
Query:  LGDENEAFVLLNSLSEAYKEVKNALKYGRDSI-----KTDVIISALRTRELEIHSSHKENHSGDGLFVKD-------ALASTLDQANHVNPLGKHD----
        + DE++A +LL SL   Y  +K+A+ YGRDS+     K     S  +T++ +    HKE H     F KD        +  T+++ +    L  +D    
Subjt:  LGDENEAFVLLNSLSEAYKEVKNALKYGRDSI-----KTDVIISALRTRELEIHSSHKENHSGDGLFVKD-------ALASTLDQANHVNPLGKHD----

Query:  -----------WVLDSGCTYHMTPFRAWFNTYREISRESVFMGNNNESNIAGIGTVTMKLKDGTVKLLRNVRHVPHLKINLISLGMLDSLGCEYKGKCGV
                   W+LDSGC++HM P +AWF  ++E     V +GNN    I G GTV +K  DG  ++L +VR++P LK NLISLGMLD  G  +K +   
Subjt:  -----------WVLDSGCTYHMTPFRAWFNTYREISRESVFMGNNNESNIAGIGTVTMKLKDGTVKLLRNVRHVPHLKINLISLGMLDSLGCEYKGKCGV

Query:  FQVFMGSKLVLVGEKVNDLFIIKGVEMIEEANTVLSLNLTEADIWHKRLSHISQKGLEALSKQDILPQDICSKLQSKKTKLHQSTTHNKRNPGLHPFGSM
         +V  GS  V+     N L+ + G  +I++A+TVL  ++    +WH+RL H+S KGL+ L KQ +L     + L   +  +    T  K    +H   + 
Subjt:  FQVFMGSKLVLVGEKVNDLFIIKGVEMIEEANTVLSLNLTEADIWHKRLSHISQKGLEALSKQDILPQDICSKLQSKKTKLHQSTTHNKRNPGLHPFGSM

Query:  ---------GSASTPSL------------------------SSSRESGITRHKTVTYTPQQNGVAERLNRTIMERVRCLLSNAILEEKFWVEVAAYVVYT
                 G +  PS+                        S  ++ GI  H+TV YTPQQNG+AER+NRTI+ER+RC+LS++ L + FW E A  VV+ 
Subjt:  ---------GSASTPSL------------------------SSSRESGITRHKTVTYTPQQNGVAERLNRTIMERVRCLLSNAILEEKFWVEVAAYVVYT

Query:  LNRSPHTSLGFLTPEEKWSKHPPSLNDLKVFGCVGY
        +NRSP ++L F TP+EKW+    +   LKVFGC  Y
Subjt:  LNRSPHTSLGFLTPEEKWSKHPPSLNDLKVFGCVGY

TYK25306.1 putative gag-pol polyprotein [Cucumis melo var. makuwa]1.6e-6232Show/hide
Query:  DKLGDENEAFVLLNSLSEAYKEVKNALKYGRDSIKTDVIISALRTRELEIHSS---------------------------------------HKENHSGD
        +K+ DEN+A +LLNSL E Y+EVK A+KYGRDS+   +++ AL+TR LEI                                          HKE H   
Subjt:  DKLGDENEAFVLLNSLSEAYKEVKNALKYGRDSIKTDVIISALRTRELEIHSS---------------------------------------HKENHSGD

Query:  GLFVKDALASTLDQANHVNPLGKHD----------------------------WVLDSGCTYHMTPFRAWFNTYREISRESVFMGNNNESNIAGIGTVTM
           +  +  ++  +AN  +     +                            W++DSGCT+HMTP R +   ++++    V +G+N   ++ G G+V +
Subjt:  GLFVKDALASTLDQANHVNPLGKHD----------------------------WVLDSGCTYHMTPFRAWFNTYREISRESVFMGNNNESNIAGIGTVTM

Query:  KLKDGTVKLLRNVRHVPHLKINLISLGMLDSLGCEYKGKCGVFQVFMGSKLVLVGEKVNDLFIIKGVEMIEEANTVLSLNLTEAD-IWHKRLSHISQKGL
           DG V++L NVR+VP LK NLISLG LD  GC  K + GV +V  GS + L G   + L++++G  +   A  + S  +T+   +WHKRL+H+S++GL
Subjt:  KLKDGTVKLLRNVRHVPHLKINLISLGMLDSLGCEYKGKCGVFQVFMGSKLVLVGEKVNDLFIIKGVEMIEEANTVLSLNLTEAD-IWHKRLSHISQKGL

Query:  EALSKQDILPQDICSKLQSKKTKLHQSTTHNKRNPGLHPFGSM---------GSASTPSLSSSR------------------------------------
        +ALS+Q +L      +L   +  +   +T  K   G H    +         G     S+  SR                                    
Subjt:  EALSKQDILPQDICSKLQSKKTKLHQSTTHNKRNPGLHPFGSM---------GSASTPSLSSSR------------------------------------

Query:  ----------------------------ESGITRHKTVTYTPQQNGVAERLNRTIMERVRCLLSNAILEEKFWVEVAAYVVYTLNRSPHTSLGFLTPEEK
                                      GITRH TVTYTPQQNG+AER NRTIMER RCLL+NA L  KFW E A    Y +NRSP T+L   TP+E 
Subjt:  ----------------------------ESGITRHKTVTYTPQQNGVAERLNRTIMERVRCLLSNAILEEKFWVEVAAYVVYTLNRSPHTSLGFLTPEEK

Query:  WSKHPPSLNDLKVFGCVGYADQNKG
        W+   PSL  L+VFGC  YA    G
Subjt:  WSKHPPSLNDLKVFGCVGYADQNKG

XP_038880322.1 uncharacterized protein LOC120071961 [Benincasa hispida]9.3e-7442.89Show/hide
Query:  KIKALLGQQKTHKDKLGDENEAFVLLNSLSEAYKEVKNALKYGRDSIKTDVIISALRTRELEIHSSHKENHSGDGLFVKDALASTLDQANHVNPLGKHDW
        + K ++ + K+  +K+GDENEAFVLLNSL E YKEVKNALKYGR+S+  D IISALRTRELE+ S  KE    +G                         
Subjt:  KIKALLGQQKTHKDKLGDENEAFVLLNSLSEAYKEVKNALKYGRDSIKTDVIISALRTRELEIHSSHKENHSGDGLFVKDALASTLDQANHVNPLGKHDW

Query:  VLDSGCTYHMTPFRAWFNTYREISRESVFMGNNNESNIAGIGTVTMKLKDGTVKLLRNVRHVPHLKINLISLGMLDSLGCEYKGKCGVFQVFMGSKLVLV
                     + WF+TY+++  ESV+MGNN    I G+G+                                       +GK G+ +V   SK+VLV
Subjt:  VLDSGCTYHMTPFRAWFNTYREISRESVFMGNNNESNIAGIGTVTMKLKDGTVKLLRNVRHVPHLKINLISLGMLDSLGCEYKGKCGVFQVFMGSKLVLV

Query:  GEKVNDLFIIKGVEMIEEANTVLSLNLTEADIWHKRLSHISQKGLEALSKQDILPQDICSKLQSKKTKLHQSTTHNKRNPGLHPFGSMGSASTPSLSSSR
        GE+VNDL+I++GVEM+  A TV + +LTEAD+WHKRLSHIS+KG      +  L ++ C   ++ +    +S    K       F   G + TPSLS SR
Subjt:  GEKVNDLFIIKGVEMIEEANTVLSLNLTEADIWHKRLSHISQKGLEALSKQDILPQDICSKLQSKKTKLHQSTTHNKRNPGLHPFGSMGSASTPSLSSSR

Query:  -----------------ESGITRHKTVTYTPQQNGVAERLNRTIMERVRCLLSNAILEEKFWVEVAAYVVYTLNRSPHTSLGFLTPEEKWSKHPPSLNDL
                         E GI+R +TV YTPQQNGVAERLN TIME VRCLLS+AIL E +W E  AY+VYTLNR PHTSL  LTPEE W+ HPP+L++L
Subjt:  -----------------ESGITRHKTVTYTPQQNGVAERLNRTIMERVRCLLSNAILEEKFWVEVAAYVVYTLNRSPHTSLGFLTPEEKWSKHPPSLNDL

Query:  K
        K
Subjt:  K

XP_038904504.1 uncharacterized protein LOC120090876 [Benincasa hispida]7.9e-7339.66Show/hide
Query:  MAITRVEIEKFDGNGDFVLWKAKIKALLGQQKTHK---------------------------------------DKLGDENEAFVLLNSLSEAYKEVKNA
        M++ R E++KFD  GDF LWKAKIKA+L QQK  +                                       +KLGDENEA+VLLNSL E Y+E+KNA
Subjt:  MAITRVEIEKFDGNGDFVLWKAKIKALLGQQKTHK---------------------------------------DKLGDENEAFVLLNSLSEAYKEVKNA

Query:  LKYGRDSIKTDVIISALRTRELEIHSSHKENHSGDGLFVKDALASTLDQANHVNPLGKHDWVLDSGCTYHMTPFRAWFNTYREISRESVFMGNNNESNIA
        LKY RD+I TD IISALRTREL   +  K+  SG+G                                                                
Subjt:  LKYGRDSIKTDVIISALRTRELEIHSSHKENHSGDGLFVKDALASTLDQANHVNPLGKHDWVLDSGCTYHMTPFRAWFNTYREISRESVFMGNNNESNIA

Query:  GIGTVTMKLKDGTVKLLRNVRHVPHLKINLISLGMLDSLGCEYKGKCGVFQVFMGSKLVLVGEKVNDLFIIKGVEMIEEANTVLSLNLTEADIWHKRLSH
                          NVR+VP LK NLISLGMLD++GCEYKG                                  A    +  +TEAD+WHKRLSH
Subjt:  GIGTVTMKLKDGTVKLLRNVRHVPHLKINLISLGMLDSLGCEYKGKCGVFQVFMGSKLVLVGEKVNDLFIIKGVEMIEEANTVLSLNLTEADIWHKRLSH

Query:  ISQKGLEALSKQDILPQDICSKLQSKKTKLHQSTTHNKRNP------GLHPFGSMGSASTPSLSSSRESGITRHKTVTYTPQQNGVAERLNRTIMERVRC
        IS KGLE LSKQ ILP+   ++    K    Q TT    +       GL P  S+  +S       +E+ ITRH+T+ +T Q+NGVAERLNRTIMERVRC
Subjt:  ISQKGLEALSKQDILPQDICSKLQSKKTKLHQSTTHNKRNP------GLHPFGSMGSASTPSLSSSRESGITRHKTVTYTPQQNGVAERLNRTIMERVRC

Query:  LLSNAILEEKFWVEVAAYVVYTLNRSPHTSLGFLTPEEKWSKHPPSLNDLKVFGCVGYADQNKG
        LLS+A L EK+W E A+Y ++TLNR PH+SL  LT EEKW+KHPP+L +L+VFGCVGY  Q++G
Subjt:  LLSNAILEEKFWVEVAAYVVYTLNRSPHTSLGFLTPEEKWSKHPPSLNDLKVFGCVGYADQNKG

TrEMBL top hitse value%identityAlignment
A0A438IR25 Retrovirus-related Pol polyprotein from transposon TNT 1-948.0e-6335.09Show/hide
Query:  LGDENEAFVLLNSLSEAYKEVKNALKYGRDSI-----KTDVIISALRTRELEIHSSHKENHSGDGLFVKD-------ALASTLDQANHVNPLGKHD----
        + DE++A +LL SL   Y  +K+A+ YGRDS+     K     S  +T++ +    HKE H     F KD        +  T+++ +    L  +D    
Subjt:  LGDENEAFVLLNSLSEAYKEVKNALKYGRDSI-----KTDVIISALRTRELEIHSSHKENHSGDGLFVKD-------ALASTLDQANHVNPLGKHD----

Query:  -----------WVLDSGCTYHMTPFRAWFNTYREISRESVFMGNNNESNIAGIGTVTMKLKDGTVKLLRNVRHVPHLKINLISLGMLDSLGCEYKGKCGV
                   W+LDSGC++HM P +AWF  ++E     V +GNN    I G GTV +K  DG  ++L +VR++P LK NLISLGMLD  G  +K +   
Subjt:  -----------WVLDSGCTYHMTPFRAWFNTYREISRESVFMGNNNESNIAGIGTVTMKLKDGTVKLLRNVRHVPHLKINLISLGMLDSLGCEYKGKCGV

Query:  FQVFMGSKLVLVGEKVNDLFIIKGVEMIEEANTVLSLNLTEADIWHKRLSHISQKGLEALSKQDILPQDICSKLQSKKTKLHQSTTHNKRNPGLHPFGSM
         +V  GS  V+     N L+ + G  +I++A+TVL  ++    +WH+RL H+S KGL+ L KQ +L     + L   +  +    T  K    +H   + 
Subjt:  FQVFMGSKLVLVGEKVNDLFIIKGVEMIEEANTVLSLNLTEADIWHKRLSHISQKGLEALSKQDILPQDICSKLQSKKTKLHQSTTHNKRNPGLHPFGSM

Query:  ---------GSASTPSL------------------------SSSRESGITRHKTVTYTPQQNGVAERLNRTIMERVRCLLSNAILEEKFWVEVAAYVVYT
                 G +  PS+                        S  ++ GI  H+TV YTPQQNG+AER+NRTI+ER+RC+LS++ L + FW E A  VV+ 
Subjt:  ---------GSASTPSL------------------------SSSRESGITRHKTVTYTPQQNGVAERLNRTIMERVRCLLSNAILEEKFWVEVAAYVVYT

Query:  LNRSPHTSLGFLTPEEKWSKHPPSLNDLKVFGCVGY
        +NRSP ++L F TP+EKW+    +   LKVFGC  Y
Subjt:  LNRSPHTSLGFLTPEEKWSKHPPSLNDLKVFGCVGY

A0A5A7TP18 Putative gag-pol polyprotein6.7e-6231.81Show/hide
Query:  DKLGDENEAFVLLNSLSEAYKEVKNALKYGRDSIKTDVIISALRTRELEIHSS---------------------------------------HKENHSGD
        +K+ DEN+A +LLNSL E Y+EVK A+KYGRDS+   +++ AL+TR LEI                                          HKE H   
Subjt:  DKLGDENEAFVLLNSLSEAYKEVKNALKYGRDSIKTDVIISALRTRELEIHSS---------------------------------------HKENHSGD

Query:  GLFVKDALASTLDQANHVNPLGKHD----------------------------WVLDSGCTYHMTPFRAWFNTYREISRESVFMGNNNESNIAGIGTVTM
           +  +  ++  +AN  +     +                            W++DSGCT+HMTP R +   ++++    V +G+N   ++ G G+V +
Subjt:  GLFVKDALASTLDQANHVNPLGKHD----------------------------WVLDSGCTYHMTPFRAWFNTYREISRESVFMGNNNESNIAGIGTVTM

Query:  KLKDGTVKLLRNVRHVPHLKINLISLGMLDSLGCEYKGKCGVFQVFMGSKLVLVGEKVNDLFIIKGVEMIEEANTVLSLNLTEAD-IWHKRLSHISQKGL
           DG V++L NV +VP LK NLISLG LD  GC  K + GV +V  GS + L G   + L++++G  +   A  + S  +T+   +WHKRL+H+S++GL
Subjt:  KLKDGTVKLLRNVRHVPHLKINLISLGMLDSLGCEYKGKCGVFQVFMGSKLVLVGEKVNDLFIIKGVEMIEEANTVLSLNLTEAD-IWHKRLSHISQKGL

Query:  EALSKQDILPQDICSKLQSKKTKLHQSTTHNKRNPGLHPFGSM---------GSASTPSLSSSR------------------------------------
        +ALS+Q +L      +L   +  +   +T  K   G H    +         G     S+  SR                                    
Subjt:  EALSKQDILPQDICSKLQSKKTKLHQSTTHNKRNPGLHPFGSM---------GSASTPSLSSSR------------------------------------

Query:  ----------------------------ESGITRHKTVTYTPQQNGVAERLNRTIMERVRCLLSNAILEEKFWVEVAAYVVYTLNRSPHTSLGFLTPEEK
                                      GITRH TVTYTPQQNG+AER NRTIMER RCLL+NA L  KFW E A    Y +NRSP T+L   TP+E 
Subjt:  ----------------------------ESGITRHKTVTYTPQQNGVAERLNRTIMERVRCLLSNAILEEKFWVEVAAYVVYTLNRSPHTSLGFLTPEEK

Query:  WSKHPPSLNDLKVFGCVGYADQNKG
        W+   PSL  L+VFGC  YA    G
Subjt:  WSKHPPSLNDLKVFGCVGYADQNKG

A0A5A7U2U7 Retrotransposon protein, putative, Ty1-copia sub-class6.7e-6231.49Show/hide
Query:  DKLGDENEAFVLLNSLSEAYKEVKNALKYGRDSIKTDVIISALRTRELEIHSS---------------------------------------HKENHSGD
        +K+ DEN+A +LLNSL E Y+EVK A+KYG DS+   +++ AL+TR LEI                                          HKE H   
Subjt:  DKLGDENEAFVLLNSLSEAYKEVKNALKYGRDSIKTDVIISALRTRELEIHSS---------------------------------------HKENHSGD

Query:  GLFVKDALASTLDQANHVNPLGKHD----------------------------WVLDSGCTYHMTPFRAWFNTYREISRESVFMGNNNESNIAGIGTVTM
           +  +  ++  +AN  +     +                            W++DSGCT+HMTP R +   ++++    V +G+N   ++ G G+V +
Subjt:  GLFVKDALASTLDQANHVNPLGKHD----------------------------WVLDSGCTYHMTPFRAWFNTYREISRESVFMGNNNESNIAGIGTVTM

Query:  KLKDGTVKLLRNVRHVPHLKINLISLGMLDSLGCEYKGKCGVFQVFMGSKLVLVGEKVNDLFIIKGVEMIEEANTVLSLNLTEADIWHKRLSHISQKGLE
           DG V++L NVR+VP LK NLISLG LD  GC  K + GV +V  GS + L G   + L++++G  +   A          + +WHKRL+H+S++GL+
Subjt:  KLKDGTVKLLRNVRHVPHLKINLISLGMLDSLGCEYKGKCGVFQVFMGSKLVLVGEKVNDLFIIKGVEMIEEANTVLSLNLTEADIWHKRLSHISQKGLE

Query:  ALSKQDILPQDICSKLQSKKTKLHQSTTHNKRNPGLHPFGSM---------GSASTPSLSSSR-------------------------------------
        ALS+Q +L      +L   +  +   +T  K   G H    +         G     S+  SR                                     
Subjt:  ALSKQDILPQDICSKLQSKKTKLHQSTTHNKRNPGLHPFGSM---------GSASTPSLSSSR-------------------------------------

Query:  ---------------------------ESGITRHKTVTYTPQQNGVAERLNRTIMERVRCLLSNAILEEKFWVEVAAYVVYTLNRSPHTSLGFLTPEEKW
                                     GITRH TVTYTPQQNG+AER NRTIMER RCLL+NA L  KFW E A    Y +NRSP T+L   TP+E W
Subjt:  ---------------------------ESGITRHKTVTYTPQQNGVAERLNRTIMERVRCLLSNAILEEKFWVEVAAYVVYTLNRSPHTSLGFLTPEEKW

Query:  SKHPPSLNDLKVFGCVGYADQNKG
        +   PSL  L+VFGC  YA    G
Subjt:  SKHPPSLNDLKVFGCVGYADQNKG

A0A5A7UB25 Putative gag-pol polyprotein8.0e-6332Show/hide
Query:  DKLGDENEAFVLLNSLSEAYKEVKNALKYGRDSIKTDVIISALRTRELEIHSS---------------------------------------HKENHSGD
        +K+ DEN+A +LLNSL E Y+EVK A+KYGRDS+   +++ AL+TR LEI                                          HKE H   
Subjt:  DKLGDENEAFVLLNSLSEAYKEVKNALKYGRDSIKTDVIISALRTRELEIHSS---------------------------------------HKENHSGD

Query:  GLFVKDALASTLDQANHVNPLGKHD----------------------------WVLDSGCTYHMTPFRAWFNTYREISRESVFMGNNNESNIAGIGTVTM
           +  +  ++  +AN  +     +                            W++DSGCT+HMTP R +   ++++    V +G+N   ++ G G+V +
Subjt:  GLFVKDALASTLDQANHVNPLGKHD----------------------------WVLDSGCTYHMTPFRAWFNTYREISRESVFMGNNNESNIAGIGTVTM

Query:  KLKDGTVKLLRNVRHVPHLKINLISLGMLDSLGCEYKGKCGVFQVFMGSKLVLVGEKVNDLFIIKGVEMIEEANTVLSLNLTEAD-IWHKRLSHISQKGL
           DG V++L NVR+VP LK NLISLG LD  GC  K + GV +V  GS + L G   + L++++G  +   A  + S  +T+   +WHKRL+H+S++GL
Subjt:  KLKDGTVKLLRNVRHVPHLKINLISLGMLDSLGCEYKGKCGVFQVFMGSKLVLVGEKVNDLFIIKGVEMIEEANTVLSLNLTEAD-IWHKRLSHISQKGL

Query:  EALSKQDILPQDICSKLQSKKTKLHQSTTHNKRNPGLHPFGSM---------GSASTPSLSSSR------------------------------------
        +ALS+Q +L      +L   +  +   +T  K   G H    +         G     S+  SR                                    
Subjt:  EALSKQDILPQDICSKLQSKKTKLHQSTTHNKRNPGLHPFGSM---------GSASTPSLSSSR------------------------------------

Query:  ----------------------------ESGITRHKTVTYTPQQNGVAERLNRTIMERVRCLLSNAILEEKFWVEVAAYVVYTLNRSPHTSLGFLTPEEK
                                      GITRH TVTYTPQQNG+AER NRTIMER RCLL+NA L  KFW E A    Y +NRSP T+L   TP+E 
Subjt:  ----------------------------ESGITRHKTVTYTPQQNGVAERLNRTIMERVRCLLSNAILEEKFWVEVAAYVVYTLNRSPHTSLGFLTPEEK

Query:  WSKHPPSLNDLKVFGCVGYADQNKG
        W+   PSL  L+VFGC  YA    G
Subjt:  WSKHPPSLNDLKVFGCVGYADQNKG

A0A5D3DNU1 Putative gag-pol polyprotein8.0e-6332Show/hide
Query:  DKLGDENEAFVLLNSLSEAYKEVKNALKYGRDSIKTDVIISALRTRELEIHSS---------------------------------------HKENHSGD
        +K+ DEN+A +LLNSL E Y+EVK A+KYGRDS+   +++ AL+TR LEI                                          HKE H   
Subjt:  DKLGDENEAFVLLNSLSEAYKEVKNALKYGRDSIKTDVIISALRTRELEIHSS---------------------------------------HKENHSGD

Query:  GLFVKDALASTLDQANHVNPLGKHD----------------------------WVLDSGCTYHMTPFRAWFNTYREISRESVFMGNNNESNIAGIGTVTM
           +  +  ++  +AN  +     +                            W++DSGCT+HMTP R +   ++++    V +G+N   ++ G G+V +
Subjt:  GLFVKDALASTLDQANHVNPLGKHD----------------------------WVLDSGCTYHMTPFRAWFNTYREISRESVFMGNNNESNIAGIGTVTM

Query:  KLKDGTVKLLRNVRHVPHLKINLISLGMLDSLGCEYKGKCGVFQVFMGSKLVLVGEKVNDLFIIKGVEMIEEANTVLSLNLTEAD-IWHKRLSHISQKGL
           DG V++L NVR+VP LK NLISLG LD  GC  K + GV +V  GS + L G   + L++++G  +   A  + S  +T+   +WHKRL+H+S++GL
Subjt:  KLKDGTVKLLRNVRHVPHLKINLISLGMLDSLGCEYKGKCGVFQVFMGSKLVLVGEKVNDLFIIKGVEMIEEANTVLSLNLTEAD-IWHKRLSHISQKGL

Query:  EALSKQDILPQDICSKLQSKKTKLHQSTTHNKRNPGLHPFGSM---------GSASTPSLSSSR------------------------------------
        +ALS+Q +L      +L   +  +   +T  K   G H    +         G     S+  SR                                    
Subjt:  EALSKQDILPQDICSKLQSKKTKLHQSTTHNKRNPGLHPFGSM---------GSASTPSLSSSR------------------------------------

Query:  ----------------------------ESGITRHKTVTYTPQQNGVAERLNRTIMERVRCLLSNAILEEKFWVEVAAYVVYTLNRSPHTSLGFLTPEEK
                                      GITRH TVTYTPQQNG+AER NRTIMER RCLL+NA L  KFW E A    Y +NRSP T+L   TP+E 
Subjt:  ----------------------------ESGITRHKTVTYTPQQNGVAERLNRTIMERVRCLLSNAILEEKFWVEVAAYVVYTLNRSPHTSLGFLTPEEK

Query:  WSKHPPSLNDLKVFGCVGYADQNKG
        W+   PSL  L+VFGC  YA    G
Subjt:  WSKHPPSLNDLKVFGCVGYADQNKG

SwissProt top hitse value%identityAlignment
P04146 Copia protein3.3e-1343.01Show/hide
Query:  ESGITRHKTVTYTPQQNGVAERLNRTIMERVRCLLSNAILEEKFWVEVAAYVVYTLNRSPHTSL--GFLTPEEKWSKHPPSLNDLKVFGCVGY
        + GI+ H TV +TPQ NGV+ER+ RTI E+ R ++S A L++ FW E      Y +NR P  +L     TP E W    P L  L+VFG   Y
Subjt:  ESGITRHKTVTYTPQQNGVAERLNRTIMERVRCLLSNAILEEKFWVEVAAYVVYTLNRSPHTSL--GFLTPEEKWSKHPPSLNDLKVFGCVGY

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-948.3e-3329.97Show/hide
Query:  DWVLDSGCTYHMTPFRAWFNTYREISRESVFMGNNNESNIAGIGTVTMKLKDGTVKLLRNVRHVPHLKINLISLGMLDSLGCEYKGKCGVFQVFMGSKLV
        +WV+D+  ++H TP R  F  Y      +V MGN + S IAGIG + +K   G   +L++VRHVP L++NLIS   LD  G E       +++  GS ++
Subjt:  DWVLDSGCTYHMTPFRAWFNTYREISRESVFMGNNNESNIAGIGTVTMKLKDGTVKLLRNVRHVPHLKINLISLGMLDSLGCEYKGKCGVFQVFMGSKLV

Query:  LVGEKVNDLFIIKGVEMIEEANTVLSLNLTEADIWHKRLSHISQKGLEALSKQ--------------------------------------DILPQDICS
          G     L+         E N   + +    D+WHKR+ H+S+KGL+ L+K+                                      D++  D+C 
Subjt:  LVGEKVNDLFIIKGVEMIEEANTVLSLNLTEADIWHKRLSHISQKGLEALSKQ--------------------------------------DILPQDICS

Query:  KLQ-----------------SKKTKLHQSTTHNKRNPGLHPFGSMGSAST-----------PSLSSSRE-------SGITRHKTVTYTPQQNGVAERLNR
         ++                 S+K  ++   T ++       F ++    T               +SRE        GI   KTV  TPQ NGVAER+NR
Subjt:  KLQ-----------------SKKTKLHQSTTHNKRNPGLHPFGSMGSAST-----------PSLSSSRE-------SGITRHKTVTYTPQQNGVAERLNR

Query:  TIMERVRCLLSNAILEEKFWVEVAAYVVYTLNRSPHTSLGFLTPEEKWSKHPPSLNDLKVFGCVGYA
        TI+E+VR +L  A L + FW E      Y +NRSP   L F  PE  W+    S + LKVFGC  +A
Subjt:  TIMERVRCLLSNAILEEKFWVEVAAYVVYTLNRSPHTSLGFLTPEEKWSKHPPSLNDLKVFGCVGYA

P92512 Uncharacterized mitochondrial protein AtMg007105.4e-0836Show/hide
Query:  LNRTIMERVRCLLSNAILEEKFWVEVAAYVVYTLNRSPHTSLGFLTPEEKWSKHPPSLNDLKVFGCVGYADQNKG
        +NRTI+E+VR +L    L + F  + A   V+ +N+ P T++ F  P+E W +  P+ + L+ FGCV Y   ++G
Subjt:  LNRTIMERVRCLLSNAILEEKFWVEVAAYVVYTLNRSPHTSLGFLTPEEKWSKHPPSLNDLKVFGCVGYADQNKG

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE13.2e-0835.16Show/hide
Query:  ESGITRHKTVTYTPQQNGVAERLNRTIMERVRCLLSNAILEEKFWVEVAAYVVYTLNRSPHTSLGFLTPEEKWSKHPPSLNDLKVFGCVGY
        + GI+   +  +TP+ NG++ER +R I+E    LLS+A + + +W    A  VY +NR P   L   +P +K     P+ + L+VFGC  Y
Subjt:  ESGITRHKTVTYTPQQNGVAERLNRTIMERVRCLLSNAILEEKFWVEVAAYVVYTLNRSPHTSLGFLTPEEKWSKHPPSLNDLKVFGCVGY

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.7e-0936.26Show/hide
Query:  ESGITRHKTVTYTPQQNGVAERLNRTIMERVRCLLSNAILEEKFWVEVAAYVVYTLNRSPHTSLGFLTPEEKWSKHPPSLNDLKVFGCVGY
        + GI+   +  +TP+ NG++ER +R I+E    LLS+A + + +W    +  VY +NR P   L   +P +K    PP+   LKVFGC  Y
Subjt:  ESGITRHKTVTYTPQQNGVAERLNRTIMERVRCLLSNAILEEKFWVEVAAYVVYTLNRSPHTSLGFLTPEEKWSKHPPSLNDLKVFGCVGY

Arabidopsis top hitse value%identityAlignment
AT3G21000.1 Gag-Pol-related retrotransposon family protein1.7e-0429.49Show/hide
Query:  WVLDSGCTYHMTPFRAWFNTYREISRESVFMGNNNESNIAGIGTVTMKLKDGTVKLLRNVRHVPHLKINLISLGMLDS
        W++      +MTP+  +F T     + +V   +     + G G V +++K+G  K +RNV  VP L  N++S G + S
Subjt:  WVLDSGCTYHMTPFRAWFNTYREISRESVFMGNNNESNIAGIGTVTMKLKDGTVKLLRNVRHVPHLKINLISLGMLDS

ATMG00300.1 Gag-Pol-related retrotransposon family protein1.2e-0529.9Show/hide
Query:  GVFQVFMGSKLVLVGEKVNDLFIIKGVEMIEEANTVLSLNLTEADIWHKRLSHISQKGLEALSKQDILPQDICSKLQSKKTKLHQSTTHNKRNPGLH
        GV +V  G + +L G + + L+I++G     E+N   +    E  +WH RL+H+SQ+G+E L K+  L     S L+  +  ++  T     + G H
Subjt:  GVFQVFMGSKLVLVGEKVNDLFIIKGVEMIEEANTVLSLNLTEADIWHKRLSHISQKGLEALSKQDILPQDICSKLQSKKTKLHQSTTHNKRNPGLH

ATMG00710.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein3.8e-0936Show/hide
Query:  LNRTIMERVRCLLSNAILEEKFWVEVAAYVVYTLNRSPHTSLGFLTPEEKWSKHPPSLNDLKVFGCVGYADQNKG
        +NRTI+E+VR +L    L + F  + A   V+ +N+ P T++ F  P+E W +  P+ + L+ FGCV Y   ++G
Subjt:  LNRTIMERVRCLLSNAILEEKFWVEVAAYVVYTLNRSPHTSLGFLTPEEKWSKHPPSLNDLKVFGCVGYADQNKG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTATAACAAGAGTAGAAATAGAGAAGTTTGATGGAAATGGAGACTTTGTTTTATGGAAAGCAAAGATCAAAGCGTTGCTTGGACAGCAAAAGACTCATAAAGACAA
ACTAGGTGATGAAAATGAGGCATTCGTCCTTTTAAATTCTTTATCCGAGGCATACAAAGAAGTAAAAAATGCTCTAAAATATGGAAGAGACTCAATAAAAACAGATGTCA
TAATATCAGCTCTAAGAACTAGAGAATTAGAAATACATTCATCACACAAAGAAAATCATAGTGGTGATGGATTGTTTGTCAAAGATGCCTTGGCATCAACCTTAGACCAA
GCCAACCATGTCAACCCCTTAGGAAAACATGATTGGGTCCTAGACTCAGGATGCACCTACCATATGACACCTTTTAGAGCATGGTTCAATACCTATAGAGAGATTAGTAG
AGAATCTGTGTTCATGGGGAATAATAATGAAAGTAACATTGCTGGAATTGGAACAGTTACCATGAAACTAAAAGATGGGACTGTAAAACTCCTTAGAAATGTAAGACATG
TTCCTCACCTTAAAATAAATTTAATCTCCCTAGGAATGTTAGACTCTCTAGGGTGTGAATATAAAGGAAAATGTGGAGTTTTCCAAGTCTTTATGGGATCTAAGTTAGTC
TTGGTTGGGGAAAAGGTAAATGATTTGTTCATAATAAAAGGAGTAGAAATGATAGAGGAGGCAAATACAGTTTTATCTCTAAACCTAACAGAAGCTGATATTTGGCATAA
AAGATTGTCCCATATTAGTCAGAAGGGTCTTGAGGCACTATCTAAACAGGACATTCTGCCTCAAGACATATGCAGCAAACTGCAAAGCAAGAAAACAAAACTTCACCAAA
GCACAACACACAACAAAAGGAATCCTGGACTACATCCATTCGGATCTATGGGTTCAGCATCCACTCCAAGCCTAAGTAGCTCAAGGGAAAGTGGAATCACAAGACACAAA
ACTGTGACATACACACCTCAACAAAATGGAGTGGCAGAAAGACTCAACAGAACTATAATGGAAAGGGTAAGATGCCTATTATCAAATGCCATTCTAGAAGAAAAGTTTTG
GGTTGAGGTTGCTGCCTACGTTGTGTACACATTGAATAGAAGTCCTCACACCTCCTTGGGATTCCTAACACCTGAGGAGAAATGGTCCAAACACCCACCAAGTCTAAATG
ACCTTAAGGTGTTTGGATGTGTAGGGTATGCTGACCAAAATAAAGGAAACTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCTATAACAAGAGTAGAAATAGAGAAGTTTGATGGAAATGGAGACTTTGTTTTATGGAAAGCAAAGATCAAAGCGTTGCTTGGACAGCAAAAGACTCATAAAGACAA
ACTAGGTGATGAAAATGAGGCATTCGTCCTTTTAAATTCTTTATCCGAGGCATACAAAGAAGTAAAAAATGCTCTAAAATATGGAAGAGACTCAATAAAAACAGATGTCA
TAATATCAGCTCTAAGAACTAGAGAATTAGAAATACATTCATCACACAAAGAAAATCATAGTGGTGATGGATTGTTTGTCAAAGATGCCTTGGCATCAACCTTAGACCAA
GCCAACCATGTCAACCCCTTAGGAAAACATGATTGGGTCCTAGACTCAGGATGCACCTACCATATGACACCTTTTAGAGCATGGTTCAATACCTATAGAGAGATTAGTAG
AGAATCTGTGTTCATGGGGAATAATAATGAAAGTAACATTGCTGGAATTGGAACAGTTACCATGAAACTAAAAGATGGGACTGTAAAACTCCTTAGAAATGTAAGACATG
TTCCTCACCTTAAAATAAATTTAATCTCCCTAGGAATGTTAGACTCTCTAGGGTGTGAATATAAAGGAAAATGTGGAGTTTTCCAAGTCTTTATGGGATCTAAGTTAGTC
TTGGTTGGGGAAAAGGTAAATGATTTGTTCATAATAAAAGGAGTAGAAATGATAGAGGAGGCAAATACAGTTTTATCTCTAAACCTAACAGAAGCTGATATTTGGCATAA
AAGATTGTCCCATATTAGTCAGAAGGGTCTTGAGGCACTATCTAAACAGGACATTCTGCCTCAAGACATATGCAGCAAACTGCAAAGCAAGAAAACAAAACTTCACCAAA
GCACAACACACAACAAAAGGAATCCTGGACTACATCCATTCGGATCTATGGGTTCAGCATCCACTCCAAGCCTAAGTAGCTCAAGGGAAAGTGGAATCACAAGACACAAA
ACTGTGACATACACACCTCAACAAAATGGAGTGGCAGAAAGACTCAACAGAACTATAATGGAAAGGGTAAGATGCCTATTATCAAATGCCATTCTAGAAGAAAAGTTTTG
GGTTGAGGTTGCTGCCTACGTTGTGTACACATTGAATAGAAGTCCTCACACCTCCTTGGGATTCCTAACACCTGAGGAGAAATGGTCCAAACACCCACCAAGTCTAAATG
ACCTTAAGGTGTTTGGATGTGTAGGGTATGCTGACCAAAATAAAGGAAACTAA
Protein sequenceShow/hide protein sequence
MAITRVEIEKFDGNGDFVLWKAKIKALLGQQKTHKDKLGDENEAFVLLNSLSEAYKEVKNALKYGRDSIKTDVIISALRTRELEIHSSHKENHSGDGLFVKDALASTLDQ
ANHVNPLGKHDWVLDSGCTYHMTPFRAWFNTYREISRESVFMGNNNESNIAGIGTVTMKLKDGTVKLLRNVRHVPHLKINLISLGMLDSLGCEYKGKCGVFQVFMGSKLV
LVGEKVNDLFIIKGVEMIEEANTVLSLNLTEADIWHKRLSHISQKGLEALSKQDILPQDICSKLQSKKTKLHQSTTHNKRNPGLHPFGSMGSASTPSLSSSRESGITRHK
TVTYTPQQNGVAERLNRTIMERVRCLLSNAILEEKFWVEVAAYVVYTLNRSPHTSLGFLTPEEKWSKHPPSLNDLKVFGCVGYADQNKGN