; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0011391 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0011391
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationchr1:23589887..23592569
RNA-Seq ExpressionLag0011391
SyntenyLag0011391
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GAU19483.1 hypothetical protein TSUD_77270 [Trifolium subterraneum]8.7e-10339.12Show/hide
Query:  NPEYEEWITVDQALLGWLYGSMEQSIASDVINHKTSMAVWKALEGRYGSHNKARIILLRSTLQNTKKNNMKMTEYLAKMKQIADGLSLA-----------
        N  + EW   DQ LLGW+  SM   IA+ +++ +TS  +W   +   G+H +++II L+S   + +K  MKM +YL KMK + D L LA           
Subjt:  NPEYEEWITVDQALLGWLYGSMEQSIASDVINHKTSMAVWKALEGRYGSHNKARIILLRSTLQNTKKNNMKMTEYLAKMKQIADGLSLA-----------

Query:  ----GLDAQYLPITCTLNGKDALTWEELHATLIAFEQTLVHLNVISDVNDLSNATAHLAIQKTYSNNYSNNSFSTQGPNNSHNGRNQSNQNSGRDKSRGT
            GLD++Y P+   L+ +  L+W +L A L+ FE  +  LN ++++    NATA++A                    N  + R +S+ N+ R  +   
Subjt:  ----GLDAQYLPITCTLNGKDALTWEELHATLIAFEQTLVHLNVISDVNDLSNATAHLAIQKTYSNNYSNNSFSTQGPNNSHNGRNQSNQNSGRDKSRGT

Query:  YRGGRQRGRGGRNNGNRPVCQVCGKLGHTTAVCYNRYDGNYMGA--PPNHGKNQTNSAFIASPEIVNDSSWYADSGASNHITSDPNMLNHKTEY------
        +RGGR RG+ G+N      CQVCG   H    C++R+D  Y  +     H K  +++AF+AS   V D  WY DSGASNH+T         TE+      
Subjt:  YRGGRQRGRGGRNNGNRPVCQVCGKLGHTTAVCYNRYDGNYMGA--PPNHGKNQTNSAFIASPEIVNDSSWYADSGASNHITSDPNMLNHKTEY------

Query:  ------------TDNQKLK---LNNILHVPTITKNLLSVSKLTNDNDAYMEFHSDFCVVKDKTTGRVLHQGKLKEGLYQLELRNKSPNLFLCQKSPRFVL
                    T + KLK   L++IL+VP ITKNLLSVSKL  DN+  +EF  + C VKDK TG+V+ +G LK+GLYQL    ++P             
Subjt:  ------------TDNQKLK---LNNILHVPTITKNLLSVSKLTNDNDAYMEFHSDFCVVKDKTTGRVLHQGKLKEGLYQLELRNKSPNLFLCQKSPRFVL

Query:  KSSFKGQYGCSSQRQPLQSNVKAYVSSFNSKKEVWHRRLGHPSERVLSYILKDCNQKFTLNEKCDFCDACQFGKSHALPFQTSNSHAKKPLELIHTDLWG
                                 S+F S KE WHRRLGHP+ +VL  +L+ C  K   ++   FC+ACQ+GK H LPF++S+SHA++PLEL+HTD+WG
Subjt:  KSSFKGQYGCSSQRQPLQSNVKAYVSSFNSKKEVWHRRLGHPSERVLSYILKDCNQKFTLNEKCDFCDACQFGKSHALPFQTSNSHAKKPLELIHTDLWG

Query:  PAPVNSSTGFHYYIHFLDDFSRHTWIYPLKNKSDSLAAFKHFKNLVENKFDSKIKTLQSDWGGEYSEGNK
        PAP+ +S+GF YY+HF+DDFSR TWIYPLK KS+++ AF  FKNL EN+F+ +IK +Q D GGEY    K
Subjt:  PAPVNSSTGFHYYIHFLDDFSRHTWIYPLKNKSDSLAAFKHFKNLVENKFDSKIKTLQSDWGGEYSEGNK

PNX76291.1 gag/pol polyprotein - maize retrotransposon Hopscotch, partial [Trifolium pratense]1.5e-10741.25Show/hide
Query:  NPEYEEWITVDQALLGWLYGSMEQSIASDVINHKTSMAVWKALEGRYGSHNKARIILLRSTLQNTKKNNMKMTEYLAKMKQIADGLSLA-----------
        NPE+E+W   DQ LLGWL  SM   IA+ +++ +TSM +W   +   G+H +++I  L+S   +T+K  MKM +YL KMK +AD L LA           
Subjt:  NPEYEEWITVDQALLGWLYGSMEQSIASDVINHKTSMAVWKALEGRYGSHNKARIILLRSTLQNTKKNNMKMTEYLAKMKQIADGLSLA-----------

Query:  ----GLDAQYLPITCTLNGKDALTWEELHATLIAFEQTLVHLNVISDVNDLSNATAHLAIQKTYSNNYSNNSFSTQGPNNSHNGRNQSNQNSGRDKSRGT
            GLD++Y P+   L+ +  L+W +L A L+ FE  +  LN ++++    NATA++A +  +  N  N++ + +G NN+  G N           RG 
Subjt:  ----GLDAQYLPITCTLNGKDALTWEELHATLIAFEQTLVHLNVISDVNDLSNATAHLAIQKTYSNNYSNNSFSTQGPNNSHNGRNQSNQNSGRDKSRGT

Query:  YRGGRQRGRGGRNNGNRPVCQVCGKLGHTTAVCYNRYDGNYMGA--PPNHGKNQTNSAFIASPEIVNDSSWYADSGASNHITSDPNMLNHKTEY------
        +RGGR RGR  +       CQVCG   H    C+ R+D  Y  +    N+ K  +++AF+AS   + D  WY DSGASNH+T   +   + +E+      
Subjt:  YRGGRQRGRGGRNNGNRPVCQVCGKLGHTTAVCYNRYDGNYMGA--PPNHGKNQTNSAFIASPEIVNDSSWYADSGASNHITSDPNMLNHKTEY------

Query:  ------------TDNQKLK---LNNILHVPTITKNLLSVSKLTNDNDAYMEFHSDFCVVKDKTTGRVLHQGKLKEGLYQLELRNKSPNLFLCQKSPRFVL
                    T + KLK   L++IL+VP ITKNLLSVSKL  DN+  +EF  + C VKDK TG+ + +G LK+GLYQL  ++ S              
Subjt:  ------------TDNQKLK---LNNILHVPTITKNLLSVSKLTNDNDAYMEFHSDFCVVKDKTTGRVLHQGKLKEGLYQLELRNKSPNLFLCQKSPRFVL

Query:  KSSFKGQYGCSSQRQPLQSNVKAYVSSFNSKKEVWHRRLGHPSERVLSYILKDCNQKFTLNEKCDFCDACQFGKSHALPFQTSNSHAKKPLELIHTDLWG
                              AYVS     KE WHR+LGHP+ +VL  +LK CN K + +++  FC+ACQ+GK H LPF+TS SHAK+ LEL+HTD+WG
Subjt:  KSSFKGQYGCSSQRQPLQSNVKAYVSSFNSKKEVWHRRLGHPSERVLSYILKDCNQKFTLNEKCDFCDACQFGKSHALPFQTSNSHAKKPLELIHTDLWG

Query:  PAPVNSSTGFHYYIHFLDDFSRHTWIYPLKNKSDSLAAFKHFKNLVENKFDSKIKTLQSDWGGEYSEGNKSPHAAEA
        PAP+ SS+GF YY+HF+DDF+R TWIYPLK KSD+  AF  FKN+VEN+F  KIKT+Q D GGEY    K  HA EA
Subjt:  PAPVNSSTGFHYYIHFLDDFSRHTWIYPLKNKSDSLAAFKHFKNLVENKFDSKIKTLQSDWGGEYSEGNKSPHAAEA

PNX78574.1 retrovirus-related Pol polyprotein from transposon TNT 1-94 [Trifolium pratense]3.2e-10540.42Show/hide
Query:  NPEYEEWITVDQALLGWLYGSMEQSIASDVINHKTSMAVWKALEGRYGSHNKARIILLRSTLQNTKKNNMKMTEYLAKMKQIADGLSLA-----------
        NP++ +W   DQ +LGWL  +M    AS +++ +TS  +W+  +    +H ++R+I LRS   NT+K   KM +YL KMK +AD L +A           
Subjt:  NPEYEEWITVDQALLGWLYGSMEQSIASDVINHKTSMAVWKALEGRYGSHNKARIILLRSTLQNTKKNNMKMTEYLAKMKQIADGLSLA-----------

Query:  ----GLDAQYLPITCTLNGKDALTWEELHATLIAFEQTLVHLNVISDVNDLSNATAHLAIQKTYSNNYSNNSFSTQGPNNSHNGRNQSNQNSGRDKSRGT
            GLD+ Y PI   L+ +  L+W +L A L+AFE  L  LN  +++N   NAT ++A +  +  N  N+                          RG+
Subjt:  ----GLDAQYLPITCTLNGKDALTWEELHATLIAFEQTLVHLNVISDVNDLSNATAHLAIQKTYSNNYSNNSFSTQGPNNSHNGRNQSNQNSGRDKSRGT

Query:  YRGGRQRG-RGGRNNG--NRPVCQVCGKLGHTTAVCYNRYDGNYMGAP---PNHGKNQTNSAFIASPEIVNDSSWYADSGASNHITSDPNMLNHKTEYTD
        +RG   R  RGGR  G  +  +CQVC K GHT   C +RYD +Y G+     N  + +T++AF+AS     D  WY DSGASNH+T   +     TE + 
Subjt:  YRGGRQRG-RGGRNNG--NRPVCQVCGKLGHTTAVCYNRYDGNYMGAP---PNHGKNQTNSAFIASPEIVNDSSWYADSGASNHITSDPNMLNHKTEYTD

Query:  N---------------------QKLKLNNILHVPTITKNLLSVSKLTNDNDAYMEFHSDFCVVKDKTTGRVLHQGKLKEGLYQLELRNKSPNLFLCQKSP
                              + L L+++L+VP ITKNLLSVSKLT+DN+  +EF +D C VKDK TG+VL +G LK+GLYQL   +   N     K P
Subjt:  N---------------------QKLKLNNILHVPTITKNLLSVSKLTNDNDAYMEFHSDFCVVKDKTTGRVLHQGKLKEGLYQLELRNKSPNLFLCQKSP

Query:  RFVLKSSFKGQYGCSSQRQPLQSNVKAYVSSFNSKKEVWHRRLGHPSERVLSYILKDCNQKFTLNEKCDFCDACQFGKSHALPFQTSNSHAKKPLELIHT
           L                             S KE WHR+LGHPS  VL  +LK CN K + ++K  FC+ACQ GKSH LPF++S+SHA++ LELIHT
Subjt:  RFVLKSSFKGQYGCSSQRQPLQSNVKAYVSSFNSKKEVWHRRLGHPSERVLSYILKDCNQKFTLNEKCDFCDACQFGKSHALPFQTSNSHAKKPLELIHT

Query:  DLWGPAPVNSSTGFHYYIHFLDDFSRHTWIYPLKNKSDSLAAFKHFKNLVENKFDSKIKTLQSDWGGEYSEGNK
        D+WGPAP+NS +GF YY+HF+DD SR TWIYPLK KSD++ AF  FKN+VEN+F+ +IK +Q D GGE+    K
Subjt:  DLWGPAPVNSSTGFHYYIHFLDDFSRHTWIYPLKNKSDSLAAFKHFKNLVENKFDSKIKTLQSDWGGEYSEGNK

PNX94503.1 putative retrotransposon Ty1-copia subclass protein, partial [Trifolium pratense]1.1e-11343.13Show/hide
Query:  NPEYEEWITVDQALLGWLYGSMEQSIASDVINHKTSMAVWKALEGRYGSHNKARIILLRSTLQNTKKNNMKMTEYLAKMKQIADGLSLA-----------
        NP+Y++W   DQALLGWL  SM   IA+ V++ +TS  +W   +   G+H ++RII L+S   NT K  MKM +YLAKMK +AD L LA           
Subjt:  NPEYEEWITVDQALLGWLYGSMEQSIASDVINHKTSMAVWKALEGRYGSHNKARIILLRSTLQNTKKNNMKMTEYLAKMKQIADGLSLA-----------

Query:  ----GLDAQYLPITCTLNGKDALTWEELHATLIAFEQTLVHLNVISDVNDLSNATAHLAIQKTYSNNYSNNSFSTQGPNNSHNGRNQSNQNSGRDKSRGT
            GLD++Y P+   L+ +  ++W +  A L+AFE  L  LN  +++N   NA+A+ A +    N    N F ++G     N R               
Subjt:  ----GLDAQYLPITCTLNGKDALTWEELHATLIAFEQTLVHLNVISDVNDLSNATAHLAIQKTYSNNYSNNSFSTQGPNNSHNGRNQSNQNSGRDKSRGT

Query:  YRGGRQRGRGGRNNGNRPVCQVCGKLGHTTAVCYNRYDGNYMGAPPNHGKNQTNSAFIASPEIVNDSSWYADSGASNHITSDPNMLNHKTEYT-------
         RGG  RGR   +   RP+CQ+CGK GHT A CY R+D +Y           ++SAF+ASP    D  WY DSGASNH+T     L    E         
Subjt:  YRGGRQRGRGGRNNGNRPVCQVCGKLGHTTAVCYNRYDGNYMGAPPNHGKNQTNSAFIASPEIVNDSSWYADSGASNHITSDPNMLNHKTEYT-------

Query:  -DNQKLK-------------LNNILHVPTITKNLLSVSKLTNDNDAYMEFHSDFCVVKDKTTGRVLHQGKLKEGLYQLELRNKSPNLFLCQKSPRFVLKS
         + +KLK             L N+L+VP ITKNLLSVSKLT DN+A +EF  ++C VKDK TG+ L +G+LK+GLYQL   NK P               
Subjt:  -DNQKLK-------------LNNILHVPTITKNLLSVSKLTNDNDAYMEFHSDFCVVKDKTTGRVLHQGKLKEGLYQLELRNKSPNLFLCQKSPRFVLKS

Query:  SFKGQYGCSSQRQPLQSNVKAYVSSFNSKKEVWHRRLGHPSERVLSYILKDCNQKFTLNEKCDFCDACQFGKSHALPFQTSNSHAKKPLELIHTDLWGPA
                     P   +  AY+    S KE+WHR+LGHP+ +VL  +LKD N K + ++K  FC+ACQFGK H LPF+TS+SHAK+PL+LIHTD+WGPA
Subjt:  SFKGQYGCSSQRQPLQSNVKAYVSSFNSKKEVWHRRLGHPSERVLSYILKDCNQKFTLNEKCDFCDACQFGKSHALPFQTSNSHAKKPLELIHTDLWGPA

Query:  PVNSSTGFHYYIHFLDDFSRHTWIYPLKNKSDSLAAFKHFKNLVENKFDSKIKTLQSDWGGEYSEGNK
        P+ S + F YY+HFLDDFSR TWI+PLK KS+++ AF  FKNLVEN+F+ KIK ++ D GGEY    K
Subjt:  PVNSSTGFHYYIHFLDDFSRHTWIYPLKNKSDSLAAFKHFKNLVENKFDSKIKTLQSDWGGEYSEGNK

PNY01489.1 copia-like polyprotein, partial [Trifolium pratense]2.3e-10341.01Show/hide
Query:  NPEYEEWITVDQALLGWLYGSMEQSIASDVINHKTSMAVWKALEGRYGSHNKARIILLRSTLQNTKKNNMKMTEYLAKMKQIADGLSLA-----------
        NP++++W+  DQALLGWL  SM   IA+ +++ +TS  +W   +   G+H K+RII L+S   NT+K  MKM EYL KMK ++D L L+           
Subjt:  NPEYEEWITVDQALLGWLYGSMEQSIASDVINHKTSMAVWKALEGRYGSHNKARIILLRSTLQNTKKNNMKMTEYLAKMKQIADGLSLA-----------

Query:  ----GLDAQYLPITCTLNGKDALTWEELHATLIAFEQTLVHLNVISDVNDLSNATAHLAIQKTYSNNYSNNSFSTQGPNNSHNGRNQSNQNSGRDKSRGT
            GLDA+Y P+   L+ +  L+W ++ A L+AFE  L  L                            N+FS    N S N  N++     +  SRG 
Subjt:  ----GLDAQYLPITCTLNGKDALTWEELHATLIAFEQTLVHLNVISDVNDLSNATAHLAIQKTYSNNYSNNSFSTQGPNNSHNGRNQSNQNSGRDKSRGT

Query:  YRGGRQRG-RGGRNNG--NRPVCQVCGKLGHTTAVCYNRYDGNYMGA--PPNHGKNQTNSAFIASPEIVNDSSWYADSGASNHITSDPNMLNHKTEYT--
        +R    RG RGGR  G  +   CQVC   GHT   C  R+D +Y G        K  ++SAF+ASP    D  WY DSGASNH+T   +      E+   
Subjt:  YRGGRQRG-RGGRNNG--NRPVCQVCGKLGHTTAVCYNRYDGNYMGA--PPNHGKNQTNSAFIASPEIVNDSSWYADSGASNHITSDPNMLNHKTEYT--

Query:  ------DNQKLK-------------LNNILHVPTITKNLLSVSKLTNDNDAYMEFHSDFCVVKDKTTGRVLHQGKLKEGLYQLELRNKSPNLFLCQKSPR
              + +KLK             L+++L+VP ITKNLLSVSKLT DN+ ++EF ++ C VKDK TG+ L +G+LK+GLYQL   + SP          
Subjt:  ------DNQKLK-------------LNNILHVPTITKNLLSVSKLTNDNDAYMEFHSDFCVVKDKTTGRVLHQGKLKEGLYQLELRNKSPNLFLCQKSPR

Query:  FVLKSSFKGQYGCSSQRQPLQSNVKAYVSSFNSKKEVWHRRLGHPSERVLSYILKDCNQKFTLNEKCDFCDACQFGKSHALPFQTSNSHAKKPLELIHTD
                            QSN    V  + S KE WHR+LGHP+ +VL  +LKDCN K + +++  FC+ACQFGK H LPF++S+SH ++PL LIH+D
Subjt:  FVLKSSFKGQYGCSSQRQPLQSNVKAYVSSFNSKKEVWHRRLGHPSERVLSYILKDCNQKFTLNEKCDFCDACQFGKSHALPFQTSNSHAKKPLELIHTD

Query:  LWGPAPVNSSTGFHYYIHFLDDFSRHTWIYPLKNKSDSLAAFKHFKNLVENKFDSKIKTLQSDWGGEYSEGNK
        +WGPAP+ S +GF YY+HF+DDFSR TWI+PLK KSD++ AF  FKNL EN+F+ KIK +Q D GGEY    K
Subjt:  LWGPAPVNSSTGFHYYIHFLDDFSRHTWIYPLKNKSDSLAAFKHFKNLVENKFDSKIKTLQSDWGGEYSEGNK

TrEMBL top hitse value%identityAlignment
A0A2K3LCM1 Gag/pol polyprotein-maize retrotransposon Hopscotch (Fragment)7.4e-10841.25Show/hide
Query:  NPEYEEWITVDQALLGWLYGSMEQSIASDVINHKTSMAVWKALEGRYGSHNKARIILLRSTLQNTKKNNMKMTEYLAKMKQIADGLSLA-----------
        NPE+E+W   DQ LLGWL  SM   IA+ +++ +TSM +W   +   G+H +++I  L+S   +T+K  MKM +YL KMK +AD L LA           
Subjt:  NPEYEEWITVDQALLGWLYGSMEQSIASDVINHKTSMAVWKALEGRYGSHNKARIILLRSTLQNTKKNNMKMTEYLAKMKQIADGLSLA-----------

Query:  ----GLDAQYLPITCTLNGKDALTWEELHATLIAFEQTLVHLNVISDVNDLSNATAHLAIQKTYSNNYSNNSFSTQGPNNSHNGRNQSNQNSGRDKSRGT
            GLD++Y P+   L+ +  L+W +L A L+ FE  +  LN ++++    NATA++A +  +  N  N++ + +G NN+  G N           RG 
Subjt:  ----GLDAQYLPITCTLNGKDALTWEELHATLIAFEQTLVHLNVISDVNDLSNATAHLAIQKTYSNNYSNNSFSTQGPNNSHNGRNQSNQNSGRDKSRGT

Query:  YRGGRQRGRGGRNNGNRPVCQVCGKLGHTTAVCYNRYDGNYMGA--PPNHGKNQTNSAFIASPEIVNDSSWYADSGASNHITSDPNMLNHKTEY------
        +RGGR RGR  +       CQVCG   H    C+ R+D  Y  +    N+ K  +++AF+AS   + D  WY DSGASNH+T   +   + +E+      
Subjt:  YRGGRQRGRGGRNNGNRPVCQVCGKLGHTTAVCYNRYDGNYMGA--PPNHGKNQTNSAFIASPEIVNDSSWYADSGASNHITSDPNMLNHKTEY------

Query:  ------------TDNQKLK---LNNILHVPTITKNLLSVSKLTNDNDAYMEFHSDFCVVKDKTTGRVLHQGKLKEGLYQLELRNKSPNLFLCQKSPRFVL
                    T + KLK   L++IL+VP ITKNLLSVSKL  DN+  +EF  + C VKDK TG+ + +G LK+GLYQL  ++ S              
Subjt:  ------------TDNQKLK---LNNILHVPTITKNLLSVSKLTNDNDAYMEFHSDFCVVKDKTTGRVLHQGKLKEGLYQLELRNKSPNLFLCQKSPRFVL

Query:  KSSFKGQYGCSSQRQPLQSNVKAYVSSFNSKKEVWHRRLGHPSERVLSYILKDCNQKFTLNEKCDFCDACQFGKSHALPFQTSNSHAKKPLELIHTDLWG
                              AYVS     KE WHR+LGHP+ +VL  +LK CN K + +++  FC+ACQ+GK H LPF+TS SHAK+ LEL+HTD+WG
Subjt:  KSSFKGQYGCSSQRQPLQSNVKAYVSSFNSKKEVWHRRLGHPSERVLSYILKDCNQKFTLNEKCDFCDACQFGKSHALPFQTSNSHAKKPLELIHTDLWG

Query:  PAPVNSSTGFHYYIHFLDDFSRHTWIYPLKNKSDSLAAFKHFKNLVENKFDSKIKTLQSDWGGEYSEGNKSPHAAEA
        PAP+ SS+GF YY+HF+DDF+R TWIYPLK KSD+  AF  FKN+VEN+F  KIKT+Q D GGEY    K  HA EA
Subjt:  PAPVNSSTGFHYYIHFLDDFSRHTWIYPLKNKSDSLAAFKHFKNLVENKFDSKIKTLQSDWGGEYSEGNKSPHAAEA

A0A2K3LJ49 Retrovirus-related Pol polyprotein from transposon TNT 1-941.6e-10540.42Show/hide
Query:  NPEYEEWITVDQALLGWLYGSMEQSIASDVINHKTSMAVWKALEGRYGSHNKARIILLRSTLQNTKKNNMKMTEYLAKMKQIADGLSLA-----------
        NP++ +W   DQ +LGWL  +M    AS +++ +TS  +W+  +    +H ++R+I LRS   NT+K   KM +YL KMK +AD L +A           
Subjt:  NPEYEEWITVDQALLGWLYGSMEQSIASDVINHKTSMAVWKALEGRYGSHNKARIILLRSTLQNTKKNNMKMTEYLAKMKQIADGLSLA-----------

Query:  ----GLDAQYLPITCTLNGKDALTWEELHATLIAFEQTLVHLNVISDVNDLSNATAHLAIQKTYSNNYSNNSFSTQGPNNSHNGRNQSNQNSGRDKSRGT
            GLD+ Y PI   L+ +  L+W +L A L+AFE  L  LN  +++N   NAT ++A +  +  N  N+                          RG+
Subjt:  ----GLDAQYLPITCTLNGKDALTWEELHATLIAFEQTLVHLNVISDVNDLSNATAHLAIQKTYSNNYSNNSFSTQGPNNSHNGRNQSNQNSGRDKSRGT

Query:  YRGGRQRG-RGGRNNG--NRPVCQVCGKLGHTTAVCYNRYDGNYMGAP---PNHGKNQTNSAFIASPEIVNDSSWYADSGASNHITSDPNMLNHKTEYTD
        +RG   R  RGGR  G  +  +CQVC K GHT   C +RYD +Y G+     N  + +T++AF+AS     D  WY DSGASNH+T   +     TE + 
Subjt:  YRGGRQRG-RGGRNNG--NRPVCQVCGKLGHTTAVCYNRYDGNYMGAP---PNHGKNQTNSAFIASPEIVNDSSWYADSGASNHITSDPNMLNHKTEYTD

Query:  N---------------------QKLKLNNILHVPTITKNLLSVSKLTNDNDAYMEFHSDFCVVKDKTTGRVLHQGKLKEGLYQLELRNKSPNLFLCQKSP
                              + L L+++L+VP ITKNLLSVSKLT+DN+  +EF +D C VKDK TG+VL +G LK+GLYQL   +   N     K P
Subjt:  N---------------------QKLKLNNILHVPTITKNLLSVSKLTNDNDAYMEFHSDFCVVKDKTTGRVLHQGKLKEGLYQLELRNKSPNLFLCQKSP

Query:  RFVLKSSFKGQYGCSSQRQPLQSNVKAYVSSFNSKKEVWHRRLGHPSERVLSYILKDCNQKFTLNEKCDFCDACQFGKSHALPFQTSNSHAKKPLELIHT
           L                             S KE WHR+LGHPS  VL  +LK CN K + ++K  FC+ACQ GKSH LPF++S+SHA++ LELIHT
Subjt:  RFVLKSSFKGQYGCSSQRQPLQSNVKAYVSSFNSKKEVWHRRLGHPSERVLSYILKDCNQKFTLNEKCDFCDACQFGKSHALPFQTSNSHAKKPLELIHT

Query:  DLWGPAPVNSSTGFHYYIHFLDDFSRHTWIYPLKNKSDSLAAFKHFKNLVENKFDSKIKTLQSDWGGEYSEGNK
        D+WGPAP+NS +GF YY+HF+DD SR TWIYPLK KSD++ AF  FKN+VEN+F+ +IK +Q D GGE+    K
Subjt:  DLWGPAPVNSSTGFHYYIHFLDDFSRHTWIYPLKNKSDSLAAFKHFKNLVENKFDSKIKTLQSDWGGEYSEGNK

A0A2K3MUJ9 Putative retrotransposon Ty1-copia subclass protein (Fragment)5.3e-11443.13Show/hide
Query:  NPEYEEWITVDQALLGWLYGSMEQSIASDVINHKTSMAVWKALEGRYGSHNKARIILLRSTLQNTKKNNMKMTEYLAKMKQIADGLSLA-----------
        NP+Y++W   DQALLGWL  SM   IA+ V++ +TS  +W   +   G+H ++RII L+S   NT K  MKM +YLAKMK +AD L LA           
Subjt:  NPEYEEWITVDQALLGWLYGSMEQSIASDVINHKTSMAVWKALEGRYGSHNKARIILLRSTLQNTKKNNMKMTEYLAKMKQIADGLSLA-----------

Query:  ----GLDAQYLPITCTLNGKDALTWEELHATLIAFEQTLVHLNVISDVNDLSNATAHLAIQKTYSNNYSNNSFSTQGPNNSHNGRNQSNQNSGRDKSRGT
            GLD++Y P+   L+ +  ++W +  A L+AFE  L  LN  +++N   NA+A+ A +    N    N F ++G     N R               
Subjt:  ----GLDAQYLPITCTLNGKDALTWEELHATLIAFEQTLVHLNVISDVNDLSNATAHLAIQKTYSNNYSNNSFSTQGPNNSHNGRNQSNQNSGRDKSRGT

Query:  YRGGRQRGRGGRNNGNRPVCQVCGKLGHTTAVCYNRYDGNYMGAPPNHGKNQTNSAFIASPEIVNDSSWYADSGASNHITSDPNMLNHKTEYT-------
         RGG  RGR   +   RP+CQ+CGK GHT A CY R+D +Y           ++SAF+ASP    D  WY DSGASNH+T     L    E         
Subjt:  YRGGRQRGRGGRNNGNRPVCQVCGKLGHTTAVCYNRYDGNYMGAPPNHGKNQTNSAFIASPEIVNDSSWYADSGASNHITSDPNMLNHKTEYT-------

Query:  -DNQKLK-------------LNNILHVPTITKNLLSVSKLTNDNDAYMEFHSDFCVVKDKTTGRVLHQGKLKEGLYQLELRNKSPNLFLCQKSPRFVLKS
         + +KLK             L N+L+VP ITKNLLSVSKLT DN+A +EF  ++C VKDK TG+ L +G+LK+GLYQL   NK P               
Subjt:  -DNQKLK-------------LNNILHVPTITKNLLSVSKLTNDNDAYMEFHSDFCVVKDKTTGRVLHQGKLKEGLYQLELRNKSPNLFLCQKSPRFVLKS

Query:  SFKGQYGCSSQRQPLQSNVKAYVSSFNSKKEVWHRRLGHPSERVLSYILKDCNQKFTLNEKCDFCDACQFGKSHALPFQTSNSHAKKPLELIHTDLWGPA
                     P   +  AY+    S KE+WHR+LGHP+ +VL  +LKD N K + ++K  FC+ACQFGK H LPF+TS+SHAK+PL+LIHTD+WGPA
Subjt:  SFKGQYGCSSQRQPLQSNVKAYVSSFNSKKEVWHRRLGHPSERVLSYILKDCNQKFTLNEKCDFCDACQFGKSHALPFQTSNSHAKKPLELIHTDLWGPA

Query:  PVNSSTGFHYYIHFLDDFSRHTWIYPLKNKSDSLAAFKHFKNLVENKFDSKIKTLQSDWGGEYSEGNK
        P+ S + F YY+HFLDDFSR TWI+PLK KS+++ AF  FKNLVEN+F+ KIK ++ D GGEY    K
Subjt:  PVNSSTGFHYYIHFLDDFSRHTWIYPLKNKSDSLAAFKHFKNLVENKFDSKIKTLQSDWGGEYSEGNK

A0A803PEH4 Uncharacterized protein6.1e-11042.81Show/hide
Query:  VPNPEYEEWITVDQALLGWLYGSMEQSIASDVINHKTSMAVWKALEGRYGSHNKARIILLRSTLQNTKKNNMKMTEYLAKMKQIADGLSLA---------
        V NPEYE WI  DQ L+GWLY SM + IA++V+   ++  + + LE  YG+++K+++   R+ +Q T+K +  M+EYL + K  ++ L+LA         
Subjt:  VPNPEYEEWITVDQALLGWLYGSMEQSIASDVINHKTSMAVWKALEGRYGSHNKARIILLRSTLQNTKKNNMKMTEYLAKMKQIADGLSLA---------

Query:  ------GLDAQYLPITCTLNGKDALTWEELHATLIAFEQTLVHLNVISDVNDLSNATAHLAIQKTYSNNYSNNSFSTQGPNNSHNGRNQSNQNSGRD---
              GLDA+YL I   +  +   TW+EL   L++F+         S +  L N T + + + T S+  +N +  T   NN+  GR   +QN+  +   
Subjt:  ------GLDAQYLPITCTLNGKDALTWEELHATLIAFEQTLVHLNVISDVNDLSNATAHLAIQKTYSNNYSNNSFSTQGPNNSHNGRNQSNQNSGRD---

Query:  ---KSRGTYRGGRQRGRGGRNNGNRPVCQVCGKLGHTTAVCYNRYDGNYMGAPPNHGKNQ--------TNSAFIASPEIVNDSSWYADSGASNHITSDPN
            SRGT    R RGR G  +G+RP CQV GK GHT AVCYNR+D +YMG+ PN+  NQ         +SAF+A+PE++   +W+ADSGASNHITSDP 
Subjt:  ---KSRGTYRGGRQRGRGGRNNGNRPVCQVCGKLGHTTAVCYNRYDGNYMGAPPNHGKNQ--------TNSAFIASPEIVNDSSWYADSGASNHITSDPN

Query:  MLNHKTEYTDNQK--------------------------LKLNNILHVPTITKNLLSVSKLTNDNDAYMEFHSDFCVVKDKTTGRVLHQGKLKEGLYQLE
         L  K +Y   +                           L L ++L VP I KNL+SVSKL  DN+  +EF+S+FC+VKDK T +VL  G LK+ LYQL+
Subjt:  MLNHKTEYTDNQK--------------------------LKLNNILHVPTITKNLLSVSKLTNDNDAYMEFHSDFCVVKDKTTGRVLHQGKLKEGLYQLE

Query:  LRNKSPNLFLCQKSPRFVLKSSFKGQYGCSSQRQPLQSNVKAYVSSFNSKKEVWHRRLGHPSERVLSYILKDCNQKFTLNEKCDFCDACQFGKSHALPFQ
            SP      KS     +S+F   +  S      QS      S   S+ +V HRRLGHPS +VL+++L+  N   + N     CDACQ+GK+HALPF+
Subjt:  LRNKSPNLFLCQKSPRFVLKSSFKGQYGCSSQRQPLQSNVKAYVSSFNSKKEVWHRRLGHPSERVLSYILKDCNQKFTLNEKCDFCDACQFGKSHALPFQ

Query:  TSNSHAKKPLELIHTDLWGPAPVNSSTGFHYYIHFLDDFSRHTWIYPLKNKSDSLAAFKHFKNLVENKFDSKIKTLQSDWGGEY
        +SN+ AK  L+LIHTDLWGPAP+ S+   HYYIHF+DD+SR+TW+YPLK KSD+LAAF  FK LVEN+F  KIK+L+SD GGEY
Subjt:  TSNSHAKKPLELIHTDLWGPAPVNSSTGFHYYIHFLDDFSRHTWIYPLKNKSDSLAAFKHFKNLVENKFDSKIKTLQSDWGGEY

A0A803PM38 Uncharacterized protein1.0e-10440.91Show/hide
Query:  NPEYEEWITVDQALLGWLYGSMEQSIASDVINHKTSMAVWKALEGRYGSHNKARIILLRSTLQNTKKNNMKMTEYLAKMKQIADGLSLA-----------
        NP +E+WI  DQ LLGWLYGSM + IA +V+   +S ++W ALE  +G+H+KA++   R+ +Q  +K  + M +YL + +Q AD L+LA           
Subjt:  NPEYEEWITVDQALLGWLYGSMEQSIASDVINHKTSMAVWKALEGRYGSHNKARIILLRSTLQNTKKNNMKMTEYLAKMKQIADGLSLA-----------

Query:  ----GLDAQYLPITCTLNGKDALTWEELHATLIAFEQTLVHLNVISDVNDLSNATAHLAIQKTYSNNYSNNSFSTQGPNNSHNGRNQSNQN-SGRDKSRG
            GLD +YLP+   +  + + TW++L   L++ +  +  L+  S  + L+    +            + S + +GP+   N  N +N N  G   +RG
Subjt:  ----GLDAQYLPITCTLNGKDALTWEELHATLIAFEQTLVHLNVISDVNDLSNATAHLAIQKTYSNNYSNNSFSTQGPNNSHNGRNQSNQN-SGRDKSRG

Query:  TYRGGRQRGRGGRNNGNRPVCQVCGKLGHTTAVCYNRYDGNYMGAPPNHGKNQTNSAFIASPEIVNDSSWYADSGASNHITSDPNMLNHKTEYTDNQK--
        +    R RGRGGR +G RP CQVCGK GH+ A CYNR                                     GASNHITS+ N +N K EY   +K  
Subjt:  TYRGGRQRGRGGRNNGNRPVCQVCGKLGHTTAVCYNRYDGNYMGAPPNHGKNQTNSAFIASPEIVNDSSWYADSGASNHITSDPNMLNHKTEYTDNQK--

Query:  ------------------------LKLNNILHVPTITKNLLSVSKLTNDNDAYMEFHSDFCVVKDKTTGRVLHQGKLKEGLYQLELRNKSPNLFLCQKSP
                                L L  ILHVP+ITKNLLS+SKLT+DN+  +EF SD C VKDK TG+V+ +GKLK+GLYQ +    +P       S 
Subjt:  ------------------------LKLNNILHVPTITKNLLSVSKLTNDNDAYMEFHSDFCVVKDKTTGRVLHQGKLKEGLYQLELRNKSPNLFLCQKSP

Query:  RFV-LKSSFKGQYGCSSQRQPLQSNVKAYVSS--FNSKKEVWHRRLGHPSERVLSYILKDCNQKFTLNEKCDFCDACQFGKSHALPFQTSNSHAKKPLEL
        R +   +SF G    +     ++SNV   +++    S K+ WHRRLGHPS RVL  +L   N K  +N    FCDACQ GKSH+LPF+ +   A  PLEL
Subjt:  RFV-LKSSFKGQYGCSSQRQPLQSNVKAYVSS--FNSKKEVWHRRLGHPSERVLSYILKDCNQKFTLNEKCDFCDACQFGKSHALPFQTSNSHAKKPLEL

Query:  IHTDLWGPAPVNSSTGFHYYIHFLDDFSRHTWIYPLKNKSDSLAAFKHFKNLVENKFDSKIKTLQSDWGGEY
        +HTD+WGP+P+ S+T F YYIHF+DDFSR+TWIYPLK KS++LAAF  FK LVEN+F+S++K +Q+DWGGEY
Subjt:  IHTDLWGPAPVNSSTGFHYYIHFLDDFSRHTWIYPLKNKSDSLAAFKHFKNLVENKFDSKIKTLQSDWGGEY

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.3e-1322.12Show/hide
Query:  VPNPEYEEWITVDQALLGWLYGSMEQSIASDVINHKTSMAVWKALEGRYGSHNKARIILLRSTLQNTK-KNNMKMTEYLAKMKQIADGLSLAGLDAQYLP
        +PN   + W   ++     +   +  S  +   +  T+  + + L+  Y   + A  + LR  L + K  + M +  +     ++   L  AG   + + 
Subjt:  VPNPEYEEWITVDQALLGWLYGSMEQSIASDVINHKTSMAVWKALEGRYGSHNKARIILLRSTLQNTK-KNNMKMTEYLAKMKQIADGLSLAGLDAQYLP

Query:  ITCTLNGKDALTWEELHATLIAFEQTLVHLNVISDVNDLSNATAHLAIQKTYSNNYSNNSFSTQGPNNSHNGRNQSNQNSGRDKSRGTYRGGRQRGRGGR
                     +++   LI       +  +I+ +  LS     LA  K   N   +     +   N HN  ++   N+    +  TY+    + R  +
Subjt:  ITCTLNGKDALTWEELHATLIAFEQTLVHLNVISDVNDLSNATAHLAIQKTYSNNYSNNSFSTQGPNNSHNGRNQSNQNSGRDKSRGTYRGGRQRGRGGR

Query:  -------NNGNRPVCQVCGKLGHTTAVC--YNRYDGNYMGAPPNHGKNQTNSAFIASPEIVNDSS------WYADSGASNHITSDPNMLNHKTEYTDNQK
               N+  +  C  CG+ GH    C  Y R   N         +  T+       + VN++S      +  DSGAS+H+ +D ++     E     K
Subjt:  -------NNGNRPVCQVCGKLGHTTAVC--YNRYDGNYMGAPPNHGKNQTNSAFIASPEIVNDSS------WYADSGASNHITSDPNMLNHKTEYTDNQK

Query:  LKLNNILHVPTITKNLLSVSKLTNDNDAYMEFHSDFCVVKDKTTGRVLHQGKLKEGLYQLELRNKSPNLFLCQKSPRFVLKSSFKGQYGCSSQRQPLQSN
        + +         TK    + +L ND++  +E   D    K+   G ++   +L+E    +E       +    K+   V+K+S     G  +    +  N
Subjt:  LKLNNILHVPTITKNLLSVSKLTNDNDAYMEFHSDFCVVKDKTTGRVLHQGKLKEGLYQLELRNKSPNLFLCQKSPRFVLKSSFKGQYGCSSQRQPLQSN

Query:  VKAYVSSFNSKKE----VWHRRLGHPSERVLSYI-----LKDCNQKFTLNEKCDFCDACQFGKSHALPFQ--TSNSHAKKPLELIHTDLWGPAPVNSSTG
         +AY  S N+K +    +WH R GH S+  L  I       D +    L   C+ C+ C  GK   LPF+     +H K+PL ++H+D+ GP    +   
Subjt:  VKAYVSSFNSKKE----VWHRRLGHPSERVLSYI-----LKDCNQKFTLNEKCDFCDACQFGKSHALPFQ--TSNSHAKKPLELIHTDLWGPAPVNSSTG

Query:  FHYYIHFLDDFSRHTWIYPLKNKSDSLAAFKHFKNLVENKFDSKIKTLQSDWGGEY
         +Y++ F+D F+ +   Y +K KSD  + F+ F    E  F+ K+  L  D G EY
Subjt:  FHYYIHFLDDFSRHTWIYPLKNKSDSLAAFKHFKNLVENKFDSKIKTLQSDWGGEY

P0C2I3 Transposon Ty1-DR6 Gag-Pol polyprotein6.2e-1123.81Show/hide
Query:  YTDNQKLKLNNILHVPTITKNLLSVSKLTNDNDAYMEFHSDFCV-VKDKTTGRVLHQGKLKEGLYQLELRNKSPNLFLCQKSPRFVLKSSFKGQYGCSSQ
        + DN K  +  +LH P I  +LLS+++L     A ++  + F   V +++ G VL         Y +              S R++L S+          
Subjt:  YTDNQKLKLNNILHVPTITKNLLSVSKLTNDNDAYMEFHSDFCV-VKDKTTGRVLHQGKLKEGLYQLELRNKSPNLFLCQKSPRFVLKSSFKGQYGCSSQ

Query:  RQPLQSNVKAYVSSFNSKKEVWHRRLGHPSERVLSYILKDCNQKFTLNEKCDF-------CDACQFGKS----HALPFQTSNSHAKKPLELIHTDLWGPA
          P  +NV    S+        HR L H + + + Y LK+    +      D+       C  C  GKS    H    +    ++ +P + +HTD++GP 
Subjt:  RQPLQSNVKAYVSSFNSKKEVWHRRLGHPSERVLSYILKDCNQKFTLNEKCDF-------CDACQFGKS----HALPFQTSNSHAKKPLELIHTDLWGPA

Query:  PVNSSTGFHYYIHFLDDFSRHTWIYPLKNKSDS--LAAFKHFKNLVENKFDSKIKTLQSDWGGEYSEGNKSPH
            ++   Y+I F D+ ++  W+YPL ++ +   L  F      ++N+F + +  +Q D G EY+  N++ H
Subjt:  PVNSSTGFHYYIHFLDDFSRHTWIYPLKNKSDS--LAAFKHFKNLVENKFDSKIKTLQSDWGGEYSEGNKSPH

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-943.0e-2124.13Show/hide
Query:  GLNSGLSLPTKTVPNPEYEEWITVDQALLGWLYGSMEQSIASDVINHKTSMAVWKALEGRYGSHNKARIILLRSTLQNTKKNNMKMTEYLAKMKQIADGL
        GL+  L + +K     + E+W  +D+     +   +   + +++I+  T+  +W  LE  Y S      + L+  L     +  + T +L+ +  + +GL
Subjt:  GLNSGLSLPTKTVPNPEYEEWITVDQALLGWLYGSMEQSIASDVINHKTSMAVWKALEGRYGSHNKARIILLRSTLQNTKKNNMKMTEYLAKMKQIADGL

Query:  --SLAGLDAQYLPITCTLNGKDALTWEELHATLIAFEQTLVHLNVISDVNDLSNATAHLAIQKTYSNNYSNNSFSTQGPNNSHNGRNQSNQNSGRDKSRG
           LA L      +      K  L    L ++      T++H     ++ D+++A           N         QG      GR +S Q S  +  R 
Subjt:  --SLAGLDAQYLPITCTLNGKDALTWEELHATLIAFEQTLVHLNVISDVNDLSNATAHLAIQKTYSNNYSNNSFSTQGPNNSHNGRNQSNQNSGRDKSRG

Query:  TYRG-GRQRGRGGRNNGNRPVCQVCGKLGHTTAVCYNRYDGNYMGAPPNHGKNQTNSA-----------FIASPEIV-----NDSSWYADSGASNHITSD
          RG  + R +    N     C  C + GH    C N   G       +  KN  N+A           FI   E        +S W  D+ AS+H T  
Subjt:  TYRG-GRQRGRGGRNNGNRPVCQVCGKLGHTTAVCYNRYDGNYMGAPPNHGKNQTNSA-----------FIASPEIV-----NDSSWYADSGASNHITSD

Query:  PNMLNHKTEYTDNQKLKLNNILHVPTITKNLLSVSKLTNDNDAYMEFH-SDFCVVKDKTTGRVLHQGKLKEGL----------YQLELRNKSPNLFLCQK
         ++        D   +K+ N            S SK+    D  ++ +     V+KD     V H   L+  L          Y+    N+   L    K
Subjt:  PNMLNHKTEYTDNQKLKLNNILHVPTITKNLLSVSKLTNDNDAYMEFH-SDFCVVKDKTTGRVLHQGKLKEGL----------YQLELRNKSPNLFLCQK

Query:  SPRFVLKSSFKGQYGCSSQRQPLQSNVKAYVSSFNSKKEVWHRRLGHPSERVLSYILKDCNQKFTLNEKCDFCDACQFGKSHALPFQTSNSHAKKPLELI
            + K   +G     +  +  Q  + A     +   ++WH+R+GH SE+ L  + K     +        CD C FGK H + FQTS+      L+L+
Subjt:  SPRFVLKSSFKGQYGCSSQRQPLQSNVKAYVSSFNSKKEVWHRRLGHPSERVLSYILKDCNQKFTLNEKCDFCDACQFGKSHALPFQTSNSHAKKPLELI

Query:  HTDLWGPAPVNSSTGFHYYIHFLDDFSRHTWIYPLKNKSDSLAAFKHFKNLVENKFDSKIKTLQSDWGGEYS
        ++D+ GP  + S  G  Y++ F+DD SR  W+Y LK K      F+ F  LVE +   K+K L+SD GGEY+
Subjt:  HTDLWGPAPVNSSTGFHYYIHFLDDFSRHTWIYPLKNKSDSLAAFKHFKNLVENKFDSKIKTLQSDWGGEYS

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.8e-5028.05Show/hide
Query:  FIGSNVVNTSSGNGLNSGLSLPTKTVP--NPEYEEWITVDQALLGWLYGSMEQSIASDVINHKTSMAVWKALEGRYGSHNKARIILLRSTLQNTKKNNMK
        F G  +     G+      ++ T   P  NP+Y  W   D+ +   + G++  S+   V    T+  +W+ L   Y + +   +  LR+ L+   K    
Subjt:  FIGSNVVNTSSGNGLNSGLSLPTKTVP--NPEYEEWITVDQALLGWLYGSMEQSIASDVINHKTSMAVWKALEGRYGSHNKARIILLRSTLQNTKKNNMK

Query:  MTEYLAKMKQIADGLSLAG---------------LDAQYLPITCTLNGKDA-LTWEELHATLIAFEQTLVHLNVISDVNDLSNATAHLAIQKTYSNNYSN
        + +Y+  +    D L+L G               L  +Y P+   +  KD   T  E+H  L+  E  ++ ++  + +   +NA +H     T +NN  N
Subjt:  MTEYLAKMKQIADGLSLAG---------------LDAQYLPITCTLNGKDA-LTWEELHATLIAFEQTLVHLNVISDVNDLSNATAHLAIQKTYSNNYSN

Query:  NSFSTQGPNNSHNGRNQSNQNSGRDKSRGTYRGGRQRGRGGRNNGNRPV---CQVCGKLGHTTAVC---YNRYDGNYMGAPPNHGKNQTNSAFIASPEIV
                NN ++ RN +N +    +S   +           NN ++P    CQ+CG  GH+   C    +         PP+        A +A     
Subjt:  NSFSTQGPNNSHNGRNQSNQNSGRDKSRGTYRGGRQRGRGGRNNGNRPV---CQVCGKLGHTTAVC---YNRYDGNYMGAPPNHGKNQTNSAFIASPEIV

Query:  NDSSWYADSGASNHITSDPNMLNHKTEY-------------------------TDNQKLKLNNILHVPTITKNLLSVSKLTNDNDAYMEFHSDFCVVKDK
        + ++W  DSGA++HITSD N L+    Y                         T ++ L L+NIL+VP I KNL+SV +L N N   +EF      VKD 
Subjt:  NDSSWYADSGASNHITSDPNMLNHKTEY-------------------------TDNQKLKLNNILHVPTITKNLLSVSKLTNDNDAYMEFHSDFCVVKDK

Query:  TTGRVLHQGKLKEGLYQLELRNKSP-NLFLCQKSPRFVLKSSFKGQYGCSSQRQPLQSNVKAYVSSFNSKKEVWHRRLGHPSERVLSYILKDCNQKFTLN
         TG  L QGK K+ LY+  + +  P +LF    S                          KA  SS       WH RLGHP+  +L+ ++ + +    LN
Subjt:  TTGRVLHQGKLKEGLYQLELRNKSP-NLFLCQKSPRFVLKSSFKGQYGCSSQRQPLQSNVKAYVSSFNSKKEVWHRRLGHPSERVLSYILKDCNQKFTLN

Query:  EKCDF--CDACQFGKSHALPFQTSNSHAKKPLELIHTDLWGPAPVNSSTGFHYYIHFLDDFSRHTWIYPLKNKSDSLAAFKHFKNLVENKFDSKIKTLQS
            F  C  C   KS+ +PF  S  ++ +PLE I++D+W  +P+ S   + YY+ F+D F+R+TW+YPLK KS     F  FKNL+EN+F ++I T  S
Subjt:  EKCDF--CDACQFGKSHALPFQTSNSHAKKPLELIHTDLWGPAPVNSSTGFHYYIHFLDDFSRHTWIYPLKNKSDSLAAFKHFKNLVENKFDSKIKTLQS

Query:  DWGGEY
        D GGE+
Subjt:  DWGGEY

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.3e-4529.74Show/hide
Query:  SLPTKTVP--NPEYEEWITVDQALLGWLYGSMEQSIASDVINHKTSMAVWKALEGRYGSHNKARIILLRSTLQNTKKNNM-KMTEYLAKMKQIADGLSLA
        ++ T  VP  NP+Y  W   D+ +   + G++  S+   V    T+  +W+ L   Y + +   +  LR   +  +   + K  ++  +++++     L 
Subjt:  SLPTKTVP--NPEYEEWITVDQALLGWLYGSMEQSIASDVINHKTSMAVWKALEGRYGSHNKARIILLRSTLQNTKKNNM-KMTEYLAKMKQIADGLSLA

Query:  GLDAQYLPITCTLNGKDA-LTWEELHATLIAFEQTLVHLNVISDVNDLSNATAHLAIQKTYSNNYSNNSFSTQGPNNSHNGRNQSNQNSGRDKSRGTYRG
         L   Y P+   +  KD   +  E+H  LI  E  L+ LN    V   +N   H   + T +N   NN    +G N ++N  N +  NS +  S G+   
Subjt:  GLDAQYLPITCTLNGKDA-LTWEELHATLIAFEQTLVHLNVISDVNDLSNATAHLAIQKTYSNNYSNNSFSTQGPNNSHNGRNQSNQNSGRDKSRGTYRG

Query:  GRQ-RGRGGRNNGNRPVCQVCGKLGHTTAVCYNRYDGNYMGAPPNHGKNQTNSAF--------IASPEIVNDSSWYADSGASNHITSDPNMLNHKTEY--
         RQ +   GR       CQ+C   GH+   C   +            + Q+ S F        +A     N ++W  DSGA++HITSD N L+    Y  
Subjt:  GRQ-RGRGGRNNGNRPVCQVCGKLGHTTAVCYNRYDGNYMGAPPNHGKNQTNSAF--------IASPEIVNDSSWYADSGASNHITSDPNMLNHKTEY--

Query:  -----------------------TDNQKLKLNNILHVPTITKNLLSVSKLTNDNDAYMEFHSDFCVVKDKTTGRVLHQGKLKEGLYQLEL-RNKSPNLFL
                               T ++ L LN +L+VP I KNL+SV +L N N   +EF      VKD  TG  L QGK K+ LY+  +  +++ ++F 
Subjt:  -----------------------TDNQKLKLNNILHVPTITKNLLSVSKLTNDNDAYMEFHSDFCVVKDKTTGRVLHQGKLKEGLYQLEL-RNKSPNLFL

Query:  CQKSPRFVLKSSFKGQYGCSSQRQPLQSNVKAYVSSFNSKKEVWHRRLGHPSERVLSYILKDCN-QKFTLNEKCDFCDACQFGKSHALPFQTSNSHAKKP
           SP             CS          KA  SS       WH RLGHPS  +L+ ++ + +      + K   C  C   KSH +PF  S   + KP
Subjt:  CQKSPRFVLKSSFKGQYGCSSQRQPLQSNVKAYVSSFNSKKEVWHRRLGHPSERVLSYILKDCN-QKFTLNEKCDFCDACQFGKSHALPFQTSNSHAKKP

Query:  LELIHTDLWGPAPVNSSTGFHYYIHFLDDFSRHTWIYPLKNKSDSLAAFKHFKNLVENKFDSKIKTLQSDWGGEY
        LE I++D+W  +P+ S   + YY+ F+D F+R+TW+YPLK KS     F  FK+LVEN+F ++I TL SD GGE+
Subjt:  LELIHTDLWGPAPVNSSTGFHYYIHFLDDFSRHTWIYPLKNKSDSLAAFKHFKNLVENKFDSKIKTLQSDWGGEY

Arabidopsis top hitse value%identityAlignment
ATMG00300.1 Gag-Pol-related retrotransposon family protein3.5e-0935.21Show/hide
Query:  VWHRRLGHPSERVLSYILKDCNQKFTLNEKCDFCDACQFGKSHALPFQTSNSHAKKPLELIHTDLWGPAPV
        +WH RL H S+R +  ++K      +      FC+ C +GK+H + F T     K PL+ +H+DLWG   V
Subjt:  VWHRRLGHPSERVLSYILKDCNQKFTLNEKCDFCDACQFGKSHALPFQTSNSHAKKPLELIHTDLWGPAPV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTAAAATCCTCAAAGGCAAATATTTCAGCGGAGGGGTTTTCCTAAAGGCCAAAGAGGACTGCATAGCTTCTATCGTGTGGAAAAACATATTATGGGGCAGATCTCT
ATTTGAAAAAGGTTACATATGGAGGATAGAGCTCATTCAAGCCAACGCTTCATCTCATTCAGCCTCCATCAACATGGATTTGGGAAAAGAAACAACGGTTCCAACGCCCA
CATCATCAACAACAGCTCCTGCATTCATAGGAAGCAACGTTGTTAACACTTCATCTGGAAATGGGCTAAACTCAGGACTCTCACTCCCAACAAAAACTGTTCCAAACCCA
GAGTATGAAGAATGGATAACAGTGGATCAAGCTCTCTTAGGGTGGCTTTATGGCTCAATGGAACAATCCATTGCCTCAGATGTTATAAACCATAAAACATCAATGGCTGT
CTGGAAAGCTTTAGAAGGAAGATATGGGTCTCACAACAAAGCAAGAATCATTCTATTAAGATCAACCTTGCAAAACACAAAGAAGAACAACATGAAGATGACAGAGTACC
TTGCAAAAATGAAGCAAATAGCTGATGGGCTCAGCTTAGCCGGATTAGATGCTCAATATCTTCCTATTACATGTACGTTGAATGGAAAAGATGCCTTGACCTGGGAGGAG
CTACATGCTACTCTAATAGCATTTGAACAAACATTGGTCCACCTCAATGTAATCTCTGATGTCAATGACCTATCAAATGCCACAGCACACTTAGCCATTCAAAAGACATA
TTCAAACAATTACTCCAACAATAGCTTCAGCACACAAGGCCCAAATAATAGTCACAATGGCAGAAATCAATCCAATCAAAATAGTGGAAGAGATAAAAGTCGTGGCACAT
ACCGGGGAGGAAGACAAAGAGGCCGTGGAGGTCGAAATAATGGAAATAGACCTGTCTGTCAAGTTTGTGGAAAACTAGGCCATACAACAGCAGTTTGTTACAATAGATAT
GATGGAAACTACATGGGAGCACCACCAAATCATGGAAAAAATCAAACGAACTCTGCATTCATTGCAAGCCCAGAAATTGTCAATGACTCCAGCTGGTATGCTGATAGCGG
AGCCAGCAATCACATCACCTCGGATCCGAACATGCTGAATCACAAAACAGAATACACCGACAATCAAAAGTTGAAGTTGAACAATATTTTACATGTTCCTACTATTACCA
AGAACTTGCTGAGTGTATCTAAGTTGACGAATGATAATGATGCCTATATGGAATTTCACTCTGATTTTTGTGTTGTGAAGGACAAAACCACGGGTCGTGTTCTGCATCAA
GGGAAGCTTAAGGAAGGGCTATATCAATTGGAGTTAAGAAACAAGTCACCCAACCTATTTCTATGTCAGAAGAGTCCAAGATTTGTGCTTAAGTCTAGTTTCAAAGGTCA
GTATGGTTGTTCCTCTCAAAGGCAACCTTTGCAATCTAATGTCAAAGCCTATGTATCTAGTTTCAATTCCAAAAAAGAGGTGTGGCATAGGAGATTAGGCCACCCCTCTG
AAAGAGTCCTTAGTTATATTCTCAAAGATTGTAATCAAAAGTTTACTCTCAATGAAAAATGTGATTTTTGTGATGCATGCCAATTTGGTAAGAGTCATGCTCTTCCTTTC
CAAACTTCAAATTCACATGCCAAAAAACCATTAGAACTCATTCATACAGACCTATGGGGTCCTGCCCCTGTAAATTCCAGCACTGGATTCCACTATTACATCCATTTTTT
AGATGATTTTAGTCGCCACACTTGGATTTATCCTCTAAAAAACAAATCAGATTCATTGGCAGCCTTCAAACACTTTAAGAACCTAGTGGAAAACAAGTTTGACTCAAAAA
TAAAAACTCTTCAGTCAGATTGGGGAGGAGAATATAGTGAAGGGAATAAAAGTCCCCACGCAGCGGAAGCGCATCGATTGGACCTTACGCCGTATATTAATTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCTAAAATCCTCAAAGGCAAATATTTCAGCGGAGGGGTTTTCCTAAAGGCCAAAGAGGACTGCATAGCTTCTATCGTGTGGAAAAACATATTATGGGGCAGATCTCT
ATTTGAAAAAGGTTACATATGGAGGATAGAGCTCATTCAAGCCAACGCTTCATCTCATTCAGCCTCCATCAACATGGATTTGGGAAAAGAAACAACGGTTCCAACGCCCA
CATCATCAACAACAGCTCCTGCATTCATAGGAAGCAACGTTGTTAACACTTCATCTGGAAATGGGCTAAACTCAGGACTCTCACTCCCAACAAAAACTGTTCCAAACCCA
GAGTATGAAGAATGGATAACAGTGGATCAAGCTCTCTTAGGGTGGCTTTATGGCTCAATGGAACAATCCATTGCCTCAGATGTTATAAACCATAAAACATCAATGGCTGT
CTGGAAAGCTTTAGAAGGAAGATATGGGTCTCACAACAAAGCAAGAATCATTCTATTAAGATCAACCTTGCAAAACACAAAGAAGAACAACATGAAGATGACAGAGTACC
TTGCAAAAATGAAGCAAATAGCTGATGGGCTCAGCTTAGCCGGATTAGATGCTCAATATCTTCCTATTACATGTACGTTGAATGGAAAAGATGCCTTGACCTGGGAGGAG
CTACATGCTACTCTAATAGCATTTGAACAAACATTGGTCCACCTCAATGTAATCTCTGATGTCAATGACCTATCAAATGCCACAGCACACTTAGCCATTCAAAAGACATA
TTCAAACAATTACTCCAACAATAGCTTCAGCACACAAGGCCCAAATAATAGTCACAATGGCAGAAATCAATCCAATCAAAATAGTGGAAGAGATAAAAGTCGTGGCACAT
ACCGGGGAGGAAGACAAAGAGGCCGTGGAGGTCGAAATAATGGAAATAGACCTGTCTGTCAAGTTTGTGGAAAACTAGGCCATACAACAGCAGTTTGTTACAATAGATAT
GATGGAAACTACATGGGAGCACCACCAAATCATGGAAAAAATCAAACGAACTCTGCATTCATTGCAAGCCCAGAAATTGTCAATGACTCCAGCTGGTATGCTGATAGCGG
AGCCAGCAATCACATCACCTCGGATCCGAACATGCTGAATCACAAAACAGAATACACCGACAATCAAAAGTTGAAGTTGAACAATATTTTACATGTTCCTACTATTACCA
AGAACTTGCTGAGTGTATCTAAGTTGACGAATGATAATGATGCCTATATGGAATTTCACTCTGATTTTTGTGTTGTGAAGGACAAAACCACGGGTCGTGTTCTGCATCAA
GGGAAGCTTAAGGAAGGGCTATATCAATTGGAGTTAAGAAACAAGTCACCCAACCTATTTCTATGTCAGAAGAGTCCAAGATTTGTGCTTAAGTCTAGTTTCAAAGGTCA
GTATGGTTGTTCCTCTCAAAGGCAACCTTTGCAATCTAATGTCAAAGCCTATGTATCTAGTTTCAATTCCAAAAAAGAGGTGTGGCATAGGAGATTAGGCCACCCCTCTG
AAAGAGTCCTTAGTTATATTCTCAAAGATTGTAATCAAAAGTTTACTCTCAATGAAAAATGTGATTTTTGTGATGCATGCCAATTTGGTAAGAGTCATGCTCTTCCTTTC
CAAACTTCAAATTCACATGCCAAAAAACCATTAGAACTCATTCATACAGACCTATGGGGTCCTGCCCCTGTAAATTCCAGCACTGGATTCCACTATTACATCCATTTTTT
AGATGATTTTAGTCGCCACACTTGGATTTATCCTCTAAAAAACAAATCAGATTCATTGGCAGCCTTCAAACACTTTAAGAACCTAGTGGAAAACAAGTTTGACTCAAAAA
TAAAAACTCTTCAGTCAGATTGGGGAGGAGAATATAGTGAAGGGAATAAAAGTCCCCACGCAGCGGAAGCGCATCGATTGGACCTTACGCCGTATATTAATTAA
Protein sequenceShow/hide protein sequence
MAKILKGKYFSGGVFLKAKEDCIASIVWKNILWGRSLFEKGYIWRIELIQANASSHSASINMDLGKETTVPTPTSSTTAPAFIGSNVVNTSSGNGLNSGLSLPTKTVPNP
EYEEWITVDQALLGWLYGSMEQSIASDVINHKTSMAVWKALEGRYGSHNKARIILLRSTLQNTKKNNMKMTEYLAKMKQIADGLSLAGLDAQYLPITCTLNGKDALTWEE
LHATLIAFEQTLVHLNVISDVNDLSNATAHLAIQKTYSNNYSNNSFSTQGPNNSHNGRNQSNQNSGRDKSRGTYRGGRQRGRGGRNNGNRPVCQVCGKLGHTTAVCYNRY
DGNYMGAPPNHGKNQTNSAFIASPEIVNDSSWYADSGASNHITSDPNMLNHKTEYTDNQKLKLNNILHVPTITKNLLSVSKLTNDNDAYMEFHSDFCVVKDKTTGRVLHQ
GKLKEGLYQLELRNKSPNLFLCQKSPRFVLKSSFKGQYGCSSQRQPLQSNVKAYVSSFNSKKEVWHRRLGHPSERVLSYILKDCNQKFTLNEKCDFCDACQFGKSHALPF
QTSNSHAKKPLELIHTDLWGPAPVNSSTGFHYYIHFLDDFSRHTWIYPLKNKSDSLAAFKHFKNLVENKFDSKIKTLQSDWGGEYSEGNKSPHAAEAHRLDLTPYIN