; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0017656 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0017656
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationchr5:6444289..6446464
RNA-Seq ExpressionLag0017656
SyntenyLag0017656
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GAU19483.1 hypothetical protein TSUD_77270 [Trifolium subterraneum]2.6e-12640.2Show/hide
Query:  AAAAGTTNFSRPPLNQLLNQITSIKLDRGNFLLWKNLALPILRSYKLEGHLSGAEPCPPQFVQAAAISSSQSAGGGGASSSEAAASEVSGTAPAKEVSPL
        A+AAG+ N +  P +       S+KLDR N+ LWK+L LP++R  KL+G++ G E CP +F     I+SS S                     +K  +  
Subjt:  AAAAGTTNFSRPPLNQLLNQITSIKLDRGNFLLWKNLALPILRSYKLEGHLSGAEPCPPQFVQAAAISSSQSAGGGGASSSEAAASEVSGTAPAKEVSPL

Query:  FESWIVVDQLLLGWLYNSMSPEVATQVMGFENAQELWTAIQDLFGVQSRAEEDYLRQVFQQCRKGSLKMPDYLRVMKSHADNLGQAGSPVTTRSLV----
        F  W   DQ LLGW+ NSM+ E+ATQ++  E +++LW   Q L G  +R++  YL+  F   RKG +KM DYL  MK+  D L  AG+PV+T  L+    
Subjt:  FESWIVVDQLLLGWLYNSMSPEVATQVMGFENAQELWTAIQDLFGVQSRAEEDYLRQVFQQCRKGSLKMPDYLRVMKSHADNLGQAGSPVTTRSLV----

Query:  ------------------------SQAELLVFEKRLELQNNLKSSLTLGASVNMANSKEVGNQRSPNPASNGRQNGFGRGSQRGGSNRGRGRGCGYGFFN
                                 QA+LL FE R+E  NNL ++LTL A+ N+AN  +   + S         N   RGS   G   GRGRG       
Subjt:  ------------------------SQAELLVFEKRLELQNNLKSSLTLGASVNMANSKEVGNQRSPNPASNGRQNGFGRGSQRGGSNRGRGRGCGYGFFN

Query:  NNKPVCQVCGKVGHTALMCYQRFNKEFSGPIVNQNMGDSRSNHGNSSQPTAFMANQNMNQCVASPETVIDPNWYADSGASNHVTADYNNLANPTEYDGKE
        + K  CQVCG   H A+ C+ RF+K +S         +  + H       AF+A+QN         +V D +WY DSGASNHVT       + TE+ GK 
Subjt:  NNKPVCQVCGKVGHTALMCYQRFNKEFSGPIVNQNMGDSRSNHGNSSQPTAFMANQNMNQCVASPETVIDPNWYADSGASNHVTADYNNLANPTEYDGKE

Query:  CVTIGNGSKLQIKYVGSSCLTDGTKNLSLENMLCVPSIAKNLVSVSKLAQDNNVFVEFHDHFCLVKDKSTSNVVLKGVLRDGLYRFEGVKATSVDISQSA
         + +GNG KL I   GSS L    K+L+L ++L VP+I KNL+SVSKLA DNN+ VEF ++ C VKDK T  V+LKG+L+DGLY+  G K          
Subjt:  CVTIGNGSKLQIKYVGSSCLTDGTKNLSLENMLCVPSIAKNLVSVSKLAQDNNVFVEFHDHFCLVKDKSTSNVVLKGVLRDGLYRFEGVKATSVDISQSA

Query:  NFEKSHSSVNNNADELPVFVVSANVVVSKSIWHRRLGHP--------------------------AVKFGKSHALPFQNSVSHAFSKFDLVHTDVWGPAP
                             SA V V +S WHRRLGHP                          A ++GK H LPF++S SHA    +LVHTDVWGPAP
Subjt:  NFEKSHSSVNNNADELPVFVVSANVVVSKSIWHRRLGHP--------------------------AVKFGKSHALPFQNSVSHAFSKFDLVHTDVWGPAP

Query:  VDSVHDFRFYILFVDDWSRFIWIYPLKRKSDALTAFTHFVAMVKNQFKTTVTALQSDNRGEYAPIHRLCSQLGINTRLSCPYTSQQNGRAERKHRHVVET
        + +   F++Y+ FVDD+SRF WIYPLK+KS+ + AF  F  + +NQF   +  +Q D  GEY P+ +L  + GI  R+SCPYTSQQNGRAERKHRH+ E 
Subjt:  VDSVHDFRFYILFVDDWSRFIWIYPLKRKSDALTAFTHFVAMVKNQFKTTVTALQSDNRGEYAPIHRLCSQLGINTRLSCPYTSQQNGRAERKHRHVVET

Query:  GLTLLAQASMPLRF
        GLTLLAQA MPL +
Subjt:  GLTLLAQASMPLRF

PNX76291.1 gag/pol polyprotein - maize retrotransposon Hopscotch, partial [Trifolium pratense]6.3e-12541.22Show/hide
Query:  LNQITSIKLDRGNFLLWKNLALPILRSYKLEGHLSGAEPCPPQFVQAAAISSSQSAGGGGASSSEAAASEVSGTAPAKEVSPLFESWIVVDQLLLGWLYN
        L    S+KLDR N+ LW+++ LPI+R  +L+G++ G + CP +F+ AA  S                          K+ +P FE W   DQ LLGWL N
Subjt:  LNQITSIKLDRGNFLLWKNLALPILRSYKLEGHLSGAEPCPPQFVQAAAISSSQSAGGGGASSSEAAASEVSGTAPAKEVSPLFESWIVVDQLLLGWLYN

Query:  SMSPEVATQVMGFENAQELWTAIQDLFGVQSRAEEDYLRQVFQQCRKGSLKMPDYLRVMKSHADNLGQAGSPVTTRSLV---------------------
        SM+  +ATQ++  E + +LW   Q L G  +R++  YL+  F   RKG +KM DYL  MK+ AD L  AG+P++T  L+                     
Subjt:  SMSPEVATQVMGFENAQELWTAIQDLFGVQSRAEEDYLRQVFQQCRKGSLKMPDYLRVMKSHADNLGQAGSPVTTRSLV---------------------

Query:  -------SQAELLVFEKRLELQNNLKSSLTLGASVNMA-NSKEVGNQRSPNPASNGRQNGFGRGSQRGGSNRGRGRGCGYGFFNNNKPVCQVCGKVGHTA
                QA+LL FE R+E  N+L ++LTL A+ N+A  S   GN+ + N    G  N + RGS   G   GRGRG  +      K  CQVCG   H A
Subjt:  -------SQAELLVFEKRLELQNNLKSSLTLGASVNMA-NSKEVGNQRSPNPASNGRQNGFGRGSQRGGSNRGRGRGCGYGFFNNNKPVCQVCGKVGHTA

Query:  LMCYQRFNKEFSGPIVNQNMGDSRSNHGNSSQPTAFMANQNMNQCVASPETVIDPNWYADSGASNHVTADYNNLANPTEYDGKECVTIGNGSKLQIKYVG
        + C+ RF+K +           SRSNH  ++         + N  +AS  ++ D +WY DSGASNHVT   +   N +E+ GK  + +GNG KL+I   G
Subjt:  LMCYQRFNKEFSGPIVNQNMGDSRSNHGNSSQPTAFMANQNMNQCVASPETVIDPNWYADSGASNHVTADYNNLANPTEYDGKECVTIGNGSKLQIKYVG

Query:  SSCLTDGTKNLSLENMLCVPSIAKNLVSVSKLAQDNNVFVEFHDHFCLVKDKSTSNVVLKGVLRDGLYRFEGVKATSVDISQSANFEKSHSSVNNNADEL
        SS L    K+L+L ++L VP I KNL+SVSKLA DNN+ VEF ++ C VKDK T   +L+G+L+DGLY+    K +S  +S   ++ +     NN    L
Subjt:  SSCLTDGTKNLSLENMLCVPSIAKNLVSVSKLAQDNNVFVEFHDHFCLVKDKSTSNVVLKGVLRDGLYRFEGVKATSVDISQSANFEKSHSSVNNNADEL

Query:  PVFVVSANVVVSKSIWHRRLGHPAVKFGKSHALPFQNSVSHAFSKFDLVHTDVWGPAPVDSVHDFRFYILFVDDWSRFIWIYPLKRKSDALTAFTHFVAM
         + + S NV +S S   +     A ++GK H LPF+ S SHA    +LVHTDVWGPAP+ S   F++Y+ F+DD++RF WIYPLK+KSD   AF  F  M
Subjt:  PVFVVSANVVVSKSIWHRRLGHPAVKFGKSHALPFQNSVSHAFSKFDLVHTDVWGPAPVDSVHDFRFYILFVDDWSRFIWIYPLKRKSDALTAFTHFVAM

Query:  VKNQFKTTVTALQSDNRGEYAPIHRLCSQLGINTRLSCPYTSQQNGRAERKHRHVVETGLTLLAQASMPLRF
        V+NQF   +  +Q D  GEY P+ +   + GI  R+SCPYTSQQNGRAERKHRH+ E GLTLLAQA MPL +
Subjt:  VKNQFKTTVTALQSDNRGEYAPIHRLCSQLGINTRLSCPYTSQQNGRAERKHRHVVETGLTLLAQASMPLRF

PNX78574.1 retrovirus-related Pol polyprotein from transposon TNT 1-94 [Trifolium pratense]6.1e-12039.8Show/hide
Query:  NQLLNQITSIKLDRGNFLLWKNLALPILRSYKLEGHLSGAEPCPPQFVQAAAISSSQSAGGGGASSSEAAASEVSGTAPAKEVSPLFESWIVVDQLLLGW
        N L ++I S+ LDR NF LWK+L LPI+R  +L+G++ G + CP QF+ +A                     E SG    K+++P F  W   DQ +LGW
Subjt:  NQLLNQITSIKLDRGNFLLWKNLALPILRSYKLEGHLSGAEPCPPQFVQAAAISSSQSAGGGGASSSEAAASEVSGTAPAKEVSPLFESWIVVDQLLLGW

Query:  LYNSMSPEVATQVMGFENAQELWTAIQDLFGVQSRAEEDYLRQVFQQCRKGSLKMPDYLRVMKSHADNLGQAGSPVTTRSLV------------------
        L N+M+   A+Q++  E +++LW   Q L    +R+   YLR  F   RKG  KM DYL  MK  AD L  AGSP+T   L+                  
Subjt:  LYNSMSPEVATQVMGFENAQELWTAIQDLFGVQSRAEEDYLRQVFQQCRKGSLKMPDYLRVMKSHADNLGQAGSPVTTRSLV------------------

Query:  ----------SQAELLVFEKRLELQNNLKSSLTLGASVNMANSKEVGNQRSPNPASNGRQNGFG-RGSQRGGSNRGRGRGCGYGFFNNNKPVCQVCGKVG
                   QA+LL FE RL+ Q N  ++L   A+ N+AN  +             R N +  RGS RG S R    G G G  +N+  +CQVC K G
Subjt:  ----------SQAELLVFEKRLELQNNLKSSLTLGASVNMANSKEVGNQRSPNPASNGRQNGFG-RGSQRGGSNRGRGRGCGYGFFNNNKPVCQVCGKVG

Query:  HTALMCYQRFNKEFSGPIVNQNMGDSRSNHGNSSQPTAFMANQNMNQCVASPETVIDPNWYADSGASNHVTADYNNLANPTEYDGKECVTIGNGSKLQIK
        HTA+ C  R++K ++G   +    + +  H       AF+A++  +Q         D  WY DSGASNHVT   +     TE  GK  + +GNG+KL+I 
Subjt:  HTALMCYQRFNKEFSGPIVNQNMGDSRSNHGNSSQPTAFMANQNMNQCVASPETVIDPNWYADSGASNHVTADYNNLANPTEYDGKECVTIGNGSKLQIK

Query:  YVGSSCLTDGTKNLSLENMLCVPSIAKNLVSVSKLAQDNNVFVEFHDHFCLVKDKSTSNVVLKGVLRDGLYRFEGVKATSVDISQSANFEKSHSSVNNNA
          GSS L    KNL+L ++L VP I KNL+SVSKL  DNN+ VEF +  C VKDK T  V+L+G+L+DGLY+                   S+ S   N 
Subjt:  YVGSSCLTDGTKNLSLENMLCVPSIAKNLVSVSKLAQDNNVFVEFHDHFCLVKDKSTSNVVLKGVLRDGLYRFEGVKATSVDISQSANFEKSHSSVNNNA

Query:  DELPVFVVSANVVVSKSIWHRRLGHP--------------------------AVKFGKSHALPFQNSVSHAFSKFDLVHTDVWGPAPVDSVHDFRFYILF
        D  P   +S      K  WHR+LGHP                          A + GKSH LPF++S SHA    +L+HTDVWGPAP++S+  F++Y+ F
Subjt:  DELPVFVVSANVVVSKSIWHRRLGHP--------------------------AVKFGKSHALPFQNSVSHAFSKFDLVHTDVWGPAPVDSVHDFRFYILF

Query:  VDDWSRFIWIYPLKRKSDALTAFTHFVAMVKNQFKTTVTALQSDNRGEYAPIHRLCSQLGINTRLSCPYTSQQNGRAERKHRHVVETGLTLLAQASMPLR
        +DD SRF WIYPLK+KSD + AF  F  MV+NQF   +  +Q D  GE+ P+ ++  + GI  R+SCPYTSQQNGRAERKHRHV E GLTLLAQA+M L 
Subjt:  VDDWSRFIWIYPLKRKSDALTAFTHFVAMVKNQFKTTVTALQSDNRGEYAPIHRLCSQLGINTRLSCPYTSQQNGRAERKHRHVVETGLTLLAQASMPLR

Query:  F
        +
Subjt:  F

PNX94503.1 putative retrotransposon Ty1-copia subclass protein, partial [Trifolium pratense]1.6e-12039.48Show/hide
Query:  LNQITSIKLDRGNFLLWKNLALPILRSYKLEGHLSGAEPCPPQFVQAAAISSSQSAGGGGASSSEAAASEVSGTAPAKEVSPLFESWIVVDQLLLGWLYN
        L    S+KLDR NF LWK+L LP++R  K +G++ G + CP QFV                       + +  T   ++++P ++ W   DQ LLGWL N
Subjt:  LNQITSIKLDRGNFLLWKNLALPILRSYKLEGHLSGAEPCPPQFVQAAAISSSQSAGGGGASSSEAAASEVSGTAPAKEVSPLFESWIVVDQLLLGWLYN

Query:  SMSPEVATQVMGFENAQELWTAIQDLFGVQSRAEEDYLRQVFQQCRKGSLKMPDYLRVMKSHADNLGQAGSPVTTRSLV---------------------
        SM+ ++ATQV+  E +++LW   Q L G  +R+   YL+  F    K  +KM  YL  MK+ AD L  AGSP+++  L+                     
Subjt:  SMSPEVATQVMGFENAQELWTAIQDLFGVQSRAEEDYLRQVFQQCRKGSLKMPDYLRVMKSHADNLGQAGSPVTTRSLV---------------------

Query:  -------SQAELLVFEKRLELQNNLKSSLTLGASVNMANSKEVGNQRSPNPASNGRQNGFG-RGSQRGGSNRGRGRGCGYGFFNN-NKPVCQVCGKVGHT
                QA+LL FE RL+  NN  +++ L AS N A+  E G             N FG RG  RG ++RG   G G    +   +P+CQ+CGK GHT
Subjt:  -------SQAELLVFEKRLELQNNLKSSLTLGASVNMANSKEVGNQRSPNPASNGRQNGFG-RGSQRGGSNRGRGRGCGYGFFNN-NKPVCQVCGKVGHT

Query:  ALMCYQRFNKEFSGPIVNQNMGDSRSNHGNSSQPTAFMANQNMNQCVASPETVIDPNWYADSGASNHVTADYNNLANPTEYDGKECVTIGNGSKLQIKYV
        A  CY RF+K ++      +  +   +H      +AF         VASP    D  WY DSGASNHVT     L +  E +GK  + +GNG KL+I   
Subjt:  ALMCYQRFNKEFSGPIVNQNMGDSRSNHGNSSQPTAFMANQNMNQCVASPETVIDPNWYADSGASNHVTADYNNLANPTEYDGKECVTIGNGSKLQIKYV

Query:  GSSCLTDGTKNLSLENMLCVPSIAKNLVSVSKLAQDNNVFVEFHDHFCLVKDKSTSNVVLKGVLRDGLYRFEGVKATSVDISQSANFEKSHSSVNNNADE
        GS+ L D    ++L N+L VP I KNL+SVSKL  DNN  VEF +++C VKDK T   +LKG L+DGLY+             SAN E        N D 
Subjt:  GSSCLTDGTKNLSLENMLCVPSIAKNLVSVSKLAQDNNVFVEFHDHFCLVKDKSTSNVVLKGVLRDGLYRFEGVKATSVDISQSANFEKSHSSVNNNADE

Query:  LPVFVVSANVVVSKSIWHRRLGHP--------------------------AVKFGKSHALPFQNSVSHAFSKFDLVHTDVWGPAPVDSVHDFRFYILFVD
         P   +S      K IWHR+LGHP                          A +FGK H LPF+ S SHA    DL+HTDVWGPAP+ S  +F++Y+ F+D
Subjt:  LPVFVVSANVVVSKSIWHRRLGHP--------------------------AVKFGKSHALPFQNSVSHAFSKFDLVHTDVWGPAPVDSVHDFRFYILFVD

Query:  DWSRFIWIYPLKRKSDALTAFTHFVAMVKNQFKTTVTALQSDNRGEYAPIHRLCSQLGINTRLSCPYTSQQNGRAERKHRHVVETGLTLLAQASMPLRF
        D+SRF WI+PLK+KS+ + AF  F  +V+NQF   +  ++ D  GEY P+ +     GI  ++SCPYTSQQNGRAERKHRHV E GLTLLAQA MPL +
Subjt:  DWSRFIWIYPLKRKSDALTAFTHFVAMVKNQFKTTVTALQSDNRGEYAPIHRLCSQLGINTRLSCPYTSQQNGRAERKHRHVVETGLTLLAQASMPLRF

PNY01489.1 copia-like polyprotein, partial [Trifolium pratense]2.3e-11939.68Show/hide
Query:  LNQITSIKLDRGNFLLWKNLALPILRSYKLEGHLSGAEPCPPQFVQAAAISSSQSAGGGGASSSEAAASEVSGTAPAKEVSPLFESWIVVDQLLLGWLYN
        L  I S+KLDR N+ LWK+L LP++R  K +G++ G + CP QFV +A  S                          K+V+P F+ W+  DQ LLGWL N
Subjt:  LNQITSIKLDRGNFLLWKNLALPILRSYKLEGHLSGAEPCPPQFVQAAAISSSQSAGGGGASSSEAAASEVSGTAPAKEVSPLFESWIVVDQLLLGWLYN

Query:  SMSPEVATQVMGFENAQELWTAIQDLFGVQSRAEEDYLRQVFQQCRKGSLKMPDYLRVMKSHADNLGQAGSPVTTRSLV---------------------
        SM+ ++ATQ++  E +++LW   Q L G  +++   YL+  F   RKG +KM +YL  MK+ +D L  +GSP++   L+                     
Subjt:  SMSPEVATQVMGFENAQELWTAIQDLFGVQSRAEEDYLRQVFQQCRKGSLKMPDYLRVMKSHADNLGQAGSPVTTRSLV---------------------

Query:  -------SQAELLVFEKRLELQNNLKSSLTLGASVNMANSKEVGNQRSPNPASNGRQNGF-GRGSQRGGSNRGRGRGCGYGFFNNNKPVCQVCGKVGHTA
                QA+LL FE RL+  NN  S LTL AS N AN  E             R N F  RG+ R  + RG   G G G  +N K  CQVC   GHTA
Subjt:  -------SQAELLVFEKRLELQNNLKSSLTLGASVNMANSKEVGNQRSPNPASNGRQNGF-GRGSQRGGSNRGRGRGCGYGFFNNNKPVCQVCGKVGHTA

Query:  LMCYQRFNKEFSGPIVNQNMGDSRSNHGNSSQPTAFMANQNMNQCVASPETVIDPNWYADSGASNHVTADYNNLANPTEYDGKECVTIGNGSKLQIKYVG
        + C  RF++ ++G    +N        G+ S   AF         VASP    D  WY DSGASNHVT   +      E++GK  + +GNG KL+I   G
Subjt:  LMCYQRFNKEFSGPIVNQNMGDSRSNHGNSSQPTAFMANQNMNQCVASPETVIDPNWYADSGASNHVTADYNNLANPTEYDGKECVTIGNGSKLQIKYVG

Query:  SSCLTDGTKNLSLENMLCVPSIAKNLVSVSKLAQDNNVFVEFHDHFCLVKDKSTSNVVLKGVLRDGLYRFEGVKATSVDISQSANFEKSHSSVNNNADEL
        S+ L      L+L ++L VP I KNL+SVSKL  DNN+FVEF  + C VKDK T   +LKG L+DGLY+         D+S  +N +             
Subjt:  SSCLTDGTKNLSLENMLCVPSIAKNLVSVSKLAQDNNVFVEFHDHFCLVKDKSTSNVVLKGVLRDGLYRFEGVKATSVDISQSANFEKSHSSVNNNADEL

Query:  PVFVVSANVVVSKSIWHRRLGHP--------------------------AVKFGKSHALPFQNSVSHAFSKFDLVHTDVWGPAPVDSVHDFRFYILFVDD
        P   +S      K  WHR+LGHP                          A +FGK H LPF++S SH      L+H+DVWGPAP+ S   F++Y+ F+DD
Subjt:  PVFVVSANVVVSKSIWHRRLGHP--------------------------AVKFGKSHALPFQNSVSHAFSKFDLVHTDVWGPAPVDSVHDFRFYILFVDD

Query:  WSRFIWIYPLKRKSDALTAFTHFVAMVKNQFKTTVTALQSDNRGEYAPIHRLCSQLGINTRLSCPYTSQQNGRAERKHRHVVETGLTLLAQASMPLRF
        +SRF WI+PLK+KSD + AF  F  + +NQF   +  +Q D  GEY  + ++  + GI  R+SCPYTSQQNGRAERKHRHVVE GLTLLAQA MPLR+
Subjt:  WSRFIWIYPLKRKSDALTAFTHFVAMVKNQFKTTVTALQSDNRGEYAPIHRLCSQLGINTRLSCPYTSQQNGRAERKHRHVVETGLTLLAQASMPLRF

TrEMBL top hitse value%identityAlignment
A0A2K3LCM1 Gag/pol polyprotein-maize retrotransposon Hopscotch (Fragment)3.1e-12541.22Show/hide
Query:  LNQITSIKLDRGNFLLWKNLALPILRSYKLEGHLSGAEPCPPQFVQAAAISSSQSAGGGGASSSEAAASEVSGTAPAKEVSPLFESWIVVDQLLLGWLYN
        L    S+KLDR N+ LW+++ LPI+R  +L+G++ G + CP +F+ AA  S                          K+ +P FE W   DQ LLGWL N
Subjt:  LNQITSIKLDRGNFLLWKNLALPILRSYKLEGHLSGAEPCPPQFVQAAAISSSQSAGGGGASSSEAAASEVSGTAPAKEVSPLFESWIVVDQLLLGWLYN

Query:  SMSPEVATQVMGFENAQELWTAIQDLFGVQSRAEEDYLRQVFQQCRKGSLKMPDYLRVMKSHADNLGQAGSPVTTRSLV---------------------
        SM+  +ATQ++  E + +LW   Q L G  +R++  YL+  F   RKG +KM DYL  MK+ AD L  AG+P++T  L+                     
Subjt:  SMSPEVATQVMGFENAQELWTAIQDLFGVQSRAEEDYLRQVFQQCRKGSLKMPDYLRVMKSHADNLGQAGSPVTTRSLV---------------------

Query:  -------SQAELLVFEKRLELQNNLKSSLTLGASVNMA-NSKEVGNQRSPNPASNGRQNGFGRGSQRGGSNRGRGRGCGYGFFNNNKPVCQVCGKVGHTA
                QA+LL FE R+E  N+L ++LTL A+ N+A  S   GN+ + N    G  N + RGS   G   GRGRG  +      K  CQVCG   H A
Subjt:  -------SQAELLVFEKRLELQNNLKSSLTLGASVNMA-NSKEVGNQRSPNPASNGRQNGFGRGSQRGGSNRGRGRGCGYGFFNNNKPVCQVCGKVGHTA

Query:  LMCYQRFNKEFSGPIVNQNMGDSRSNHGNSSQPTAFMANQNMNQCVASPETVIDPNWYADSGASNHVTADYNNLANPTEYDGKECVTIGNGSKLQIKYVG
        + C+ RF+K +           SRSNH  ++         + N  +AS  ++ D +WY DSGASNHVT   +   N +E+ GK  + +GNG KL+I   G
Subjt:  LMCYQRFNKEFSGPIVNQNMGDSRSNHGNSSQPTAFMANQNMNQCVASPETVIDPNWYADSGASNHVTADYNNLANPTEYDGKECVTIGNGSKLQIKYVG

Query:  SSCLTDGTKNLSLENMLCVPSIAKNLVSVSKLAQDNNVFVEFHDHFCLVKDKSTSNVVLKGVLRDGLYRFEGVKATSVDISQSANFEKSHSSVNNNADEL
        SS L    K+L+L ++L VP I KNL+SVSKLA DNN+ VEF ++ C VKDK T   +L+G+L+DGLY+    K +S  +S   ++ +     NN    L
Subjt:  SSCLTDGTKNLSLENMLCVPSIAKNLVSVSKLAQDNNVFVEFHDHFCLVKDKSTSNVVLKGVLRDGLYRFEGVKATSVDISQSANFEKSHSSVNNNADEL

Query:  PVFVVSANVVVSKSIWHRRLGHPAVKFGKSHALPFQNSVSHAFSKFDLVHTDVWGPAPVDSVHDFRFYILFVDDWSRFIWIYPLKRKSDALTAFTHFVAM
         + + S NV +S S   +     A ++GK H LPF+ S SHA    +LVHTDVWGPAP+ S   F++Y+ F+DD++RF WIYPLK+KSD   AF  F  M
Subjt:  PVFVVSANVVVSKSIWHRRLGHPAVKFGKSHALPFQNSVSHAFSKFDLVHTDVWGPAPVDSVHDFRFYILFVDDWSRFIWIYPLKRKSDALTAFTHFVAM

Query:  VKNQFKTTVTALQSDNRGEYAPIHRLCSQLGINTRLSCPYTSQQNGRAERKHRHVVETGLTLLAQASMPLRF
        V+NQF   +  +Q D  GEY P+ +   + GI  R+SCPYTSQQNGRAERKHRH+ E GLTLLAQA MPL +
Subjt:  VKNQFKTTVTALQSDNRGEYAPIHRLCSQLGINTRLSCPYTSQQNGRAERKHRHVVETGLTLLAQASMPLRF

A0A2K3LJ49 Retrovirus-related Pol polyprotein from transposon TNT 1-943.0e-12039.8Show/hide
Query:  NQLLNQITSIKLDRGNFLLWKNLALPILRSYKLEGHLSGAEPCPPQFVQAAAISSSQSAGGGGASSSEAAASEVSGTAPAKEVSPLFESWIVVDQLLLGW
        N L ++I S+ LDR NF LWK+L LPI+R  +L+G++ G + CP QF+ +A                     E SG    K+++P F  W   DQ +LGW
Subjt:  NQLLNQITSIKLDRGNFLLWKNLALPILRSYKLEGHLSGAEPCPPQFVQAAAISSSQSAGGGGASSSEAAASEVSGTAPAKEVSPLFESWIVVDQLLLGW

Query:  LYNSMSPEVATQVMGFENAQELWTAIQDLFGVQSRAEEDYLRQVFQQCRKGSLKMPDYLRVMKSHADNLGQAGSPVTTRSLV------------------
        L N+M+   A+Q++  E +++LW   Q L    +R+   YLR  F   RKG  KM DYL  MK  AD L  AGSP+T   L+                  
Subjt:  LYNSMSPEVATQVMGFENAQELWTAIQDLFGVQSRAEEDYLRQVFQQCRKGSLKMPDYLRVMKSHADNLGQAGSPVTTRSLV------------------

Query:  ----------SQAELLVFEKRLELQNNLKSSLTLGASVNMANSKEVGNQRSPNPASNGRQNGFG-RGSQRGGSNRGRGRGCGYGFFNNNKPVCQVCGKVG
                   QA+LL FE RL+ Q N  ++L   A+ N+AN  +             R N +  RGS RG S R    G G G  +N+  +CQVC K G
Subjt:  ----------SQAELLVFEKRLELQNNLKSSLTLGASVNMANSKEVGNQRSPNPASNGRQNGFG-RGSQRGGSNRGRGRGCGYGFFNNNKPVCQVCGKVG

Query:  HTALMCYQRFNKEFSGPIVNQNMGDSRSNHGNSSQPTAFMANQNMNQCVASPETVIDPNWYADSGASNHVTADYNNLANPTEYDGKECVTIGNGSKLQIK
        HTA+ C  R++K ++G   +    + +  H       AF+A++  +Q         D  WY DSGASNHVT   +     TE  GK  + +GNG+KL+I 
Subjt:  HTALMCYQRFNKEFSGPIVNQNMGDSRSNHGNSSQPTAFMANQNMNQCVASPETVIDPNWYADSGASNHVTADYNNLANPTEYDGKECVTIGNGSKLQIK

Query:  YVGSSCLTDGTKNLSLENMLCVPSIAKNLVSVSKLAQDNNVFVEFHDHFCLVKDKSTSNVVLKGVLRDGLYRFEGVKATSVDISQSANFEKSHSSVNNNA
          GSS L    KNL+L ++L VP I KNL+SVSKL  DNN+ VEF +  C VKDK T  V+L+G+L+DGLY+                   S+ S   N 
Subjt:  YVGSSCLTDGTKNLSLENMLCVPSIAKNLVSVSKLAQDNNVFVEFHDHFCLVKDKSTSNVVLKGVLRDGLYRFEGVKATSVDISQSANFEKSHSSVNNNA

Query:  DELPVFVVSANVVVSKSIWHRRLGHP--------------------------AVKFGKSHALPFQNSVSHAFSKFDLVHTDVWGPAPVDSVHDFRFYILF
        D  P   +S      K  WHR+LGHP                          A + GKSH LPF++S SHA    +L+HTDVWGPAP++S+  F++Y+ F
Subjt:  DELPVFVVSANVVVSKSIWHRRLGHP--------------------------AVKFGKSHALPFQNSVSHAFSKFDLVHTDVWGPAPVDSVHDFRFYILF

Query:  VDDWSRFIWIYPLKRKSDALTAFTHFVAMVKNQFKTTVTALQSDNRGEYAPIHRLCSQLGINTRLSCPYTSQQNGRAERKHRHVVETGLTLLAQASMPLR
        +DD SRF WIYPLK+KSD + AF  F  MV+NQF   +  +Q D  GE+ P+ ++  + GI  R+SCPYTSQQNGRAERKHRHV E GLTLLAQA+M L 
Subjt:  VDDWSRFIWIYPLKRKSDALTAFTHFVAMVKNQFKTTVTALQSDNRGEYAPIHRLCSQLGINTRLSCPYTSQQNGRAERKHRHVVETGLTLLAQASMPLR

Query:  F
        +
Subjt:  F

A0A2K3MUJ9 Putative retrotransposon Ty1-copia subclass protein (Fragment)7.8e-12139.48Show/hide
Query:  LNQITSIKLDRGNFLLWKNLALPILRSYKLEGHLSGAEPCPPQFVQAAAISSSQSAGGGGASSSEAAASEVSGTAPAKEVSPLFESWIVVDQLLLGWLYN
        L    S+KLDR NF LWK+L LP++R  K +G++ G + CP QFV                       + +  T   ++++P ++ W   DQ LLGWL N
Subjt:  LNQITSIKLDRGNFLLWKNLALPILRSYKLEGHLSGAEPCPPQFVQAAAISSSQSAGGGGASSSEAAASEVSGTAPAKEVSPLFESWIVVDQLLLGWLYN

Query:  SMSPEVATQVMGFENAQELWTAIQDLFGVQSRAEEDYLRQVFQQCRKGSLKMPDYLRVMKSHADNLGQAGSPVTTRSLV---------------------
        SM+ ++ATQV+  E +++LW   Q L G  +R+   YL+  F    K  +KM  YL  MK+ AD L  AGSP+++  L+                     
Subjt:  SMSPEVATQVMGFENAQELWTAIQDLFGVQSRAEEDYLRQVFQQCRKGSLKMPDYLRVMKSHADNLGQAGSPVTTRSLV---------------------

Query:  -------SQAELLVFEKRLELQNNLKSSLTLGASVNMANSKEVGNQRSPNPASNGRQNGFG-RGSQRGGSNRGRGRGCGYGFFNN-NKPVCQVCGKVGHT
                QA+LL FE RL+  NN  +++ L AS N A+  E G             N FG RG  RG ++RG   G G    +   +P+CQ+CGK GHT
Subjt:  -------SQAELLVFEKRLELQNNLKSSLTLGASVNMANSKEVGNQRSPNPASNGRQNGFG-RGSQRGGSNRGRGRGCGYGFFNN-NKPVCQVCGKVGHT

Query:  ALMCYQRFNKEFSGPIVNQNMGDSRSNHGNSSQPTAFMANQNMNQCVASPETVIDPNWYADSGASNHVTADYNNLANPTEYDGKECVTIGNGSKLQIKYV
        A  CY RF+K ++      +  +   +H      +AF         VASP    D  WY DSGASNHVT     L +  E +GK  + +GNG KL+I   
Subjt:  ALMCYQRFNKEFSGPIVNQNMGDSRSNHGNSSQPTAFMANQNMNQCVASPETVIDPNWYADSGASNHVTADYNNLANPTEYDGKECVTIGNGSKLQIKYV

Query:  GSSCLTDGTKNLSLENMLCVPSIAKNLVSVSKLAQDNNVFVEFHDHFCLVKDKSTSNVVLKGVLRDGLYRFEGVKATSVDISQSANFEKSHSSVNNNADE
        GS+ L D    ++L N+L VP I KNL+SVSKL  DNN  VEF +++C VKDK T   +LKG L+DGLY+             SAN E        N D 
Subjt:  GSSCLTDGTKNLSLENMLCVPSIAKNLVSVSKLAQDNNVFVEFHDHFCLVKDKSTSNVVLKGVLRDGLYRFEGVKATSVDISQSANFEKSHSSVNNNADE

Query:  LPVFVVSANVVVSKSIWHRRLGHP--------------------------AVKFGKSHALPFQNSVSHAFSKFDLVHTDVWGPAPVDSVHDFRFYILFVD
         P   +S      K IWHR+LGHP                          A +FGK H LPF+ S SHA    DL+HTDVWGPAP+ S  +F++Y+ F+D
Subjt:  LPVFVVSANVVVSKSIWHRRLGHP--------------------------AVKFGKSHALPFQNSVSHAFSKFDLVHTDVWGPAPVDSVHDFRFYILFVD

Query:  DWSRFIWIYPLKRKSDALTAFTHFVAMVKNQFKTTVTALQSDNRGEYAPIHRLCSQLGINTRLSCPYTSQQNGRAERKHRHVVETGLTLLAQASMPLRF
        D+SRF WI+PLK+KS+ + AF  F  +V+NQF   +  ++ D  GEY P+ +     GI  ++SCPYTSQQNGRAERKHRHV E GLTLLAQA MPL +
Subjt:  DWSRFIWIYPLKRKSDALTAFTHFVAMVKNQFKTTVTALQSDNRGEYAPIHRLCSQLGINTRLSCPYTSQQNGRAERKHRHVVETGLTLLAQASMPLRF

A0A2Z6MBG6 Integrase catalytic domain-containing protein1.2e-12640.2Show/hide
Query:  AAAAGTTNFSRPPLNQLLNQITSIKLDRGNFLLWKNLALPILRSYKLEGHLSGAEPCPPQFVQAAAISSSQSAGGGGASSSEAAASEVSGTAPAKEVSPL
        A+AAG+ N +  P +       S+KLDR N+ LWK+L LP++R  KL+G++ G E CP +F     I+SS S                     +K  +  
Subjt:  AAAAGTTNFSRPPLNQLLNQITSIKLDRGNFLLWKNLALPILRSYKLEGHLSGAEPCPPQFVQAAAISSSQSAGGGGASSSEAAASEVSGTAPAKEVSPL

Query:  FESWIVVDQLLLGWLYNSMSPEVATQVMGFENAQELWTAIQDLFGVQSRAEEDYLRQVFQQCRKGSLKMPDYLRVMKSHADNLGQAGSPVTTRSLV----
        F  W   DQ LLGW+ NSM+ E+ATQ++  E +++LW   Q L G  +R++  YL+  F   RKG +KM DYL  MK+  D L  AG+PV+T  L+    
Subjt:  FESWIVVDQLLLGWLYNSMSPEVATQVMGFENAQELWTAIQDLFGVQSRAEEDYLRQVFQQCRKGSLKMPDYLRVMKSHADNLGQAGSPVTTRSLV----

Query:  ------------------------SQAELLVFEKRLELQNNLKSSLTLGASVNMANSKEVGNQRSPNPASNGRQNGFGRGSQRGGSNRGRGRGCGYGFFN
                                 QA+LL FE R+E  NNL ++LTL A+ N+AN  +   + S         N   RGS   G   GRGRG       
Subjt:  ------------------------SQAELLVFEKRLELQNNLKSSLTLGASVNMANSKEVGNQRSPNPASNGRQNGFGRGSQRGGSNRGRGRGCGYGFFN

Query:  NNKPVCQVCGKVGHTALMCYQRFNKEFSGPIVNQNMGDSRSNHGNSSQPTAFMANQNMNQCVASPETVIDPNWYADSGASNHVTADYNNLANPTEYDGKE
        + K  CQVCG   H A+ C+ RF+K +S         +  + H       AF+A+QN         +V D +WY DSGASNHVT       + TE+ GK 
Subjt:  NNKPVCQVCGKVGHTALMCYQRFNKEFSGPIVNQNMGDSRSNHGNSSQPTAFMANQNMNQCVASPETVIDPNWYADSGASNHVTADYNNLANPTEYDGKE

Query:  CVTIGNGSKLQIKYVGSSCLTDGTKNLSLENMLCVPSIAKNLVSVSKLAQDNNVFVEFHDHFCLVKDKSTSNVVLKGVLRDGLYRFEGVKATSVDISQSA
         + +GNG KL I   GSS L    K+L+L ++L VP+I KNL+SVSKLA DNN+ VEF ++ C VKDK T  V+LKG+L+DGLY+  G K          
Subjt:  CVTIGNGSKLQIKYVGSSCLTDGTKNLSLENMLCVPSIAKNLVSVSKLAQDNNVFVEFHDHFCLVKDKSTSNVVLKGVLRDGLYRFEGVKATSVDISQSA

Query:  NFEKSHSSVNNNADELPVFVVSANVVVSKSIWHRRLGHP--------------------------AVKFGKSHALPFQNSVSHAFSKFDLVHTDVWGPAP
                             SA V V +S WHRRLGHP                          A ++GK H LPF++S SHA    +LVHTDVWGPAP
Subjt:  NFEKSHSSVNNNADELPVFVVSANVVVSKSIWHRRLGHP--------------------------AVKFGKSHALPFQNSVSHAFSKFDLVHTDVWGPAP

Query:  VDSVHDFRFYILFVDDWSRFIWIYPLKRKSDALTAFTHFVAMVKNQFKTTVTALQSDNRGEYAPIHRLCSQLGINTRLSCPYTSQQNGRAERKHRHVVET
        + +   F++Y+ FVDD+SRF WIYPLK+KS+ + AF  F  + +NQF   +  +Q D  GEY P+ +L  + GI  R+SCPYTSQQNGRAERKHRH+ E 
Subjt:  VDSVHDFRFYILFVDDWSRFIWIYPLKRKSDALTAFTHFVAMVKNQFKTTVTALQSDNRGEYAPIHRLCSQLGINTRLSCPYTSQQNGRAERKHRHVVET

Query:  GLTLLAQASMPLRF
        GLTLLAQA MPL +
Subjt:  GLTLLAQASMPLRF

A0A803PEH4 Uncharacterized protein4.7e-12639.53Show/hide
Query:  NSIETSAAAAGTTNFSRPPLNQL----LNQITSIKLDRGNFLLWKNLALPILRSYKLEGHLSGAEPCPPQFVQAAAISSSQSAGGGGASSSEAAASEVSG
        NS    A+++  TN +    N      LNQ  S+KLDR N+ LWK +   I+R ++L G+LSG   CPP+FV                            
Subjt:  NSIETSAAAAGTTNFSRPPLNQL----LNQITSIKLDRGNFLLWKNLALPILRSYKLEGHLSGAEPCPPQFVQAAAISSSQSAGGGGASSSEAAASEVSG

Query:  TAPAKEVSPLFESWIVVDQLLLGWLYNSMSPEVATQVMGFENAQELWTAIQDLFGVQSRAEEDYLRQVFQQCRKGSLKMPDYLRVMKSHADNLGQAGSPV
            +  +P +E+WI+ DQLL+GWLY+SM+  +AT+VMG  +A  L   ++ L+G  S+++ D  R + Q  RKGS  M +YLR  K+ ++ L  AG P 
Subjt:  TAPAKEVSPLFESWIVVDQLLLGWLYNSMSPEVATQVMGFENAQELWTAIQDLFGVQSRAEEDYLRQVFQQCRKGSLKMPDYLRVMKSHADNLGQAGSPV

Query:  TTRSLVS------QAELLVFEKRLELQNN------------LKSSLTLGASVNMANSKEVGNQRSPNPASNGRQNGFGRGSQ---------------RGG
            LV+       AE L    ++E ++N              S +    ++ + ++K   +    N A+    NG GRG Q               RG 
Subjt:  TTRSLVS------QAELLVFEKRLELQNN------------LKSSLTLGASVNMANSKEVGNQRSPNPASNGRQNGFGRGSQ---------------RGG

Query:  SNRGRGRGCGYGFFNNNKPVCQVCGKVGHTALMCYQRFNKEFSGPIVNQNMGDSRSNHGNSSQPTAFMANQNMNQCVASPETVIDPNWYADSGASNHVTA
        SNR RGRG G G  + ++P CQV GK GHTA +CY RF++ + G        D  + H   +Q  A   N N +  VA+PE +    W+ADSGASNH+T+
Subjt:  SNRGRGRGCGYGFFNNNKPVCQVCGKVGHTALMCYQRFNKEFSGPIVNQNMGDSRSNHGNSSQPTAFMANQNMNQCVASPETVIDPNWYADSGASNHVTA

Query:  DYNNLANPTEYDGKECVTIGNGSKLQIKYVGSSCLTDGTKN-LSLENMLCVPSIAKNLVSVSKLAQDNNVFVEFHDHFCLVKDKSTSNVVLKGVLRDGLY
        D  NL    +Y+GKE V +GNGSKL+I ++G+  L   + N L L++ML VP IAKNLVSVSKLA DNNV +EF+ +FCLVKDK T  V+L GVL+D LY
Subjt:  DYNNLANPTEYDGKECVTIGNGSKLQIKYVGSSCLTDGTKN-LSLENMLCVPSIAKNLVSVSKLAQDNNVFVEFHDHFCLVKDKSTSNVVLKGVLRDGLY

Query:  RFEGVKATSVDISQSANFEKSHS-SVNNNADELPVFVVSANVVVSKSIWHRRLGHPAVK--------------------------FGKSHALPFQNSVSH
        + +     S    Q +NF  + + SV++N ++       + ++    + HRRLGHP++K                          +GK+HALPF++S + 
Subjt:  RFEGVKATSVDISQSANFEKSHS-SVNNNADELPVFVVSANVVVSKSIWHRRLGHPAVK--------------------------FGKSHALPFQNSVSH

Query:  AFSKFDLVHTDVWGPAPVDSVHDFRFYILFVDDWSRFIWIYPLKRKSDALTAFTHFVAMVKNQFKTTVTALQSDNRGEYAPIHRLCSQLGINTRLSCPYT
        A S  DL+HTD+WGPAP+ S  +  +YI FVDD+SR+ W+YPLK KSDAL AF  F A+V+NQF   + +L+SD+ GEY P   L    GI  +  CP+T
Subjt:  AFSKFDLVHTDVWGPAPVDSVHDFRFYILFVDDWSRFIWIYPLKRKSDALTAFTHFVAMVKNQFKTTVTALQSDNRGEYAPIHRLCSQLGINTRLSCPYT

Query:  SQQNGRAERKHRHVVETGLTLLAQAS
        S QNGRA+RKHRH VE GLTLLAQA+
Subjt:  SQQNGRAERKHRHVVETGLTLLAQAS

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.8e-2124.05Show/hide
Query:  KPVCQVCGKVGHTALMCYQRFNKEFSGPIVNQNMGDSRSNHGNSSQPTAFMANQNMNQCVASPETVIDPNWYADSGASNHVTADYNNLANPTEYDGKECV
        K  C  CG+ GH    C+      +   + N+N  + +     +S   AFM  +  N  V       +  +  DSGAS+H+  D +   +  E      +
Subjt:  KPVCQVCGKVGHTALMCYQRFNKEFSGPIVNQNMGDSRSNHGNSSQPTAFMANQNMNQCVASPETVIDPNWYADSGASNHVTADYNNLANPTEYDGKECV

Query:  TIGNGSKLQIKYVGSSCLTDGTKNLSLENMLCVPSIAKNLVSVSKLAQDNNVFVEFHDHFCLVKDKSTSNVVLKGVLRDGLYRFEGVKATSVDISQSANF
         +    +                 ++LE++L     A NL+SV +L Q+  + +EF      +       V   G+L +        +A S++     NF
Subjt:  TIGNGSKLQIKYVGSSCLTDGTKNLSLENMLCVPSIAKNLVSVSKLAQDNNVFVEFHDHFCLVKDKSTSNVVLKGVLRDGLYRFEGVKATSVDISQSANF

Query:  EKSHSSVNNNADELPVFVVSANVVVSKSIWHR-----RLGHPAVKFGKSHALPFQ--NSVSHAFSKFDLVHTDVWGPAPVDSVHDFRFYILFVDDWSRFI
           H    + +D   + +   N+   +S+ +       +  P +  GK   LPF+     +H      +VH+DV GP    ++ D  ++++FVD ++ + 
Subjt:  EKSHSSVNNNADELPVFVVSANVVVSKSIWHR-----RLGHPAVKFGKSHALPFQ--NSVSHAFSKFDLVHTDVWGPAPVDSVHDFRFYILFVDDWSRFI

Query:  WIYPLKRKSDALTAFTHFVAMVKNQFKTTVTALQSDNRGEYA--PIHRLCSQLGINTRLSCPYTSQQNGRAERKHRHVVETGLTLLAQASMPLRF
          Y +K KSD  + F  FVA  +  F   V  L  DN  EY    + + C + GI+  L+ P+T Q NG +ER  R + E   T+++ A +   F
Subjt:  WIYPLKRKSDALTAFTHFVAMVKNQFKTTVTALQSDNRGEYA--PIHRLCSQLGINTRLSCPYTSQQNGRAERKHRHVVETGLTLLAQASMPLRF

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.0e-3224.58Show/hide
Query:  ESWIVVDQLLLGWLYNSMSPEVATQVMGFENAQELWTAIQDLFGVQSRAEEDYL-RQVFQQCRKGSLKMPDYLRVMKSHADNLGQAGSPVTTRSLVSQAE
        E W  +D+     +   +S +V   ++  + A+ +WT ++ L+  ++   + YL +Q++            +L V       L   G  +          
Subjt:  ESWIVVDQLLLGWLYNSMSPEVATQVMGFENAQELWTAIQDLFGVQSRAEEDYL-RQVFQQCRKGSLKMPDYLRVMKSHADNLGQAGSPVTTRSLVSQAE

Query:  LLVFEKRLELQNNLKSSLTLGASV----NMANSKEVGNQRSPNPASNGR---QNGFGRGSQRGGSNRGRGRGCGYGFFNNNKPV--CQVCGKVGHTALMC
        +L+        +NL +++  G +     ++ ++  +  +    P + G+     G GR  QR  +N GR    G     +   V  C  C + GH    C
Subjt:  LLVFEKRLELQNNLKSSLTLGASV----NMANSKEVGNQRSPNPASNGR---QNGFGRGSQRGGSNRGRGRGCGYGFFNNNKPV--CQVCGKVGHTALMC

Query:  --YQRFNKEFSGPIVNQNMGDSRSNHGNSSQPTAFMANQNMNQCVASPETVIDPNWYADSGASNHVTADYNNLANPTEYDGKECVTIGNGSKLQIKYVGS
           ++   E SG    +N  ++ +   N+     F+  +     ++ PE+     W  D+ AS+H T   +        D    V +GN S  +I  +G 
Subjt:  --YQRFNKEFSGPIVNQNMGDSRSNHGNSSQPTAFMANQNMNQCVASPETVIDPNWYADSGASNHVTADYNNLANPTEYDGKECVTIGNGSKLQIKYVGS

Query:  SCL-TDGTKNLSLENMLCVPSIAKNLVSVSKLAQDNNVFVEFHDHFCLVKDKST--SNVVLKGVLRDGLYRFEGVKATSVDISQSANFEKSHSSVNNNAD
         C+ T+    L L+++  VP +  NL+S   L +D      +  +F   K + T  S V+ KGV R  LYR               N E     +N   D
Subjt:  SCL-TDGTKNLSLENMLCVPSIAKNLVSVSKLAQDNNVFVEFHDHFCLVKDKST--SNVVLKGVLRDGLYRFEGVKATSVDISQSANFEKSHSSVNNNAD

Query:  ELPVFVVSANVVVSKSIWHRRLGHPAVK--------------------------FGKSHALPFQNSVSHAFSKFDLVHTDVWGPAPVDSVHDFRFYILFV
        E           +S  +WH+R+GH + K                          FGK H + FQ S     +  DLV++DV GP  ++S+   ++++ F+
Subjt:  ELPVFVVSANVVVSKSIWHRRLGHPAVK--------------------------FGKSHALPFQNSVSHAFSKFDLVHTDVWGPAPVDSVHDFRFYILFV

Query:  DDWSRFIWIYPLKRKSDALTAFTHFVAMVKNQFKTTVTALQSDNRGEYA--PIHRLCSQLGINTRLSCPYTSQQNGRAERKHRHVVETGLTLLAQASMPL
        DD SR +W+Y LK K      F  F A+V+ +    +  L+SDN GEY        CS  GI    + P T Q NG AER +R +VE   ++L  A +P 
Subjt:  DDWSRFIWIYPLKRKSDALTAFTHFVAMVKNQFKTTVTALQSDNRGEYA--PIHRLCSQLGINTRLSCPYTSQQNGRAERKHRHVVETGLTLLAQASMPL

Query:  RF
         F
Subjt:  RF

Q12490 Transposon Ty1-BL Gag-Pol polyprotein8.0e-1429.88Show/hide
Query:  VVSKSIWHRRLGHPAVKFGKSHALPFQNSVSHAFSKFDLVHTDVWGPAPVDSVHDFRFYILFVDDWSRFIWIYPL--KRKSDALTAFTHFVAMVKNQFKT
        ++ KS  HR +        K   L +QNS    +  F  +HTD++GP          ++I F D+ ++F W+YPL  +R+   L  FT  +A +KNQF+ 
Subjt:  VVSKSIWHRRLGHPAVKFGKSHALPFQNSVSHAFSKFDLVHTDVWGPAPVDSVHDFRFYILFVDDWSRFIWIYPL--KRKSDALTAFTHFVAMVKNQFKT

Query:  TVTALQSDNRGEYA--PIHRLCSQLGINTRLSCPYTSQQNGRAERKHRHVVETGLTLLAQASMP
        +V  +Q D   EY    +H+   + GI    +    S+ +G AER +R +++   T L  + +P
Subjt:  TVTALQSDNRGEYA--PIHRLCSQLGINTRLSCPYTSQQNGRAERKHRHVVETGLTLLAQASMP

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE16.9e-6628.21Show/hide
Query:  LNQLLNQITSIKLDRGNFLLWKNLALPILRSYKLEGHLSGAEPCPPQFVQAAAISSSQSAGGGGASSSEAAASEVSGTAPAKEVSPLFESWIVVDQLLLG
        LN  ++ +T  KL   N+L+W      +   Y+L G L G+   PP  +                           GT  A  V+P +  W   D+L+  
Subjt:  LNQLLNQITSIKLDRGNFLLWKNLALPILRSYKLEGHLSGAEPCPPQFVQAAAISSSQSAGGGGASSSEAAASEVSGTAPAKEVSPLFESWIVVDQLLLG

Query:  WLYNSMSPEVATQVMGFENAQELWTAIQDLFGVQSRAEEDYLRQVFQQCRKGSLKMPDYLRVMKSHADNLGQAGSPVTTRSLVSQA--------------
         +  ++S  V   V     A ++W  ++ ++   S      LR   +Q  KG+  + DY++ + +  D L   G P+     V +               
Subjt:  WLYNSMSPEVATQVMGFENAQELWTAIQDLFGVQSRAEEDYLRQVFQQCRKGSLKMPDYLRVMKSHADNLGQAGSPVTTRSLVSQA--------------

Query:  --------ELLVFEKRLELQNNLKSSLTLGASVNMANSKEVGNQRSPNPASNGRQNGFGRGSQRGGSNRGR-GRGCGYGFFNNN---KPV---CQVCGKV
                 L    +RL L +  K      A+V    +  V ++ +    +N   N   R   R  +N  +  +     F  NN   KP    CQ+CG  
Subjt:  --------ELLVFEKRLELQNNLKSSLTLGASVNMANSKEVGNQRSPNPASNGRQNGFGRGSQRGGSNRGR-GRGCGYGFFNNN---KPV---CQVCGKV

Query:  GHTALMCYQRFNKEFSGPIVNQNMGDSRSNHGNSSQPTAFMANQNMNQCVASPETVIDPNWYADSGASNHVTADYNNLANPTEYDGKECVTIGNGSKLQI
        GH+A  C Q   + F   +             NS QP +          +A        NW  DSGA++H+T+D+NNL+    Y G + V + +GS + I
Subjt:  GHTALMCYQRFNKEFSGPIVNQNMGDSRSNHGNSSQPTAFMANQNMNQCVASPETVIDPNWYADSGASNHVTADYNNLANPTEYDGKECVTIGNGSKLQI

Query:  KYVGSSCLTDGTKNLSLENMLCVPSIAKNLVSVSKLAQDNNVFVEFHDHFCLVKDKSTSNVVLKGVLRDGLYRFEGVKATSVDISQSANFEKSHSS----
         + GS+ L+  ++ L+L N+L VP+I KNL+SV +L   N V VEF      VKD +T   +L+G  +D LY +    +  V +  S + + +HSS    
Subjt:  KYVGSSCLTDGTKNLSLENMLCVPSIAKNLVSVSKLAQDNNVFVEFHDHFCLVKDKSTSNVVLKGVLRDGLYRFEGVKATSVDISQSANFEKSHSS----

Query:  VNNNADELPVFVVSANVVVSKSIWHRRLGHPAVKFGKSHALPFQNSVSHAFSKFDLVHTDVWGPAPVDSVHDFRFYILFVDDWSRFIWIYPLKRKSDALT
        + + A  +   V+S   +   +  H+ L        KS+ +PF  S  ++    + +++DVW  +P+ S  ++R+Y++FVD ++R+ W+YPLK+KS    
Subjt:  VNNNADELPVFVVSANVVVSKSIWHRRLGHPAVKFGKSHALPFQNSVSHAFSKFDLVHTDVWGPAPVDSVHDFRFYILFVDDWSRFIWIYPLKRKSDALT

Query:  AFTHFVAMVKNQFKTTVTALQSDNRGEYAPIHRLCSQLGINTRLSCPYTSQQNGRAERKHRHVVETGLTLLAQASMP
         F  F  +++N+F+T +    SDN GE+  +    SQ GI+   S P+T + NG +ERKHRH+VETGLTLL+ AS+P
Subjt:  AFTHFVAMVKNQFKTTVTALQSDNRGEYAPIHRLCSQLGINTRLSCPYTSQQNGRAERKHRHVVETGLTLLAQASMP

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.5e-6528.92Show/hide
Query:  LNQLLNQITSIKLDRGNFLLWKNLALPILRSYKLEGHLSGAEPCPPQFVQAAAISSSQSAGGGGASSSEAAASEVSGTAPAKEVSPLFESWIVVDQLLLG
        LN  ++ +T  KL   N+L+W      +   Y+L G L G+ P PP  +                           GT     V+P +  W   D+L+  
Subjt:  LNQLLNQITSIKLDRGNFLLWKNLALPILRSYKLEGHLSGAEPCPPQFVQAAAISSSQSAGGGGASSSEAAASEVSGTAPAKEVSPLFESWIVVDQLLLG

Query:  WLYNSMSPEVATQVMGFENAQELWTAIQDLFGVQSRAEEDYLRQVFQQCRKGSLKMP-DYLRVMKSHADNLGQAGSPVTTRSLVSQAELLVFEKRLELQN
         +  ++S  V   V     A ++W  ++ ++   S      LR + +  +   L  P D+   ++   +NL     PV  +         + E    L N
Subjt:  WLYNSMSPEVATQVMGFENAQELWTAIQDLFGVQSRAEEDYLRQVFQQCRKGSLKMP-DYLRVMKSHADNLGQAGSPVTTRSLVSQAELLVFEKRLELQN

Query:  NLKSSLTLGASVNMANSKEVGNQRSPNPASNGRQNGFGRGSQRGGSNRGRGRGCGYGFFNNN---KPV---CQVCGKVGHTALMCYQRFNKEFSGPIVNQ
             L L ++  +  +  V   R+ N   N    G  R      +     +    G  ++N   KP    CQ+C   GH+A  C Q             
Subjt:  NLKSSLTLGASVNMANSKEVGNQRSPNPASNGRQNGFGRGSQRGGSNRGRGRGCGYGFFNNN---KPV---CQVCGKVGHTALMCYQRFNKEFSGPIVNQ

Query:  NMGDSRSNHGNSSQP-TAFMANQNMNQCVASPETVIDPNWYADSGASNHVTADYNNLANPTEYDGKECVTIGNGSKLQIKYVGSSCLTDGTKNLSLENML
        +   S +N   S+ P T +    N+   V SP      NW  DSGA++H+T+D+NNL+    Y G + V I +GS + I + GS+ L   +++L L  +L
Subjt:  NMGDSRSNHGNSSQP-TAFMANQNMNQCVASPETVIDPNWYADSGASNHVTADYNNLANPTEYDGKECVTIGNGSKLQIKYVGSSCLTDGTKNLSLENML

Query:  CVPSIAKNLVSVSKLAQDNNVFVEFHDHFCLVKDKSTSNVVLKGVLRDGLYRFEGVKATSVDISQSANFEKSHSSVNNNADELPVFVVSANVVVSKSI--
         VP+I KNL+SV +L   N V VEF      VKD +T   +L+G  +D LY +    + +V +  S   + +HSS ++     P   +  +V+ + S+  
Subjt:  CVPSIAKNLVSVSKLAQDNNVFVEFHDHFCLVKDKSTSNVVLKGVLRDGLYRFEGVKATSVDISQSANFEKSHSSVNNNADELPVFVVSANVVVSKSI--

Query:  ---WHRRLGHPAVKFGKSHALPFQNSVSHAFSKFDLVHTDVWGPAPVDSVHDFRFYILFVDDWSRFIWIYPLKRKSDALTAFTHFVAMVKNQFKTTVTAL
            H+ L        KSH +PF NS   +    + +++DVW  +P+ S+ ++R+Y++FVD ++R+ W+YPLK+KS     F  F ++V+N+F+T +  L
Subjt:  ---WHRRLGHPAVKFGKSHALPFQNSVSHAFSKFDLVHTDVWGPAPVDSVHDFRFYILFVDDWSRFIWIYPLKRKSDALTAFTHFVAMVKNQFKTTVTAL

Query:  QSDNRGEYAPIHRLCSQLGINTRLSCPYTSQQNGRAERKHRHVVETGLTLLAQASMP
         SDN GE+  +    SQ GI+   S P+T + NG +ERKHRH+VE GLTLL+ AS+P
Subjt:  QSDNRGEYAPIHRLCSQLGINTRLSCPYTSQQNGRAERKHRHVVETGLTLLAQASMP

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACTAGCGTCAATTCGATCGAGACCTCCGCCGCAGCTGCTGGAACAACGAATTTTAGTCGCCCACCTTTGAATCAACTGTTGAACCAGATTACATCTATAAAACTTGA
TCGAGGGAATTTCTTGTTATGGAAGAATTTGGCATTGCCAATTCTTCGCAGTTACAAATTAGAGGGTCATCTTTCAGGCGCCGAACCTTGCCCTCCACAGTTCGTTCAGG
CAGCTGCCATTAGTTCCTCACAGTCGGCAGGAGGAGGAGGAGCTTCAAGCTCTGAAGCTGCAGCAAGTGAAGTATCCGGTACTGCCCCAGCGAAAGAAGTTAGTCCACTA
TTTGAGTCGTGGATAGTAGTCGATCAATTACTGCTCGGTTGGCTTTACAATTCTATGAGTCCTGAAGTTGCCACTCAGGTAATGGGCTTTGAGAATGCCCAGGAACTGTG
GACTGCCATACAAGATCTTTTTGGAGTCCAGTCTCGTGCCGAGGAGGATTATTTGCGTCAAGTGTTTCAACAGTGCAGGAAAGGGAGTCTGAAAATGCCCGATTATCTTC
GAGTAATGAAGAGCCACGCCGACAACTTGGGTCAAGCTGGGAGCCCAGTTACAACTCGCTCTCTGGTATCTCAGGCTGAACTGTTAGTCTTTGAAAAGCGTCTGGAGCTA
CAAAACAACCTAAAGTCCTCACTGACGTTGGGTGCATCAGTTAACATGGCAAATAGCAAAGAAGTAGGAAATCAGAGGAGTCCAAATCCGGCCAGCAATGGAAGACAGAA
CGGGTTTGGTAGAGGAAGTCAGAGAGGGGGATCAAATCGGGGCAGAGGAAGAGGTTGTGGCTATGGCTTCTTTAACAATAACAAACCGGTTTGCCAGGTATGTGGAAAGG
TAGGGCACACTGCTCTTATGTGTTATCAACGATTCAATAAAGAGTTCTCTGGTCCTATAGTTAATCAGAACATGGGAGATAGCCGTTCAAATCATGGGAATTCTTCTCAA
CCTACTGCGTTTATGGCCAATCAAAACATGAATCAATGTGTTGCATCCCCTGAAACTGTGATCGACCCAAATTGGTACGCAGATAGTGGTGCATCCAACCACGTCACTGC
AGACTACAACAACCTAGCCAACCCTACTGAATATGATGGTAAGGAGTGTGTTACGATTGGCAATGGCAGTAAGTTACAAATTAAGTATGTTGGGAGCTCTTGTTTAACAG
ATGGTACTAAAAACCTTAGTCTAGAAAATATGTTGTGTGTTCCTAGTATTGCTAAGAACTTGGTTAGTGTCTCTAAATTAGCTCAAGACAATAATGTGTTTGTGGAATTT
CATGATCATTTTTGTCTTGTAAAGGACAAGTCTACGAGCAATGTGGTGCTGAAAGGAGTGCTTCGTGATGGGCTATACCGTTTTGAAGGAGTAAAAGCTACGTCAGTGGA
TATTTCTCAATCAGCAAATTTTGAAAAGAGTCACTCTAGTGTCAATAACAATGCTGATGAACTTCCTGTGTTTGTTGTTAGTGCAAATGTTGTCGTTTCTAAGTCCATCT
GGCATAGACGTCTTGGTCATCCCGCAGTTAAATTTGGAAAGTCTCACGCATTACCATTTCAGAACTCTGTCTCTCATGCATTCTCTAAATTCGATCTTGTTCACACGGAT
GTCTGGGGTCCAGCACCTGTTGATTCTGTTCATGATTTTCGGTTTTATATACTGTTTGTGGATGATTGGAGTCGGTTTATTTGGATTTATCCTTTAAAAAGGAAGAGTGA
TGCTCTAACAGCCTTTACTCACTTTGTTGCAATGGTCAAAAATCAGTTCAAGACAACAGTAACAGCCTTACAGTCAGACAACAGGGGAGAATATGCACCTATCCACAGAC
TATGTAGTCAGTTGGGCATTAATACTCGTTTGTCGTGTCCCTATACATCTCAACAGAATGGCAGAGCAGAACGGAAGCATCGCCATGTAGTAGAAACAGGGTTAACGTTA
CTAGCCCAAGCATCAATGCCATTAAGGTTCTAG
mRNA sequenceShow/hide mRNA sequence
ATGACTAGCGTCAATTCGATCGAGACCTCCGCCGCAGCTGCTGGAACAACGAATTTTAGTCGCCCACCTTTGAATCAACTGTTGAACCAGATTACATCTATAAAACTTGA
TCGAGGGAATTTCTTGTTATGGAAGAATTTGGCATTGCCAATTCTTCGCAGTTACAAATTAGAGGGTCATCTTTCAGGCGCCGAACCTTGCCCTCCACAGTTCGTTCAGG
CAGCTGCCATTAGTTCCTCACAGTCGGCAGGAGGAGGAGGAGCTTCAAGCTCTGAAGCTGCAGCAAGTGAAGTATCCGGTACTGCCCCAGCGAAAGAAGTTAGTCCACTA
TTTGAGTCGTGGATAGTAGTCGATCAATTACTGCTCGGTTGGCTTTACAATTCTATGAGTCCTGAAGTTGCCACTCAGGTAATGGGCTTTGAGAATGCCCAGGAACTGTG
GACTGCCATACAAGATCTTTTTGGAGTCCAGTCTCGTGCCGAGGAGGATTATTTGCGTCAAGTGTTTCAACAGTGCAGGAAAGGGAGTCTGAAAATGCCCGATTATCTTC
GAGTAATGAAGAGCCACGCCGACAACTTGGGTCAAGCTGGGAGCCCAGTTACAACTCGCTCTCTGGTATCTCAGGCTGAACTGTTAGTCTTTGAAAAGCGTCTGGAGCTA
CAAAACAACCTAAAGTCCTCACTGACGTTGGGTGCATCAGTTAACATGGCAAATAGCAAAGAAGTAGGAAATCAGAGGAGTCCAAATCCGGCCAGCAATGGAAGACAGAA
CGGGTTTGGTAGAGGAAGTCAGAGAGGGGGATCAAATCGGGGCAGAGGAAGAGGTTGTGGCTATGGCTTCTTTAACAATAACAAACCGGTTTGCCAGGTATGTGGAAAGG
TAGGGCACACTGCTCTTATGTGTTATCAACGATTCAATAAAGAGTTCTCTGGTCCTATAGTTAATCAGAACATGGGAGATAGCCGTTCAAATCATGGGAATTCTTCTCAA
CCTACTGCGTTTATGGCCAATCAAAACATGAATCAATGTGTTGCATCCCCTGAAACTGTGATCGACCCAAATTGGTACGCAGATAGTGGTGCATCCAACCACGTCACTGC
AGACTACAACAACCTAGCCAACCCTACTGAATATGATGGTAAGGAGTGTGTTACGATTGGCAATGGCAGTAAGTTACAAATTAAGTATGTTGGGAGCTCTTGTTTAACAG
ATGGTACTAAAAACCTTAGTCTAGAAAATATGTTGTGTGTTCCTAGTATTGCTAAGAACTTGGTTAGTGTCTCTAAATTAGCTCAAGACAATAATGTGTTTGTGGAATTT
CATGATCATTTTTGTCTTGTAAAGGACAAGTCTACGAGCAATGTGGTGCTGAAAGGAGTGCTTCGTGATGGGCTATACCGTTTTGAAGGAGTAAAAGCTACGTCAGTGGA
TATTTCTCAATCAGCAAATTTTGAAAAGAGTCACTCTAGTGTCAATAACAATGCTGATGAACTTCCTGTGTTTGTTGTTAGTGCAAATGTTGTCGTTTCTAAGTCCATCT
GGCATAGACGTCTTGGTCATCCCGCAGTTAAATTTGGAAAGTCTCACGCATTACCATTTCAGAACTCTGTCTCTCATGCATTCTCTAAATTCGATCTTGTTCACACGGAT
GTCTGGGGTCCAGCACCTGTTGATTCTGTTCATGATTTTCGGTTTTATATACTGTTTGTGGATGATTGGAGTCGGTTTATTTGGATTTATCCTTTAAAAAGGAAGAGTGA
TGCTCTAACAGCCTTTACTCACTTTGTTGCAATGGTCAAAAATCAGTTCAAGACAACAGTAACAGCCTTACAGTCAGACAACAGGGGAGAATATGCACCTATCCACAGAC
TATGTAGTCAGTTGGGCATTAATACTCGTTTGTCGTGTCCCTATACATCTCAACAGAATGGCAGAGCAGAACGGAAGCATCGCCATGTAGTAGAAACAGGGTTAACGTTA
CTAGCCCAAGCATCAATGCCATTAAGGTTCTAG
Protein sequenceShow/hide protein sequence
MTSVNSIETSAAAAGTTNFSRPPLNQLLNQITSIKLDRGNFLLWKNLALPILRSYKLEGHLSGAEPCPPQFVQAAAISSSQSAGGGGASSSEAAASEVSGTAPAKEVSPL
FESWIVVDQLLLGWLYNSMSPEVATQVMGFENAQELWTAIQDLFGVQSRAEEDYLRQVFQQCRKGSLKMPDYLRVMKSHADNLGQAGSPVTTRSLVSQAELLVFEKRLEL
QNNLKSSLTLGASVNMANSKEVGNQRSPNPASNGRQNGFGRGSQRGGSNRGRGRGCGYGFFNNNKPVCQVCGKVGHTALMCYQRFNKEFSGPIVNQNMGDSRSNHGNSSQ
PTAFMANQNMNQCVASPETVIDPNWYADSGASNHVTADYNNLANPTEYDGKECVTIGNGSKLQIKYVGSSCLTDGTKNLSLENMLCVPSIAKNLVSVSKLAQDNNVFVEF
HDHFCLVKDKSTSNVVLKGVLRDGLYRFEGVKATSVDISQSANFEKSHSSVNNNADELPVFVVSANVVVSKSIWHRRLGHPAVKFGKSHALPFQNSVSHAFSKFDLVHTD
VWGPAPVDSVHDFRFYILFVDDWSRFIWIYPLKRKSDALTAFTHFVAMVKNQFKTTVTALQSDNRGEYAPIHRLCSQLGINTRLSCPYTSQQNGRAERKHRHVVETGLTL
LAQASMPLRF