; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0028888 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0028888
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationchr8:32335390..32337449
RNA-Seq ExpressionLag0028888
SyntenyLag0028888
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GAU19483.1 hypothetical protein TSUD_77270 [Trifolium subterraneum]8.0e-10937.96Show/hide
Query:  TVKLERGNYLLWKNLTLPNLRSYRLEGHLSGDKPCPSKFLSTSQVITTDVTNEAGSSISGAVPLENSGQSSAVTLTINPEYESWLVVDKLLLGWLYNSMT
        +VKL+R NY LWK+L LP +R  +L+G++ G + CP +F+++S                      +S ++       N  +  W   D+ LLGW+ NSMT
Subjt:  TVKLERGNYLLWKNLTLPNLRSYRLEGHLSGDKPCPSKFLSTSQVITTDVTNEAGSSISGAVPLENSGQSSAVTLTINPEYESWLVVDKLLLGWLYNSMT

Query:  PEVATQVIGYENAKDLWAAIQELFGIQSRAEEDYLRHVFQQTRKGSLKMADYLRVMKSHADNLGQAGSPVPTRSLISQVLLGLDEEYNVVVAIIQGRIGI
         E+ATQ++  E +K LW   Q L G  +R++  YL+  F   RKG +KM DYL  MK+  D L  AG+PV T  LI Q L GLD EYN VV  +  +  +
Subjt:  PEVATQVIGYENAKDLWAAIQELFGIQSRAEEDYLRHVFQQTRKGSLKMADYLRVMKSHADNLGQAGSPVPTRSLISQVLLGLDEEYNVVVAIIQGRIGI

Query:  SWSEMQVELLVFEKRLETQNSQRSSTTFGSATSVNMATKGNGQSGSRQQNSSINRQQYNNKQHGGGGRNRGQGRWNGNNNNRLLCQICGKNGHSALACYQ
        SW ++Q +LL FE R+E  N+  + T   +A   N +          +  SS N  + +N +   GGR RG+   +G N     CQ+CG + H A+ C+ 
Subjt:  SWSEMQVELLVFEKRLETQNSQRSSTTFGSATSVNMATKGNGQSGSRQQNSSINRQQYNNKQHGGGGRNRGQGRWNGNNNNRLLCQICGKNGHSALACYQ

Query:  RFDKQFVGPSQNVNRNVSTGGNNFQNGGTSQMSPVAPQAFMTTQNTNPFIASPETVIDPNWYVDSGASNHVTADYNNMMHTSEYG---------------
        RFDK +       +R+  + G++ Q                   + N F+AS  +V D +WY DSGASNHVT         +E+                
Subjt:  RFDKQFVGPSQNVNRNVSTGGNNFQNGGTSQMSPVAPQAFMTTQNTNPFIASPETVIDPNWYVDSGASNHVTADYNNMMHTSEYG---------------

Query:  ----------------------AKNLVSVSKLAQDNNVDIEFHADSCLVKDIHTDKVVLRGVLKDGLYQL-GTRITRSALGSVGSNLKSANKSVSHSAFI
                               KNL+SVSKLA DNN+ +EF  + C VKD  T KV+L+G+LKDGLYQL GT+   SA  SV                 
Subjt:  ----------------------AKNLVSVSKLAQDNNVDIEFHADSCLVKDIHTDKVVLRGVLKDGLYQL-GTRITRSALGSVGSNLKSANKSVSHSAFI

Query:  TSGIYANVLVSKSVWHRRLGHPSLKILNSIVKKCNLPVSANDIFNFCEACKFGKSHDLPFPNSKSHAIAPFDLIHTDLWGPALVMSTDGYRYYVHFLDDF
                   K  WHRRLGHP+ K+L+ +++ C + V  +D F+FCEAC++GK H LPF +S SHA  P +L+HTD+WGPA +M++ G++YYVHF+DDF
Subjt:  TSGIYANVLVSKSVWHRRLGHPSLKILNSIVKKCNLPVSANDIFNFCEACKFGKSHDLPFPNSKSHAIAPFDLIHTDLWGPALVMSTDGYRYYVHFLDDF

Query:  SRFVWVYPLKLKSDTVATFTHFITMIKTQFNKIVKVLQSDNGGEYKKV
        SRF W+YPLK KS+TV  F  F  + + QFNK +KV+Q D GGEYK V
Subjt:  SRFVWVYPLKLKSDTVATFTHFITMIKTQFNKIVKVLQSDNGGEYKKV

PNX76291.1 gag/pol polyprotein - maize retrotransposon Hopscotch, partial [Trifolium pratense]1.6e-10937.42Show/hide
Query:  LNQITTVKLERGNYLLWKNLTLPNLRSYRLEGHLSGDKPCPSKFLSTSQVITTDVTNEAGSSISGAVPLENSGQSSAVTLTINPEYESWLVVDKLLLGWL
        L    +VKL+R NY LW+++ LP +R  RL+G++ G K CP +F++ +                              +   NPE+E W   D+ LLGWL
Subjt:  LNQITTVKLERGNYLLWKNLTLPNLRSYRLEGHLSGDKPCPSKFLSTSQVITTDVTNEAGSSISGAVPLENSGQSSAVTLTINPEYESWLVVDKLLLGWL

Query:  YNSMTPEVATQVIGYENAKDLWAAIQELFGIQSRAEEDYLRHVFQQTRKGSLKMADYLRVMKSHADNLGQAGSPVPTRSLISQVLLGLDEEYNVVVAIIQ
         NSMT  +ATQ++  E +  LW   Q L G  +R++  YL+  F  TRKG +KM DYL  MK+ AD L  AG+P+ T  LI Q L GLD EYN VV  + 
Subjt:  YNSMTPEVATQVIGYENAKDLWAAIQELFGIQSRAEEDYLRHVFQQTRKGSLKMADYLRVMKSHADNLGQAGSPVPTRSLISQVLLGLDEEYNVVVAIIQ

Query:  GRIGISWSEMQVELLVFEKRLETQNSQRSSTTFGSATSVNMATKGNGQSGSRQQNSSINRQQYNNKQHGGGGRNRGQGRWNGNNNNRLLCQICGKNGHSA
         +  +SW ++Q +LL FE R+E  NS  + T   +A   N+A K + +    + NS+ N +  NN   G   R    GR  G  + +  CQ+CG + H A
Subjt:  GRIGISWSEMQVELLVFEKRLETQNSQRSSTTFGSATSVNMATKGNGQSGSRQQNSSINRQQYNNKQHGGGGRNRGQGRWNGNNNNRLLCQICGKNGHSA

Query:  LACYQRFDKQFVGPSQNVNRNVSTGGNNFQNGGTSQMSPVAPQAFMTTQNTNPFIASPETVIDPNWYVDSGASNHVTADYNNMMHTSEYG----------
        + C+ RFDK +   + +         NN + G                 + N F+AS  ++ D +WY DSGASNHVT   +   + SE+           
Subjt:  LACYQRFDKQFVGPSQNVNRNVSTGGNNFQNGGTSQMSPVAPQAFMTTQNTNPFIASPETVIDPNWYVDSGASNHVTADYNNMMHTSEYG----------

Query:  ---------------------------AKNLVSVSKLAQDNNVDIEFHADSCLVKDIHTDKVVLRGVLKDGLYQLGTRITRSALGSVGSNLKSANKSVSH
                                    KNL+SVSKLA DNN+ +EF  + C VKD  T K +LRG+LKDGLYQL  +                      
Subjt:  ---------------------------AKNLVSVSKLAQDNNVDIEFHADSCLVKDIHTDKVVLRGVLKDGLYQLGTRITRSALGSVGSNLKSANKSVSH

Query:  SAFITSGIYANVLVSKSVWHRRLGHPSLKILNSIVKKCNLPVSANDIFNFCEACKFGKSHDLPFPNSKSHAIAPFDLIHTDLWGPALVMSTDGYRYYVHF
             S  Y ++   K  WHR+LGHP+ K+L+ ++K CN+ +S +D F+FCEAC++GK H LPF  S SHA    +L+HTD+WGPA ++S+ G++YYVHF
Subjt:  SAFITSGIYANVLVSKSVWHRRLGHPSLKILNSIVKKCNLPVSANDIFNFCEACKFGKSHDLPFPNSKSHAIAPFDLIHTDLWGPALVMSTDGYRYYVHF

Query:  LDDFSRFVWVYPLKLKSDTVATFTHFITMIKTQFNKIVKVLQSDNGGEYKKV
        +DDF+RF W+YPLK KSDT   F  F  M++ QF+K +K +Q D GGEYK V
Subjt:  LDDFSRFVWVYPLKLKSDTVATFTHFITMIKTQFNKIVKVLQSDNGGEYKKV

PNX78574.1 retrovirus-related Pol polyprotein from transposon TNT 1-94 [Trifolium pratense]7.5e-10738.39Show/hide
Query:  NQLLNQITTVKLERGNYLLWKNLTLPNLRSYRLEGHLSGDKPCPSKFLSTSQVITTDVTNEAGSSISGAVPLENSGQSSAVTLTINPEYESWLVVDKLLL
        N L ++I +V L+R N+ LWK+L LP +R  RL+G++ G K CP +F++++                     E SG+       INP++  W   D+ +L
Subjt:  NQLLNQITTVKLERGNYLLWKNLTLPNLRSYRLEGHLSGDKPCPSKFLSTSQVITTDVTNEAGSSISGAVPLENSGQSSAVTLTINPEYESWLVVDKLLL

Query:  GWLYNSMTPEVATQVIGYENAKDLWAAIQELFGIQSRAEEDYLRHVFQQTRKGSLKMADYLRVMKSHADNLGQAGSPVPTRSLISQVLLGLDEEYNVVVA
        GWL N+MT   A+Q++  E +K LW   Q L    +R+   YLR  F  TRKG  KM DYL  MK  AD L  AGSP+    LI Q L GLD +YN +V 
Subjt:  GWLYNSMTPEVATQVIGYENAKDLWAAIQELFGIQSRAEEDYLRHVFQQTRKGSLKMADYLRVMKSHADNLGQAGSPVPTRSLISQVLLGLDEEYNVVVA

Query:  IIQGRIGISWSEMQVELLVFEKRLETQNS----QRSSTTFGSATSVNMATKGNGQSGSRQQNSSINRQQYNNKQHGGGGRNRGQGRWNGNNNNRLLCQIC
         +  +I +SW ++Q +LL FE RL+  NS     R++TT       N+A K   +          N   +     G   RN   GR  G  +N  +CQ+C
Subjt:  IIQGRIGISWSEMQVELLVFEKRLETQNS----QRSSTTFGSATSVNMATKGNGQSGSRQQNSSINRQQYNNKQHGGGGRNRGQGRWNGNNNNRLLCQIC

Query:  GKNGHSALACYQRFDKQFVGPSQNVNRNVSTGGNNFQNGGTSQMSPVAPQAFMTTQNTNPFIASPETVIDPNWYVDSGASNHVTADYNNMMHTSEYG---
         K+GH+A+ C  R+DK + G S + N NV                          +  N F+AS     D  WY DSGASNHVT   +     +E     
Subjt:  GKNGHSALACYQRFDKQFVGPSQNVNRNVSTGGNNFQNGGTSQMSPVAPQAFMTTQNTNPFIASPETVIDPNWYVDSGASNHVTADYNNMMHTSEYG---

Query:  ----------------------------------AKNLVSVSKLAQDNNVDIEFHADSCLVKDIHTDKVVLRGVLKDGLYQLGTRITRSALGSVGSNLKS
                                           KNL+SVSKL  DNN+ +EF  D C VKD  T KV+LRG+LKDGLYQL          S GS+   
Subjt:  ----------------------------------AKNLVSVSKLAQDNNVDIEFHADSCLVKDIHTDKVVLRGVLKDGLYQLGTRITRSALGSVGSNLKS

Query:  ANKSVSHSAFITSGIYANVLVSKSVWHRRLGHPSLKILNSIVKKCNLPVSANDIFNFCEACKFGKSHDLPFPNSKSHAIAPFDLIHTDLWGPALVMSTDG
         NK           +Y +V   K  WHR+LGHPS  +L+ ++K CN+  S +D F FCEAC+ GKSH LPF +S SHA    +LIHTD+WGPA + S  G
Subjt:  ANKSVSHSAFITSGIYANVLVSKSVWHRRLGHPSLKILNSIVKKCNLPVSANDIFNFCEACKFGKSHDLPFPNSKSHAIAPFDLIHTDLWGPALVMSTDG

Query:  YRYYVHFLDDFSRFVWVYPLKLKSDTVATFTHFITMIKTQFNKIVKVLQSDNGGEYKKV
        ++YYVHF+DD SRF W+YPLK KSDT+  F  F  M++ QFNK +K++Q D GGE+K V
Subjt:  YRYYVHFLDDFSRFVWVYPLKLKSDTVATFTHFITMIKTQFNKIVKVLQSDNGGEYKKV

PNX94503.1 putative retrotransposon Ty1-copia subclass protein, partial [Trifolium pratense]3.4e-10738.13Show/hide
Query:  LNQITTVKLERGNYLLWKNLTLPNLRSYRLEGHLSGDKPCPSKFLSTSQVITTDVTNEAGSSISGAVPLENSGQSSAVTLTINPEYESWLVVDKLLLGWL
        L    +VKL+R N+ LWK+L LP +R  + +G++ G K CP +F+       T + N                     T  INP+Y+ W   D+ LLGWL
Subjt:  LNQITTVKLERGNYLLWKNLTLPNLRSYRLEGHLSGDKPCPSKFLSTSQVITTDVTNEAGSSISGAVPLENSGQSSAVTLTINPEYESWLVVDKLLLGWL

Query:  YNSMTPEVATQVIGYENAKDLWAAIQELFGIQSRAEEDYLRHVFQQTRKGSLKMADYLRVMKSHADNLGQAGSPVPTRSLISQVLLGLDEEYNVVVAIIQ
         NSMT ++ATQV+  E +K LW   Q L G  +R+   YL+  F  T K  +KM  YL  MK+ AD L  AGSP+ +  L+ Q L GLD EYN VV  + 
Subjt:  YNSMTPEVATQVIGYENAKDLWAAIQELFGIQSRAEEDYLRHVFQQTRKGSLKMADYLRVMKSHADNLGQAGSPVPTRSLISQVLLGLDEEYNVVVAIIQ

Query:  GRIGISWSEMQVELLVFEKRLETQNSQRSSTTFGSATSVNMATKGNGQSGSRQQNSSINRQQYNNKQHGGGGRNRGQGRWNGNNNNRLLCQICGKNGHSA
         +  ISW + Q +LL FE RL+  N+  +     SA   +    G  + GSR      N +          G   G+GR   +   R +CQICGK GH+A
Subjt:  GRIGISWSEMQVELLVFEKRLETQNSQRSSTTFGSATSVNMATKGNGQSGSRQQNSSINRQQYNNKQHGGGGRNRGQGRWNGNNNNRLLCQICGKNGHSA

Query:  LACYQRFDKQFVGPSQNVNRNVSTGGNNFQNGGTSQMSPVAPQAFMTTQNTNPFIASPETVIDPNWYVDSGASNHVT-----------------------
          CY RFDK +            T  N++  G  S                + F+ASP    D  WY DSGASNHVT                       
Subjt:  LACYQRFDKQFVGPSQNVNRNVSTGGNNFQNGGTSQMSPVAPQAFMTTQNTNPFIASPETVIDPNWYVDSGASNHVT-----------------------

Query:  ---------------ADYNNMMHTSEYGAKNLVSVSKLAQDNNVDIEFHADSCLVKDIHTDKVVLRGVLKDGLYQLGTRITRSALGSVGSNLKSANKSVS
                        +  N+++  E   KNL+SVSKL  DNN  +EF  + C VKD  T K +L+G LKDGLYQL                 SANK   
Subjt:  ---------------ADYNNMMHTSEYGAKNLVSVSKLAQDNNVDIEFHADSCLVKDIHTDKVVLRGVLKDGLYQLGTRITRSALGSVGSNLKSANKSVS

Query:  HSAFITSGIYANVLVSKSVWHRRLGHPSLKILNSIVKKCNLPVSANDIFNFCEACKFGKSHDLPFPNSKSHAIAPFDLIHTDLWGPALVMSTDGYRYYVH
             T+      +  K +WHR+LGHP+ K+L  ++K  N+ +S +D F FCEAC+FGK H LPF  S SHA  P DLIHTD+WGPA ++S   ++YYVH
Subjt:  HSAFITSGIYANVLVSKSVWHRRLGHPSLKILNSIVKKCNLPVSANDIFNFCEACKFGKSHDLPFPNSKSHAIAPFDLIHTDLWGPALVMSTDGYRYYVH

Query:  FLDDFSRFVWVYPLKLKSDTVATFTHFITMIKTQFNKIVKVLQSDNGGEYKKV
        FLDDFSRF W++PLK KS+T+  F  F  +++ QFNK +KV++ D GGEYK V
Subjt:  FLDDFSRFVWVYPLKLKSDTVATFTHFITMIKTQFNKIVKVLQSDNGGEYKKV

PNY01489.1 copia-like polyprotein, partial [Trifolium pratense]2.4e-10536.66Show/hide
Query:  LNQITTVKLERGNYLLWKNLTLPNLRSYRLEGHLSGDKPCPSKFLSTSQVITTDVTNEAGSSISGAVPLENSGQSSAVTLTINPEYESWLVVDKLLLGWL
        L  I +VKL+R NY LWK+L LP +R  + +G++ G K CP +F++                            S+  +  +NP+++ W+  D+ LLGWL
Subjt:  LNQITTVKLERGNYLLWKNLTLPNLRSYRLEGHLSGDKPCPSKFLSTSQVITTDVTNEAGSSISGAVPLENSGQSSAVTLTINPEYESWLVVDKLLLGWL

Query:  YNSMTPEVATQVIGYENAKDLWAAIQELFGIQSRAEEDYLRHVFQQTRKGSLKMADYLRVMKSHADNLGQAGSPVPTRSLISQVLLGLDEEYNVVVAIIQ
         NSM  ++ATQ++  E +K LW   Q L G  +++   YL+  F  TRKG +KM +YL  MK+ +D L  +GSP+    L+ Q L GLD EYN VV  + 
Subjt:  YNSMTPEVATQVIGYENAKDLWAAIQELFGIQSRAEEDYLRHVFQQTRKGSLKMADYLRVMKSHADNLGQAGSPVPTRSLISQVLLGLDEEYNVVVAIIQ

Query:  GRIGISWSEMQVELLVFEKRLETQNSQRSSTTFGSATSVNMATKGNGQSGSRQQNSSINRQQYNNKQHGGGGRNRGQGRWNGNNNNRLLCQICGKNGHSA
         +I +SW ++Q +LL FE RL+  N+    T   SA   N A K   +        +  R  +   + G     RG+GR +        CQ+C   GH+A
Subjt:  GRIGISWSEMQVELLVFEKRLETQNSQRSSTTFGSATSVNMATKGNGQSGSRQQNSSINRQQYNNKQHGGGGRNRGQGRWNGNNNNRLLCQICGKNGHSA

Query:  LACYQRFDKQFVGPSQNVNRNVSTGGNNFQNGGTSQMSPVAPQAFMTTQNTNPFIASPETVIDPNWYVDSGASNHVTADYNNMMHTSEYG----------
        + C  RFD+ + G      RN ST  +  + G  S                  F+ASP    D  WY DSGASNHVT   +     +E+           
Subjt:  LACYQRFDKQFVGPSQNVNRNVSTGGNNFQNGGTSQMSPVAPQAFMTTQNTNPFIASPETVIDPNWYVDSGASNHVTADYNNMMHTSEYG----------

Query:  ---------------------------AKNLVSVSKLAQDNNVDIEFHADSCLVKDIHTDKVVLRGVLKDGLYQLGTRITRSALGSVGSNLKSANKSVSH
                                    KNL+SVSKL  DNN+ +EF A+ C VKD  T + +L+G LKDGLYQL                      VS 
Subjt:  ---------------------------AKNLVSVSKLAQDNNVDIEFHADSCLVKDIHTDKVVLRGVLKDGLYQLGTRITRSALGSVGSNLKSANKSVSH

Query:  SAFITSGIYANVLVSKSVWHRRLGHPSLKILNSIVKKCNLPVSANDIFNFCEACKFGKSHDLPFPNSKSHAIAPFDLIHTDLWGPALVMSTDGYRYYVHF
         +     +Y +V   K  WHR+LGHP+ K+L  ++K CN+ +S +D F+FCEAC+FGK H LPF +S SH   P  LIH+D+WGPA ++S  G++YYVHF
Subjt:  SAFITSGIYANVLVSKSVWHRRLGHPSLKILNSIVKKCNLPVSANDIFNFCEACKFGKSHDLPFPNSKSHAIAPFDLIHTDLWGPALVMSTDGYRYYVHF

Query:  LDDFSRFVWVYPLKLKSDTVATFTHFITMIKTQFNKIVKVLQSDNGGEYKKV
        +DDFSRF W++PLK KSDT+  F  F  + + QFNK +K++Q D GGEYK V
Subjt:  LDDFSRFVWVYPLKLKSDTVATFTHFITMIKTQFNKIVKVLQSDNGGEYKKV

TrEMBL top hitse value%identityAlignment
A0A2K3LCM1 Gag/pol polyprotein-maize retrotransposon Hopscotch (Fragment)7.8e-11037.42Show/hide
Query:  LNQITTVKLERGNYLLWKNLTLPNLRSYRLEGHLSGDKPCPSKFLSTSQVITTDVTNEAGSSISGAVPLENSGQSSAVTLTINPEYESWLVVDKLLLGWL
        L    +VKL+R NY LW+++ LP +R  RL+G++ G K CP +F++ +                              +   NPE+E W   D+ LLGWL
Subjt:  LNQITTVKLERGNYLLWKNLTLPNLRSYRLEGHLSGDKPCPSKFLSTSQVITTDVTNEAGSSISGAVPLENSGQSSAVTLTINPEYESWLVVDKLLLGWL

Query:  YNSMTPEVATQVIGYENAKDLWAAIQELFGIQSRAEEDYLRHVFQQTRKGSLKMADYLRVMKSHADNLGQAGSPVPTRSLISQVLLGLDEEYNVVVAIIQ
         NSMT  +ATQ++  E +  LW   Q L G  +R++  YL+  F  TRKG +KM DYL  MK+ AD L  AG+P+ T  LI Q L GLD EYN VV  + 
Subjt:  YNSMTPEVATQVIGYENAKDLWAAIQELFGIQSRAEEDYLRHVFQQTRKGSLKMADYLRVMKSHADNLGQAGSPVPTRSLISQVLLGLDEEYNVVVAIIQ

Query:  GRIGISWSEMQVELLVFEKRLETQNSQRSSTTFGSATSVNMATKGNGQSGSRQQNSSINRQQYNNKQHGGGGRNRGQGRWNGNNNNRLLCQICGKNGHSA
         +  +SW ++Q +LL FE R+E  NS  + T   +A   N+A K + +    + NS+ N +  NN   G   R    GR  G  + +  CQ+CG + H A
Subjt:  GRIGISWSEMQVELLVFEKRLETQNSQRSSTTFGSATSVNMATKGNGQSGSRQQNSSINRQQYNNKQHGGGGRNRGQGRWNGNNNNRLLCQICGKNGHSA

Query:  LACYQRFDKQFVGPSQNVNRNVSTGGNNFQNGGTSQMSPVAPQAFMTTQNTNPFIASPETVIDPNWYVDSGASNHVTADYNNMMHTSEYG----------
        + C+ RFDK +   + +         NN + G                 + N F+AS  ++ D +WY DSGASNHVT   +   + SE+           
Subjt:  LACYQRFDKQFVGPSQNVNRNVSTGGNNFQNGGTSQMSPVAPQAFMTTQNTNPFIASPETVIDPNWYVDSGASNHVTADYNNMMHTSEYG----------

Query:  ---------------------------AKNLVSVSKLAQDNNVDIEFHADSCLVKDIHTDKVVLRGVLKDGLYQLGTRITRSALGSVGSNLKSANKSVSH
                                    KNL+SVSKLA DNN+ +EF  + C VKD  T K +LRG+LKDGLYQL  +                      
Subjt:  ---------------------------AKNLVSVSKLAQDNNVDIEFHADSCLVKDIHTDKVVLRGVLKDGLYQLGTRITRSALGSVGSNLKSANKSVSH

Query:  SAFITSGIYANVLVSKSVWHRRLGHPSLKILNSIVKKCNLPVSANDIFNFCEACKFGKSHDLPFPNSKSHAIAPFDLIHTDLWGPALVMSTDGYRYYVHF
             S  Y ++   K  WHR+LGHP+ K+L+ ++K CN+ +S +D F+FCEAC++GK H LPF  S SHA    +L+HTD+WGPA ++S+ G++YYVHF
Subjt:  SAFITSGIYANVLVSKSVWHRRLGHPSLKILNSIVKKCNLPVSANDIFNFCEACKFGKSHDLPFPNSKSHAIAPFDLIHTDLWGPALVMSTDGYRYYVHF

Query:  LDDFSRFVWVYPLKLKSDTVATFTHFITMIKTQFNKIVKVLQSDNGGEYKKV
        +DDF+RF W+YPLK KSDT   F  F  M++ QF+K +K +Q D GGEYK V
Subjt:  LDDFSRFVWVYPLKLKSDTVATFTHFITMIKTQFNKIVKVLQSDNGGEYKKV

A0A2K3LJ49 Retrovirus-related Pol polyprotein from transposon TNT 1-943.6e-10738.39Show/hide
Query:  NQLLNQITTVKLERGNYLLWKNLTLPNLRSYRLEGHLSGDKPCPSKFLSTSQVITTDVTNEAGSSISGAVPLENSGQSSAVTLTINPEYESWLVVDKLLL
        N L ++I +V L+R N+ LWK+L LP +R  RL+G++ G K CP +F++++                     E SG+       INP++  W   D+ +L
Subjt:  NQLLNQITTVKLERGNYLLWKNLTLPNLRSYRLEGHLSGDKPCPSKFLSTSQVITTDVTNEAGSSISGAVPLENSGQSSAVTLTINPEYESWLVVDKLLL

Query:  GWLYNSMTPEVATQVIGYENAKDLWAAIQELFGIQSRAEEDYLRHVFQQTRKGSLKMADYLRVMKSHADNLGQAGSPVPTRSLISQVLLGLDEEYNVVVA
        GWL N+MT   A+Q++  E +K LW   Q L    +R+   YLR  F  TRKG  KM DYL  MK  AD L  AGSP+    LI Q L GLD +YN +V 
Subjt:  GWLYNSMTPEVATQVIGYENAKDLWAAIQELFGIQSRAEEDYLRHVFQQTRKGSLKMADYLRVMKSHADNLGQAGSPVPTRSLISQVLLGLDEEYNVVVA

Query:  IIQGRIGISWSEMQVELLVFEKRLETQNS----QRSSTTFGSATSVNMATKGNGQSGSRQQNSSINRQQYNNKQHGGGGRNRGQGRWNGNNNNRLLCQIC
         +  +I +SW ++Q +LL FE RL+  NS     R++TT       N+A K   +          N   +     G   RN   GR  G  +N  +CQ+C
Subjt:  IIQGRIGISWSEMQVELLVFEKRLETQNS----QRSSTTFGSATSVNMATKGNGQSGSRQQNSSINRQQYNNKQHGGGGRNRGQGRWNGNNNNRLLCQIC

Query:  GKNGHSALACYQRFDKQFVGPSQNVNRNVSTGGNNFQNGGTSQMSPVAPQAFMTTQNTNPFIASPETVIDPNWYVDSGASNHVTADYNNMMHTSEYG---
         K+GH+A+ C  R+DK + G S + N NV                          +  N F+AS     D  WY DSGASNHVT   +     +E     
Subjt:  GKNGHSALACYQRFDKQFVGPSQNVNRNVSTGGNNFQNGGTSQMSPVAPQAFMTTQNTNPFIASPETVIDPNWYVDSGASNHVTADYNNMMHTSEYG---

Query:  ----------------------------------AKNLVSVSKLAQDNNVDIEFHADSCLVKDIHTDKVVLRGVLKDGLYQLGTRITRSALGSVGSNLKS
                                           KNL+SVSKL  DNN+ +EF  D C VKD  T KV+LRG+LKDGLYQL          S GS+   
Subjt:  ----------------------------------AKNLVSVSKLAQDNNVDIEFHADSCLVKDIHTDKVVLRGVLKDGLYQLGTRITRSALGSVGSNLKS

Query:  ANKSVSHSAFITSGIYANVLVSKSVWHRRLGHPSLKILNSIVKKCNLPVSANDIFNFCEACKFGKSHDLPFPNSKSHAIAPFDLIHTDLWGPALVMSTDG
         NK           +Y +V   K  WHR+LGHPS  +L+ ++K CN+  S +D F FCEAC+ GKSH LPF +S SHA    +LIHTD+WGPA + S  G
Subjt:  ANKSVSHSAFITSGIYANVLVSKSVWHRRLGHPSLKILNSIVKKCNLPVSANDIFNFCEACKFGKSHDLPFPNSKSHAIAPFDLIHTDLWGPALVMSTDG

Query:  YRYYVHFLDDFSRFVWVYPLKLKSDTVATFTHFITMIKTQFNKIVKVLQSDNGGEYKKV
        ++YYVHF+DD SRF W+YPLK KSDT+  F  F  M++ QFNK +K++Q D GGE+K V
Subjt:  YRYYVHFLDDFSRFVWVYPLKLKSDTVATFTHFITMIKTQFNKIVKVLQSDNGGEYKKV

A0A2K3MUJ9 Putative retrotransposon Ty1-copia subclass protein (Fragment)1.6e-10738.13Show/hide
Query:  LNQITTVKLERGNYLLWKNLTLPNLRSYRLEGHLSGDKPCPSKFLSTSQVITTDVTNEAGSSISGAVPLENSGQSSAVTLTINPEYESWLVVDKLLLGWL
        L    +VKL+R N+ LWK+L LP +R  + +G++ G K CP +F+       T + N                     T  INP+Y+ W   D+ LLGWL
Subjt:  LNQITTVKLERGNYLLWKNLTLPNLRSYRLEGHLSGDKPCPSKFLSTSQVITTDVTNEAGSSISGAVPLENSGQSSAVTLTINPEYESWLVVDKLLLGWL

Query:  YNSMTPEVATQVIGYENAKDLWAAIQELFGIQSRAEEDYLRHVFQQTRKGSLKMADYLRVMKSHADNLGQAGSPVPTRSLISQVLLGLDEEYNVVVAIIQ
         NSMT ++ATQV+  E +K LW   Q L G  +R+   YL+  F  T K  +KM  YL  MK+ AD L  AGSP+ +  L+ Q L GLD EYN VV  + 
Subjt:  YNSMTPEVATQVIGYENAKDLWAAIQELFGIQSRAEEDYLRHVFQQTRKGSLKMADYLRVMKSHADNLGQAGSPVPTRSLISQVLLGLDEEYNVVVAIIQ

Query:  GRIGISWSEMQVELLVFEKRLETQNSQRSSTTFGSATSVNMATKGNGQSGSRQQNSSINRQQYNNKQHGGGGRNRGQGRWNGNNNNRLLCQICGKNGHSA
         +  ISW + Q +LL FE RL+  N+  +     SA   +    G  + GSR      N +          G   G+GR   +   R +CQICGK GH+A
Subjt:  GRIGISWSEMQVELLVFEKRLETQNSQRSSTTFGSATSVNMATKGNGQSGSRQQNSSINRQQYNNKQHGGGGRNRGQGRWNGNNNNRLLCQICGKNGHSA

Query:  LACYQRFDKQFVGPSQNVNRNVSTGGNNFQNGGTSQMSPVAPQAFMTTQNTNPFIASPETVIDPNWYVDSGASNHVT-----------------------
          CY RFDK +            T  N++  G  S                + F+ASP    D  WY DSGASNHVT                       
Subjt:  LACYQRFDKQFVGPSQNVNRNVSTGGNNFQNGGTSQMSPVAPQAFMTTQNTNPFIASPETVIDPNWYVDSGASNHVT-----------------------

Query:  ---------------ADYNNMMHTSEYGAKNLVSVSKLAQDNNVDIEFHADSCLVKDIHTDKVVLRGVLKDGLYQLGTRITRSALGSVGSNLKSANKSVS
                        +  N+++  E   KNL+SVSKL  DNN  +EF  + C VKD  T K +L+G LKDGLYQL                 SANK   
Subjt:  ---------------ADYNNMMHTSEYGAKNLVSVSKLAQDNNVDIEFHADSCLVKDIHTDKVVLRGVLKDGLYQLGTRITRSALGSVGSNLKSANKSVS

Query:  HSAFITSGIYANVLVSKSVWHRRLGHPSLKILNSIVKKCNLPVSANDIFNFCEACKFGKSHDLPFPNSKSHAIAPFDLIHTDLWGPALVMSTDGYRYYVH
             T+      +  K +WHR+LGHP+ K+L  ++K  N+ +S +D F FCEAC+FGK H LPF  S SHA  P DLIHTD+WGPA ++S   ++YYVH
Subjt:  HSAFITSGIYANVLVSKSVWHRRLGHPSLKILNSIVKKCNLPVSANDIFNFCEACKFGKSHDLPFPNSKSHAIAPFDLIHTDLWGPALVMSTDGYRYYVH

Query:  FLDDFSRFVWVYPLKLKSDTVATFTHFITMIKTQFNKIVKVLQSDNGGEYKKV
        FLDDFSRF W++PLK KS+T+  F  F  +++ QFNK +KV++ D GGEYK V
Subjt:  FLDDFSRFVWVYPLKLKSDTVATFTHFITMIKTQFNKIVKVLQSDNGGEYKKV

A0A2Z6MBG6 Integrase catalytic domain-containing protein3.9e-10937.96Show/hide
Query:  TVKLERGNYLLWKNLTLPNLRSYRLEGHLSGDKPCPSKFLSTSQVITTDVTNEAGSSISGAVPLENSGQSSAVTLTINPEYESWLVVDKLLLGWLYNSMT
        +VKL+R NY LWK+L LP +R  +L+G++ G + CP +F+++S                      +S ++       N  +  W   D+ LLGW+ NSMT
Subjt:  TVKLERGNYLLWKNLTLPNLRSYRLEGHLSGDKPCPSKFLSTSQVITTDVTNEAGSSISGAVPLENSGQSSAVTLTINPEYESWLVVDKLLLGWLYNSMT

Query:  PEVATQVIGYENAKDLWAAIQELFGIQSRAEEDYLRHVFQQTRKGSLKMADYLRVMKSHADNLGQAGSPVPTRSLISQVLLGLDEEYNVVVAIIQGRIGI
         E+ATQ++  E +K LW   Q L G  +R++  YL+  F   RKG +KM DYL  MK+  D L  AG+PV T  LI Q L GLD EYN VV  +  +  +
Subjt:  PEVATQVIGYENAKDLWAAIQELFGIQSRAEEDYLRHVFQQTRKGSLKMADYLRVMKSHADNLGQAGSPVPTRSLISQVLLGLDEEYNVVVAIIQGRIGI

Query:  SWSEMQVELLVFEKRLETQNSQRSSTTFGSATSVNMATKGNGQSGSRQQNSSINRQQYNNKQHGGGGRNRGQGRWNGNNNNRLLCQICGKNGHSALACYQ
        SW ++Q +LL FE R+E  N+  + T   +A   N +          +  SS N  + +N +   GGR RG+   +G N     CQ+CG + H A+ C+ 
Subjt:  SWSEMQVELLVFEKRLETQNSQRSSTTFGSATSVNMATKGNGQSGSRQQNSSINRQQYNNKQHGGGGRNRGQGRWNGNNNNRLLCQICGKNGHSALACYQ

Query:  RFDKQFVGPSQNVNRNVSTGGNNFQNGGTSQMSPVAPQAFMTTQNTNPFIASPETVIDPNWYVDSGASNHVTADYNNMMHTSEYG---------------
        RFDK +       +R+  + G++ Q                   + N F+AS  +V D +WY DSGASNHVT         +E+                
Subjt:  RFDKQFVGPSQNVNRNVSTGGNNFQNGGTSQMSPVAPQAFMTTQNTNPFIASPETVIDPNWYVDSGASNHVTADYNNMMHTSEYG---------------

Query:  ----------------------AKNLVSVSKLAQDNNVDIEFHADSCLVKDIHTDKVVLRGVLKDGLYQL-GTRITRSALGSVGSNLKSANKSVSHSAFI
                               KNL+SVSKLA DNN+ +EF  + C VKD  T KV+L+G+LKDGLYQL GT+   SA  SV                 
Subjt:  ----------------------AKNLVSVSKLAQDNNVDIEFHADSCLVKDIHTDKVVLRGVLKDGLYQL-GTRITRSALGSVGSNLKSANKSVSHSAFI

Query:  TSGIYANVLVSKSVWHRRLGHPSLKILNSIVKKCNLPVSANDIFNFCEACKFGKSHDLPFPNSKSHAIAPFDLIHTDLWGPALVMSTDGYRYYVHFLDDF
                   K  WHRRLGHP+ K+L+ +++ C + V  +D F+FCEAC++GK H LPF +S SHA  P +L+HTD+WGPA +M++ G++YYVHF+DDF
Subjt:  TSGIYANVLVSKSVWHRRLGHPSLKILNSIVKKCNLPVSANDIFNFCEACKFGKSHDLPFPNSKSHAIAPFDLIHTDLWGPALVMSTDGYRYYVHFLDDF

Query:  SRFVWVYPLKLKSDTVATFTHFITMIKTQFNKIVKVLQSDNGGEYKKV
        SRF W+YPLK KS+TV  F  F  + + QFNK +KV+Q D GGEYK V
Subjt:  SRFVWVYPLKLKSDTVATFTHFITMIKTQFNKIVKVLQSDNGGEYKKV

A0A803PEH4 Uncharacterized protein1.4e-11438.37Show/hide
Query:  SSSLVNMAASASVPIFSSPPLNQLLNQITTVKLERGNYLLWKNLTLPNLRSYRLEGHLSGDKPCPSKFLSTSQVITTDVTNEAGSSISGAVPLENSGQSS
        +SS  N   ++ +P   +PP    LNQ  ++KL+R NY LWK +    +R +RL G+LSG   CP +F+    V  T VT                    
Subjt:  SSSLVNMAASASVPIFSSPPLNQLLNQITTVKLERGNYLLWKNLTLPNLRSYRLEGHLSGDKPCPSKFLSTSQVITTDVTNEAGSSISGAVPLENSGQSS

Query:  AVTLTINPEYESWLVVDKLLLGWLYNSMTPEVATQVIGYENAKDLWAAIQELFGIQSRAEEDYLRHVFQQTRKGSLKMADYLRVMKSHADNLGQAGSPVP
              NPEYE+W++ D+LL+GWLY+SMT  +AT+V+G  +A +L   ++ L+G  S+++ D  R + Q TRKGS  M++YLR  K+ ++ L  AG P P
Subjt:  AVTLTINPEYESWLVVDKLLLGWLYNSMTPEVATQVIGYENAKDLWAAIQELFGIQSRAEEDYLRHVFQQTRKGSLKMADYLRVMKSHADNLGQAGSPVP

Query:  TRSLISQVLLGLDEEYNVVVAIIQGRIGISWSEMQVELLVFEKRLE-----TQNSQRSSTTFGSATSVNMATKGNGQS---GSRQQNSSINRQQYNNKQH
           L++ VL GLD EY  +V  I+ R   +W E+Q  LL F+ ++E     T NS ++++   S+   NMA K N      G + QN+S N     +   
Subjt:  TRSLISQVLLGLDEEYNVVVAIIQGRIGISWSEMQVELLVFEKRLE-----TQNSQRSSTTFGSATSVNMATKGNGQS---GSRQQNSSINRQQYNNKQH

Query:  GGGGRNRGQGRWNGNNNNRLLCQICGKNGHSALACYQRFDKQFVGPSQNVNRNVSTGGNNFQNGGTSQMSPVAPQAFMTTQNTNPFIASPETVIDPNWYV
        G   R RG+GR  G + +R  CQ+ GK GH+A  CY RFD+ ++G   N   N +  G                    T  N + F+A+PE +    W+ 
Subjt:  GGGGRNRGQGRWNGNNNNRLLCQICGKNGHSALACYQRFDKQFVGPSQNVNRNVSTGGNNFQNGGTSQMSPVAPQAFMTTQNTNPFIASPETVIDPNWYV

Query:  DSGASNHVTADYNNMMHTSEYG------------------------------------------AKNLVSVSKLAQDNNVDIEFHADSCLVKDIHTDKVV
        DSGASNH+T+D  N+    +Y                                           AKNLVSVSKLA DNNV IEF+++ CLVKD  T KV+
Subjt:  DSGASNHVTADYNNMMHTSEYG------------------------------------------AKNLVSVSKLAQDNNVDIEFHADSCLVKDIHTDKVV

Query:  LRGVLKDGLYQLGTRITRSALGSVGSNLKSANKSVSHSAFITSGIYANVLVSK-SVWHRRLGHPSLKILNSIVKKCNLPVSANDIFNFCEACKFGKSHDL
        L GVLKD LYQL +  T+S+     SN  SA  ++S  + +      ++L+S+  V HRRLGHPS+K+LN +++  N+ VS N +   C+AC++GK+H L
Subjt:  LRGVLKDGLYQLGTRITRSALGSVGSNLKSANKSVSHSAFITSGIYANVLVSK-SVWHRRLGHPSLKILNSIVKKCNLPVSANDIFNFCEACKFGKSHDL

Query:  PFPNSKSHAIAPFDLIHTDLWGPALVMSTDGYRYYVHFLDDFSRFVWVYPLKLKSDTVATFTHFITMIKTQFNKIVKVLQSDNGGEYK
        PF +S + A +  DLIHTDLWGPA + S   + YY+HF+DD+SR+ W+YPLKLKSD +A F  F  +++ QF K +K L+SD+GGEYK
Subjt:  PFPNSKSHAIAPFDLIHTDLWGPALVMSTDGYRYYVHFLDDFSRFVWVYPLKLKSDTVATFTHFITMIKTQFNKIVKVLQSDNGGEYK

SwissProt top hitse value%identityAlignment
P04146 Copia protein2.7e-1425Show/hide
Query:  WNGNNNNRLLCQICGKNGHSALAC--YQRFDKQFVGPSQNVNRNVSTGGNNFQNGGTSQMSPVAPQAFMTTQNTNPFIASPETVIDPNWYVDSGASNHVT
        + GN+  ++ C  CG+ GH    C  Y+R                     N +N    Q +     AFM  +  N  +       +  + +DSGAS+H+ 
Subjt:  WNGNNNNRLLCQICGKNGHSALAC--YQRFDKQFVGPSQNVNRNVSTGGNNFQNGGTSQMSPVAPQAFMTTQNTNPFIASPETVIDPNWYVDSGASNHVT

Query:  ADYNNMMHTSEYGAKNLVSVSK--------------LAQDNNVDIEFHADSCLVKDIHTDKVVLRGVLKDGLY----QLGTRITRSALGSVGSNLKSANK
         D +    + E      ++V+K              L  D+ + +E   D    K+   + + ++ + + G+     + G  I+++ L  V ++    N 
Subjt:  ADYNNMMHTSEYGAKNLVSVSK--------------LAQDNNVDIEFHADSCLVKDIHTDKVVLRGVLKDGLY----QLGTRITRSALGSVGSNLKSANK

Query:  SVSHSAFITSGIYANVLVSKSVWHRRLGHPSLKILNSIVKK---------CNLPVSANDIFNFCEACKFGKSHDLPFP--NSKSHAIAPFDLIHTDLWGP
         V +  F    I A    +  +WH R GH S   L  I +K          NL +S       CE C  GK   LPF     K+H   P  ++H+D+ GP
Subjt:  SVSHSAFITSGIYANVLVSKSVWHRRLGHPSLKILNSIVKK---------CNLPVSANDIFNFCEACKFGKSHDLPFP--NSKSHAIAPFDLIHTDLWGP

Query:  ALVMSTDGYRYYVHFLDDFSRFVWVYPLKLKSDTVATFTHFITMIKTQFNKIVKVLQSDNGGEY
           ++ D   Y+V F+D F+ +   Y +K KSD  + F  F+   +  FN  V  L  DNG EY
Subjt:  ALVMSTDGYRYYVHFLDDFSRFVWVYPLKLKSDTVATFTHFITMIKTQFNKIVKVLQSDNGGEY

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.5e-2823.54Show/hide
Query:  ESWLVVDKLLLGWLYNSMTPEVATQVIGYENAKDLWAAIQELFGIQSRAEEDYL-RHVFQQTRKGSLKMADYLRVMKSHADNLGQAGSPVPTRSLISQVL
        E W  +D+     +   ++ +V   +I  + A+ +W  ++ L+  ++   + YL + ++            +L V       L   G  +        +L
Subjt:  ESWLVVDKLLLGWLYNSMTPEVATQVIGYENAKDLWAAIQELFGIQSRAEEDYL-RHVFQQTRKGSLKMADYLRVMKSHADNLGQAGSPVPTRSLISQVL

Query:  LGLDEEY-NVVVAIIQGRIGISWSEMQVELLVFEK-RLETQNSQRSSTTFGSATSVNMATKGNGQSGSRQQNSSINRQQYNNKQHGGGGRNRGQGRWNGN
          L   Y N+   I+ G+  I   ++   LL+ EK R + +N  ++  T G   S   ++   G+SG+R                 G  +NR + R    
Subjt:  LGLDEEY-NVVVAIIQGRIGISWSEMQVELLVFEK-RLETQNSQRSSTTFGSATSVNMATKGNGQSGSRQQNSSINRQQYNNKQHGGGGRNRGQGRWNGN

Query:  NNNRLLCQICGKNGHSALACYQRFDKQFVGPSQNVNRNVSTGGNNFQNGGTSQMSPVAPQAFMTTQNTNPFIASPETVIDPNWYVDSGASNHVT--ADYN
              C  C + GH    C          P+    +  ++G  N  N      +      F+  +     ++ PE+     W VD+ AS+H T   D  
Subjt:  NNNRLLCQICGKNGHSALACYQRFDKQFVGPSQNVNRNVSTGGNNFQNGGTSQMSPVAPQAFMTTQNTNPFIASPETVIDPNWYVDSGASNHVT--ADYN

Query:  NMMHTSEYGAKNL--VSVSKLAQDNNVDIEFHADSCLV-KDI-HTDKV---VLRGVL--KDGLYQLGT----RITRSAL----GSVGSNLKSANKSVSHS
              ++G   +   S SK+A   ++ I+ +    LV KD+ H   +   ++ G+   +DG          R+T+ +L    G     L   N  +   
Subjt:  NMMHTSEYGAKNL--VSVSKLAQDNNVDIEFHADSCLV-KDI-HTDKV---VLRGVL--KDGLYQLGT----RITRSAL----GSVGSNLKSANKSVSHS

Query:  AFITSGIYANVLVSKSVWHRRLGHPSLKILNSIVKKCNLPVSANDIFNFCEACKFGKSHDLPFPNSKSHAIAPFDLIHTDLWGPALVMSTDGYRYYVHFL
                A   +S  +WH+R+GH S K L  + KK  +  +       C+ C FGK H + F  S    +   DL+++D+ GP  + S  G +Y+V F+
Subjt:  AFITSGIYANVLVSKSVWHRRLGHPSLKILNSIVKKCNLPVSANDIFNFCEACKFGKSHDLPFPNSKSHAIAPFDLIHTDLWGPALVMSTDGYRYYVHFL

Query:  DDFSRFVWVYPLKLKSDTVATFTHFITMIKTQFNKIVKVLQSDNGGEY
        DD SR +WVY LK K      F  F  +++ +  + +K L+SDNGGEY
Subjt:  DDFSRFVWVYPLKLKSDTVATFTHFITMIKTQFNKIVKVLQSDNGGEY

Q12491 Transposon Ty2-B Gag-Pol polyprotein6.5e-1327.09Show/hide
Query:  MHTSEYGAKNLVSVSKLAQDNNVDIEFHADSCLVKDI--HTDKVVLRGVLKDG-------LYQLGTRITRSALGSVGSNLKSANKSVSHSAFITSGIYAN
        +HT    A +L+S+S+LA  N         +C  ++    +D  VL  ++K G        Y + + I++  + +V  + KS NK            Y  
Subjt:  MHTSEYGAKNLVSVSKLAQDNNVDIEFHADSCLVKDI--HTDKVVLRGVLKDG-------LYQLGTRITRSALGSVGSNLKSANKSVSHSAFITSGIYAN

Query:  VLVSKSVWHRRLGHPSLKILNSIVKKCNLPVSANDIFNF-------CEACKFGKS----HDLPFPNSKSHAIAPFDLIHTDLWGPALVMSTDGYRYYVHF
         L+     HR LGH + + +   +KK  +         +       C  C  GKS    H          +  PF  +HTD++GP   +      Y++ F
Subjt:  VLVSKSVWHRRLGHPSLKILNSIVKKCNLPVSANDIFNF-------CEACKFGKS----HDLPFPNSKSHAIAPFDLIHTDLWGPALVMSTDGYRYYVHF

Query:  LDDFSRFVWVYPL--KLKSDTVATFTHFITMIKTQFNKIVKVLQSDNGGEY
         D+ +RF WVYPL  + +   +  FT  +  IK QFN  V V+Q D G EY
Subjt:  LDDFSRFVWVYPL--KLKSDTVATFTHFITMIKTQFNKIVKVLQSDNGGEY

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.5e-4926.4Show/hide
Query:  MAASASVPIFSSPPLNQLLNQITTVKLERGNYLLWKNLTLPNLRSYRLEGHLSGDKPCPSKFLSTSQVITTDVTNEAGSSISGAVPLENSGQSSAVTLTI
        MAA A   + ++  +   +N     KL   NYL+W          Y L G L G    P   + T                  A P             +
Subjt:  MAASASVPIFSSPPLNQLLNQITTVKLERGNYLLWKNLTLPNLRSYRLEGHLSGDKPCPSKFLSTSQVITTDVTNEAGSSISGAVPLENSGQSSAVTLTI

Query:  NPEYESWLVVDKLLLGWLYNSMTPEVATQVIGYENAKDLWAAIQELFGIQSRAEEDYLRHVFQQTRKGSLKMADYLRVMKSHADNLGQAGSPVPTRSLIS
        NP+Y  W   DKL+   +  +++  V   V     A  +W  +++++   S      LR   +Q  KG+  + DY++ + +  D L   G P+     + 
Subjt:  NPEYESWLVVDKLLLGWLYNSMTPEVATQVIGYENAKDLWAAIQELFGIQSRAEEDYLRHVFQQTRKGSLKMADYLRVMKSHADNLGQAGSPVPTRSLIS

Query:  QVLLGLDEEYNVVVAIIQGR-IGISWSEMQVELLVFEKRLETQNSQRSSTTFGSATSVNMATKGNGQSGSRQQNSSINRQQYNNKQHGGGGRNRGQGRWN
        +VL  L EEY  V+  I  +    + +E+   LL  E ++   +S        +A S    T  N  +   + N   NR   NN +       +    ++
Subjt:  QVLLGLDEEYNVVVAIIQGR-IGISWSEMQVELLVFEKRLETQNSQRSSTTFGSATSVNMATKGNGQSGSRQQNSSINRQQYNNKQHGGGGRNRGQGRWN

Query:  GNNNNRL----LCQICGKNGHSALACYQRFDKQFVGPSQNVNRNVSTGGNNFQNGGTSQMSPVAPQAFMTTQ-NTNPFIASPETVIDPNWYVDSGASNHV
         NNN        CQICG  GHSA  C Q                        Q+  +S  S   P  F   Q   N  + SP +    NW +DSGA++H+
Subjt:  GNNNNRL----LCQICGKNGHSALACYQRFDKQFVGPSQNVNRNVSTGGNNFQNGGTSQMSPVAPQAFMTTQ-NTNPFIASPETVIDPNWYVDSGASNHV

Query:  TADYNNM-MHTSEYGA----------------------------------------KNLVSVSKLAQDNNVDIEFHADSCLVKDIHTDKVVLRGVLKDGL
        T+D+NN+ +H    G                                         KNL+SV +L   N V +EF   S  VKD++T   +L+G  KD L
Subjt:  TADYNNM-MHTSEYGA----------------------------------------KNLVSVSKLAQDNNVDIEFHADSCLVKDIHTDKVVLRGVLKDGL

Query:  YQLGTRITRSALGSVGSNLKSANKSVSHSAFITSGIYANVLVSKSVWHRRLGHPSLKILNSIVKKCNLPV-SANDIFNFCEACKFGKSHDLPFPNSKSHA
        Y+     ++       S   S +   +HS+                WH RLGHP+  ILNS++   +L V + +  F  C  C   KS+ +PF  S  ++
Subjt:  YQLGTRITRSALGSVGSNLKSANKSVSHSAFITSGIYANVLVSKSVWHRRLGHPSLKILNSIVKKCNLPV-SANDIFNFCEACKFGKSHDLPFPNSKSHA

Query:  IAPFDLIHTDLWGPALVMSTDGYRYYVHFLDDFSRFVWVYPLKLKSDTVATFTHFITMIKTQFNKIVKVLQSDNGGEY
          P + I++D+W  + ++S D YRYYV F+D F+R+ W+YPLK KS    TF  F  +++ +F   +    SDNGGE+
Subjt:  IAPFDLIHTDLWGPALVMSTDGYRYYVHFLDDFSRFVWVYPLKLKSDTVATFTHFITMIKTQFNKIVKVLQSDNGGEY

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE27.4e-4926.44Show/hide
Query:  LNQITTVKLERGNYLLWKNLTLPNLRSYRLEGHLSGDKPCPSKFLSTSQVITTDVTNEAGSSISGAVPLENSGQSSAVTLTINPEYESWLVVDKLLLGWL
        +N     KL   NYL+W          Y L G L G  P P   + T                  AVP             +NP+Y  W   DKL+   +
Subjt:  LNQITTVKLERGNYLLWKNLTLPNLRSYRLEGHLSGDKPCPSKFLSTSQVITTDVTNEAGSSISGAVPLENSGQSSAVTLTINPEYESWLVVDKLLLGWL

Query:  YNSMTPEVATQVIGYENAKDLWAAIQELFGIQSRAEEDYLRHVFQQTRKGSLKMADYLRVMKSHADNLGQAGSPVPTRSLISQVLLGLDEEYNVVVAIIQ
          +++  V   V     A  +W  +++++   S      LR +                   +  D L   G P+     + +VL  L ++Y  V+  I 
Subjt:  YNSMTPEVATQVIGYENAKDLWAAIQELFGIQSRAEEDYLRHVFQQTRKGSLKMADYLRVMKSHADNLGQAGSPVPTRSLISQVLLGLDEEYNVVVAIIQ

Query:  GR-IGISWSEMQVELLVFEKRLETQNSQRSSTTFGSATSVNMATKGNGQSGSRQQNSSINRQQYNNKQHGGGGRNRGQGRWNGNNNNRLL---CQICGKN
         +    S +E+   L+  E +L   NS           + N+ T  N  +   Q N   NR   NN       +    G  + N   +     CQIC   
Subjt:  GR-IGISWSEMQVELLVFEKRLETQNSQRSSTTFGSATSVNMATKGNGQSGSRQQNSSINRQQYNNKQHGGGGRNRGQGRWNGNNNNRLL---CQICGKN

Query:  GHSALACYQRFDKQFVGPSQNVNRNVSTGGNNFQNGGTSQMSPVAPQAFMTTQNTNPFIASPETVIDPNWYVDSGASNHVTADYNNMMHTSEYGA-----
        GHSA  C Q    Q                   Q   TS  +P  P+A +     +P+ A+       NW +DSGA++H+T+D+NN+     Y       
Subjt:  GHSALACYQRFDKQFVGPSQNVNRNVSTGGNNFQNGGTSQMSPVAPQAFMTTQNTNPFIASPETVIDPNWYVDSGASNHVTADYNNMMHTSEYGA-----

Query:  ------------------------------------KNLVSVSKLAQDNNVDIEFHADSCLVKDIHTDKVVLRGVLKDGLYQLGTRITRSALGSVGSNLK
                                            KNL+SV +L   N V +EF   S  VKD++T   +L+G  KD LY+     +++      S   
Subjt:  ------------------------------------KNLVSVSKLAQDNNVDIEFHADSCLVKDIHTDKVVLRGVLKDGLYQLGTRITRSALGSVGSNLK

Query:  SANKSVSHSAFITSGIYANVLVSKSVWHRRLGHPSLKILNSIVKKCNLPV-SANDIFNFCEACKFGKSHDLPFPNSKSHAIAPFDLIHTDLWGPALVMST
        S     +HS+                WH RLGHPSL ILNS++   +LPV + +     C  C   KSH +PF NS   +  P + I++D+W  + ++S 
Subjt:  SANKSVSHSAFITSGIYANVLVSKSVWHRRLGHPSLKILNSIVKKCNLPV-SANDIFNFCEACKFGKSHDLPFPNSKSHAIAPFDLIHTDLWGPALVMST

Query:  DGYRYYVHFLDDFSRFVWVYPLKLKSDTVATFTHFITMIKTQFNKIVKVLQSDNGGEY
        D YRYYV F+D F+R+ W+YPLK KS    TF  F ++++ +F   +  L SDNGGE+
Subjt:  DGYRYYVHFLDDFSRFVWVYPLKLKSDTVATFTHFITMIKTQFNKIVKVLQSDNGGEY

Arabidopsis top hitse value%identityAlignment
AT1G34070.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)6.7e-0523.36Show/hide
Query:  LERGNYLLWKNLTLPNLRSYRLEGHLSGDKPCPSKFLSTSQVITTDVTNEAGSSISGAVPLENSGQSSAVTLTINPEYESWLVVDKLLLGWLYNSMTP-E
        +E  NY  W+ L L +  S+ + GH+ G                                           L  N    +W   D ++   LY ++TP +
Subjt:  LERGNYLLWKNLTLPNLRSYRLEGHLSGDKPCPSKFLSTSQVITTDVTNEAGSSISGAVPLENSGQSSAVTLTINPEYESWLVVDKLLLGWLYNSMTP-E

Query:  VATQVIGYENAKDLWAAIQELFGIQSRAEEDYLRHVFQQTRKGSLKMADYLRVMKSHADNLGQAGSPVPTRSLISQVLLGLDEEYNVVVAIIQGRIGI-S
             +    ++D+W  I+  F     A    L    +    G +++ADY R MK  AD+L     PV  R+L+  VL GL+ +++ ++ +I+ R    S
Subjt:  VATQVIGYENAKDLWAAIQELFGIQSRAEEDYLRHVFQQTRKGSLKMADYLRVMKSHADNLGQAGSPVPTRSLISQVLLGLDEEYNVVVAIIQGRIGI-S

Query:  WSEMQVELLVFEKRLE-------TQNSQRSSTTFGSATSVNMATKGNGQSGSRQQNSSINRQQYNNKQHGGGGR
        + +    L   E RL+       T     SS+T  + +     T  N Q     Q     R + NN   G GGR
Subjt:  WSEMQVELLVFEKRLE-------TQNSQRSSTTFGSATSVNMATKGNGQSGSRQQNSSINRQQYNNKQHGGGGR

AT5G48050.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)4.1e-1028.64Show/hide
Query:  WLVVDKLLLGWLYNSMTPEVATQVIGYE-NAKDLWAAIQELFGIQSRAEEDYLRHVFQQTRKGSLKMADYLRVMKSHADNLGQAGSPVPTRSLISQVLLG
        W   D L+  W+Y ++T  +   +I     A+DLW +++ LF     A      +  + T    L + +Y + +KS +D L    SP+  R L+  +L G
Subjt:  WLVVDKLLLGWLYNSMTPEVATQVIGYE-NAKDLWAAIQELFGIQSRAEEDYLRHVFQQTRKGSLKMADYLRVMKSHADNLGQAGSPVPTRSLISQVLLG

Query:  LDEEYNVVVAIIQGRIGI-SWSEMQVELLVFEKRLETQNSQRSSTTFGSATSVNMATKGNGQSGSRQQ----NSSINRQQYNNKQHGGGGRNRGQGRWNG
        L E+Y+ ++ +I+ +    S++E +  LL+ E RL  ++    S T   + S  + T    Q    Q+    NS++ R +   K  GGG      GR+N 
Subjt:  LDEEYNVVVAIIQGRIGI-SWSEMQVELLVFEKRLETQNSQRSSTTFGSATSVNMATKGNGQSGSRQQ----NSSINRQQYNNKQHGGGGRNRGQGRWNG

Query:  NNNNRL
        NNN RL
Subjt:  NNNNRL

ATMG00300.1 Gag-Pol-related retrotransposon family protein1.0e-0837.84Show/hide
Query:  VWHRRLGHPSLKILNSIVKKCNLPVSANDIFNFCEACKFGKSHDLPFPNSKSHAIAPFDLIHTDLWG-PALVMS
        +WH RL H S + +  +VKK  L  S      FCE C +GK+H + F   +     P D +H+DLWG P++ +S
Subjt:  VWHRRLGHPSLKILNSIVKKCNLPVSANDIFNFCEACKFGKSHDLPFPNSKSHAIAPFDLIHTDLWG-PALVMS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCAACACCTTGTCTTCTTCCTTAGTCAACATGGCTGCGTCTGCAAGCGTACCAATTTTTAGTAGCCCTCCTTTAAATCAGCTCTTGAATCAAATAACCACAGTCAA
GTTGGAAAGAGGTAATTATTTGCTGTGGAAAAATTTGACTCTTCCCAACCTTCGAAGTTACCGTTTGGAAGGGCACTTGTCTGGGGACAAGCCTTGCCCCTCCAAATTTC
TCTCTACTTCTCAGGTGATCACTACAGATGTTACAAATGAAGCGGGATCGTCAATCTCTGGAGCTGTCCCGCTAGAAAACTCAGGCCAATCCTCAGCTGTGACTCTTACC
ATCAATCCAGAATATGAAAGTTGGTTGGTAGTTGATAAACTCCTTCTTGGTTGGCTATACAACTCAATGACGCCTGAGGTGGCAACCCAAGTTATAGGGTATGAAAATGC
TAAAGACCTTTGGGCTGCTATTCAAGAGTTGTTTGGAATCCAGTCTCGAGCCGAAGAAGATTATCTTCGTCATGTATTTCAACAGACTCGTAAAGGTTCTCTTAAGATGG
CTGATTACTTGCGTGTCATGAAATCTCATGCAGATAATTTGGGTCAAGCGGGAAGTCCCGTTCCAACAAGATCTCTAATTTCTCAAGTCCTCTTAGGACTTGATGAAGAA
TATAATGTCGTAGTAGCCATAATCCAAGGAAGAATTGGAATATCCTGGTCTGAGATGCAAGTCGAACTGCTGGTTTTTGAGAAAAGGTTGGAAACTCAGAACTCACAAAG
GTCCTCTACAACCTTTGGCTCAGCAACCTCTGTAAACATGGCAACAAAAGGAAATGGTCAATCTGGGTCGAGGCAGCAAAACTCTTCTATCAATCGACAACAGTACAACA
ACAAACAACATGGAGGAGGAGGTAGAAATCGAGGGCAAGGGCGCTGGAATGGCAATAATAACAACCGACTCCTCTGTCAAATTTGTGGCAAAAATGGGCATTCTGCACTA
GCTTGCTACCAGAGGTTTGATAAACAATTTGTTGGTCCCAGTCAGAATGTTAATCGAAATGTTAGTACTGGTGGAAACAACTTCCAAAATGGGGGCACAAGTCAAATGAG
TCCAGTAGCTCCTCAGGCCTTCATGACTACTCAAAATACCAATCCGTTCATCGCCAGTCCAGAAACCGTTATTGATCCAAATTGGTATGTCGACAGCGGTGCTTCTAATC
ATGTCACAGCCGATTACAACAATATGATGCACACATCTGAATATGGAGCAAAGAATCTTGTTAGTGTCTCAAAGCTTGCTCAAGATAACAATGTCGATATTGAATTTCAC
GCTGACTCTTGTCTTGTTAAGGACATTCACACGGACAAAGTGGTGCTGAGGGGAGTTCTTAAAGATGGTTTGTATCAACTTGGAACAAGAATCACTCGTAGTGCTTTAGG
TTCAGTAGGTAGTAACTTGAAGTCGGCTAATAAATCAGTTTCTCACTCTGCCTTTATTACCTCTGGCATCTATGCTAATGTATTGGTGTCCAAATCAGTTTGGCATAGAA
GGCTTGGGCACCCGTCCTTAAAAATTTTGAACTCTATTGTTAAGAAGTGTAATCTACCAGTGAGTGCTAATGATATTTTCAATTTCTGTGAAGCATGCAAATTTGGAAAG
TCTCATGATCTTCCTTTTCCAAATTCAAAATCCCATGCTATTGCTCCCTTTGACTTGATTCACACTGATCTGTGGGGTCCAGCTCTAGTTATGTCTACAGATGGTTATCG
TTATTACGTACATTTTCTTGATGATTTTAGCCGTTTTGTTTGGGTGTATCCACTCAAATTGAAGAGCGACACAGTTGCAACATTTACCCATTTTATTACTATGATAAAAA
CTCAGTTTAACAAAATCGTTAAGGTCTTGCAGTCTGATAATGGAGGAGAATACAAAAAAGTGCATTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCCAACACCTTGTCTTCTTCCTTAGTCAACATGGCTGCGTCTGCAAGCGTACCAATTTTTAGTAGCCCTCCTTTAAATCAGCTCTTGAATCAAATAACCACAGTCAA
GTTGGAAAGAGGTAATTATTTGCTGTGGAAAAATTTGACTCTTCCCAACCTTCGAAGTTACCGTTTGGAAGGGCACTTGTCTGGGGACAAGCCTTGCCCCTCCAAATTTC
TCTCTACTTCTCAGGTGATCACTACAGATGTTACAAATGAAGCGGGATCGTCAATCTCTGGAGCTGTCCCGCTAGAAAACTCAGGCCAATCCTCAGCTGTGACTCTTACC
ATCAATCCAGAATATGAAAGTTGGTTGGTAGTTGATAAACTCCTTCTTGGTTGGCTATACAACTCAATGACGCCTGAGGTGGCAACCCAAGTTATAGGGTATGAAAATGC
TAAAGACCTTTGGGCTGCTATTCAAGAGTTGTTTGGAATCCAGTCTCGAGCCGAAGAAGATTATCTTCGTCATGTATTTCAACAGACTCGTAAAGGTTCTCTTAAGATGG
CTGATTACTTGCGTGTCATGAAATCTCATGCAGATAATTTGGGTCAAGCGGGAAGTCCCGTTCCAACAAGATCTCTAATTTCTCAAGTCCTCTTAGGACTTGATGAAGAA
TATAATGTCGTAGTAGCCATAATCCAAGGAAGAATTGGAATATCCTGGTCTGAGATGCAAGTCGAACTGCTGGTTTTTGAGAAAAGGTTGGAAACTCAGAACTCACAAAG
GTCCTCTACAACCTTTGGCTCAGCAACCTCTGTAAACATGGCAACAAAAGGAAATGGTCAATCTGGGTCGAGGCAGCAAAACTCTTCTATCAATCGACAACAGTACAACA
ACAAACAACATGGAGGAGGAGGTAGAAATCGAGGGCAAGGGCGCTGGAATGGCAATAATAACAACCGACTCCTCTGTCAAATTTGTGGCAAAAATGGGCATTCTGCACTA
GCTTGCTACCAGAGGTTTGATAAACAATTTGTTGGTCCCAGTCAGAATGTTAATCGAAATGTTAGTACTGGTGGAAACAACTTCCAAAATGGGGGCACAAGTCAAATGAG
TCCAGTAGCTCCTCAGGCCTTCATGACTACTCAAAATACCAATCCGTTCATCGCCAGTCCAGAAACCGTTATTGATCCAAATTGGTATGTCGACAGCGGTGCTTCTAATC
ATGTCACAGCCGATTACAACAATATGATGCACACATCTGAATATGGAGCAAAGAATCTTGTTAGTGTCTCAAAGCTTGCTCAAGATAACAATGTCGATATTGAATTTCAC
GCTGACTCTTGTCTTGTTAAGGACATTCACACGGACAAAGTGGTGCTGAGGGGAGTTCTTAAAGATGGTTTGTATCAACTTGGAACAAGAATCACTCGTAGTGCTTTAGG
TTCAGTAGGTAGTAACTTGAAGTCGGCTAATAAATCAGTTTCTCACTCTGCCTTTATTACCTCTGGCATCTATGCTAATGTATTGGTGTCCAAATCAGTTTGGCATAGAA
GGCTTGGGCACCCGTCCTTAAAAATTTTGAACTCTATTGTTAAGAAGTGTAATCTACCAGTGAGTGCTAATGATATTTTCAATTTCTGTGAAGCATGCAAATTTGGAAAG
TCTCATGATCTTCCTTTTCCAAATTCAAAATCCCATGCTATTGCTCCCTTTGACTTGATTCACACTGATCTGTGGGGTCCAGCTCTAGTTATGTCTACAGATGGTTATCG
TTATTACGTACATTTTCTTGATGATTTTAGCCGTTTTGTTTGGGTGTATCCACTCAAATTGAAGAGCGACACAGTTGCAACATTTACCCATTTTATTACTATGATAAAAA
CTCAGTTTAACAAAATCGTTAAGGTCTTGCAGTCTGATAATGGAGGAGAATACAAAAAAGTGCATTAG
Protein sequenceShow/hide protein sequence
MANTLSSSLVNMAASASVPIFSSPPLNQLLNQITTVKLERGNYLLWKNLTLPNLRSYRLEGHLSGDKPCPSKFLSTSQVITTDVTNEAGSSISGAVPLENSGQSSAVTLT
INPEYESWLVVDKLLLGWLYNSMTPEVATQVIGYENAKDLWAAIQELFGIQSRAEEDYLRHVFQQTRKGSLKMADYLRVMKSHADNLGQAGSPVPTRSLISQVLLGLDEE
YNVVVAIIQGRIGISWSEMQVELLVFEKRLETQNSQRSSTTFGSATSVNMATKGNGQSGSRQQNSSINRQQYNNKQHGGGGRNRGQGRWNGNNNNRLLCQICGKNGHSAL
ACYQRFDKQFVGPSQNVNRNVSTGGNNFQNGGTSQMSPVAPQAFMTTQNTNPFIASPETVIDPNWYVDSGASNHVTADYNNMMHTSEYGAKNLVSVSKLAQDNNVDIEFH
ADSCLVKDIHTDKVVLRGVLKDGLYQLGTRITRSALGSVGSNLKSANKSVSHSAFITSGIYANVLVSKSVWHRRLGHPSLKILNSIVKKCNLPVSANDIFNFCEACKFGK
SHDLPFPNSKSHAIAPFDLIHTDLWGPALVMSTDGYRYYVHFLDDFSRFVWVYPLKLKSDTVATFTHFITMIKTQFNKIVKVLQSDNGGEYKKVH