; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0011808 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0011808
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationchr1:33198220..33200379
RNA-Seq ExpressionLag0011808
SyntenyLag0011808
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GAU19483.1 hypothetical protein TSUD_77270 [Trifolium subterraneum]6.1e-8834.85Show/hide
Query:  NLLNQVTSIKLDRTNYLLWQNIVIPILKSYKLEGHLSSKTPAPEMSIILPPSQKEPDGLKMPNPEYDLWLAANQLLVGWLYNSMTPEITTQVMGHDEAKP
        N L    S+KLDR NY LW+++V+P+++  KL+G++      PE  I         D  K  N  +  W A +Q L+GW+ NSMT EI TQ++  + +K 
Subjt:  NLLNQVTSIKLDRTNYLLWQNIVIPILKSYKLEGHLSSKTPAPEMSIILPPSQKEPDGLKMPNPEYDLWLAANQLLVGWLYNSMTPEITTQVMGHDEAKP

Query:  LWDSIQEYFDIQSRSQEDYNRLMLQQTRKGTMKMHEYLDTMKRL------------------QALKG-NISMNLTSTKLASINVASTEGQKGGMHT----
        LWD  Q      +RSQ  Y +      RKG MKM +YL  MK L                  Q L G +   N    KL+     S    +  + T    
Subjt:  LWDSIQEYFDIQSRSQEDYNRLMLQQTRKGTMKMHEYLDTMKRL------------------QALKG-NISMNLTSTKLASINVASTEGQKGGMHT----

Query:  ----SNPNNYNQN-SGFASNNSQTRGYSSN---RGNRYRS-RGRYSPYTPNNRPICQLCGKMGHTTVICHHRYDKANSNTQDHTNQTSAQPTVIQEPTSS
            +N  N   N +   +N S  RG SSN   RG+  R  RG          P CQ+CG   H  + C HR+DK    T   +N ++            
Subjt:  ----SNPNNYNQN-SGFASNNSQTRGYSSN---RGNRYRS-RGRYSPYTPNNRPICQLCGKMGHTTVICHHRYDKANSNTQDHTNQTSAQPTVIQEPTSS

Query:  NPTALMPYPESLQDPSWYMDSGASNHVATDFGKLSLKGTTPSLSNVVVGNGTKVPIKGVGSTYIKGRKRNLLLKDILFVPHMKKNLISVSKITQDNPVLV
        +  A +    S++D  WY DSGASNHV     K          +++VVGNG K+ I   GS+ +K    +L L DIL+VP++ KNL+SVSK+  DN +LV
Subjt:  NPTALMPYPESLQDPSWYMDSGASNHVATDFGKLSLKGTTPSLSNVVVGNGTKVPIKGVGSTYIKGRKRNLLLKDILFVPHMKKNLISVSKITQDNPVLV

Query:  EFHDVFCFVKEKNSSEVCLIGKLENGLYRLLEEHEALVCLSEVKQRTQSQFCGPPPPVQGDQQPFFVTPTVVQSVRRDNGVLINVIQSILLTCKVDMWHK
        EF +  CFVK+K + +V L G L++GLY+L                                             +R+    ++V +S         WH+
Subjt:  EFHDVFCFVKEKNSSEVCLIGKLENGLYRLLEEHEALVCLSEVKQRTQSQFCGPPPPVQGDQQPFFVTPTVVQSVRRDNGVLINVIQSILLTCKVDMWHK

Query:  RLGHPSPKILNQVLHICNASSKNNESLSFCEACKFGKSHRLPFTLSDSRASGVLDLVHSDLWGPTPIRSTHGYAYYIAFLDDYSRYTWIFPLKTRADAFS
        RLGHP+ K+L++VL  C      +++ SFCEAC++GK H LPF  S S A   L+LVH+D+WGP PI ++ G+ YY+ F+DD+SR+TWI+PLK +++   
Subjt:  RLGHPSPKILNQVLHICNASSKNNESLSFCEACKFGKSHRLPFTLSDSRASGVLDLVHSDLWGPTPIRSTHGYAYYIAFLDDYSRYTWIFPLKTRADAFS

Query:  VFTQFKAQAEKQYNTVLKTLRCDGGGEYKPIIQFAKE
         F QFK   E Q+N  +K ++CDGGGEYKP+ + A E
Subjt:  VFTQFKAQAEKQYNTVLKTLRCDGGGEYKPIIQFAKE

PNX76291.1 gag/pol polyprotein - maize retrotransposon Hopscotch, partial [Trifolium pratense]2.2e-9033.64Show/hide
Query:  NLLNQVTSIKLDRTNYLLWQNIVIPILKSYKLEGHLSSKTPAPEMSIILPPSQKEPDGLKMPNPEYDLWLAANQLLVGWLYNSMTPEITTQVMGHDEAKP
        N L    S+KLDR NY LWQ++V+PI++  +L+G++  K   PE  I    S K+       NPE++ W A +Q L+GWL NSMT  I TQ++  + +  
Subjt:  NLLNQVTSIKLDRTNYLLWQNIVIPILKSYKLEGHLSSKTPAPEMSIILPPSQKEPDGLKMPNPEYDLWLAANQLLVGWLYNSMTPEITTQVMGHDEAKP

Query:  LWDSIQEYFDIQSRSQEDYNRLMLQQTRKGTMKMHEYLDTMKRL------------------QALKG---------------------------------
        LWD  Q      +RSQ  Y +     TRKG MKM +YL  MK L                  Q L G                                 
Subjt:  LWDSIQEYFDIQSRSQEDYNRLMLQQTRKGTMKMHEYLDTMKRL------------------QALKG---------------------------------

Query:  ----NISMNLTSTKLASINVASTEGQKGGMHTSNPNNYNQNSGFASNNSQTRGYSSNRGNRYRSRGRYSPYTPNNRPICQLCGKMGHTTVICHHRYDKAN
            N   NLT    A+ NVA     +G       N +N N+ +  +N+  RG +       R RGR      + +  CQ+CG   H  + C +R+DK  
Subjt:  ----NISMNLTSTKLASINVASTEGQKGGMHTSNPNNYNQNSGFASNNSQTRGYSSNRGNRYRSRGRYSPYTPNNRPICQLCGKMGHTTVICHHRYDKAN

Query:  SNTQDHTNQTSAQPTVIQEPTSSNPTALMPYPESLQDPSWYMDSGASNHVATDFGKLSLKGTTPSLSNVVVGNGTKVPIKGVGSTYIKGRKRNLLLKDIL
        S +    N               +  A +    S++D  WY DSGASNHV     K          ++++VGNG K+ I   GS+ +K    +L L DIL
Subjt:  SNTQDHTNQTSAQPTVIQEPTSSNPTALMPYPESLQDPSWYMDSGASNHVATDFGKLSLKGTTPSLSNVVVGNGTKVPIKGVGSTYIKGRKRNLLLKDIL

Query:  FVPHMKKNLISVSKITQDNPVLVEFHDVFCFVKEKNSSEVCLIGKLENGLYRLLEEHEALVCLSEVKQRTQSQFCGPPPPVQGDQQPFFVTPTVVQSVRR
        +VP + KNL+SVSK+  DN +LVEF +  CFVK+K + +  L G L++GLY+L E                                            +
Subjt:  FVPHMKKNLISVSKITQDNPVLVEFHDVFCFVKEKNSSEVCLIGKLENGLYRLLEEHEALVCLSEVKQRTQSQFCGPPPPVQGDQQPFFVTPTVVQSVRR

Query:  DNGVLINVIQSILLTCKVDMWHKRLGHPSPKILNQVLHICNASSKNNESLSFCEACKFGKSHRLPFTLSDSRASGVLDLVHSDLWGPTPIRSTHGYAYYI
        D+   +++ +S         WH++LGHP+ K+L+ VL  CN     ++  SFCEAC++GK H LPF  S S A  +L+LVH+D+WGP PI S+ G+ YY+
Subjt:  DNGVLINVIQSILLTCKVDMWHKRLGHPSPKILNQVLHICNASSKNNESLSFCEACKFGKSHRLPFTLSDSRASGVLDLVHSDLWGPTPIRSTHGYAYYI

Query:  AFLDDYSRYTWIFPLKTRADAFSVFTQFKAQAEKQYNTVLKTLRCDGGGEYKPIIQFAKE
         F+DD++R+TWI+PLK ++D    F QFK   E Q++  +KT++CDGGGEYKP+ + A E
Subjt:  AFLDDYSRYTWIFPLKTRADAFSVFTQFKAQAEKQYNTVLKTLRCDGGGEYKPIIQFAKE

PNX78574.1 retrovirus-related Pol polyprotein from transposon TNT 1-94 [Trifolium pratense]7.7e-9135.6Show/hide
Query:  SIKLDRTNYLLWQNIVIPILKSYKLEGHLSSKTPAPEMSIILPPSQKEPDGLKMPNPEYDLWLAANQLLVGWLYNSMTPEITTQVMGHDEAKPLWDSIQE
        S+ LDR N+ LW+++V+PI++  +L+G++      PE  I    +  E  G K+ NP++  W A +Q ++GWL N+MT    +Q++  + +K LW+  Q 
Subjt:  SIKLDRTNYLLWQNIVIPILKSYKLEGHLSSKTPAPEMSIILPPSQKEPDGLKMPNPEYDLWLAANQLLVGWLYNSMTPEITTQVMGHDEAKPLWDSIQE

Query:  YFDIQSRSQEDYNRLMLQQTRKGTMKMHEYLDTMKRL------------------QALKG-NISMNLTSTKLA-SINVASTEGQKGGM-------HTSNP
             +RS+  Y R     TRKG  KM +YL  MK L                  Q L G +   N    KL+  IN++  + Q   +         ++ 
Subjt:  YFDIQSRSQEDYNRLMLQQTRKGTMKMHEYLDTMKRL------------------QALKG-NISMNLTSTKLA-SINVASTEGQKGGM-------HTSNP

Query:  NNYNQNSGF-ASNNSQTRGYSSN-----RGNRYR-SRGRYSPYTPNNRPICQLCGKMGHTTVICHHRYDKANSNTQDHTNQTSAQPTVIQEPTSSNPTAL
        NN N+N+    +N +Q RG   N     RG+ +R +RG      P+N  ICQ+C K GHT + C HRYDK+      +T  + +   V ++ T +   A 
Subjt:  NNYNQNSGF-ASNNSQTRGYSSN-----RGNRYR-SRGRYSPYTPNNRPICQLCGKMGHTTVICHHRYDKANSNTQDHTNQTSAQPTVIQEPTSSNPTAL

Query:  MPYPESLQDPSWYMDSGASNHVATDFGKLSLKGTTPSLSNVVVGNGTKVPIKGVGSTYIKGRKRNLLLKDILFVPHMKKNLISVSKITQDNPVLVEFHDV
        +    + QD  WY DSGASNHV     K          ++++VGNG K+ I   GS+ +K    NL L D+L+VP + KNL+SVSK+T DN ++VEF + 
Subjt:  MPYPESLQDPSWYMDSGASNHVATDFGKLSLKGTTPSLSNVVVGNGTKVPIKGVGSTYIKGRKRNLLLKDILFVPHMKKNLISVSKITQDNPVLVEFHDV

Query:  FCFVKEKNSSEVCLIGKLENGLYRLLEEHEALVCLSEVKQRTQSQFCGPPPPVQGDQQPFFVTPTVVQSVRRDNGVLINVIQSILLTCKVDMWHKRLGHP
         CFVK+K + +V L G L++GLY+          LS    +T                     P V  SV+                   + WH++LGHP
Subjt:  FCFVKEKNSSEVCLIGKLENGLYRLLEEHEALVCLSEVKQRTQSQFCGPPPPVQGDQQPFFVTPTVVQSVRRDNGVLINVIQSILLTCKVDMWHKRLGHP

Query:  SPKILNQVLHICNASSKNNESLSFCEACKFGKSHRLPFTLSDSRASGVLDLVHSDLWGPTPIRSTHGYAYYIAFLDDYSRYTWIFPLKTRADAFSVFTQF
        S  +L++VL ICN  +  ++   FCEAC+ GKSH LPF  S S A  VL+L+H+D+WGP PI S  G+ YY+ F+DD SR+TWI+PLK ++D    F QF
Subjt:  SPKILNQVLHICNASSKNNESLSFCEACKFGKSHRLPFTLSDSRASGVLDLVHSDLWGPTPIRSTHGYAYYIAFLDDYSRYTWIFPLKTRADAFSVFTQF

Query:  KAQAEKQYNTVLKTLRCDGGGEYKPIIQFAKE
        K   E Q+N  +K ++CDGGGE+KP+ + A E
Subjt:  KAQAEKQYNTVLKTLRCDGGGEYKPIIQFAKE

PNX94503.1 putative retrotransposon Ty1-copia subclass protein, partial [Trifolium pratense]1.0e-9034.22Show/hide
Query:  NLLNQVTSIKLDRTNYLLWQNIVIPILKSYKLEGHLSSKTPAPEMSIILPPSQKEPDGLKMPNPEYDLWLAANQLLVGWLYNSMTPEITTQVMGHDEAKP
        N L    S+KLDR N+ LW+++V+P+++  K +G++      P+  +         D  +  NP+Y  W A +Q L+GWL NSMT +I TQV+  + +K 
Subjt:  NLLNQVTSIKLDRTNYLLWQNIVIPILKSYKLEGHLSSKTPAPEMSIILPPSQKEPDGLKMPNPEYDLWLAANQLLVGWLYNSMTPEITTQVMGHDEAKP

Query:  LWDSIQEYFDIQSRSQEDYNRLMLQQTRKGTMKMHEYLDTMKRL------------------QALKG-NISMNLTSTKLA----------SINVASTEGQ
        LWD  Q      +RS+  Y +     T K  MKM +YL  MK L                  Q L G +   N    KL+             + + E +
Subjt:  LWDSIQEYFDIQSRSQEDYNRLMLQQTRKGTMKMHEYLDTMKRL------------------QALKG-NISMNLTSTKLA----------SINVASTEGQ

Query:  KGGMHTSNPNNYNQNSGFASNNSQ----------TRGYSSN--RGNRYRSRGRYSPYTPNNRPICQLCGKMGHTTVICHHRYDKANSNTQDHTNQTSAQP
           ++  N  N N ++ FAS N             RG +S   RG R R+R    P     RPICQ+CGK GHT   C++R+DK+ +    +     +  
Subjt:  KGGMHTSNPNNYNQNSGFASNNSQ----------TRGYSSN--RGNRYRSRGRYSPYTPNNRPICQLCGKMGHTTVICHHRYDKANSNTQDHTNQTSAQP

Query:  TVIQEPTSSNPTALMPYPESLQDPSWYMDSGASNHVATDFGKLSLKGTTPSLSNVVVGNGTKVPIKGVGSTYIKGRKRNLLLKDILFVPHMKKNLISVSK
          +  P               QD  WY DSGASNHV    G+L         ++++VGNG K+ I   GST    +  ++ L+++L+VP + KNL+SVSK
Subjt:  TVIQEPTSSNPTALMPYPESLQDPSWYMDSGASNHVATDFGKLSLKGTTPSLSNVVVGNGTKVPIKGVGSTYIKGRKRNLLLKDILFVPHMKKNLISVSK

Query:  ITQDNPVLVEFHDVFCFVKEKNSSEVCLIGKLENGLYRLLEEHEALVCLSEVKQRTQSQFCGPPPPVQGDQQPFFVTPTVVQSVRRDNGVLINVIQSILL
        +T DN  LVEF + +C+VK+K + +  L G+L++GLY+L    E                    PP   D       P    S++               
Subjt:  ITQDNPVLVEFHDVFCFVKEKNSSEVCLIGKLENGLYRLLEEHEALVCLSEVKQRTQSQFCGPPPPVQGDQQPFFVTPTVVQSVRRDNGVLINVIQSILL

Query:  TCKVDMWHKRLGHPSPKILNQVLHICNASSKNNESLSFCEACKFGKSHRLPFTLSDSRASGVLDLVHSDLWGPTPIRSTHGYAYYIAFLDDYSRYTWIFP
            ++WH++LGHP+ K+L +VL   N     ++  +FCEAC+FGK H LPF  S S A   LDL+H+D+WGP PI S   + YY+ FLDD+SR+TWIFP
Subjt:  TCKVDMWHKRLGHPSPKILNQVLHICNASSKNNESLSFCEACKFGKSHRLPFTLSDSRASGVLDLVHSDLWGPTPIRSTHGYAYYIAFLDDYSRYTWIFP

Query:  LKTRADAFSVFTQFKAQAEKQYNTVLKTLRCDGGGEYKPI
        LK +++    F QFK   E Q+N  +K +RCDGGGEYKP+
Subjt:  LKTRADAFSVFTQFKAQAEKQYNTVLKTLRCDGGGEYKPI

PNY01489.1 copia-like polyprotein, partial [Trifolium pratense]4.0e-8734.54Show/hide
Query:  NLLNQVTSIKLDRTNYLLWQNIVIPILKSYKLEGHLSSKTPAPEMSIILPPSQKEPDGLKMPNPEYDLWLAANQLLVGWLYNSMTPEITTQVMGHDEAKP
        N L  + S+KLDR NY LW+++V+P+++  K +G++      PE  +         D  K  NP++  W+A +Q L+GWL NSM  +I TQ++  + +K 
Subjt:  NLLNQVTSIKLDRTNYLLWQNIVIPILKSYKLEGHLSSKTPAPEMSIILPPSQKEPDGLKMPNPEYDLWLAANQLLVGWLYNSMTPEITTQVMGHDEAKP

Query:  LWDSIQEYFDIQSRSQEDYNRLMLQQTRKGTMKMHEYLDTMKRL------------------QALKG-NISMNLTSTKLA-SINVASTEGQKGGM-HTSN
        LWD  Q      ++S+  Y +     TRKG MKM EYL  MK L                  Q L G +   N    KL+  IN++  + Q   +   S 
Subjt:  LWDSIQEYFDIQSRSQEDYNRLMLQQTRKGTMKMHEYLDTMKRL------------------QALKG-NISMNLTSTKLA-SINVASTEGQKGGM-HTSN

Query:  PNNYNQNSGFASNNSQTRGYSSN-RGNRYRSRGRYSPYTPNNRPI-------------CQLCGKMGHTTVICHHRYDKANSNTQDHTNQTSAQPTVIQEP
         +  N  SG   N S      +  RGN++ SRG +     N R +             CQ+C   GHT V C +R+D++ +  ++++ +   Q       
Subjt:  PNNYNQNSGFASNNSQTRGYSSN-RGNRYRSRGRYSPYTPNNRPI-------------CQLCGKMGHTTVICHHRYDKANSNTQDHTNQTSAQPTVIQEP

Query:  TSSNPTALMPYPESLQDPSWYMDSGASNHVATDFGKLSLKGTTPSLSNVVVGNGTKVPIKGVGSTYIKGRKRNLLLKDILFVPHMKKNLISVSKITQDNP
           + +A +  P   QD  WY DSGASNHV     K          ++++VGNG K+ I   GST    +   L L D+L+VP + KNL+SVSK+T DN 
Subjt:  TSSNPTALMPYPESLQDPSWYMDSGASNHVATDFGKLSLKGTTPSLSNVVVGNGTKVPIKGVGSTYIKGRKRNLLLKDILFVPHMKKNLISVSKITQDNP

Query:  VLVEFHDVFCFVKEKNSSEVCLIGKLENGLYRLLEEHEALVCLSEVKQRTQSQFCGPPPPVQGDQQPFFVTPTVVQSVRRDNGVLINVIQSILLTCKVDM
        + VEF    C VK+K + +  L G+L++GLY+          LS+V  ++                     P V  SV+                   + 
Subjt:  VLVEFHDVFCFVKEKNSSEVCLIGKLENGLYRLLEEHEALVCLSEVKQRTQSQFCGPPPPVQGDQQPFFVTPTVVQSVRRDNGVLINVIQSILLTCKVDM

Query:  WHKRLGHPSPKILNQVLHICNASSKNNESLSFCEACKFGKSHRLPFTLSDSRASGVLDLVHSDLWGPTPIRSTHGYAYYIAFLDDYSRYTWIFPLKTRAD
        WH++LGHP+ K+L +VL  CN     ++  SFCEAC+FGK H LPF  S S     L L+HSD+WGP PI S  G+ YY+ F+DD+SR+TWIFPLK ++D
Subjt:  WHKRLGHPSPKILNQVLHICNASSKNNESLSFCEACKFGKSHRLPFTLSDSRASGVLDLVHSDLWGPTPIRSTHGYAYYIAFLDDYSRYTWIFPLKTRAD

Query:  AFSVFTQFKAQAEKQYNTVLKTLRCDGGGEYKPI
            F QFK  AE Q+N  +K ++CDGGGEYK +
Subjt:  AFSVFTQFKAQAEKQYNTVLKTLRCDGGGEYKPI

TrEMBL top hitse value%identityAlignment
A0A2K3LCM1 Gag/pol polyprotein-maize retrotransposon Hopscotch (Fragment)1.1e-9033.64Show/hide
Query:  NLLNQVTSIKLDRTNYLLWQNIVIPILKSYKLEGHLSSKTPAPEMSIILPPSQKEPDGLKMPNPEYDLWLAANQLLVGWLYNSMTPEITTQVMGHDEAKP
        N L    S+KLDR NY LWQ++V+PI++  +L+G++  K   PE  I    S K+       NPE++ W A +Q L+GWL NSMT  I TQ++  + +  
Subjt:  NLLNQVTSIKLDRTNYLLWQNIVIPILKSYKLEGHLSSKTPAPEMSIILPPSQKEPDGLKMPNPEYDLWLAANQLLVGWLYNSMTPEITTQVMGHDEAKP

Query:  LWDSIQEYFDIQSRSQEDYNRLMLQQTRKGTMKMHEYLDTMKRL------------------QALKG---------------------------------
        LWD  Q      +RSQ  Y +     TRKG MKM +YL  MK L                  Q L G                                 
Subjt:  LWDSIQEYFDIQSRSQEDYNRLMLQQTRKGTMKMHEYLDTMKRL------------------QALKG---------------------------------

Query:  ----NISMNLTSTKLASINVASTEGQKGGMHTSNPNNYNQNSGFASNNSQTRGYSSNRGNRYRSRGRYSPYTPNNRPICQLCGKMGHTTVICHHRYDKAN
            N   NLT    A+ NVA     +G       N +N N+ +  +N+  RG +       R RGR      + +  CQ+CG   H  + C +R+DK  
Subjt:  ----NISMNLTSTKLASINVASTEGQKGGMHTSNPNNYNQNSGFASNNSQTRGYSSNRGNRYRSRGRYSPYTPNNRPICQLCGKMGHTTVICHHRYDKAN

Query:  SNTQDHTNQTSAQPTVIQEPTSSNPTALMPYPESLQDPSWYMDSGASNHVATDFGKLSLKGTTPSLSNVVVGNGTKVPIKGVGSTYIKGRKRNLLLKDIL
        S +    N               +  A +    S++D  WY DSGASNHV     K          ++++VGNG K+ I   GS+ +K    +L L DIL
Subjt:  SNTQDHTNQTSAQPTVIQEPTSSNPTALMPYPESLQDPSWYMDSGASNHVATDFGKLSLKGTTPSLSNVVVGNGTKVPIKGVGSTYIKGRKRNLLLKDIL

Query:  FVPHMKKNLISVSKITQDNPVLVEFHDVFCFVKEKNSSEVCLIGKLENGLYRLLEEHEALVCLSEVKQRTQSQFCGPPPPVQGDQQPFFVTPTVVQSVRR
        +VP + KNL+SVSK+  DN +LVEF +  CFVK+K + +  L G L++GLY+L E                                            +
Subjt:  FVPHMKKNLISVSKITQDNPVLVEFHDVFCFVKEKNSSEVCLIGKLENGLYRLLEEHEALVCLSEVKQRTQSQFCGPPPPVQGDQQPFFVTPTVVQSVRR

Query:  DNGVLINVIQSILLTCKVDMWHKRLGHPSPKILNQVLHICNASSKNNESLSFCEACKFGKSHRLPFTLSDSRASGVLDLVHSDLWGPTPIRSTHGYAYYI
        D+   +++ +S         WH++LGHP+ K+L+ VL  CN     ++  SFCEAC++GK H LPF  S S A  +L+LVH+D+WGP PI S+ G+ YY+
Subjt:  DNGVLINVIQSILLTCKVDMWHKRLGHPSPKILNQVLHICNASSKNNESLSFCEACKFGKSHRLPFTLSDSRASGVLDLVHSDLWGPTPIRSTHGYAYYI

Query:  AFLDDYSRYTWIFPLKTRADAFSVFTQFKAQAEKQYNTVLKTLRCDGGGEYKPIIQFAKE
         F+DD++R+TWI+PLK ++D    F QFK   E Q++  +KT++CDGGGEYKP+ + A E
Subjt:  AFLDDYSRYTWIFPLKTRADAFSVFTQFKAQAEKQYNTVLKTLRCDGGGEYKPIIQFAKE

A0A2K3LJ49 Retrovirus-related Pol polyprotein from transposon TNT 1-943.7e-9135.6Show/hide
Query:  SIKLDRTNYLLWQNIVIPILKSYKLEGHLSSKTPAPEMSIILPPSQKEPDGLKMPNPEYDLWLAANQLLVGWLYNSMTPEITTQVMGHDEAKPLWDSIQE
        S+ LDR N+ LW+++V+PI++  +L+G++      PE  I    +  E  G K+ NP++  W A +Q ++GWL N+MT    +Q++  + +K LW+  Q 
Subjt:  SIKLDRTNYLLWQNIVIPILKSYKLEGHLSSKTPAPEMSIILPPSQKEPDGLKMPNPEYDLWLAANQLLVGWLYNSMTPEITTQVMGHDEAKPLWDSIQE

Query:  YFDIQSRSQEDYNRLMLQQTRKGTMKMHEYLDTMKRL------------------QALKG-NISMNLTSTKLA-SINVASTEGQKGGM-------HTSNP
             +RS+  Y R     TRKG  KM +YL  MK L                  Q L G +   N    KL+  IN++  + Q   +         ++ 
Subjt:  YFDIQSRSQEDYNRLMLQQTRKGTMKMHEYLDTMKRL------------------QALKG-NISMNLTSTKLA-SINVASTEGQKGGM-------HTSNP

Query:  NNYNQNSGF-ASNNSQTRGYSSN-----RGNRYR-SRGRYSPYTPNNRPICQLCGKMGHTTVICHHRYDKANSNTQDHTNQTSAQPTVIQEPTSSNPTAL
        NN N+N+    +N +Q RG   N     RG+ +R +RG      P+N  ICQ+C K GHT + C HRYDK+      +T  + +   V ++ T +   A 
Subjt:  NNYNQNSGF-ASNNSQTRGYSSN-----RGNRYR-SRGRYSPYTPNNRPICQLCGKMGHTTVICHHRYDKANSNTQDHTNQTSAQPTVIQEPTSSNPTAL

Query:  MPYPESLQDPSWYMDSGASNHVATDFGKLSLKGTTPSLSNVVVGNGTKVPIKGVGSTYIKGRKRNLLLKDILFVPHMKKNLISVSKITQDNPVLVEFHDV
        +    + QD  WY DSGASNHV     K          ++++VGNG K+ I   GS+ +K    NL L D+L+VP + KNL+SVSK+T DN ++VEF + 
Subjt:  MPYPESLQDPSWYMDSGASNHVATDFGKLSLKGTTPSLSNVVVGNGTKVPIKGVGSTYIKGRKRNLLLKDILFVPHMKKNLISVSKITQDNPVLVEFHDV

Query:  FCFVKEKNSSEVCLIGKLENGLYRLLEEHEALVCLSEVKQRTQSQFCGPPPPVQGDQQPFFVTPTVVQSVRRDNGVLINVIQSILLTCKVDMWHKRLGHP
         CFVK+K + +V L G L++GLY+          LS    +T                     P V  SV+                   + WH++LGHP
Subjt:  FCFVKEKNSSEVCLIGKLENGLYRLLEEHEALVCLSEVKQRTQSQFCGPPPPVQGDQQPFFVTPTVVQSVRRDNGVLINVIQSILLTCKVDMWHKRLGHP

Query:  SPKILNQVLHICNASSKNNESLSFCEACKFGKSHRLPFTLSDSRASGVLDLVHSDLWGPTPIRSTHGYAYYIAFLDDYSRYTWIFPLKTRADAFSVFTQF
        S  +L++VL ICN  +  ++   FCEAC+ GKSH LPF  S S A  VL+L+H+D+WGP PI S  G+ YY+ F+DD SR+TWI+PLK ++D    F QF
Subjt:  SPKILNQVLHICNASSKNNESLSFCEACKFGKSHRLPFTLSDSRASGVLDLVHSDLWGPTPIRSTHGYAYYIAFLDDYSRYTWIFPLKTRADAFSVFTQF

Query:  KAQAEKQYNTVLKTLRCDGGGEYKPIIQFAKE
        K   E Q+N  +K ++CDGGGE+KP+ + A E
Subjt:  KAQAEKQYNTVLKTLRCDGGGEYKPIIQFAKE

A0A2K3MUJ9 Putative retrotransposon Ty1-copia subclass protein (Fragment)4.9e-9134.22Show/hide
Query:  NLLNQVTSIKLDRTNYLLWQNIVIPILKSYKLEGHLSSKTPAPEMSIILPPSQKEPDGLKMPNPEYDLWLAANQLLVGWLYNSMTPEITTQVMGHDEAKP
        N L    S+KLDR N+ LW+++V+P+++  K +G++      P+  +         D  +  NP+Y  W A +Q L+GWL NSMT +I TQV+  + +K 
Subjt:  NLLNQVTSIKLDRTNYLLWQNIVIPILKSYKLEGHLSSKTPAPEMSIILPPSQKEPDGLKMPNPEYDLWLAANQLLVGWLYNSMTPEITTQVMGHDEAKP

Query:  LWDSIQEYFDIQSRSQEDYNRLMLQQTRKGTMKMHEYLDTMKRL------------------QALKG-NISMNLTSTKLA----------SINVASTEGQ
        LWD  Q      +RS+  Y +     T K  MKM +YL  MK L                  Q L G +   N    KL+             + + E +
Subjt:  LWDSIQEYFDIQSRSQEDYNRLMLQQTRKGTMKMHEYLDTMKRL------------------QALKG-NISMNLTSTKLA----------SINVASTEGQ

Query:  KGGMHTSNPNNYNQNSGFASNNSQ----------TRGYSSN--RGNRYRSRGRYSPYTPNNRPICQLCGKMGHTTVICHHRYDKANSNTQDHTNQTSAQP
           ++  N  N N ++ FAS N             RG +S   RG R R+R    P     RPICQ+CGK GHT   C++R+DK+ +    +     +  
Subjt:  KGGMHTSNPNNYNQNSGFASNNSQ----------TRGYSSN--RGNRYRSRGRYSPYTPNNRPICQLCGKMGHTTVICHHRYDKANSNTQDHTNQTSAQP

Query:  TVIQEPTSSNPTALMPYPESLQDPSWYMDSGASNHVATDFGKLSLKGTTPSLSNVVVGNGTKVPIKGVGSTYIKGRKRNLLLKDILFVPHMKKNLISVSK
          +  P               QD  WY DSGASNHV    G+L         ++++VGNG K+ I   GST    +  ++ L+++L+VP + KNL+SVSK
Subjt:  TVIQEPTSSNPTALMPYPESLQDPSWYMDSGASNHVATDFGKLSLKGTTPSLSNVVVGNGTKVPIKGVGSTYIKGRKRNLLLKDILFVPHMKKNLISVSK

Query:  ITQDNPVLVEFHDVFCFVKEKNSSEVCLIGKLENGLYRLLEEHEALVCLSEVKQRTQSQFCGPPPPVQGDQQPFFVTPTVVQSVRRDNGVLINVIQSILL
        +T DN  LVEF + +C+VK+K + +  L G+L++GLY+L    E                    PP   D       P    S++               
Subjt:  ITQDNPVLVEFHDVFCFVKEKNSSEVCLIGKLENGLYRLLEEHEALVCLSEVKQRTQSQFCGPPPPVQGDQQPFFVTPTVVQSVRRDNGVLINVIQSILL

Query:  TCKVDMWHKRLGHPSPKILNQVLHICNASSKNNESLSFCEACKFGKSHRLPFTLSDSRASGVLDLVHSDLWGPTPIRSTHGYAYYIAFLDDYSRYTWIFP
            ++WH++LGHP+ K+L +VL   N     ++  +FCEAC+FGK H LPF  S S A   LDL+H+D+WGP PI S   + YY+ FLDD+SR+TWIFP
Subjt:  TCKVDMWHKRLGHPSPKILNQVLHICNASSKNNESLSFCEACKFGKSHRLPFTLSDSRASGVLDLVHSDLWGPTPIRSTHGYAYYIAFLDDYSRYTWIFP

Query:  LKTRADAFSVFTQFKAQAEKQYNTVLKTLRCDGGGEYKPI
        LK +++    F QFK   E Q+N  +K +RCDGGGEYKP+
Subjt:  LKTRADAFSVFTQFKAQAEKQYNTVLKTLRCDGGGEYKPI

A0A803P4G6 Uncharacterized protein8.3e-9135.77Show/hide
Query:  PTFANLLNQVTSIKLDRTNYLLWQNIVIPILKSYKLEGHLSSKTPAPEMSIILPPSQKEPDGLKM--PNPEYDLWLAANQLLVGWLYNSMTPEITTQVMG
        P   + LNQ  S+KLDR N+ LW+ +V  I++ Y+++G LS   P P   +   P+    +G+++   NPE++ W+  +QLL+GWLY+SMT  I T+VMG
Subjt:  PTFANLLNQVTSIKLDRTNYLLWQNIVIPILKSYKLEGHLSSKTPAPEMSIILPPSQKEPDGLKM--PNPEYDLWLAANQLLVGWLYNSMTPEITTQVMG

Query:  HDEAKPLW-DSIQE---YFDIQSRSQEDYNRLMLQQTRKGTMKMHEYLDTM-------KRLQALKGNISMNLTSTKLASINVASTEGQKGGMHTSNPNNY
           A  LW D   E     ++ S    +Y  +++Q   +         D +       +R+Q+L+G  +        A++ VA T  Q  G    + N  
Subjt:  HDEAKPLW-DSIQE---YFDIQSRSQEDYNRLMLQQTRKGTMKMHEYLDTM-------KRLQALKGNISMNLTSTKLASINVASTEGQKGGMHTSNPNNY

Query:  NQNSGFASNNSQTRGYSSNRGNRYRSRGRYSPYTPNNRPICQLCGKMGHTTVICHHRYDKANSNTQDHTNQTSAQPTVIQEPTSSNPTALMPYPESLQDP
        N N G              RG+R RSRGR   Y  N+RP CQ+CGK GH+  +C++RYD+   N   H           Q        A +  P+ L   
Subjt:  NQNSGFASNNSQTRGYSSNRGNRYRSRGRYSPYTPNNRPICQLCGKMGHTTVICHHRYDKANSNTQDHTNQTSAQPTVIQEPTSSNPTALMPYPESLQDP

Query:  SWYMDSGASNHVATDFGKLSLKGTTPSLSNVVVGNGTKVPIKGVGSTYIKGRKRNLL-LKDILFVPHMKKNLISVSKITQDNPVLVEFHDVFCFVKEKNS
        +W++DSGASNH+ +    +S K        + VG+G+K+ I  +G+ ++K     LL LK +L VP + KNLISV K+T DN VL+EF+   C VK+K +
Subjt:  SWYMDSGASNHVATDFGKLSLKGTTPSLSNVVVGNGTKVPIKGVGSTYIKGRKRNLL-LKDILFVPHMKKNLISVSKITQDNPVLVEFHDVFCFVKEKNS

Query:  SEVCLIGKLENGLYRLLEEH----EALVCLSEVKQRTQSQFCGPPPPVQGDQQPFFVTPTVVQSVRRDNGVLINVIQSILLTCKVDMWHKRLGHPSPKIL
         +V L G L++GLY++   H     AL+       +T                   V+PTV       N  +   + S  L  K+D+WH+RLGHPS K+L
Subjt:  SEVCLIGKLENGLYRLLEEH----EALVCLSEVKQRTQSQFCGPPPPVQGDQQPFFVTPTVVQSVRRDNGVLINVIQSILLTCKVDMWHKRLGHPSPKIL

Query:  NQVLHICNASSKNNESLSFCEACKFGKSHRLPFTLSDSRASGVLDLVHSDLWGPTPIRSTHGYAYYIAFLDDYSRYTWIFPLKTRADAFSVFTQFKAQAE
         QVL   N     NE  +FC+AC++GKSH LPF  S +RA  VLDL+H+DLWGP P+ S+  + YYI F+D +SRYTW++PLK +++A   F QFK   E
Subjt:  NQVLHICNASSKNNESLSFCEACKFGKSHRLPFTLSDSRASGVLDLVHSDLWGPTPIRSTHGYAYYIAFLDDYSRYTWIFPLKTRADAFSVFTQFKAQAE

Query:  ---KQYNTVLKTLRC
           ++Y+T  K  +C
Subjt:  ---KQYNTVLKTLRC

A0A803PEH4 Uncharacterized protein1.2e-10037.04Show/hide
Query:  LNQVTSIKLDRTNYLLWQNIVIPILKSYKLEGHLSSKTPAPEMSIILPPSQKEPDGLKMPNPEYDLWLAANQLLVGWLYNSMTPEITTQVMGHDEAKPLW
        LNQ  S+KLDR NY LW+ +V  I++ ++L G+LS        +++ PP        ++ NPEY+ W+  +QLL+GWLY+SMT  I T+VMG   A  L 
Subjt:  LNQVTSIKLDRTNYLLWQNIVIPILKSYKLEGHLSSKTPAPEMSIILPPSQKEPDGLKMPNPEYDLWLAANQLLVGWLYNSMTPEITTQVMGHDEAKPLW

Query:  DSIQEYFDIQSRSQEDYNRLMLQQTRKGTMKMHEYLDTMK---RLQALKG----------NISMNLTSTKLASI--------------------------
         +++  +   S+S+ D  R ++Q TRKG+  M EYL   K    + AL G          N+   L +  L+ +                          
Subjt:  DSIQEYFDIQSRSQEDYNRLMLQQTRKGTMKMHEYLDTMK---RLQALKG----------NISMNLTSTKLASI--------------------------

Query:  ---------NVASTEGQKGGMHTSNPNNYNQNSGFASNNSQTRG---YSSNRGNRYRSRGRYSPYTPNNRPICQLCGKMGHTTVICHHRYDKA--NSNTQ
                 N A++   +  M  +  NN  +  GF S N+ T     +S++RG   R RGR       +RP CQ+ GK GHT  +C++R+D++   S+  
Subjt:  ---------NVASTEGQKGGMHTSNPNNYNQNSGFASNNSQTRG---YSSNRGNRYRSRGRYSPYTPNNRPICQLCGKMGHTTVICHHRYDKA--NSNTQ

Query:  DHTNQTSAQPTVIQEPTSSNPTALMPYPESLQDPSWYMDSGASNHVATDFGKLSLKGTTPSLSNVVVGNGTKVPIKGVGSTYIKGRKRN-LLLKDILFVP
        +  NQ  A        T++N +A +  PE L+  +W+ DSGASNH+ +D   L+ K       +VVVGNG+K+ I  +G+  +     N LLLKD+L VP
Subjt:  DHTNQTSAQPTVIQEPTSSNPTALMPYPESLQDPSWYMDSGASNHVATDFGKLSLKGTTPSLSNVVVGNGTKVPIKGVGSTYIKGRKRN-LLLKDILFVP

Query:  HMKKNLISVSKITQDNPVLVEFHDVFCFVKEKNSSEVCLIGKLENGLYRLLEEHEALVCLSEVKQRTQSQFCGPPPPVQGDQQPFFVTPTVVQSVRRDNG
         + KNL+SVSK+  DN VL+EF+  FC VK+K + +V L G L++ LY+L                  S F     P Q  Q  F    T+  SV  D+ 
Subjt:  HMKKNLISVSKITQDNPVLVEFHDVFCFVKEKNSSEVCLIGKLENGLYRLLEEHEALVCLSEVKQRTQSQFCGPPPPVQGDQQPFFVTPTVVQSVRRDNG

Query:  VLINVIQSILLTCKVDMWHKRLGHPSPKILNQVLHICNASSKNNESLSFCEACKFGKSHRLPFTLSDSRASGVLDLVHSDLWGPTPIRSTHGYAYYIAFL
        V  +   S+L++ ++D+ H+RLGHPS K+LN VL   N S   N   + C+AC++GK+H LPF  S++RA  VLDL+H+DLWGP PI S   + YYI F+
Subjt:  VLINVIQSILLTCKVDMWHKRLGHPSPKILNQVLHICNASSKNNESLSFCEACKFGKSHRLPFTLSDSRASGVLDLVHSDLWGPTPIRSTHGYAYYIAFL

Query:  DDYSRYTWIFPLKTRADAFSVFTQFKAQAEKQYNTVLKTLRCDGGGEYKPIIQFAK
        DDYSRYTW++PLK ++DA + F QFKA  E Q+   +K+LR D GGEYKP +   +
Subjt:  DDYSRYTWIFPLKTRADAFSVFTQFKAQAEKQYNTVLKTLRCDGGGEYKPIIQFAK

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.8e-1324.29Show/hide
Query:  CQLCGKMGHTTVICHHRYDKANSNTQDHTN--QTSAQPTVIQEPTSSNPTALMPYPESLQDPSWYMDSGASNHVATDFGKLS-LKGTTPSLSNVVVGNGT
        C  CG+ GH    C H     N+  +++    QT+    +       N T++M       +  + +DSGAS+H+  D    +      P L   V   G 
Subjt:  CQLCGKMGHTTVICHHRYDKANSNTQDHTN--QTSAQPTVIQEPTSSNPTALMPYPESLQDPSWYMDSGASNHVATDFGKLS-LKGTTPSLSNVVVGNGT

Query:  KVPIKGVGSTYIKGRKRNLLLKDILFVPHMKKNLISVSKITQDNPVLVEFHDVFCFVKEKNSSEVCLIGKLENGLYRLLEEHEALVCLSEVKQRTQSQFC
         +     G   ++     + L+D+LF      NL+SV ++ Q+  + +EF        +K+   +      +NGL                         
Subjt:  KVPIKGVGSTYIKGRKRNLLLKDILFVPHMKKNLISVSKITQDNPVLVEFHDVFCFVKEKNSSEVCLIGKLENGLYRLLEEHEALVCLSEVKQRTQSQFC

Query:  GPPPPVQGDQQPFFVTPTVVQSVRRDNGVLINVIQSILLTCK----VDMWHKRLGHPSPKILNQVLH---ICNASSKNNESLS--FCEACKFGKSHRLPF
                          VV++    N V +   Q+  +  K      +WH+R GH S   L ++       + S  NN  LS   CE C  GK  RLPF
Subjt:  GPPPPVQGDQQPFFVTPTVVQSVRRDNGVLINVIQSILLTCK----VDMWHKRLGHPSPKILNQVLH---ICNASSKNNESLS--FCEACKFGKSHRLPF

Query:  -TLSD-SRASGVLDLVHSDLWGPTPIRSTHGYAYYIAFLDDYSRYTWIFPLKTRADAFSVFTQFKAQAEKQYNTVLKTLRCDGGGEY
          L D +     L +VHSD+ GP    +     Y++ F+D ++ Y   + +K ++D FS+F  F A++E  +N  +  L  D G EY
Subjt:  -TLSD-SRASGVLDLVHSDLWGPTPIRSTHGYAYYIAFLDDYSRYTWIFPLKTRADAFSVFTQFKAQAEKQYNTVLKTLRCDGGGEY

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-948.7e-2925.04Show/hide
Query:  ILPPSQKEPDGLKMPNPEYDLWLAANQLLVGWLYNSMTPEITTQVMGHDEAKPLWDSIQEYFDIQSRSQEDYNRLMLQQTRKGTMKMHEYLDTMKRLQAL
        +L    K+PD +K  +     W   ++     +   ++ ++   ++  D A+ +W  ++  +  ++ + + Y +  L       + M E  + +  L   
Subjt:  ILPPSQKEPDGLKMPNPEYDLWLAANQLLVGWLYNSMTPEITTQVMGHDEAKPLWDSIQEYFDIQSRSQEDYNRLMLQQTRKGTMKMHEYLDTMKRLQAL

Query:  KGNISMNLTSTKLASINVASTEGQKG-GMHTSNPNNYN--------------------------------QNSGFA-SNNSQTRGY--SSNRGNRYRSRG
         G I      T+LA++ V   E  K   +  S P++Y+                                +N G A     + R Y  SSN   R  +RG
Subjt:  KGNISMNLTSTKLASINVASTEGQKG-GMHTSNPNNYN--------------------------------QNSGFA-SNNSQTRGY--SSNRGNRYRSRG

Query:  RYSPYTPNNRPICQLCGKMGHTTVIC-HHRYDKANSNTQDHTNQTSA----QPTVIQEPTSSNPTALMPYPESLQDPSWYMDSGASNHVATDFGKLSLKG
        +    + +    C  C + GH    C + R  K  ++ Q + + T+A       V+           +  PES     W +D+ AS+H AT    L  + 
Subjt:  RYSPYTPNNRPICQLCGKMGHTTVIC-HHRYDKANSNTQDHTNQTSA----QPTVIQEPTSSNPTALMPYPESLQDPSWYMDSGASNHVATDFGKLSLKG

Query:  TTPSLSNVVVGNGTKVPIKGVGSTYIKGRKR-NLLLKDILFVPHMKKNLISVSKITQDNPVLVEFHDVFCFVKEKNSSEVCLIGKLENGLYRLLEEHEAL
               V +GN +   I G+G   IK      L+LKD+  VP ++ NLIS   + +D      +   F   K + +    +I K   G+ R        
Subjt:  TTPSLSNVVVGNGTKVPIKGVGSTYIKGRKR-NLLLKDILFVPHMKKNLISVSKITQDNPVLVEFHDVFCFVKEKNSSEVCLIGKLENGLYRLLEEHEAL

Query:  VCLSEVKQRTQSQFCGPPPPVQGDQQPFFVTPTVVQSVRRDNGVLINVIQSILLTCKVDMWHKRLGHPSPKILNQVLHICNASSKNNESLSFCEACKFGK
                RT ++ C      QG+                     +N  Q  +    VD+WHKR+GH S K L  +      S     ++  C+ C FGK
Subjt:  VCLSEVKQRTQSQFCGPPPPVQGDQQPFFVTPTVVQSVRRDNGVLINVIQSILLTCKVDMWHKRLGHPSPKILNQVLHICNASSKNNESLSFCEACKFGK

Query:  SHRLPFTLSDSRASGVLDLVHSDLWGPTPIRSTHGYAYYIAFLDDYSRYTWIFPLKTRADAFSVFTQFKAQAEKQYNTVLKTLRCDGGGEY
         HR+ F  S  R   +LDLV+SD+ GP  I S  G  Y++ F+DD SR  W++ LKT+   F VF +F A  E++    LK LR D GGEY
Subjt:  SHRLPFTLSDSRASGVLDLVHSDLWGPTPIRSTHGYAYYIAFLDDYSRYTWIFPLKTRADAFSVFTQFKAQAEKQYNTVLKTLRCDGGGEY

Q12501 Transposon Ty2-OR2 Gag-Pol polyprotein1.8e-0521.16Show/hide
Query:  NRLMLQQTRKGTMKMHEYLDTMKRLQALKGNISMNLTSTKLASINVASTEGQKGGMHTSNPNNYNQNSGFASNNSQTRGYSSNRGNRYRSRGRYSPYTPN
        +RL  Q   KG     +YL    R    K N+ +   S   A I +   E +   M+ + P+ Y Q+S +  N S+T   ++N     R+  R +   P 
Subjt:  NRLMLQQTRKGTMKMHEYLDTMKRLQALKGNISMNLTSTKLASINVASTEGQKGGMHTSNPNNYNQNSGFASNNSQTRGYSSNRGNRYRSRGRYSPYTPN

Query:  NRPICQLCGKMGHTTVICHHRYDKANSNTQDHTNQTSAQPTVI---------QEPTSSNPTALMPYPESLQDPSWYMDSGASNHVATDFGKLSLKGTTPS
                    H  +    ++ + N+   DH N+++     +         Q+   S PT  +   + L D    +DSGAS  +      L    T  S
Subjt:  NRPICQLCGKMGHTTVICHHRYDKANSNTQDHTNQTSAQPTVI---------QEPTSSNPTALMPYPESLQDPSWYMDSGASNHVATDFGKLSLKGTTPS

Query:  LSNVVVGNGTKVPIKGVGSTYIKGRKRNLLLKDILFVPHMKKNLISVSKITQDNPVLVEFHDVFCFVKE--KNSSEVCLIGKLENGLYRLLEEHEALVCL
          N+V      +PI  +G+ +   +         L  P++  +L+S+S++T  N          CF +   + S    L   +++G +  L +   +   
Subjt:  LSNVVVGNGTKVPIKGVGSTYIKGRKRNLLLKDILFVPHMKKNLISVSKITQDNPVLVEFHDVFCFVKE--KNSSEVCLIGKLENGLYRLLEEHEALVCL

Query:  SEVKQRTQSQFCGPPPPVQGDQQPFFVTPTVVQSVRRDNGVLINVIQSILLTCKVDMWHKRLGHPSPKIL------NQVLHICNASSK-NNESLSFCEAC
                               P  ++   + +V +   V  N     L+       H+ LGH + + +      N V ++  +  + +N S   C  C
Subjt:  SEVKQRTQSQFCGPPPPVQGDQQPFFVTPTVVQSVRRDNGVLINVIQSILLTCKVDMWHKRLGHPSPKIL------NQVLHICNASSK-NNESLSFCEAC

Query:  KFGKS----HRLPFTLSDSRASGVLDLVHSDLWGPTPIRSTHGYAYYIAFLDDYSRYTWIFPLKTRAD--AFSVFTQFKAQAEKQYNTVLKTLRCDGGGE
          GKS    H     L    +      +H+D++GP         +Y+I+F D+ +R+ W++PL  R +    +VFT   A  + Q+N  +  ++ D G E
Subjt:  KFGKS----HRLPFTLSDSRASGVLDLVHSDLWGPTPIRSTHGYAYYIAFLDDYSRYTWIFPLKTRAD--AFSVFTQFKAQAEKQYNTVLKTLRCDGGGE

Query:  Y
        Y
Subjt:  Y

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE19.9e-4924.73Show/hide
Query:  LNQVTSIKLDRTNYLLWQNIVIPILKSYKLEGHLSSKTPAPEMSIILPPSQKEPDGLKMPNPEYDLWLAANQLLVGWLYNSMTPEITTQVMGHDEAKPLW
        +N     KL  TNYL+W   V  +   Y+L G L   T        +PP+    D     NP+Y  W   ++L+   +  +++  +   V     A  +W
Subjt:  LNQVTSIKLDRTNYLLWQNIVIPILKSYKLEGHLSSKTPAPEMSIILPPSQKEPDGLKMPNPEYDLWLAANQLLVGWLYNSMTPEITTQVMGHDEAKPLW

Query:  DSIQEYFDIQSRSQEDYNRLMLQQTRKGTMKMHEYLDTM----------------------------KRLQALKGNISMNLTSTKLASIN----------
        +++++ +   S       R  L+Q  KGT  + +Y+  +                            +  + +   I+   T   L  I+          
Subjt:  DSIQEYFDIQSRSQEDYNRLMLQQTRKGTMKMHEYLDTM----------------------------KRLQALKGNISMNLTSTKLASIN----------

Query:  -----------VASTEGQKGGMHTSNPNNYNQNSGF--ASNNSQTRGYSSNRGNRYRSRGRYSPYTPNNRPICQLCGKMGHTTVICHHRYDKANSNTQDH
                    A+    +    T+N NN N+N+ +   +NN+ ++ +  +  N + +  +  PY       CQ+CG  GH+   C        S  Q  
Subjt:  -----------VASTEGQKGGMHTSNPNNYNQNSGF--ASNNSQTRGYSSNRGNRYRSRGRYSPYTPNNRPICQLCGKMGHTTVICHHRYDKANSNTQDH

Query:  TNQTSAQPTVIQEP---TSSNPTALMPYPESLQDPSWYMDSGASNHVATDFGKLSLKGTTPSLSNVVVGNGTKVPIKGVGSTYIKGRKRNLLLKDILFVP
         +  ++Q    Q P   T   P A +         +W +DSGA++H+ +DF  LSL        +V+V +G+ +PI   GST +  + R L L +IL+VP
Subjt:  TNQTSAQPTVIQEP---TSSNPTALMPYPESLQDPSWYMDSGASNHVATDFGKLSLKGTTPSLSNVVVGNGTKVPIKGVGSTYIKGRKRNLLLKDILFVP

Query:  HMKKNLISVSKITQDNPVLVEFHDVFCFVKEKNSSEVCLIGKLENGLYRLLEEHEALVCLSEVKQRTQSQFCGPPPPVQGDQQPFFVTPTVVQSVRRDNG
        ++ KNLISV ++   N V VEF      VK+ N+    L GK ++ LY                            P+   Q                  
Subjt:  HMKKNLISVSKITQDNPVLVEFHDVFCFVKEKNSSEVCLIGKLENGLYRLLEEHEALVCLSEVKQRTQSQFCGPPPPVQGDQQPFFVTPTVVQSVRRDNG

Query:  VLINVIQSILLTCKVDMWHKRLGHPSPKILNQVLHICNASSKN-NESLSFCEACKFGKSHRLPFTLSDSRASGVLDLVHSDLWGPTPIRSTHGYAYYIAF
          +++  S         WH RLGHP+P ILN V+   + S  N +     C  C   KS+++PF+ S   ++  L+ ++SD+W  +PI S   Y YY+ F
Subjt:  VLINVIQSILLTCKVDMWHKRLGHPSPKILNQVLHICNASSKN-NESLSFCEACKFGKSHRLPFTLSDSRASGVLDLVHSDLWGPTPIRSTHGYAYYIAF

Query:  LDDYSRYTWIFPLKTRADAFSVFTQFKAQAEKQYNTVLKTLRCDGGGEYKPIIQF
        +D ++RYTW++PLK ++     F  FK   E ++ T + T   D GGE+  + ++
Subjt:  LDDYSRYTWIFPLKTRADAFSVFTQFKAQAEKQYNTVLKTLRCDGGGEYKPIIQF

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE28.4e-4825.63Show/hide
Query:  LNQVTSIKLDRTNYLLWQNIVIPILKSYKLEGHLSSKTPAPEMSIILPPSQKEPDGLKMPNPEYDLWLAANQLLVGWLYNSMTPEITTQVMGHDEAKPLW
        +N     KL  TNYL+W   V  +   Y+L G L   TP       +PP+    D +   NP+Y  W   ++L+   +  +++  +   V     A  +W
Subjt:  LNQVTSIKLDRTNYLLWQNIVIPILKSYKLEGHLSSKTPAPEMSIILPPSQKEPDGLKMPNPEYDLWLAANQLLVGWLYNSMTPEITTQVMGHDEAKPLW

Query:  DSIQEYF----------------------------------DIQSRSQEDYNRLMLQQTRKGT----MKMHEYL-DTMKRLQALKGNISMNLTSTKLASI
        +++++ +                                   +     +DY  ++ Q   K T     ++HE L +   +L AL     + +T+  +   
Subjt:  DSIQEYF----------------------------------DIQSRSQEDYNRLMLQQTRKGT----MKMHEYL-DTMKRLQALKGNISMNLTSTKLASI

Query:  NVASTEGQKGGMHTSNPNNYNQNSGFASNNSQTRGYS-SNRGNRYRSRGRYSPYTPNNRPICQLCGKMGHTTVICHHRYDKANSNTQDHTNQTSAQPTVI
        N  +           N NN   N  + +NN+++  +  S+ G+R  +R +  PY       CQ+C   GH+   C   +   ++  Q    Q S  P   
Subjt:  NVASTEGQKGGMHTSNPNNYNQNSGFASNNSQTRGYS-SNRGNRYRSRGRYSPYTPNNRPICQLCGKMGHTTVICHHRYDKANSNTQDHTNQTSAQPTVI

Query:  QEPTSSNPTALMPYPESLQDPSWYMDSGASNHVATDFGKLSLKGTTPSLSNVVVGNGTKVPIKGVGSTYIKGRKRNLLLKDILFVPHMKKNLISVSKITQ
         +P  +N     PY  +    +W +DSGA++H+ +DF  LS         +V++ +G+ +PI   GS  +    R+L L  +L+VP++ KNLISV ++  
Subjt:  QEPTSSNPTALMPYPESLQDPSWYMDSGASNHVATDFGKLSLKGTTPSLSNVVVGNGTKVPIKGVGSTYIKGRKRNLLLKDILFVPHMKKNLISVSKITQ

Query:  DNPVLVEFHDVFCFVKEKNSSEVCLIGKLENGLYRLLEEHEALVCLSEVKQRTQSQFCGPPPPVQGDQQPFFVTPTVVQSVRRDNGVLINVIQSILLTCK
         N V VEF      VK+ N+    L GK ++ LY                            P+   Q                    +++  S      
Subjt:  DNPVLVEFHDVFCFVKEKNSSEVCLIGKLENGLYRLLEEHEALVCLSEVKQRTQSQFCGPPPPVQGDQQPFFVTPTVVQSVRRDNGVLINVIQSILLTCK

Query:  VDMWHKRLGHPSPKILNQVL--HICNASSKNNESLSFCEACKFGKSHRLPFTLSDSRASGVLDLVHSDLWGPTPIRSTHGYAYYIAFLDDYSRYTWIFPL
           WH RLGHPS  ILN V+  H     + +++ LS C  C   KSH++PF+ S   +S  L+ ++SD+W  +PI S   Y YY+ F+D ++RYTW++PL
Subjt:  VDMWHKRLGHPSPKILNQVL--HICNASSKNNESLSFCEACKFGKSHRLPFTLSDSRASGVLDLVHSDLWGPTPIRSTHGYAYYIAFLDDYSRYTWIFPL

Query:  KTRADAFSVFTQFKAQAEKQYNTVLKTLRCDGGGEY
        K ++     F  FK+  E ++ T + TL  D GGE+
Subjt:  KTRADAFSVFTQFKAQAEKQYNTVLKTLRCDGGGEY

Arabidopsis top hitse value%identityAlignment
ATMG00300.1 Gag-Pol-related retrotransposon family protein6.0e-0940.3Show/hide
Query:  MWHKRLGHPSPKILNQVLHICNASSKNNESLSFCEACKFGKSHRLPFTLSDSRASGVLDLVHSDLWG
        +WH RL H S + +  ++      S    SL FCE C +GK+HR+ F+         LD VHSDLWG
Subjt:  MWHKRLGHPSPKILNQVLHICNASSKNNESLSFCEACKFGKSHRLPFTLSDSRASGVLDLVHSDLWG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACTGAAACCGAACACCCAAACCCAGAGCCCTCAATCGAAGCTTCAAAGAGCAACACGCCTTCTGGAATCACACCCAACAAAGGAAAGGAGATTATCGTTGAAAACAG
ACCGTTTGAAGTAAGATATGGAACCCAATCTTTGGGAACACCAACGTTTGCAAATCTCCTGAATCAGGTAACATCCATCAAGCTAGACCGAACAAATTACCTCCTCTGGC
AGAATATTGTTATTCCGATCCTCAAGAGCTACAAGCTTGAAGGCCACTTGTCTAGCAAAACTCCTGCGCCCGAAATGTCGATTATCCTGCCACCGTCCCAAAAGGAACCT
GATGGCCTGAAAATGCCCAATCCTGAATACGATCTGTGGCTGGCAGCAAACCAATTGTTGGTAGGATGGTTGTACAACTCGATGACACCAGAAATCACCACCCAAGTTAT
GGGACATGATGAAGCCAAACCTCTGTGGGACTCCATTCAGGAGTATTTCGATATTCAATCACGTTCACAAGAAGACTACAATCGCTTGATGCTTCAACAAACCAGAAAAG
GTACTATGAAGATGCATGAATATCTAGACACCATGAAAAGATTACAAGCATTAAAGGGAAATATCTCCATGAACCTGACATCAACCAAGCTTGCTTCAATAAATGTTGCA
AGTACAGAAGGACAAAAGGGTGGGATGCACACATCCAACCCAAACAATTACAATCAGAACAGTGGATTTGCTTCAAACAACAGTCAAACCAGAGGATACTCGTCAAATAG
AGGAAATAGGTATCGATCCAGGGGCAGGTACTCTCCTTATACTCCAAATAATAGACCAATCTGTCAACTTTGTGGAAAAATGGGCCACACTACAGTCATTTGTCACCACA
GATATGACAAGGCTAACTCCAATACACAAGATCACACAAATCAGACCTCTGCTCAGCCTACTGTCATACAAGAACCAACATCCAGTAATCCCACTGCCCTCATGCCCTAC
CCTGAATCTCTTCAAGACCCTTCGTGGTATATGGACAGCGGAGCTAGCAACCATGTTGCAACTGATTTCGGAAAACTCTCTCTAAAAGGTACCACCCCATCTCTGTCAAA
TGTGGTTGTGGGAAATGGAACAAAAGTTCCAATTAAAGGCGTTGGTTCCACATATATAAAAGGGAGAAAACGGAATTTATTGTTGAAAGACATTTTATTTGTGCCTCACA
TGAAAAAGAATCTAATAAGTGTCTCAAAAATCACTCAAGACAACCCGGTTCTAGTAGAGTTTCATGATGTGTTTTGTTTTGTAAAGGAAAAGAACTCCAGTGAGGTGTGC
TTGATTGGGAAGCTCGAGAATGGCCTCTACAGACTCTTAGAAGAACATGAAGCCTTAGTATGTCTAAGTGAAGTGAAGCAGAGAACTCAAAGTCAGTTTTGTGGACCTCC
ACCCCCTGTCCAAGGCGACCAACAACCATTTTTTGTTACCCCAACTGTTGTGCAATCTGTCCGACGTGATAATGGGGTTTTGATTAATGTCATTCAAAGTATTTTGTTAA
CTTGTAAAGTGGATATGTGGCATAAAAGACTAGGTCACCCCTCTCCCAAGATTTTAAATCAAGTTCTGCACATTTGTAATGCTTCTTCTAAGAACAATGAAAGTTTATCT
TTTTGTGAGGCATGTAAATTTGGTAAATCCCATCGGTTGCCCTTTACCTTATCTGATTCTCGAGCTTCTGGTGTGTTAGATTTAGTTCACTCGGATCTTTGGGGACCAAC
CCCTATTCGGTCTACACATGGTTATGCCTACTATATTGCCTTCTTGGATGATTATTCACGATACACTTGGATTTTCCCTTTGAAAACCAGAGCCGATGCTTTCTCAGTGT
TCACTCAGTTTAAGGCCCAAGCGGAAAAACAATATAATACCGTTTTAAAAACTCTTAGGTGCGATGGTGGTGGCGAATACAAACCGATTATTCAGTTTGCCAAAGAACAA
TGA
mRNA sequenceShow/hide mRNA sequence
ATGACTGAAACCGAACACCCAAACCCAGAGCCCTCAATCGAAGCTTCAAAGAGCAACACGCCTTCTGGAATCACACCCAACAAAGGAAAGGAGATTATCGTTGAAAACAG
ACCGTTTGAAGTAAGATATGGAACCCAATCTTTGGGAACACCAACGTTTGCAAATCTCCTGAATCAGGTAACATCCATCAAGCTAGACCGAACAAATTACCTCCTCTGGC
AGAATATTGTTATTCCGATCCTCAAGAGCTACAAGCTTGAAGGCCACTTGTCTAGCAAAACTCCTGCGCCCGAAATGTCGATTATCCTGCCACCGTCCCAAAAGGAACCT
GATGGCCTGAAAATGCCCAATCCTGAATACGATCTGTGGCTGGCAGCAAACCAATTGTTGGTAGGATGGTTGTACAACTCGATGACACCAGAAATCACCACCCAAGTTAT
GGGACATGATGAAGCCAAACCTCTGTGGGACTCCATTCAGGAGTATTTCGATATTCAATCACGTTCACAAGAAGACTACAATCGCTTGATGCTTCAACAAACCAGAAAAG
GTACTATGAAGATGCATGAATATCTAGACACCATGAAAAGATTACAAGCATTAAAGGGAAATATCTCCATGAACCTGACATCAACCAAGCTTGCTTCAATAAATGTTGCA
AGTACAGAAGGACAAAAGGGTGGGATGCACACATCCAACCCAAACAATTACAATCAGAACAGTGGATTTGCTTCAAACAACAGTCAAACCAGAGGATACTCGTCAAATAG
AGGAAATAGGTATCGATCCAGGGGCAGGTACTCTCCTTATACTCCAAATAATAGACCAATCTGTCAACTTTGTGGAAAAATGGGCCACACTACAGTCATTTGTCACCACA
GATATGACAAGGCTAACTCCAATACACAAGATCACACAAATCAGACCTCTGCTCAGCCTACTGTCATACAAGAACCAACATCCAGTAATCCCACTGCCCTCATGCCCTAC
CCTGAATCTCTTCAAGACCCTTCGTGGTATATGGACAGCGGAGCTAGCAACCATGTTGCAACTGATTTCGGAAAACTCTCTCTAAAAGGTACCACCCCATCTCTGTCAAA
TGTGGTTGTGGGAAATGGAACAAAAGTTCCAATTAAAGGCGTTGGTTCCACATATATAAAAGGGAGAAAACGGAATTTATTGTTGAAAGACATTTTATTTGTGCCTCACA
TGAAAAAGAATCTAATAAGTGTCTCAAAAATCACTCAAGACAACCCGGTTCTAGTAGAGTTTCATGATGTGTTTTGTTTTGTAAAGGAAAAGAACTCCAGTGAGGTGTGC
TTGATTGGGAAGCTCGAGAATGGCCTCTACAGACTCTTAGAAGAACATGAAGCCTTAGTATGTCTAAGTGAAGTGAAGCAGAGAACTCAAAGTCAGTTTTGTGGACCTCC
ACCCCCTGTCCAAGGCGACCAACAACCATTTTTTGTTACCCCAACTGTTGTGCAATCTGTCCGACGTGATAATGGGGTTTTGATTAATGTCATTCAAAGTATTTTGTTAA
CTTGTAAAGTGGATATGTGGCATAAAAGACTAGGTCACCCCTCTCCCAAGATTTTAAATCAAGTTCTGCACATTTGTAATGCTTCTTCTAAGAACAATGAAAGTTTATCT
TTTTGTGAGGCATGTAAATTTGGTAAATCCCATCGGTTGCCCTTTACCTTATCTGATTCTCGAGCTTCTGGTGTGTTAGATTTAGTTCACTCGGATCTTTGGGGACCAAC
CCCTATTCGGTCTACACATGGTTATGCCTACTATATTGCCTTCTTGGATGATTATTCACGATACACTTGGATTTTCCCTTTGAAAACCAGAGCCGATGCTTTCTCAGTGT
TCACTCAGTTTAAGGCCCAAGCGGAAAAACAATATAATACCGTTTTAAAAACTCTTAGGTGCGATGGTGGTGGCGAATACAAACCGATTATTCAGTTTGCCAAAGAACAA
TGA
Protein sequenceShow/hide protein sequence
MTETEHPNPEPSIEASKSNTPSGITPNKGKEIIVENRPFEVRYGTQSLGTPTFANLLNQVTSIKLDRTNYLLWQNIVIPILKSYKLEGHLSSKTPAPEMSIILPPSQKEP
DGLKMPNPEYDLWLAANQLLVGWLYNSMTPEITTQVMGHDEAKPLWDSIQEYFDIQSRSQEDYNRLMLQQTRKGTMKMHEYLDTMKRLQALKGNISMNLTSTKLASINVA
STEGQKGGMHTSNPNNYNQNSGFASNNSQTRGYSSNRGNRYRSRGRYSPYTPNNRPICQLCGKMGHTTVICHHRYDKANSNTQDHTNQTSAQPTVIQEPTSSNPTALMPY
PESLQDPSWYMDSGASNHVATDFGKLSLKGTTPSLSNVVVGNGTKVPIKGVGSTYIKGRKRNLLLKDILFVPHMKKNLISVSKITQDNPVLVEFHDVFCFVKEKNSSEVC
LIGKLENGLYRLLEEHEALVCLSEVKQRTQSQFCGPPPPVQGDQQPFFVTPTVVQSVRRDNGVLINVIQSILLTCKVDMWHKRLGHPSPKILNQVLHICNASSKNNESLS
FCEACKFGKSHRLPFTLSDSRASGVLDLVHSDLWGPTPIRSTHGYAYYIAFLDDYSRYTWIFPLKTRADAFSVFTQFKAQAEKQYNTVLKTLRCDGGGEYKPIIQFAKEQ