; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0036141 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0036141
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationchr3:40143048..40148741
RNA-Seq ExpressionLag0036141
SyntenyLag0036141
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GAU17915.1 hypothetical protein TSUD_330400, partial [Trifolium subterraneum]2.7e-9529.32Show/hide
Query:  VKLERGNYLLWKNLALLILRSYRLEGHLSGDKPCPSKFLSTSQVITTNVTNEAGSSSSEAVPLENSAQSSAVTLTINPEYESWLVVDQLLLGWLYNSMTP
        VKL+R NY LWK++ L I+R  RL+G++ G K CP +F++            A  SS +                 NPE+E W   DQ LLGWL NSMT 
Subjt:  VKLERGNYLLWKNLALLILRSYRLEGHLSGDKPCPSKFLSTSQVITTNVTNEAGSSSSEAVPLENSAQSSAVTLTINPEYESWLVVDQLLLGWLYNSMTP

Query:  EVATQVMGYENAKDLWAAIQE-----------FLESSLEPKKIIFVRYFNRLVKV-----------------------LLGLDEEYNVVVAMIQGKIGIS
         +ATQ++  E +  LW                +L+S     +   ++  + L+K+                       L GLD EYN VV  +  +  + 
Subjt:  EVATQVMGYENAKDLWAAIQE-----------FLESSLEPKKIIFVRYFNRLVKV-----------------------LLGLDEEYNVVVAMIQGKIGIS

Query:  LSEMQAKLLVFEKRLEIQNSQRSSTTFGSTISVNMATKGDGQSGSRQKNFSINRQQYNNNQRGGGGRNRGRGRWNGNNNNRLICQICGKNGYSALAFYQR
          ++QA+LL FE R+E  N+        + +++N       +S  R   F+ N     +N RG  G  RGRGR     +++  CQ+CG++ + A+  + R
Subjt:  LSEMQAKLLVFEKRLEIQNSQRSSTTFGSTISVNMATKGDGQSGSRQKNFSINRQQYNNNQRGGGGRNRGRGRWNGNNNNRLICQICGKNGYSALAFYQR

Query:  FDKQFIGPSQNINRNISPGGNSFQNGGTSQMSPVAPQAFMTTQNTNPFVASLETVIDPNCKNLVS-VSKLAQDNNVYIEFHADSCLV-------KDIHTD
        FDK +   S + + N   G ++                F+ +QN+   V   +   D    N V+  +   QD     E H  + LV       + + T 
Subjt:  FDKQFIGPSQNINRNISPGGNSFQNGGTSQMSPVAPQAFMTTQNTNPFVASLETVIDPNCKNLVS-VSKLAQDNNVYIEFHADSCLV-------KDIHTD

Query:  KVVLRGVLKDGLYQLGTRITGSASGSVGSNLKSANKSVPHSAFITSGIYANVLVSKSVWHRRLGHSMQIWKVSCSSFSKLKSHAIAPFDLIHTDLWGPAP
        + +LRG LKDGLYQL  +                           S  Y +V   K  WHR+LGH                 +     +L+HTD+WGPAP
Subjt:  KVVLRGVLKDGLYQLGTRITGSASGSVGSNLKSANKSVPHSAFITSGIYANVLVSKSVWHRRLGHSMQIWKVSCSSFSKLKSHAIAPFDLIHTDLWGPAP

Query:  VMSTDGYRYYVHFLDDFSRFVWVYPLKLKSDTVAAITHFITMIKTQFNESVKVLQSDNGGEYKKVHQLCAQNGIQSRLSCPYTSAKNGRAERKHRHLVET
        ++S+ G++YYVHF+DDF+RF W+YPLK KSDT  A   F  M++ QFN+ +K +Q D GGEYK V +   + GIQ R+SCPYTS +NGRAERKHRH+ E 
Subjt:  VMSTDGYRYYVHFLDDFSRFVWVYPLKLKSDTVAAITHFITMIKTQFNESVKVLQSDNGGEYKKVHQLCAQNGIQSRLSCPYTSAKNGRAERKHRHLVET

Query:  SLTLLAQASMSLSYWWEAFTIATFLINGLPTPVLHGKSPIELLLKSKIECYGLTGRETSGKEEGIRGRDTPKKRRNSPIGRIMKPSPPTKLERRRRNHEM
         LTLLAQA M L+YWWEAF+ A +LIN LP+PV H +SP  LL K +                       P      P G    P               
Subjt:  SLTLLAQASMSLSYWWEAFTIATFLINGLPTPVLHGKSPIELLLKSKIECYGLTGRETSGKEEGIRGRDTPKKRRNSPIGRIMKPSPPTKLERRRRNHEM

Query:  NLSPTSELREDSWPDEEEKRGWAAGRWLTAERKEEENARRGSSAANTRGREKESPAHKRHRCLSSDGKVIISRHVKLNESDFPFATGFGSSLNNSQIFAY
         L P ++ +      +    G+                               S +HK ++C++S G+V ISRHV  NE  FPF  GF   LN       
Subjt:  NLSPTSELREDSWPDEEEKRGWAAGRWLTAERKEEENARRGSSAANTRGREKESPAHKRHRCLSSDGKVIISRHVKLNESDFPFATGFGSSLNNSQIFAY

Query:  TKSNPPLHNWLHEPQISCSNDLSSPFVPPSLITSTSPSTSSNPTVSPSPPFTPQSAQLSTNPSTRPFITPPPDTVSLIPDPVNHIPDPNISFSHQPSSPV
           +P  H  LH                       +  TSS+   S     T Q++   T         P   TV +  +  N+                
Subjt:  TKSNPPLHNWLHEPQISCSNDLSSPFVPPSLITSTSPSTSSNPTVSPSPPFTPQSAQLSTNPSTRPFITPPPDTVSLIPDPVNHIPDPNISFSHQPSSPV

Query:  TPYVASPSQRHSISACQPSASPSSSTPNNVVPSPSSPPILPSHPMITRVKTDIFKPKV-WLSVSPTDWSTREPSRVVDALVTPVWKAAMDVEFQALLRNQ
                                                 +H M TR K  I KPK+ ++ ++      +EP    +AL  P WK AM  EFQAL+ NQ
Subjt:  TPYVASPSQRHSISACQPSASPSSSTPNNVVPSPSSPPILPSHPMITRVKTDIFKPKV-WLSVSPTDWSTREPSRVVDALVTPVWKAAMDVEFQALLRNQ

Query:  TWCLVPLSPSQNVFGCKWVFRIKRNSDGSV
        TW L+P    +++   +WVF+IK  +DG++
Subjt:  TWCLVPLSPSQNVFGCKWVFRIKRNSDGSV

GAU19483.1 hypothetical protein TSUD_77270 [Trifolium subterraneum]1.7e-10529.94Show/hide
Query:  TVKLERGNYLLWKNLALLILRSYRLEGHLSGDKPCPSKFLSTSQVITTNVTNEAGSSSSEAVPLENSAQSSAVTLTINPEYESWLVVDQLLLGWLYNSMT
        +VKL+R NY LWK+L L ++R  +L+G++ G + CP +F+                +SS++   +NSA            +  W   DQ LLGW+ NSMT
Subjt:  TVKLERGNYLLWKNLALLILRSYRLEGHLSGDKPCPSKFLSTSQVITTNVTNEAGSSSSEAVPLENSAQSSAVTLTINPEYESWLVVDQLLLGWLYNSMT

Query:  PEVATQVMGYENAKDLWAAIQEFLESSLEPKKIIFVRYFNR----------------------------------LVKVLLGLDEEYNVVVAMIQGKIGI
         E+ATQ++  E +K LW   Q    +    + I     F+                                   +++ L GLD EYN VV  +  +  +
Subjt:  PEVATQVMGYENAKDLWAAIQEFLESSLEPKKIIFVRYFNR----------------------------------LVKVLLGLDEEYNVVVAMIQGKIGI

Query:  SLSEMQAKLLVFEKRLEIQNSQRSSTTFGSTISVNMATKGDGQSGSRQKNFSINRQQYNNNQRGGGGR--NRGRGRWNGNNNNRLICQICGKNGYSALAF
        S  ++QA+LL FE R+E  N+  + T      + N+A + D +  S            NNN RG   R    GRGR     N    CQ+CG + + A+  
Subjt:  SLSEMQAKLLVFEKRLEIQNSQRSSTTFGSTISVNMATKGDGQSGSRQKNFSINRQQYNNNQRGGGGR--NRGRGRWNGNNNNRLICQICGKNGYSALAF

Query:  YQRFDKQFIGPSQNINRNISPGGNS--------------FQNGGTSQMSPVAPQ---------------------AFMTTQNTNPFVASLETVI-DPN-C
        + RFDK +   + +   +     N+              F +G ++ ++    +                     A + T ++     +L  ++  PN  
Subjt:  YQRFDKQFIGPSQNINRNISPGGNS--------------FQNGGTSQMSPVAPQ---------------------AFMTTQNTNPFVASLETVI-DPN-C

Query:  KNLVSVSKLAQDNNVYIEFHADSCLVKDIHTDKVVLRGVLKDGLYQLGTRITGSASGSVGSNLKSANKSVPHSAFITSGIYANVLVSKSVWHRRLGH---
        KNL+SVSKLA DNN+ +EF  + C VKD  T KV+L+G+LKDGLYQL                 S  K  P SAF++          K  WHRRLGH   
Subjt:  KNLVSVSKLAQDNNVYIEFHADSCLVKDIHTDKVVLRGVLKDGLYQLGTRITGSASGSVGSNLKSANKSVPHSAFITSGIYANVLVSKSVWHRRLGH---

Query:  ------------------------SMQIWKVSCSSFSKLKSHAIAPFDLIHTDLWGPAPVMSTDGYRYYVHFLDDFSRFVWVYPLKLKSDTVAAITHFIT
                                + Q  K+    F    SHA  P +L+HTD+WGPAP+M++ G++YYVHF+DDFSRF W+YPLK KS+TV A   F  
Subjt:  ------------------------SMQIWKVSCSSFSKLKSHAIAPFDLIHTDLWGPAPVMSTDGYRYYVHFLDDFSRFVWVYPLKLKSDTVAAITHFIT

Query:  MIKTQFNESVKVLQSDNGGEYKKVHQLCAQNGIQSRLSCPYTSAKNGRAERKHRHLVETSLTLLAQASMSLSYWWEAFTIATFLINGLPTPVLHGKSPIE
        + + QFN+ +KV+Q D GGEYK V +L  + GIQ R+SCPYTS +NGRAERKHRH+ E  LTLLAQA M L YWWEAF+ A +LIN LP+ V   +SP  
Subjt:  MIKTQFNESVKVLQSDNGGEYKKVHQLCAQNGIQSRLSCPYTSAKNGRAERKHRHLVETSLTLLAQASMSLSYWWEAFTIATFLINGLPTPVLHGKSPIE

Query:  LLLKSKIECYGLTGRETSGKEEGIRGRDTPKKRRNSPIGRIMKPSPPTKLERRRRNHEMNLSPTSELREDSWPDEEEKRGWAAGRWLTAERKEEENARRG
        L+L+ +                                                                  PD +  + +    +   +   +   +  
Subjt:  LLLKSKIECYGLTGRETSGKEEGIRGRDTPKKRRNSPIGRIMKPSPPTKLERRRRNHEMNLSPTSELREDSWPDEEEKRGWAAGRWLTAERKEEENARRG

Query:  SSAANTRGREKESPAHKRHRCLSSDGKVIISRHVKLNESDFPFATGFGSSLNNSQIFAYTKSNPPLHNWLHEPQISCSNDLSSPFVPPSLITSTSPSTSS
        ++     G    S +HK ++CL+S G++ ISRHV  NE  FPF  GF              +  PL   ++ P                  TS    T+ 
Subjt:  SSAANTRGREKESPAHKRHRCLSSDGKVIISRHVKLNESDFPFATGFGSSLNNSQIFAYTKSNPPLHNWLHEPQISCSNDLSSPFVPPSLITSTSPSTSS

Query:  NPTVSPSPPFTPQSAQLSTNPSTRPFITPPPDTVSLIPDPVNHIPDPNISFSHQPSSPVTPYVASPSQRHSISACQPSASPSSSTPNNVVPSPSSPPILP
        N     S P         TN       T     V+   +  N+ P  + + +H+ +  +T       Q+ S+      AS +++T               
Subjt:  NPTVSPSPPFTPQSAQLSTNPSTRPFITPPPDTVSLIPDPVNHIPDPNISFSHQPSSPVTPYVASPSQRHSISACQPSASPSSSTPNNVVPSPSSPPILP

Query:  SHPMITRVKTDIFKPKV-WLSVSPTDWSTREPSRVVDALVTPVWKAAMDVEFQALLRNQTWCLVPLSPSQNVFGCKWVFRIKRNSDGSV
        SH + TR K+ I KPK+ ++ ++ T   T EP+   +AL  P+WK AM  EF+AL+ N+TW LVP    +N+   KWVF+ K   DGS+
Subjt:  SHPMITRVKTDIFKPKV-WLSVSPTDWSTREPSRVVDALVTPVWKAAMDVEFQALLRNQTWCLVPLSPSQNVFGCKWVFRIKRNSDGSV

GAU51268.1 hypothetical protein TSUD_412550 [Trifolium subterraneum]3.3e-10129.7Show/hide
Query:  SSPPLNQLLNQITTVKLERGNYLLWKNLALLILRSYRLEGHLSGDKPCPSKFLSTSQVITTNVTNEAGSSSSEAVPLENSAQSSAVTLTINPEYESWLVV
        +SP  N  L  I +VKL+R NY LWK+L L ++R  +L+G++ G   CP +F++                            S+  +  +NP++  W+  
Subjt:  SSPPLNQLLNQITTVKLERGNYLLWKNLALLILRSYRLEGHLSGDKPCPSKFLSTSQVITTNVTNEAGSSSSEAVPLENSAQSSAVTLTINPEYESWLVV

Query:  DQLLLGWLYNSMTPEVATQVMGYENAKDLWAAIQEFLESSLEPKKIIFVRYFNR----------------------------------LVKVLLGLDEEY
        DQ LLGWL NSM  ++ATQ++  E +K LW   Q    +  + +       F+                                   +++ L GLD EY
Subjt:  DQLLLGWLYNSMTPEVATQVMGYENAKDLWAAIQEFLESSLEPKKIIFVRYFNR----------------------------------LVKVLLGLDEEY

Query:  NVVVAMIQGKIGISLSEMQAKLLVFEKRLEIQNSQRSSTTFGSTISVNMATKGDGQSGSRQKNFSINRQQYNNNQRGGGGRNRGRGRWNGNNNNRLICQI
        N VV  +  +I +S  ++QA+LL FE RL+  N+    T      S N A K +     R   F+       +N RG  G  RG+GR +        CQ+
Subjt:  NVVVAMIQGKIGISLSEMQAKLLVFEKRLEIQNSQRSSTTFGSTISVNMATKGDGQSGSRQKNFSINRQQYNNNQRGGGGRNRGRGRWNGNNNNRLICQI

Query:  CGKNGYSALAFYQRFDKQFIGPSQNINRNISPGGNS---------------FQNGGTSQMSPVAP--QAFMTTQNTN----------PFVASLET-----
        C   G+ A+    RFD+ + G + +   +   G +S               F +G  + ++      Q F      N            VAS  T     
Subjt:  CGKNGYSALAFYQRFDKQFIGPSQNINRNISPGGNS---------------FQNGGTSQMSPVAP--QAFMTTQNTN----------PFVASLET-----

Query:  ------VIDPNCKNLVSVSKLAQDNNVYIEFHADSCLVKDIHTDKVVLRGVLKDGLYQLGTR---ITGSASGSVGSNLKSANKSVPHSAFITSGIYANVL
               +    KNL+SVSKL  DNN+ +EF A+ C VKD  T + +L+G LKDGLYQL  +   +  S   S    L   N  V            NV 
Subjt:  ------VIDPNCKNLVSVSKLAQDNNVYIEFHADSCLVKDIHTDKVVLRGVLKDGLYQLGTR---ITGSASGSVGSNLKSANKSVPHSAFITSGIYANVL

Query:  VSKSVWHRRLGHSMQIWKVSCSSFSKLKSHAIAPFDLIHTDLWGPAPVMSTDGYRYYVHFLDDFSRFVWVYPLKLKSDTVAAITHFITMIKTQFNESVKV
        +S S        + Q  K+    F    SH   P  LIH+D+WGPAP++S  G++YYVHF+DDFSRF W++PLK KSDT+ A   F  + + QFN+ +K+
Subjt:  VSKSVWHRRLGHSMQIWKVSCSSFSKLKSHAIAPFDLIHTDLWGPAPVMSTDGYRYYVHFLDDFSRFVWVYPLKLKSDTVAAITHFITMIKTQFNESVKV

Query:  LQSDNGGEYKKVHQLCAQNGIQSRLSCPYTSAKNGRAERKHRHLVETSLTLLAQASMSLSYWWEAFTIATFLINGLPTPVLHGKSPIELLLKSKIECYGL
        +Q D GGEYK V ++  + GIQ R+SCPYTS +NGRAERKHRH+ E  LTLLAQA M L YWWEAF+ A +LIN LP+ V   +SP  L+ K +      
Subjt:  LQSDNGGEYKKVHQLCAQNGIQSRLSCPYTSAKNGRAERKHRHLVETSLTLLAQASMSLSYWWEAFTIATFLINGLPTPVLHGKSPIELLLKSKIECYGL

Query:  TGRETSGKEEGIRGRDTPKKRRNSPIGRIMKPSPPTKLERRRRNHEMNLSPTSELREDSWPDEEEKRGWAAGRWLTAERKEEENARRGSSAANTRGREKE
                         P      P G    P       +    H++    T              R    G                            
Subjt:  TGRETSGKEEGIRGRDTPKKRRNSPIGRIMKPSPPTKLERRRRNHEMNLSPTSELREDSWPDEEEKRGWAAGRWLTAERKEEENARRGSSAANTRGREKE

Query:  SPAHKRHRCLSSDGKVIISRHVKLNESDFPFATGFGSSLN-------NSQIFAYTKSNPPLHNWLHEPQISCSNDLSSPFVPPSLITSTSPSTSSNPTVS
        S +HK ++C++S G++ +SRHV  NE+ FPF  GF  + N       NS I   T S         EP  + ++D ++  +       +S +  +   V 
Subjt:  SPAHKRHRCLSSDGKVIISRHVKLNESDFPFATGFGSSLN-------NSQIFAYTKSNPPLHNWLHEPQISCSNDLSSPFVPPSLITSTSPSTSSNPTVS

Query:  PSPPFTPQSAQLSTNPSTRPFITPPPDTVSLIPDPVNHIPDPNISFSHQPSSPVTPYVASPSQRHSISACQPSASPSSSTPNNVVPSPSSPPILPSHPMI
         S  F      ++TN S+   I                  D ++    + +S +T  +   +Q+ + +                           +H M 
Subjt:  PSPPFTPQSAQLSTNPSTRPFITPPPDTVSLIPDPVNHIPDPNISFSHQPSSPVTPYVASPSQRHSISACQPSASPSSSTPNNVVPSPSSPPILPSHPMI

Query:  TRVKTDIFKPKV-WLSVSPTDWSTREPSRVVDALVTPVWKAAMDVEFQALLRNQTWCLVPLSPSQNVFGCKWVFRIKRNSDGSV
        TR K  I KPK+ ++ ++ TD   +EP  V +AL  P+WK AMD E++AL+ N TW LVP    +N+   KW+F+ K  SDGS+
Subjt:  TRVKTDIFKPKV-WLSVSPTDWSTREPSRVVDALVTPVWKAAMDVEFQALLRNQTWCLVPLSPSQNVFGCKWVFRIKRNSDGSV

PNX78574.1 retrovirus-related Pol polyprotein from transposon TNT 1-94 [Trifolium pratense]3.0e-9434.45Show/hide
Query:  NQLLNQITTVKLERGNYLLWKNLALLILRSYRLEGHLSGDKPCPSKFLSTSQVITTNVTNEAGSSSSEAVPLENSAQSSAVTLTINPEYESWLVVDQLLL
        N L ++I +V L+R N+ LWK+L L I+R  RL+G++ G K CP +F+++++                           A    INP++  W   DQ +L
Subjt:  NQLLNQITTVKLERGNYLLWKNLALLILRSYRLEGHLSGDKPCPSKFLSTSQVITTNVTNEAGSSSSEAVPLENSAQSSAVTLTINPEYESWLVVDQLLL

Query:  GWLYNSMTPEVATQVMGYENAKDLWAAIQEFLESSLEPKKIIFVR-----------------------------------YFNRLVKVLLGLDEEYNVVV
        GWL N+MT   A+Q++  E +K LW   Q  L S+    ++I++R                                     + +++ L GLD +YN +V
Subjt:  GWLYNSMTPEVATQVMGYENAKDLWAAIQEFLESSLEPKKIIFVR-----------------------------------YFNRLVKVLLGLDEEYNVVV

Query:  AMIQGKIGISLSEMQAKLLVFEKRLEIQNSQRSSTTFGSTISVNMATKGDGQSGSRQKNFSINRQQYNNNQRGGGGRNRGRGRWNGNNNNRLICQICGKN
          +  +I +S  ++QA+LL FE RL+  NS  +     +T   N+A K           F  N   +  + RG   RN   GR  G  +N  ICQ+C K+
Subjt:  AMIQGKIGISLSEMQAKLLVFEKRLEIQNSQRSSTTFGSTISVNMATKGDGQSGSRQKNFSINRQQYNNNQRGGGGRNRGRGRWNGNNNNRLICQICGKN

Query:  GYSALAFYQRFDKQFIGPS---QNINRN------ISPGGNS------FQNGGTSQMSPVAPQAFMTTQNT--NPFVA--SLETVIDPN------------
        G++A+    R+DK + G S    N+ R       ++   NS      F +G ++ ++  A +    T+N+  N  +     +  ID +            
Subjt:  GYSALAFYQRFDKQFIGPS---QNINRN------ISPGGNS------FQNGGTSQMSPVAPQAFMTTQNT--NPFVA--SLETVIDPN------------

Query:  -------CKNLVSVSKLAQDNNVYIEFHADSCLVKDIHTDKVVLRGVLKDGLYQLGTRITGSASGSVGSNLKSANKSVPHSAFITSGIYANVLVSKSVWH
                KNL+SVSKL  DNN+ +EF  D C VKD  T KV+LRG+LKDGLYQL       ++GS  +N                 +Y +V   K  WH
Subjt:  -------CKNLVSVSKLAQDNNVYIEFHADSCLVKDIHTDKVVLRGVLKDGLYQLGTRITGSASGSVGSNLKSANKSVPHSAFITSGIYANVLVSKSVWH

Query:  RRLGH---------------------------SMQIWKVSCSSFSKLKSHAIAPFDLIHTDLWGPAPVMSTDGYRYYVHFLDDFSRFVWVYPLKLKSDTV
        R+LGH                           + Q+ K     F    SHA    +LIHTD+WGPAP+ S  G++YYVHF+DD SRF W+YPLK KSDT+
Subjt:  RRLGH---------------------------SMQIWKVSCSSFSKLKSHAIAPFDLIHTDLWGPAPVMSTDGYRYYVHFLDDFSRFVWVYPLKLKSDTV

Query:  AAITHFITMIKTQFNESVKVLQSDNGGEYKKVHQLCAQNGIQSRLSCPYTSAKNGRAERKHRHLVETSLTLLAQASMSLSYWWEAFTIATFLINGLPTPV
         A   F  M++ QFN+ +K++Q D GGE+K V ++  + GI+ R+SCPYTS +NGRAERKHRH+ E  LTLLAQA+MSL YWWEAF+ A +LIN LP+ V
Subjt:  AAITHFITMIKTQFNESVKVLQSDNGGEYKKVHQLCAQNGIQSRLSCPYTSAKNGRAERKHRHLVETSLTLLAQASMSLSYWWEAFTIATFLINGLPTPV

Query:  LHGKSPIELLLKSK
           +SP  L+ K +
Subjt:  LHGKSPIELLLKSK

PNX94503.1 putative retrotransposon Ty1-copia subclass protein, partial [Trifolium pratense]3.5e-10629.87Show/hide
Query:  LNQITTVKLERGNYLLWKNLALLILRSYRLEGHLSGDKPCPSKFLSTSQVITTNVTNEAGSSSSEAVPLENSAQSSAVTLTINPEYESWLVVDQLLLGWL
        L    +VKL+R N+ LWK+L L ++R  + +G++ G K CP +F+       T++ N                     T  INP+Y+ W   DQ LLGWL
Subjt:  LNQITTVKLERGNYLLWKNLALLILRSYRLEGHLSGDKPCPSKFLSTSQVITTNVTNEAGSSSSEAVPLENSAQSSAVTLTINPEYESWLVVDQLLLGWL

Query:  YNSMTPEVATQVMGYENAKDLWAAIQEFLESSLEPKKIIFVRYFNR----------------------------------LVKVLLGLDEEYNVVVAMIQ
         NSMT ++ATQV+  E +K LW   Q    +    + I     F+                                   +++ L GLD EYN VV  + 
Subjt:  YNSMTPEVATQVMGYENAKDLWAAIQEFLESSLEPKKIIFVRYFNR----------------------------------LVKVLLGLDEEYNVVVAMIQ

Query:  GKIGISLSEMQAKLLVFEKRLEIQNSQRSSTTFGSTISVNMATKGDGQSGSRQKNFSINRQQYNNNQRGGGGRNRGRGRWNGNNNNRLICQICGKNGYSA
         +  IS  + QA+LL FE RL+  N   +        S N A+K +    S    F        +N RG  G   GRGR   +   R ICQICGK G++A
Subjt:  GKIGISLSEMQAKLLVFEKRLEIQNSQRSSTTFGSTISVNMATKGDGQSGSRQKNFSINRQQYNNNQRGGGGRNRGRGRWNGNNNNRLICQICGKNGYSA

Query:  LAFYQRFDKQFI-----------------------------------------GPSQNINRNISPGGNSFQNGGTSQMSPVAPQAFMTTQNTNPFVASLE
           Y RFDK +                                          G  Q++N N   G NS   G   ++        + + +T     +L 
Subjt:  LAFYQRFDKQFI-----------------------------------------GPSQNINRNISPGGNSFQNGGTSQMSPVAPQAFMTTQNTNPFVASLE

Query:  TV--IDPNCKNLVSVSKLAQDNNVYIEFHADSCLVKDIHTDKVVLRGVLKDGLYQLGTRITGSASGSVGSNLKSANKSVPHSAFITSGIYANVLVSKSVW
         V  +    KNL+SVSKL  DNN  +EF  + C VKD  T K +L+G LKDGLYQL                 SANK  P     T+      +  K +W
Subjt:  TV--IDPNCKNLVSVSKLAQDNNVYIEFHADSCLVKDIHTDKVVLRGVLKDGLYQLGTRITGSASGSVGSNLKSANKSVPHSAFITSGIYANVLVSKSVW

Query:  HRRLGH---------------------------SMQIWKVSCSSFSKLKSHAIAPFDLIHTDLWGPAPVMSTDGYRYYVHFLDDFSRFVWVYPLKLKSDT
        HR+LGH                           + Q  K+    F    SHA  P DLIHTD+WGPAP++S   ++YYVHFLDDFSRF W++PLK KS+T
Subjt:  HRRLGH---------------------------SMQIWKVSCSSFSKLKSHAIAPFDLIHTDLWGPAPVMSTDGYRYYVHFLDDFSRFVWVYPLKLKSDT

Query:  VAAITHFITMIKTQFNESVKVLQSDNGGEYKKVHQLCAQNGIQSRLSCPYTSAKNGRAERKHRHLVETSLTLLAQASMSLSYWWEAFTIATFLINGLPTP
        + A   F  +++ QFN+ +KV++ D GGEYK V +    +GIQ ++SCPYTS +NGRAERKHRH+ E  LTLLAQA M LSYWWEAF+ A +LIN LP+ 
Subjt:  VAAITHFITMIKTQFNESVKVLQSDNGGEYKKVHQLCAQNGIQSRLSCPYTSAKNGRAERKHRHLVETSLTLLAQASMSLSYWWEAFTIATFLINGLPTP

Query:  VLHGKSPIELLLKSKIECYGLTGRETSGKEEGIRGRDTPKKRRNSPIGRIMKPSPPTKLERRRRNHEMNLSPTSELREDSWPDEEEKRGWAAGRWLTAER
        V   +SP  L+ K +                       P      P G    P                L P ++ +           G+          
Subjt:  VLHGKSPIELLLKSKIECYGLTGRETSGKEEGIRGRDTPKKRRNSPIGRIMKPSPPTKLERRRRNHEMNLSPTSELREDSWPDEEEKRGWAAGRWLTAER

Query:  KEEENARRGSSAANTRGREKESPAHKRHRCLSSDGKVIISRHVKLNESDFPFATGFGSSLNNSQIFAYTKSNPPLHNWLHEPQISCSNDLSSPFVPPSLI
                             S +HK ++C++S G+V +SRHV  NE+ FPF  GF  + N  ++                  ++    +  P  P  + 
Subjt:  KEEENARRGSSAANTRGREKESPAHKRHRCLSSDGKVIISRHVKLNESDFPFATGFGSSLNNSQIFAYTKSNPPLHNWLHEPQISCSNDLSSPFVPPSLI

Query:  TSTSPSTSSNPTVSPSPPFTPQSAQLSTNPSTRPFITPPPDTVSLIPDPVNHIPDPNISFSHQPSSPVTPYVASPSQRHSISACQPSASPSSSTPNNVVP
        T+ +   + N      P                  I    D  S+  D   H  + N S              +     S  A    +    S P     
Subjt:  TSTSPSTSSNPTVSPSPPFTPQSAQLSTNPSTRPFITPPPDTVSLIPDPVNHIPDPNISFSHQPSSPVTPYVASPSQRHSISACQPSASPSSSTPNNVVP

Query:  SPSSPPILPSHPMITRVKTDIFKPKV-WLSVSPTDWSTREPSRVVDALVTPVWKAAMDVEFQALLRNQTWCLVPLSPSQNVFGCKWVFRIKRNSDGSV
         P    I  +H M TR K  ++KPK+ ++ ++      +EP  V +AL  P W  AMD E++AL+ N+TW LVP    +NV   KW+F+ K  +DG++
Subjt:  SPSSPPILPSHPMITRVKTDIFKPKV-WLSVSPTDWSTREPSRVVDALVTPVWKAAMDVEFQALLRNQTWCLVPLSPSQNVFGCKWVFRIKRNSDGSV

TrEMBL top hitse value%identityAlignment
A0A2K3MUJ9 Putative retrotransposon Ty1-copia subclass protein (Fragment)1.7e-10629.87Show/hide
Query:  LNQITTVKLERGNYLLWKNLALLILRSYRLEGHLSGDKPCPSKFLSTSQVITTNVTNEAGSSSSEAVPLENSAQSSAVTLTINPEYESWLVVDQLLLGWL
        L    +VKL+R N+ LWK+L L ++R  + +G++ G K CP +F+       T++ N                     T  INP+Y+ W   DQ LLGWL
Subjt:  LNQITTVKLERGNYLLWKNLALLILRSYRLEGHLSGDKPCPSKFLSTSQVITTNVTNEAGSSSSEAVPLENSAQSSAVTLTINPEYESWLVVDQLLLGWL

Query:  YNSMTPEVATQVMGYENAKDLWAAIQEFLESSLEPKKIIFVRYFNR----------------------------------LVKVLLGLDEEYNVVVAMIQ
         NSMT ++ATQV+  E +K LW   Q    +    + I     F+                                   +++ L GLD EYN VV  + 
Subjt:  YNSMTPEVATQVMGYENAKDLWAAIQEFLESSLEPKKIIFVRYFNR----------------------------------LVKVLLGLDEEYNVVVAMIQ

Query:  GKIGISLSEMQAKLLVFEKRLEIQNSQRSSTTFGSTISVNMATKGDGQSGSRQKNFSINRQQYNNNQRGGGGRNRGRGRWNGNNNNRLICQICGKNGYSA
         +  IS  + QA+LL FE RL+  N   +        S N A+K +    S    F        +N RG  G   GRGR   +   R ICQICGK G++A
Subjt:  GKIGISLSEMQAKLLVFEKRLEIQNSQRSSTTFGSTISVNMATKGDGQSGSRQKNFSINRQQYNNNQRGGGGRNRGRGRWNGNNNNRLICQICGKNGYSA

Query:  LAFYQRFDKQFI-----------------------------------------GPSQNINRNISPGGNSFQNGGTSQMSPVAPQAFMTTQNTNPFVASLE
           Y RFDK +                                          G  Q++N N   G NS   G   ++        + + +T     +L 
Subjt:  LAFYQRFDKQFI-----------------------------------------GPSQNINRNISPGGNSFQNGGTSQMSPVAPQAFMTTQNTNPFVASLE

Query:  TV--IDPNCKNLVSVSKLAQDNNVYIEFHADSCLVKDIHTDKVVLRGVLKDGLYQLGTRITGSASGSVGSNLKSANKSVPHSAFITSGIYANVLVSKSVW
         V  +    KNL+SVSKL  DNN  +EF  + C VKD  T K +L+G LKDGLYQL                 SANK  P     T+      +  K +W
Subjt:  TV--IDPNCKNLVSVSKLAQDNNVYIEFHADSCLVKDIHTDKVVLRGVLKDGLYQLGTRITGSASGSVGSNLKSANKSVPHSAFITSGIYANVLVSKSVW

Query:  HRRLGH---------------------------SMQIWKVSCSSFSKLKSHAIAPFDLIHTDLWGPAPVMSTDGYRYYVHFLDDFSRFVWVYPLKLKSDT
        HR+LGH                           + Q  K+    F    SHA  P DLIHTD+WGPAP++S   ++YYVHFLDDFSRF W++PLK KS+T
Subjt:  HRRLGH---------------------------SMQIWKVSCSSFSKLKSHAIAPFDLIHTDLWGPAPVMSTDGYRYYVHFLDDFSRFVWVYPLKLKSDT

Query:  VAAITHFITMIKTQFNESVKVLQSDNGGEYKKVHQLCAQNGIQSRLSCPYTSAKNGRAERKHRHLVETSLTLLAQASMSLSYWWEAFTIATFLINGLPTP
        + A   F  +++ QFN+ +KV++ D GGEYK V +    +GIQ ++SCPYTS +NGRAERKHRH+ E  LTLLAQA M LSYWWEAF+ A +LIN LP+ 
Subjt:  VAAITHFITMIKTQFNESVKVLQSDNGGEYKKVHQLCAQNGIQSRLSCPYTSAKNGRAERKHRHLVETSLTLLAQASMSLSYWWEAFTIATFLINGLPTP

Query:  VLHGKSPIELLLKSKIECYGLTGRETSGKEEGIRGRDTPKKRRNSPIGRIMKPSPPTKLERRRRNHEMNLSPTSELREDSWPDEEEKRGWAAGRWLTAER
        V   +SP  L+ K +                       P      P G    P                L P ++ +           G+          
Subjt:  VLHGKSPIELLLKSKIECYGLTGRETSGKEEGIRGRDTPKKRRNSPIGRIMKPSPPTKLERRRRNHEMNLSPTSELREDSWPDEEEKRGWAAGRWLTAER

Query:  KEEENARRGSSAANTRGREKESPAHKRHRCLSSDGKVIISRHVKLNESDFPFATGFGSSLNNSQIFAYTKSNPPLHNWLHEPQISCSNDLSSPFVPPSLI
                             S +HK ++C++S G+V +SRHV  NE+ FPF  GF  + N  ++                  ++    +  P  P  + 
Subjt:  KEEENARRGSSAANTRGREKESPAHKRHRCLSSDGKVIISRHVKLNESDFPFATGFGSSLNNSQIFAYTKSNPPLHNWLHEPQISCSNDLSSPFVPPSLI

Query:  TSTSPSTSSNPTVSPSPPFTPQSAQLSTNPSTRPFITPPPDTVSLIPDPVNHIPDPNISFSHQPSSPVTPYVASPSQRHSISACQPSASPSSSTPNNVVP
        T+ +   + N      P                  I    D  S+  D   H  + N S              +     S  A    +    S P     
Subjt:  TSTSPSTSSNPTVSPSPPFTPQSAQLSTNPSTRPFITPPPDTVSLIPDPVNHIPDPNISFSHQPSSPVTPYVASPSQRHSISACQPSASPSSSTPNNVVP

Query:  SPSSPPILPSHPMITRVKTDIFKPKV-WLSVSPTDWSTREPSRVVDALVTPVWKAAMDVEFQALLRNQTWCLVPLSPSQNVFGCKWVFRIKRNSDGSV
         P    I  +H M TR K  ++KPK+ ++ ++      +EP  V +AL  P W  AMD E++AL+ N+TW LVP    +NV   KW+F+ K  +DG++
Subjt:  SPSSPPILPSHPMITRVKTDIFKPKV-WLSVSPTDWSTREPSRVVDALVTPVWKAAMDVEFQALLRNQTWCLVPLSPSQNVFGCKWVFRIKRNSDGSV

A0A2Z6MBG6 Integrase catalytic domain-containing protein8.3e-10629.94Show/hide
Query:  TVKLERGNYLLWKNLALLILRSYRLEGHLSGDKPCPSKFLSTSQVITTNVTNEAGSSSSEAVPLENSAQSSAVTLTINPEYESWLVVDQLLLGWLYNSMT
        +VKL+R NY LWK+L L ++R  +L+G++ G + CP +F+                +SS++   +NSA            +  W   DQ LLGW+ NSMT
Subjt:  TVKLERGNYLLWKNLALLILRSYRLEGHLSGDKPCPSKFLSTSQVITTNVTNEAGSSSSEAVPLENSAQSSAVTLTINPEYESWLVVDQLLLGWLYNSMT

Query:  PEVATQVMGYENAKDLWAAIQEFLESSLEPKKIIFVRYFNR----------------------------------LVKVLLGLDEEYNVVVAMIQGKIGI
         E+ATQ++  E +K LW   Q    +    + I     F+                                   +++ L GLD EYN VV  +  +  +
Subjt:  PEVATQVMGYENAKDLWAAIQEFLESSLEPKKIIFVRYFNR----------------------------------LVKVLLGLDEEYNVVVAMIQGKIGI

Query:  SLSEMQAKLLVFEKRLEIQNSQRSSTTFGSTISVNMATKGDGQSGSRQKNFSINRQQYNNNQRGGGGR--NRGRGRWNGNNNNRLICQICGKNGYSALAF
        S  ++QA+LL FE R+E  N+  + T      + N+A + D +  S            NNN RG   R    GRGR     N    CQ+CG + + A+  
Subjt:  SLSEMQAKLLVFEKRLEIQNSQRSSTTFGSTISVNMATKGDGQSGSRQKNFSINRQQYNNNQRGGGGR--NRGRGRWNGNNNNRLICQICGKNGYSALAF

Query:  YQRFDKQFIGPSQNINRNISPGGNS--------------FQNGGTSQMSPVAPQ---------------------AFMTTQNTNPFVASLETVI-DPN-C
        + RFDK +   + +   +     N+              F +G ++ ++    +                     A + T ++     +L  ++  PN  
Subjt:  YQRFDKQFIGPSQNINRNISPGGNS--------------FQNGGTSQMSPVAPQ---------------------AFMTTQNTNPFVASLETVI-DPN-C

Query:  KNLVSVSKLAQDNNVYIEFHADSCLVKDIHTDKVVLRGVLKDGLYQLGTRITGSASGSVGSNLKSANKSVPHSAFITSGIYANVLVSKSVWHRRLGH---
        KNL+SVSKLA DNN+ +EF  + C VKD  T KV+L+G+LKDGLYQL                 S  K  P SAF++          K  WHRRLGH   
Subjt:  KNLVSVSKLAQDNNVYIEFHADSCLVKDIHTDKVVLRGVLKDGLYQLGTRITGSASGSVGSNLKSANKSVPHSAFITSGIYANVLVSKSVWHRRLGH---

Query:  ------------------------SMQIWKVSCSSFSKLKSHAIAPFDLIHTDLWGPAPVMSTDGYRYYVHFLDDFSRFVWVYPLKLKSDTVAAITHFIT
                                + Q  K+    F    SHA  P +L+HTD+WGPAP+M++ G++YYVHF+DDFSRF W+YPLK KS+TV A   F  
Subjt:  ------------------------SMQIWKVSCSSFSKLKSHAIAPFDLIHTDLWGPAPVMSTDGYRYYVHFLDDFSRFVWVYPLKLKSDTVAAITHFIT

Query:  MIKTQFNESVKVLQSDNGGEYKKVHQLCAQNGIQSRLSCPYTSAKNGRAERKHRHLVETSLTLLAQASMSLSYWWEAFTIATFLINGLPTPVLHGKSPIE
        + + QFN+ +KV+Q D GGEYK V +L  + GIQ R+SCPYTS +NGRAERKHRH+ E  LTLLAQA M L YWWEAF+ A +LIN LP+ V   +SP  
Subjt:  MIKTQFNESVKVLQSDNGGEYKKVHQLCAQNGIQSRLSCPYTSAKNGRAERKHRHLVETSLTLLAQASMSLSYWWEAFTIATFLINGLPTPVLHGKSPIE

Query:  LLLKSKIECYGLTGRETSGKEEGIRGRDTPKKRRNSPIGRIMKPSPPTKLERRRRNHEMNLSPTSELREDSWPDEEEKRGWAAGRWLTAERKEEENARRG
        L+L+ +                                                                  PD +  + +    +   +   +   +  
Subjt:  LLLKSKIECYGLTGRETSGKEEGIRGRDTPKKRRNSPIGRIMKPSPPTKLERRRRNHEMNLSPTSELREDSWPDEEEKRGWAAGRWLTAERKEEENARRG

Query:  SSAANTRGREKESPAHKRHRCLSSDGKVIISRHVKLNESDFPFATGFGSSLNNSQIFAYTKSNPPLHNWLHEPQISCSNDLSSPFVPPSLITSTSPSTSS
        ++     G    S +HK ++CL+S G++ ISRHV  NE  FPF  GF              +  PL   ++ P                  TS    T+ 
Subjt:  SSAANTRGREKESPAHKRHRCLSSDGKVIISRHVKLNESDFPFATGFGSSLNNSQIFAYTKSNPPLHNWLHEPQISCSNDLSSPFVPPSLITSTSPSTSS

Query:  NPTVSPSPPFTPQSAQLSTNPSTRPFITPPPDTVSLIPDPVNHIPDPNISFSHQPSSPVTPYVASPSQRHSISACQPSASPSSSTPNNVVPSPSSPPILP
        N     S P         TN       T     V+   +  N+ P  + + +H+ +  +T       Q+ S+      AS +++T               
Subjt:  NPTVSPSPPFTPQSAQLSTNPSTRPFITPPPDTVSLIPDPVNHIPDPNISFSHQPSSPVTPYVASPSQRHSISACQPSASPSSSTPNNVVPSPSSPPILP

Query:  SHPMITRVKTDIFKPKV-WLSVSPTDWSTREPSRVVDALVTPVWKAAMDVEFQALLRNQTWCLVPLSPSQNVFGCKWVFRIKRNSDGSV
        SH + TR K+ I KPK+ ++ ++ T   T EP+   +AL  P+WK AM  EF+AL+ N+TW LVP    +N+   KWVF+ K   DGS+
Subjt:  SHPMITRVKTDIFKPKV-WLSVSPTDWSTREPSRVVDALVTPVWKAAMDVEFQALLRNQTWCLVPLSPSQNVFGCKWVFRIKRNSDGSV

A0A2Z6P4D5 Integrase catalytic domain-containing protein1.6e-10129.7Show/hide
Query:  SSPPLNQLLNQITTVKLERGNYLLWKNLALLILRSYRLEGHLSGDKPCPSKFLSTSQVITTNVTNEAGSSSSEAVPLENSAQSSAVTLTINPEYESWLVV
        +SP  N  L  I +VKL+R NY LWK+L L ++R  +L+G++ G   CP +F++                            S+  +  +NP++  W+  
Subjt:  SSPPLNQLLNQITTVKLERGNYLLWKNLALLILRSYRLEGHLSGDKPCPSKFLSTSQVITTNVTNEAGSSSSEAVPLENSAQSSAVTLTINPEYESWLVV

Query:  DQLLLGWLYNSMTPEVATQVMGYENAKDLWAAIQEFLESSLEPKKIIFVRYFNR----------------------------------LVKVLLGLDEEY
        DQ LLGWL NSM  ++ATQ++  E +K LW   Q    +  + +       F+                                   +++ L GLD EY
Subjt:  DQLLLGWLYNSMTPEVATQVMGYENAKDLWAAIQEFLESSLEPKKIIFVRYFNR----------------------------------LVKVLLGLDEEY

Query:  NVVVAMIQGKIGISLSEMQAKLLVFEKRLEIQNSQRSSTTFGSTISVNMATKGDGQSGSRQKNFSINRQQYNNNQRGGGGRNRGRGRWNGNNNNRLICQI
        N VV  +  +I +S  ++QA+LL FE RL+  N+    T      S N A K +     R   F+       +N RG  G  RG+GR +        CQ+
Subjt:  NVVVAMIQGKIGISLSEMQAKLLVFEKRLEIQNSQRSSTTFGSTISVNMATKGDGQSGSRQKNFSINRQQYNNNQRGGGGRNRGRGRWNGNNNNRLICQI

Query:  CGKNGYSALAFYQRFDKQFIGPSQNINRNISPGGNS---------------FQNGGTSQMSPVAP--QAFMTTQNTN----------PFVASLET-----
        C   G+ A+    RFD+ + G + +   +   G +S               F +G  + ++      Q F      N            VAS  T     
Subjt:  CGKNGYSALAFYQRFDKQFIGPSQNINRNISPGGNS---------------FQNGGTSQMSPVAP--QAFMTTQNTN----------PFVASLET-----

Query:  ------VIDPNCKNLVSVSKLAQDNNVYIEFHADSCLVKDIHTDKVVLRGVLKDGLYQLGTR---ITGSASGSVGSNLKSANKSVPHSAFITSGIYANVL
               +    KNL+SVSKL  DNN+ +EF A+ C VKD  T + +L+G LKDGLYQL  +   +  S   S    L   N  V            NV 
Subjt:  ------VIDPNCKNLVSVSKLAQDNNVYIEFHADSCLVKDIHTDKVVLRGVLKDGLYQLGTR---ITGSASGSVGSNLKSANKSVPHSAFITSGIYANVL

Query:  VSKSVWHRRLGHSMQIWKVSCSSFSKLKSHAIAPFDLIHTDLWGPAPVMSTDGYRYYVHFLDDFSRFVWVYPLKLKSDTVAAITHFITMIKTQFNESVKV
        +S S        + Q  K+    F    SH   P  LIH+D+WGPAP++S  G++YYVHF+DDFSRF W++PLK KSDT+ A   F  + + QFN+ +K+
Subjt:  VSKSVWHRRLGHSMQIWKVSCSSFSKLKSHAIAPFDLIHTDLWGPAPVMSTDGYRYYVHFLDDFSRFVWVYPLKLKSDTVAAITHFITMIKTQFNESVKV

Query:  LQSDNGGEYKKVHQLCAQNGIQSRLSCPYTSAKNGRAERKHRHLVETSLTLLAQASMSLSYWWEAFTIATFLINGLPTPVLHGKSPIELLLKSKIECYGL
        +Q D GGEYK V ++  + GIQ R+SCPYTS +NGRAERKHRH+ E  LTLLAQA M L YWWEAF+ A +LIN LP+ V   +SP  L+ K +      
Subjt:  LQSDNGGEYKKVHQLCAQNGIQSRLSCPYTSAKNGRAERKHRHLVETSLTLLAQASMSLSYWWEAFTIATFLINGLPTPVLHGKSPIELLLKSKIECYGL

Query:  TGRETSGKEEGIRGRDTPKKRRNSPIGRIMKPSPPTKLERRRRNHEMNLSPTSELREDSWPDEEEKRGWAAGRWLTAERKEEENARRGSSAANTRGREKE
                         P      P G    P       +    H++    T              R    G                            
Subjt:  TGRETSGKEEGIRGRDTPKKRRNSPIGRIMKPSPPTKLERRRRNHEMNLSPTSELREDSWPDEEEKRGWAAGRWLTAERKEEENARRGSSAANTRGREKE

Query:  SPAHKRHRCLSSDGKVIISRHVKLNESDFPFATGFGSSLN-------NSQIFAYTKSNPPLHNWLHEPQISCSNDLSSPFVPPSLITSTSPSTSSNPTVS
        S +HK ++C++S G++ +SRHV  NE+ FPF  GF  + N       NS I   T S         EP  + ++D ++  +       +S +  +   V 
Subjt:  SPAHKRHRCLSSDGKVIISRHVKLNESDFPFATGFGSSLN-------NSQIFAYTKSNPPLHNWLHEPQISCSNDLSSPFVPPSLITSTSPSTSSNPTVS

Query:  PSPPFTPQSAQLSTNPSTRPFITPPPDTVSLIPDPVNHIPDPNISFSHQPSSPVTPYVASPSQRHSISACQPSASPSSSTPNNVVPSPSSPPILPSHPMI
         S  F      ++TN S+   I                  D ++    + +S +T  +   +Q+ + +                           +H M 
Subjt:  PSPPFTPQSAQLSTNPSTRPFITPPPDTVSLIPDPVNHIPDPNISFSHQPSSPVTPYVASPSQRHSISACQPSASPSSSTPNNVVPSPSSPPILPSHPMI

Query:  TRVKTDIFKPKV-WLSVSPTDWSTREPSRVVDALVTPVWKAAMDVEFQALLRNQTWCLVPLSPSQNVFGCKWVFRIKRNSDGSV
        TR K  I KPK+ ++ ++ TD   +EP  V +AL  P+WK AMD E++AL+ N TW LVP    +N+   KW+F+ K  SDGS+
Subjt:  TRVKTDIFKPKV-WLSVSPTDWSTREPSRVVDALVTPVWKAAMDVEFQALLRNQTWCLVPLSPSQNVFGCKWVFRIKRNSDGSV

A0A803PEH4 Uncharacterized protein8.9e-10030.36Show/hide
Query:  SSSSVNMAASASLPIFSSPPLNQLLNQITTVKLERGNYLLWKNLALLILRSYRLEGHLSGDKPCPSKFLSTSQVITTNVTNEAGSSSSEAVPLENSAQSS
        +SSS N   ++ LP   +PP    LNQ  ++KL+R NY LWK +   I+R +RL G+LSG   CP +F+    V  T VT                    
Subjt:  SSSSVNMAASASLPIFSSPPLNQLLNQITTVKLERGNYLLWKNLALLILRSYRLEGHLSGDKPCPSKFLSTSQVITTNVTNEAGSSSSEAVPLENSAQSS

Query:  AVTLTINPEYESWLVVDQLLLGWLYNSMTPEVATQVMGYENAKDLWAAIQEFLESSLEPK--------------KIIFVRYFNR----------------
              NPEYE+W++ DQLL+GWLY+SMT  +AT+VMG  +A +L   ++    +  + K                +   Y  +                
Subjt:  AVTLTINPEYESWLVVDQLLLGWLYNSMTPEVATQVMGYENAKDLWAAIQEFLESSLEPK--------------KIIFVRYFNR----------------

Query:  ----LVKVLLGLDEEYNVVVAMIQGKIGISLSEMQAKLLVFEKRLE-IQN-SQRSSTTFGSTISVNMATKGDGQS---GSRQKNFSINRQQYNNNQRGGG
            +  VL GLD EY  +V  I+ +   +  E+Q  LL F+ ++E +QN +  S+    S+   NMA K +      G + +N S N     +N RG  
Subjt:  ----LVKVLLGLDEEYNVVVAMIQGKIGISLSEMQAKLLVFEKRLE-IQN-SQRSSTTFGSTISVNMATKGDGQS---GSRQKNFSINRQQYNNNQRGGG

Query:  GRNRGRGRWNGNNNNRLICQICGKNGYSALAFYQRFDKQFIGPSQNINRNISPGGNS-------------------FQNGGTSQMSPVAPQAFMTTQNTN
         R RGRGR  G + +R  CQ+ GK G++A   Y RFD+ ++G   N   N +  G +                   F + G S      P      Q+ N
Subjt:  GRNRGRGRWNGNNNNRLICQICGKNGYSALAFYQRFDKQFIGPSQNINRNISPGGNS-------------------FQNGGTSQMSPVAPQAFMTTQNTN

Query:  ----------------------------PFVASLETVIDPN-CKNLVSVSKLAQDNNVYIEFHADSCLVKDIHTDKVVLRGVLKDGLYQLGTRITGSASG
                                     ++   + ++ P   KNLVSVSKLA DNNV IEF+++ CLVKD  T KV+L GVLKD LYQL +  T S+  
Subjt:  ----------------------------PFVASLETVIDPN-CKNLVSVSKLAQDNNVYIEFHADSCLVKDIHTDKVVLRGVLKDGLYQLGTRITGSASG

Query:  SVGSNLKSANKSVPHSAFITSGIYANVLVSKSVWHRRLGH-----------------SMQIWKVSCSSFSKLKSHAIAPF-----------DLIHTDLWG
           SN  SA      S    S   + ++    V HRRLGH                 S    K  C +    K+HA+ PF           DLIHTDLWG
Subjt:  SVGSNLKSANKSVPHSAFITSGIYANVLVSKSVWHRRLGH-----------------SMQIWKVSCSSFSKLKSHAIAPF-----------DLIHTDLWG

Query:  PAPVMSTDGYRYYVHFLDDFSRFVWVYPLKLKSDTVAAITHFITMIKTQFNESVKVLQSDNGGEYKKVHQLCAQNGIQSRLSCPYTSAKNGRAERKHRHL
        PAP+ S   + YY+HF+DD+SR+ W+YPLKLKSD +AA   F  +++ QF + +K L+SD+GGEYK    L   +GI+ +  CP+TS +NGRA+RKHRH 
Subjt:  PAPVMSTDGYRYYVHFLDDFSRFVWVYPLKLKSDTVAAITHFITMIKTQFNESVKVLQSDNGGEYKKVHQLCAQNGIQSRLSCPYTSAKNGRAERKHRHL

Query:  VETSLTLLAQASMSLSYWWEAFTIATFLINGLPTPVLHGKSPIELLLKSKIECYGLTGRETSGKEEGIRGRDTPKKRRNSPIGRIMKPSPPTKLERRRRN
        VE  LTLLAQA+                      P L      +    S I+C  L                                            
Subjt:  VETSLTLLAQASMSLSYWWEAFTIATFLINGLPTPVLHGKSPIELLLKSKIECYGLTGRETSGKEEGIRGRDTPKKRRNSPIGRIMKPSPPTKLERRRRN

Query:  HEMNLSPTSELREDSWPDEEEKRGWAAGRWLTAERKEEENARRGSSAANTRGREKESPAHKRHRCLSSDGKVIISRHVKLNESDFPFATGFGSSLNNSQI
                               G+                               S ++K ++CLS  G++ IS+ V  NE  FPF TGF ++ N  Q 
Subjt:  HEMNLSPTSELREDSWPDEEEKRGWAAGRWLTAERKEEENARRGSSAANTRGREKESPAHKRHRCLSSDGKVIISRHVKLNESDFPFATGFGSSLNNSQI

Query:  FAYTKSNPPLHNWLHEPQISCSNDLSSPFVPPSLITSTSPSTSSNPTVSPSPPFTPQSAQLSTNPSTRPFITPPPDTVS-LIPDPVNHIPDPNISFSHQP
                                       P +I STS S       SPSP     S+  S++  T P  +P   + S     P+ H   P+   SH  
Subjt:  FAYTKSNPPLHNWLHEPQISCSNDLSSPFVPPSLITSTSPSTSSNPTVSPSPPFTPQSAQLSTNPSTRPFITPPPDTVS-LIPDPVNHIPDPNISFSHQP

Query:  SSPVTPYVASPSQRHSISACQPSASPSSSTPNNVVPSPSSPPILPSHPMITRVKTDIFKPKVWLSVSPTDWSTREPSRVVDALVTPVWKAAMDVEFQALL
         S    Y       H++        P+ +     +P+  + P LPSHPMITR K  IFKP+ ++S   ++ +  EP  VV+A+  P W  AM++EF AL 
Subjt:  SSPVTPYVASPSQRHSISACQPSASPSSSTPNNVVPSPSSPPILPSHPMITRVKTDIFKPKVWLSVSPTDWSTREPSRVVDALVTPVWKAAMDVEFQALL

Query:  RNQTWCLVPLSPSQNVFGCKWVFRIKRNSDGSV
           T  LVP S + N+ G KWV+RIK N+DG+V
Subjt:  RNQTWCLVPLSPSQNVFGCKWVFRIKRNSDGSV

A0A803PM38 Uncharacterized protein8.9e-10831.68Show/hide
Query:  PPLNQLLNQITTVKLERGNYLLWKNLALLILRSYRLEGHLSGDKPCPSKFLSTSQVITTNVTNEAGSSSSEAVPLENSAQSSAVTLTINPEYESWLVVDQ
        P     LNQ   +KL+R N+ LW+ +   I+R +RL+G+L G  P P +FLS+        T+  GS SS                 +NP +E W+V DQ
Subjt:  PPLNQLLNQITTVKLERGNYLLWKNLALLILRSYRLEGHLSGDKPCPSKFLSTSQVITTNVTNEAGSSSSEAVPLENSAQSSAVTLTINPEYESWLVVDQ

Query:  LLLGWLYNSMTPEVATQVMGYENAKDLWAAIQEFLESSLEPK------KIIFVR--------YF-------------------NRLV-KVLLGLDEEYNV
        LLLGWLY SMT  +A +VMG +++  LW A++E   +  + K      KI   R        Y                    N+LV  VL GLD EY  
Subjt:  LLLGWLYNSMTPEVATQVMGYENAKDLWAAIQEFLESSLEPK------KIIFVR--------YF-------------------NRLV-KVLLGLDEEYNV

Query:  VVAMIQGKIGISLSEMQAKLLVFEKRLEIQNSQRSS---TTFGSTISVNMATKGDGQSGSRQKNFSINRQQYNNNQRGG-----GGRNRGRGRWNGNNNN
        +V +I+ +   +  ++Q  LL  + ++E  +S   S   T      S ++A KG             NR  +NNN RGG     G  NR RGR    +  
Subjt:  VVAMIQGKIGISLSEMQAKLLVFEKRLEIQNSQRSS---TTFGSTISVNMATKGDGQSGSRQKNFSINRQQYNNNQRGG-----GGRNRGRGRWNGNNNN

Query:  RLICQICGKNGYSALAFYQRFDKQFIGPSQNINRNISPGGNSFQNGGTSQMSPVAPQAF---------MTTQNTNPFVASLETVIDPNCKNLVSVSKLAQ
        R  CQ+CGK G+SA   Y R      G S +I   I+      +  G  +++                + T + +P +      +    KNL+S+SKL  
Subjt:  RLICQICGKNGYSALAFYQRFDKQFIGPSQNINRNISPGGNSFQNGGTSQMSPVAPQAF---------MTTQNTNPFVASLETVIDPNCKNLVSVSKLAQ

Query:  DNNVYIEFHADSCLVKDIHTDKVVLRGVLKDGLYQLGTRITGSASGSVGSNLKSANKSVPHSAFITSGIYANV--------LVS-KSVWHRRLGH-----
        DNNV +EF +D C VKD  T +VVL+G LKDGLYQ       +++ S+ SN +S +     S  + S + +NV        L S K  WHRRLGH     
Subjt:  DNNVYIEFHADSCLVKDIHTDKVVLRGVLKDGLYQLGTRITGSASGSVGSNLKSANKSVPHSAFITSGIYANV--------LVS-KSVWHRRLGH-----

Query:  ---------------------SMQIWKVSCSSFSKLKSHAIAPFDLIHTDLWGPAPVMSTDGYRYYVHFLDDFSRFVWVYPLKLKSDTVAAITHFITMIK
                             + Q+ K     F      A AP +L+HTD+WGP+P+MS   +RYY+HF+DDFSR+ W+YPLK KS+ +AA   F  +++
Subjt:  ---------------------SMQIWKVSCSSFSKLKSHAIAPFDLIHTDLWGPAPVMSTDGYRYYVHFLDDFSRFVWVYPLKLKSDTVAAITHFITMIK

Query:  TQFNESVKVLQSDNGGEYKKVHQLCAQNGIQSRLSCPYTSAKNGRAERKHRHLVETSLTLLAQASMSLSYWWEAFTIATFLINGLPTPVLHGKSPIELLL
         QFN  VK +Q+D GGEY+   +  + +GI  +  CP+TS +NGRAERKHRH+VE  LTLLAQA +   YWW+AF  A +LIN LPTPVL  K+P E+L 
Subjt:  TQFNESVKVLQSDNGGEYKKVHQLCAQNGIQSRLSCPYTSAKNGRAERKHRHLVETSLTLLAQASMSLSYWWEAFTIATFLINGLPTPVLHGKSPIELLL

Query:  KSKIECYGLTGRETSGKEEGIRGRDTPKKRRNSPIGRIMKPSPPTKLERRRRNHEMNLSPTSELREDSWPDEEEKRGWAAGRWLTAERKEEENARRGSSA
        K +                       P  +     G    P       R  +NH+     T                                       
Subjt:  KSKIECYGLTGRETSGKEEGIRGRDTPKKRRNSPIGRIMKPSPPTKLERRRRNHEMNLSPTSELREDSWPDEEEKRGWAAGRWLTAERKEEENARRGSSA

Query:  ANTRGREKESPAHKRHRCLSSDGKVIISRHVKLNESDFPFATGFGSSLNNSQIFAYTKSNPPLHNWLHEPQISCSNDLSSPFVPPSLITSTSPSTSSNPT
         N    +K    HK ++CLSS G++ ISR V  NE +FPF +GF   LN ++    T  +  +  W     ++  +   + F        T       PT
Subjt:  ANTRGREKESPAHKRHRCLSSDGKVIISRHVKLNESDFPFATGFGSSLNNSQIFAYTKSNPPLHNWLHEPQISCSNDLSSPFVPPSLITSTSPSTSSNPT

Query:  VSPSPPFTPQSAQLSTNPSTRPFITPPPDTVSLIPD--PVNHIPDPNISFSHQPSSPVTPYVASPSQRHSISACQPSASPSSSTPNNVVPSPSSPPILPS
         S   P       LST            DT  +I D   ++ I D  I                  Q+H+ +    SA+    T        +   ++ +
Subjt:  VSPSPPFTPQSAQLSTNPSTRPFITPPPDTVSLIPD--PVNHIPDPNISFSHQPSSPVTPYVASPSQRHSISACQPSASPSSSTPNNVVPSPSSPPILPS

Query:  HPMITRVKTDIFKPKVWLSVSPTDWSTREPSRVVDALVTPVWKAAMDVEFQALLRNQTWCLVPLSPSQNVFGCKWVFRIKRNSDGS
        HPMITR K  IFKPK +L+ +    ++ EP  + +AL    W  AM  E  AL RN TW LVP  P  ++   KWV++ KRN+DGS
Subjt:  HPMITRVKTDIFKPKVWLSVSPTDWSTREPSRVVDALVTPVWKAAMDVEFQALLRNQTWCLVPLSPSQNVFGCKWVFRIKRNSDGS

SwissProt top hitse value%identityAlignment
P04146 Copia protein9.4e-2234.52Show/hide
Query:  KLKSHAIAPFDLIHTDLWGPAPVMSTDGYRYYVHFLDDFSRFVWVYPLKLKSDTVAAITHFITMIKTQFNESVKVLQSDNGGEY--KKVHQLCAQNGIQS
        K K+H   P  ++H+D+ GP   ++ D   Y+V F+D F+ +   Y +K KSD  +    F+   +  FN  V  L  DNG EY   ++ Q C + GI  
Subjt:  KLKSHAIAPFDLIHTDLWGPAPVMSTDGYRYYVHFLDDFSRFVWVYPLKLKSDTVAAITHFITMIKTQFNESVKVLQSDNGGEY--KKVHQLCAQNGIQS

Query:  RLSCPYTSAKNGRAERKHRHLVETSLTLLAQASMSLSYWWEAFTIATFLINGLPTPVL--HGKSPIEL
         L+ P+T   NG +ER  R + E + T+++ A +  S+W EA   AT+LIN +P+  L    K+P E+
Subjt:  RLSCPYTSAKNGRAERKHRHLVETSLTLLAQASMSLSYWWEAFTIATFLINGLPTPVL--HGKSPIEL

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-943.0e-2024.85Show/hide
Query:  NRLVKVLLGLDEEY-NVVVAMIQGKIGISLSEMQAKLLVFEK-RLEIQNSQRSSTTFGSTISVNMATKGDGQSGSR--QKNFSINRQQ--YNNNQRGGGG
        ++ + +L  L   Y N+   ++ GK  I L ++ + LL+ EK R + +N  ++  T G   S   ++   G+SG+R   KN S +R +  YN NQ G   
Subjt:  NRLVKVLLGLDEEY-NVVVAMIQGKIGISLSEMQAKLLVFEK-RLEIQNSQRSSTTFGSTISVNMATKGDGQSGSR--QKNFSINRQQ--YNNNQRGGGG

Query:  RN-----RGRGRWNGNNNNRLICQICGKNGYSALAFYQRFDKQFIGPSQNINRNISPGGN-SFQNGGTSQMSPV-------APQAFMTTQNTNPFVASLE
        R+     +G+G  +G  N+     +   N    L         FI   +       P          +   +PV           F T +  N   + + 
Subjt:  RN-----RGRGRWNGNNNNRLICQICGKNGYSALAFYQRFDKQFIGPSQNINRNISPGGN-SFQNGGTSQMSPV-------APQAFMTTQNTNPFVASLE

Query:  TVIDPNCKNLVSVSKLAQDNNVYIEFHADSCLVKDIHTDKVVLRGVLKDGLYQLGTRITGSASGSVGSNLKSANKSVPHSAFITSGIYANVLVSKSVWHR
         + D   K  V  + + +D     +   +  L+  I  D+        +  ++L       A G     L   N  +           A   +S  +WH+
Subjt:  TVIDPNCKNLVSVSKLAQDNNVYIEFHADSCLVKDIHTDKVVLRGVLKDGLYQLGTRITGSASGSVGSNLKSANKSVPHSAFITSGIYANVLVSKSVWHR

Query:  RLGHSMQ-----IWKVSCSSFSK------------LKSHAIA----------PFDLIHTDLWGPAPVMSTDGYRYYVHFLDDFSRFVWVYPLKLKSDTVA
        R+GH  +     + K S  S++K             K H ++            DL+++D+ GP  + S  G +Y+V F+DD SR +WVY LK K     
Subjt:  RLGHSMQ-----IWKVSCSSFSK------------LKSHAIA----------PFDLIHTDLWGPAPVMSTDGYRYYVHFLDDFSRFVWVYPLKLKSDTVA

Query:  AITHFITMIKTQFNESVKVLQSDNGGEY--KKVHQLCAQNGIQSRLSCPYTSAKNGRAERKHRHLVETSLTLLAQASMSLSYWWEAFTIATFLINGLPTP
            F  +++ +    +K L+SDNGGEY  ++  + C+ +GI+   + P T   NG AER +R +VE   ++L  A +  S+W EA   A +LIN  P+ 
Subjt:  AITHFITMIKTQFNESVKVLQSDNGGEY--KKVHQLCAQNGIQSRLSCPYTSAKNGRAERKHRHLVETSLTLLAQASMSLSYWWEAFTIATFLINGLPTP

Query:  VLHGKSP
         L  + P
Subjt:  VLHGKSP

Q07791 Transposon Ty2-DR3 Gag-Pol polyprotein9.1e-1731.35Show/hide
Query:  LVSKSVWHRRLGHSMQIWKVSCSSFSKLK-SHAIAPFDLIHTDLWGPAPVMSTDGYRYYVHFLDDFSRFVWVYPL--KLKSDTVAAITHFITMIKTQFNE
        L+ KS  HR +              S+LK   +  PF  +HTD++GP   +      Y++ F D+ +RF WVYPL  + +   +   T  +  IK QFN 
Subjt:  LVSKSVWHRRLGHSMQIWKVSCSSFSKLK-SHAIAPFDLIHTDLWGPAPVMSTDGYRYYVHFLDDFSRFVWVYPL--KLKSDTVAAITHFITMIKTQFNE

Query:  SVKVLQSDNGGEY--KKVHQLCAQNGIQSRLSCPYTSAKNGRAERKHRHLVETSLTLLAQASMSLSYWWEAFTIATFLINGLPTP
         V V+Q D G EY  K +H+     GI +  +    S  +G AER +R L+    TLL  + +    W+ A   +T + N L +P
Subjt:  SVKVLQSDNGGEY--KKVHQLCAQNGIQSRLSCPYTSAKNGRAERKHRHLVETSLTLLAQASMSLSYWWEAFTIATFLINGLPTP

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE18.1e-5025.22Show/hide
Query:  MAASA-SLPIFSSPPLNQLLNQITTVKLERGNYLLWKNLALLILRSYRLEGHLSGDKPCPSKFLSTSQVITTNVTNEAGSSSSEAVPLENSAQSSAVTLT
        MAA A  L + ++  LN  ++ +T  KL   NYL+W      +   Y L G L G    P   + T      N          + +    SA   A++++
Subjt:  MAASA-SLPIFSSPPLNQLLNQITTVKLERGNYLLWKNLALLILRSYRLEGHLSGDKPCPSKFLSTSQVITTNVTNEAGSSSSEAVPLENSAQSSAVTLT

Query:  INPEY-------ESWLVVDQLLLGWLYNSMTPEVATQVMGYENAKDLWAAIQEFLESSLEPKKIIF--VRYFNRLVKVLLGLDEEYNVVVAMIQGK-IGI
        + P         + W  + ++     Y  +T ++ TQ+  +           + L +  +   ++   + +  ++ +VL  L EEY  V+  I  K    
Subjt:  INPEY-------ESWLVVDQLLLGWLYNSMTPEVATQVMGYENAKDLWAAIQEFLESSLEPKKIIF--VRYFNRLVKVLLGLDEEYNVVVAMIQGK-IGI

Query:  SLSEMQAKLLVFE-KRLEIQNSQRSSTTFGSTISVNMATKGDGQSGSRQKNFSINRQQYNNNQRGGGGRNRGRGRWNGNNNNRL----ICQICGKNGYSA
        +L+E+  +LL  E K L + ++     T  +    N  T  +  +G+R      NR    NN        +    ++ NNN        CQICG  G+SA
Subjt:  SLSEMQAKLLVFE-KRLEIQNSQRSSTTFGSTISVNMATKGDGQSGSRQKNFSINRQQYNNNQRGGGGRNRGRGRWNGNNNNRL----ICQICGKNGYSA

Query:  LAFYQ----------------------RFDKQFIGPSQNINRNISPGG-----NSFQNGGTSQMSPVAPQAFMTTQNTNPFVASLETVID----------
            Q                      R +     P  + N  +  G      + F N    Q         +   +T P   +  T +           
Subjt:  LAFYQ----------------------RFDKQFIGPSQNINRNISPGG-----NSFQNGGTSQMSPVAPQAFMTTQNTNPFVASLETVID----------

Query:  ----PNC-KNLVSVSKLAQDNNVYIEFHADSCLVKDIHTDKVVLRGVLKDGLYQLGTRITGSASGSVGSNLKSANKSVPHSAFITSGIYANVLVSKSVWH
            PN  KNL+SV +L   N V +EF   S  VKD++T   +L+G  KD LY+        AS    S   S +    HS+                WH
Subjt:  ----PNC-KNLVSVSKLAQDNNVYIEFHADSCLVKDIHTDKVVLRGVLKDGLYQLGTRITGSASGSVGSNLKSANKSVPHSAFITSGIYANVLVSKSVWH

Query:  RRLGH-------------SMQIWK-----VSCSS----------FSKLKSHAIAPFDLIHTDLWGPAPVMSTDGYRYYVHFLDDFSRFVWVYPLKLKSDT
         RLGH             S+ +       +SCS           FS+   ++  P + I++D+W  +P++S D YRYYV F+D F+R+ W+YPLK KS  
Subjt:  RRLGH-------------SMQIWK-----VSCSS----------FSKLKSHAIAPFDLIHTDLWGPAPVMSTDGYRYYVHFLDDFSRFVWVYPLKLKSDT

Query:  VAAITHFITMIKTQFNESVKVLQSDNGGEYKKVHQLCAQNGIQSRLSCPYTSAKNGRAERKHRHLVETSLTLLAQASMSLSYWWEAFTIATFLINGLPTP
              F  +++ +F   +    SDNGGE+  + +  +Q+GI    S P+T   NG +ERKHRH+VET LTLL+ AS+  +YW  AF +A +LIN LPTP
Subjt:  VAAITHFITMIKTQFNESVKVLQSDNGGEYKKVHQLCAQNGIQSRLSCPYTSAKNGRAERKHRHLVETSLTLLAQASMSLSYWWEAFTIATFLINGLPTP

Query:  VLHGKSPIELLLKSKIECYGLTGRETSGKEEGIRGRDTPKKRRNSPIGRIMKPSPPTKLERRRRNHEMNLSPTSELREDSWPDEEEKRGWAAGRWLTA--
        +L  +SP + L                                                           SP          D+    G A   WL    
Subjt:  VLHGKSPIELLLKSKIECYGLTGRETSGKEEGIRGRDTPKKRRNSPIGRIMKPSPPTKLERRRRNHEMNLSPTSELREDSWPDEEEKRGWAAGRWLTA--

Query:  ERKEEENARRGSSAANTRGREKESPAHKRHRCLS-SDGKVIISRHVKLNESDFPFATGFGS-------SLNNSQIFAYTKSNPPLHNWLHEPQISCSNDL
        + K ++ +R+             S     + CL     ++ ISRHV+ +E+ FPF+    +          +S +++   + P     L  P  S  +  
Subjt:  ERKEEENARRGSSAANTRGREKESPAHKRHRCLS-SDGKVIISRHVKLNESDFPFATGFGS-------SLNNSQIFAYTKSNPPLHNWLHEPQISCSNDL

Query:  SSPFVPPSLITSTSPSTSSNPTVS-----PSPPFTPQSAQLSTNPSTRPFITPPPDTVSLIPDPVNHIPDPNISFSHQPSSPVTPYVASPSQRHSISACQ
        ++P   PS     S  +SSN   S     PS P      Q    P+T+P  T      S      N   +     +   S+P     +SPS   S S+  
Subjt:  SSPFVPPSLITSTSPSTSSNPTVS-----PSPPFTPQSAQLSTNPSTRPFITPPPDTVSLIPDPVNHIPDPNISFSHQPSSPVTPYVASPSQRHSISACQ

Query:  PSASPSS-----STPNNVVPSPSSPPILPSHPMITRVKTDIFKPKVWLSVSPTDWSTREPSRVVDALVTPVWKAAMDVEFQALLRNQTWCLVPLSPSQ-N
         S +P S       P   + + ++   L +H M TR K  I KP    S++ +  +  EP   + AL    W+ AM  E  A + N TW LVP  PS   
Subjt:  PSASPSS-----STPNNVVPSPSSPPILPSHPMITRVKTDIFKPKVWLSVSPTDWSTREPSRVVDALVTPVWKAAMDVEFQALLRNQTWCLVPLSPSQ-N

Query:  VFGCKWVFRIKRNSDGSV
        + GC+W+F  K NSDGS+
Subjt:  VFGCKWVFRIKRNSDGSV

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE26.0e-6127.4Show/hide
Query:  LNQITTVKLERGNYLLWKNLALLILRSYRLEGHLSGDKPCPSKFLSTSQVITTNVTNEAGSSSSEAVPLENSAQSSAVTLTINPEYESWLVVDQLLLGWL
        +N     KL   NYL+W      +   Y L G L G  P P   + T                 +AVP             +NP+Y  W   D+L+   +
Subjt:  LNQITTVKLERGNYLLWKNLALLILRSYRLEGHLSGDKPCPSKFLSTSQVITTNVTNEAGSSSSEAVPLENSAQSSAVTLTINPEYESWLVVDQLLLGWL

Query:  YNSMTPEVATQVMGYENAKDLWAAIQEFL--ESSLEPKKIIFVRYFNRLV-------------KVLLGLDEEYNVVVAMIQGK-IGISLSEMQAKLLVFE
          +++  V   V     A  +W  +++     S     ++ F+  F++L              +VL  L ++Y  V+  I  K    SL+E+  +L+  E
Subjt:  YNSMTPEVATQVMGYENAKDLWAAIQEFL--ESSLEPKKIIFVRYFNRLV-------------KVLLGLDEEYNVVVAMIQGK-IGISLSEMQAKLLVFE

Query:  KRLEIQNSQRSSTTFGSTISVNMATKGDGQSGSRQKNFSINRQQYNNNQRGGGGRNRGRGRWNGNNNNRLI---CQICGKNGYSA-----LAFYQRFDKQ
         +L   NS          I+ N+ T  +  +   Q N   NR   NNN R    +    G  + N   +     CQIC   G+SA     L  +Q    Q
Subjt:  KRLEIQNSQRSSTTFGSTISVNMATKGDGQSGSRQKNFSINRQQYNNNQRGGGGRNRGRGRWNGNNNNRLI---CQICGKNGYSA-----LAFYQRFDKQ

Query:  ----------------FIGPSQNINRNISPGG------NSFQNGGTSQMSPVAPQAFMTTQNTNPFV----ASLET----------VIDPNC-KNLVSVS
                         +    N N  +   G      + F N    Q         +   +T P      ASL T          +  PN  KNL+SV 
Subjt:  ----------------FIGPSQNINRNISPGG------NSFQNGGTSQMSPVAPQAFMTTQNTNPFV----ASLET----------VIDPNC-KNLVSVS

Query:  KLAQDNNVYIEFHADSCLVKDIHTDKVVLRGVLKDGLYQLGTRITGSASGSVGSNLKSANKSVPHSAFITSGIYANVLVSKSVWHRRLGHSMQIWK----
        +L   N V +EF   S  VKD++T   +L+G  KD LY+        AS    S   S      HS++ +   + ++ +  SV      HS+ +      
Subjt:  KLAQDNNVYIEFHADSCLVKDIHTDKVVLRGVLKDGLYQLGTRITGSASGSVGSNLKSANKSVPHSAFITSGIYANVLVSKSVWHRRLGHSMQIWK----

Query:  -VSCSSFSKLKSHAI----------APFDLIHTDLWGPAPVMSTDGYRYYVHFLDDFSRFVWVYPLKLKSDTVAAITHFITMIKTQFNESVKVLQSDNGG
         +SCS     KSH +           P + I++D+W  +P++S D YRYYV F+D F+R+ W+YPLK KS        F ++++ +F   +  L SDNGG
Subjt:  -VSCSSFSKLKSHAI----------APFDLIHTDLWGPAPVMSTDGYRYYVHFLDDFSRFVWVYPLKLKSDTVAAITHFITMIKTQFNESVKVLQSDNGG

Query:  EYKKVHQLCAQNGIQSRLSCPYTSAKNGRAERKHRHLVETSLTLLAQASMSLSYWWEAFTIATFLINGLPTPVLHGKSPIELLLKSKIECYGLTGRETSG
        E+  +    +Q+GI    S P+T   NG +ERKHRH+VE  LTLL+ AS+  +YW  AF++A +LIN LPTP+L  +SP + L                 
Subjt:  EYKKVHQLCAQNGIQSRLSCPYTSAKNGRAERKHRHLVETSLTLLAQASMSLSYWWEAFTIATFLINGLPTPVLHGKSPIELLLKSKIECYGLTGRETSG

Query:  KEEGIRGRDTPKKRRNSPIGRIMKPSPPTKLERRRRNHEMNLSPTSELREDSWPDEEEKRGWAAGRWL------TAERKEEENARRGSSAANTRGREKES
                                  PP        N+E                + +  G A   WL        E K ++ A  G S   +       
Subjt:  KEEGIRGRDTPKKRRNSPIGRIMKPSPPTKLERRRRNHEMNLSPTSELREDSWPDEEEKRGWAAGRWL------TAERKEEENARRGSSAANTRGREKES

Query:  PAHKRHRCLS-SDGKVIISRHVKLNESDFPFA-TGFGSSLNNSQIFAYTKSNPPLHNWLHE-----PQISCSNDL--SSPFVP--PSLITSTSPSTSSNP
             + CL    G++  SRHV+ +E  FPF+ T FG S +  Q  + +  N P H  L       P   C      +SP  P  PS + +T  S+S+ P
Subjt:  PAHKRHRCLS-SDGKVIISRHVKLNESDFPFA-TGFGSSLNNSQIFAYTKSNPPLHNWLHE-----PQISCSNDL--SSPFVP--PSLITSTSPSTSSNP

Query:  TVSPSPPFTPQSAQLSTN---PSTRPFITPPPDTVSLI---PDPVNHIPDPNISFSHQPSSPV-TPYVASPSQRHSISACQPSASPSSSTPNNVVPSPSS
        + S S P + +    S N   P+ +P  T   ++ S I   P+P +  P+     S  P SP+ +P++ +PS   SIS     +S S+STP  + P   +
Subjt:  TVSPSPPFTPQSAQLSTN---PSTRPFITPPPDTVSLI---PDPVNHIPDPNISFSHQPSSPV-TPYVASPSQRHSISACQPSASPSSSTPNNVVPSPSS

Query:  PPILP--------SHPMITRVKTDIFKPKVWLSVSPTDWSTREPSRVVDALVTPVWKAAMDVEFQALLRNQTWCLV-PLSPSQNVFGCKWVFRIKRNSDG
        PPI+         +H M TR K  I KP    S + +  +  EP   + A+    W+ AM  E  A + N TW LV P  PS  + GC+W+F  K NSDG
Subjt:  PPILP--------SHPMITRVKTDIFKPKVWLSVSPTDWSTREPSRVVDALVTPVWKAAMDVEFQALLRNQTWCLV-PLSPSQNVFGCKWVFRIKRNSDG

Query:  SV
        S+
Subjt:  SV

Arabidopsis top hitse value%identityAlignment
ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)4.3e-1445.88Show/hide
Query:  MITRVKTDIFKPKVWLSVSPTDWSTREPSRVVDALVTPVWKAAMDVEFQALLRNQTWCLVPLSPSQNVFGCKWVFRIKRNSDGSV
        M+TR K  I K     S++ T    +EP  V+ AL  P W  AM  E  AL RN+TW LVP   +QN+ GCKWVF+ K +SDG++
Subjt:  MITRVKTDIFKPKVWLSVSPTDWSTREPSRVVDALVTPVWKAAMDVEFQALLRNQTWCLVPLSPSQNVFGCKWVFRIKRNSDGSV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCAATGTCTTGTCTTCTTCCTCAGTCAACATGGCTGCGTCTGCAAGCTTACCAATTTTTAGTAGCCCTCCTTTAAATCAGCTCTTGAATCAAATAACCACAGTCAA
GTTGGAAAGAGGTAATTATTTGTTGTGGAAAAATTTGGCTCTTCTCATCCTTCGAAGTTACCGTTTGGAAGGGCACTTGTCTGGGGACAAGCCTTGCCCCTCCAAATTTC
TTTCTACTTCTCAGGTGATCACTACAAATGTTACAAATGAAGCGGGATCGTCAAGCTCTGAAGCTGTCCCGCTAGAAAACTCAGCCCAATCCTCAGCTGTGACTCTTACT
ATCAATCCAGAATATGAAAGTTGGTTGGTAGTGGATCAACTCCTTCTTGGTTGGCTTTACAACTCAATGACGCCTGAGGTGGCAACCCAAGTTATGGGGTATGAAAATGC
TAAAGACCTTTGGGCTGCTATTCAAGAGTTTTTGGAATCCAGCCTCGAGCCAAAGAAGATTATCTTTGTCAGGTATTTCAACAGACTCGTAAAGGTCCTCTTAGGACTCG
ATGAAGAATATAATGTCGTAGTAGCCATGATCCAAGGAAAAATTGGAATATCCTTGTCCGAGATGCAAGCCAAACTGCTGGTTTTTGAGAAAAGGTTGGAAATTCAGAAC
TCACAAAGGTCCTCTACAACCTTTGGCTCAACAATCTCTGTAAACATGGCAACAAAAGGAGATGGTCAATCTGGGTCGAGGCAGAAAAACTTTTCTATAAATCGACAACA
GTACAACAACAATCAACGTGGAGGAGGAGGTAGAAATCGAGGGCGAGGGCGCTGGAATGGCAATAATAACAACCGACTCATCTGTCAAATTTGTGGCAAAAATGGGTATT
CTGCACTAGCTTTCTACCAGAGGTTTGACAAACAATTTATTGGTCCTAGTCAGAATATTAATCGGAATATTAGTCCTGGTGGAAACAGCTTCCAAAATGGGGGCACAAGT
CAAATGAGTCCAGTAGCTCCTCAGGCCTTCATGACTACTCAAAATACCAATCCGTTCGTCGCCAGTCTAGAAACCGTTATTGATCCAAATTGCAAAAATCTTGTTAGTGT
CTCAAAGCTTGCCCAAGATAACAATGTCTATATTGAATTTCACGCTGACTCTTGTCTTGTTAAGGACATTCACACGGATAAAGTGGTGCTGAGGGGAGTTCTTAAAGATG
GTTTGTATCAGCTTGGAACGAGAATCACTGGTAGTGCTTCAGGTTCAGTAGGTAGTAACTTGAAGTCGGCTAATAAATCAGTTCCTCACTCTGCCTTTATTACCTCAGGC
ATCTATGCTAATGTATTGGTGTCCAAATCAGTTTGGCATAGAAGGCTTGGGCACTCCATGCAAATTTGGAAAGTCTCATGCTCTTCCTTTTCCAAACTCAAATCTCATGC
TATTGCTCCCTTTGACTTGATTCACACTGATCTGTGGGGTCCAGCTCCAGTTATGTCTACAGATGGTTATCGTTATTACGTACATTTTCTTGATGATTTTAGCCGTTTTG
TTTGGGTGTATCCACTCAAATTGAAGAGCGACACAGTTGCAGCAATTACCCATTTTATTACTATGATAAAAACTCAGTTTAACGAATCCGTTAAAGTCTTGCAGTCTGAT
AATGGAGGAGAATACAAAAAAGTGCACCAGCTTTGTGCCCAGAATGGTATTCAGTCTCGCCTCTCTTGTCCCTATACTTCGGCCAAAAATGGGAGAGCAGAACGCAAACA
TCGACATCTTGTTGAGACAAGCTTAACCCTACTTGCTCAAGCTTCAATGTCGCTCTCTTATTGGTGGGAAGCGTTCACAATTGCTACATTTCTTATTAATGGGCTACCTA
CTCCAGTTCTACACGGGAAGTCTCCCATAGAGTTATTACTGAAATCGAAAATTGAATGTTATGGACTTACGGGAAGAGAGACATCGGGCAAGGAAGAAGGGATTCGTGGG
AGAGACACGCCAAAGAAGAGGAGAAACTCACCTATTGGCCGAATAATGAAGCCGTCGCCGCCCACTAAGCTGGAGCGACGCCGTCGGAATCACGAGATGAACCTGTCGCC
GACGTCGGAGCTGCGGGAGGATAGCTGGCCGGACGAAGAAGAAAAACGGGGATGGGCTGCTGGGAGGTGGCTCACGGCTGAACGAAAAGAAGAAGAAAATGCAAGGAGAG
GGAGTTCGGCGGCTAACACGAGAGGAAGGGAGAAAGAGAGCCCAGCCCACAAACGCCATAGGTGCCTCAGTTCAGATGGTAAAGTAATTATTTCTCGCCATGTGAAGCTT
AATGAATCTGATTTTCCCTTCGCTACAGGTTTTGGGTCGTCCCTCAATAATTCCCAAATATTTGCCTATACTAAGTCCAACCCACCATTACACAACTGGCTTCATGAACC
TCAAATAAGCTGCTCAAATGATTTATCCAGTCCGTTTGTACCCCCCAGCCTGATTACATCTACCAGCCCATCTACGTCATCCAACCCAACTGTGTCTCCCAGCCCACCCT
TTACTCCCCAGTCTGCACAGTTGTCCACTAACCCTTCTACTCGTCCATTCATCACTCCACCACCCGACACAGTTAGTCTTATACCCGACCCAGTTAATCATATACCTGAT
CCAAATATCTCATTTAGTCATCAACCCTCTTCTCCTGTCACGCCATATGTCGCATCACCATCTCAACGTCACTCTATTTCTGCGTGCCAACCTTCAGCCTCACCTTCCTC
TTCAACTCCAAACAATGTTGTTCCCTCACCCTCATCTCCTCCAATTTTGCCTAGCCATCCAATGATTACACGTGTGAAGACCGACATCTTTAAGCCAAAAGTGTGGCTCT
CTGTCTCTCCGACTGATTGGTCTACTCGCGAACCCAGTCGTGTTGTTGATGCCTTGGTAACTCCGGTATGGAAAGCCGCTATGGATGTTGAGTTTCAGGCACTTCTACGG
AATCAAACATGGTGTCTAGTTCCACTTTCACCGTCTCAAAATGTTTTTGGGTGCAAATGGGTTTTCCGGATCAAGAGGAATTCAGATGGTTCAGTCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCCAATGTCTTGTCTTCTTCCTCAGTCAACATGGCTGCGTCTGCAAGCTTACCAATTTTTAGTAGCCCTCCTTTAAATCAGCTCTTGAATCAAATAACCACAGTCAA
GTTGGAAAGAGGTAATTATTTGTTGTGGAAAAATTTGGCTCTTCTCATCCTTCGAAGTTACCGTTTGGAAGGGCACTTGTCTGGGGACAAGCCTTGCCCCTCCAAATTTC
TTTCTACTTCTCAGGTGATCACTACAAATGTTACAAATGAAGCGGGATCGTCAAGCTCTGAAGCTGTCCCGCTAGAAAACTCAGCCCAATCCTCAGCTGTGACTCTTACT
ATCAATCCAGAATATGAAAGTTGGTTGGTAGTGGATCAACTCCTTCTTGGTTGGCTTTACAACTCAATGACGCCTGAGGTGGCAACCCAAGTTATGGGGTATGAAAATGC
TAAAGACCTTTGGGCTGCTATTCAAGAGTTTTTGGAATCCAGCCTCGAGCCAAAGAAGATTATCTTTGTCAGGTATTTCAACAGACTCGTAAAGGTCCTCTTAGGACTCG
ATGAAGAATATAATGTCGTAGTAGCCATGATCCAAGGAAAAATTGGAATATCCTTGTCCGAGATGCAAGCCAAACTGCTGGTTTTTGAGAAAAGGTTGGAAATTCAGAAC
TCACAAAGGTCCTCTACAACCTTTGGCTCAACAATCTCTGTAAACATGGCAACAAAAGGAGATGGTCAATCTGGGTCGAGGCAGAAAAACTTTTCTATAAATCGACAACA
GTACAACAACAATCAACGTGGAGGAGGAGGTAGAAATCGAGGGCGAGGGCGCTGGAATGGCAATAATAACAACCGACTCATCTGTCAAATTTGTGGCAAAAATGGGTATT
CTGCACTAGCTTTCTACCAGAGGTTTGACAAACAATTTATTGGTCCTAGTCAGAATATTAATCGGAATATTAGTCCTGGTGGAAACAGCTTCCAAAATGGGGGCACAAGT
CAAATGAGTCCAGTAGCTCCTCAGGCCTTCATGACTACTCAAAATACCAATCCGTTCGTCGCCAGTCTAGAAACCGTTATTGATCCAAATTGCAAAAATCTTGTTAGTGT
CTCAAAGCTTGCCCAAGATAACAATGTCTATATTGAATTTCACGCTGACTCTTGTCTTGTTAAGGACATTCACACGGATAAAGTGGTGCTGAGGGGAGTTCTTAAAGATG
GTTTGTATCAGCTTGGAACGAGAATCACTGGTAGTGCTTCAGGTTCAGTAGGTAGTAACTTGAAGTCGGCTAATAAATCAGTTCCTCACTCTGCCTTTATTACCTCAGGC
ATCTATGCTAATGTATTGGTGTCCAAATCAGTTTGGCATAGAAGGCTTGGGCACTCCATGCAAATTTGGAAAGTCTCATGCTCTTCCTTTTCCAAACTCAAATCTCATGC
TATTGCTCCCTTTGACTTGATTCACACTGATCTGTGGGGTCCAGCTCCAGTTATGTCTACAGATGGTTATCGTTATTACGTACATTTTCTTGATGATTTTAGCCGTTTTG
TTTGGGTGTATCCACTCAAATTGAAGAGCGACACAGTTGCAGCAATTACCCATTTTATTACTATGATAAAAACTCAGTTTAACGAATCCGTTAAAGTCTTGCAGTCTGAT
AATGGAGGAGAATACAAAAAAGTGCACCAGCTTTGTGCCCAGAATGGTATTCAGTCTCGCCTCTCTTGTCCCTATACTTCGGCCAAAAATGGGAGAGCAGAACGCAAACA
TCGACATCTTGTTGAGACAAGCTTAACCCTACTTGCTCAAGCTTCAATGTCGCTCTCTTATTGGTGGGAAGCGTTCACAATTGCTACATTTCTTATTAATGGGCTACCTA
CTCCAGTTCTACACGGGAAGTCTCCCATAGAGTTATTACTGAAATCGAAAATTGAATGTTATGGACTTACGGGAAGAGAGACATCGGGCAAGGAAGAAGGGATTCGTGGG
AGAGACACGCCAAAGAAGAGGAGAAACTCACCTATTGGCCGAATAATGAAGCCGTCGCCGCCCACTAAGCTGGAGCGACGCCGTCGGAATCACGAGATGAACCTGTCGCC
GACGTCGGAGCTGCGGGAGGATAGCTGGCCGGACGAAGAAGAAAAACGGGGATGGGCTGCTGGGAGGTGGCTCACGGCTGAACGAAAAGAAGAAGAAAATGCAAGGAGAG
GGAGTTCGGCGGCTAACACGAGAGGAAGGGAGAAAGAGAGCCCAGCCCACAAACGCCATAGGTGCCTCAGTTCAGATGGTAAAGTAATTATTTCTCGCCATGTGAAGCTT
AATGAATCTGATTTTCCCTTCGCTACAGGTTTTGGGTCGTCCCTCAATAATTCCCAAATATTTGCCTATACTAAGTCCAACCCACCATTACACAACTGGCTTCATGAACC
TCAAATAAGCTGCTCAAATGATTTATCCAGTCCGTTTGTACCCCCCAGCCTGATTACATCTACCAGCCCATCTACGTCATCCAACCCAACTGTGTCTCCCAGCCCACCCT
TTACTCCCCAGTCTGCACAGTTGTCCACTAACCCTTCTACTCGTCCATTCATCACTCCACCACCCGACACAGTTAGTCTTATACCCGACCCAGTTAATCATATACCTGAT
CCAAATATCTCATTTAGTCATCAACCCTCTTCTCCTGTCACGCCATATGTCGCATCACCATCTCAACGTCACTCTATTTCTGCGTGCCAACCTTCAGCCTCACCTTCCTC
TTCAACTCCAAACAATGTTGTTCCCTCACCCTCATCTCCTCCAATTTTGCCTAGCCATCCAATGATTACACGTGTGAAGACCGACATCTTTAAGCCAAAAGTGTGGCTCT
CTGTCTCTCCGACTGATTGGTCTACTCGCGAACCCAGTCGTGTTGTTGATGCCTTGGTAACTCCGGTATGGAAAGCCGCTATGGATGTTGAGTTTCAGGCACTTCTACGG
AATCAAACATGGTGTCTAGTTCCACTTTCACCGTCTCAAAATGTTTTTGGGTGCAAATGGGTTTTCCGGATCAAGAGGAATTCAGATGGTTCAGTCTAG
Protein sequenceShow/hide protein sequence
MANVLSSSSVNMAASASLPIFSSPPLNQLLNQITTVKLERGNYLLWKNLALLILRSYRLEGHLSGDKPCPSKFLSTSQVITTNVTNEAGSSSSEAVPLENSAQSSAVTLT
INPEYESWLVVDQLLLGWLYNSMTPEVATQVMGYENAKDLWAAIQEFLESSLEPKKIIFVRYFNRLVKVLLGLDEEYNVVVAMIQGKIGISLSEMQAKLLVFEKRLEIQN
SQRSSTTFGSTISVNMATKGDGQSGSRQKNFSINRQQYNNNQRGGGGRNRGRGRWNGNNNNRLICQICGKNGYSALAFYQRFDKQFIGPSQNINRNISPGGNSFQNGGTS
QMSPVAPQAFMTTQNTNPFVASLETVIDPNCKNLVSVSKLAQDNNVYIEFHADSCLVKDIHTDKVVLRGVLKDGLYQLGTRITGSASGSVGSNLKSANKSVPHSAFITSG
IYANVLVSKSVWHRRLGHSMQIWKVSCSSFSKLKSHAIAPFDLIHTDLWGPAPVMSTDGYRYYVHFLDDFSRFVWVYPLKLKSDTVAAITHFITMIKTQFNESVKVLQSD
NGGEYKKVHQLCAQNGIQSRLSCPYTSAKNGRAERKHRHLVETSLTLLAQASMSLSYWWEAFTIATFLINGLPTPVLHGKSPIELLLKSKIECYGLTGRETSGKEEGIRG
RDTPKKRRNSPIGRIMKPSPPTKLERRRRNHEMNLSPTSELREDSWPDEEEKRGWAAGRWLTAERKEEENARRGSSAANTRGREKESPAHKRHRCLSSDGKVIISRHVKL
NESDFPFATGFGSSLNNSQIFAYTKSNPPLHNWLHEPQISCSNDLSSPFVPPSLITSTSPSTSSNPTVSPSPPFTPQSAQLSTNPSTRPFITPPPDTVSLIPDPVNHIPD
PNISFSHQPSSPVTPYVASPSQRHSISACQPSASPSSSTPNNVVPSPSSPPILPSHPMITRVKTDIFKPKVWLSVSPTDWSTREPSRVVDALVTPVWKAAMDVEFQALLR
NQTWCLVPLSPSQNVFGCKWVFRIKRNSDGSV