; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0018246 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0018246
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr5:20285026..20289504
RNA-Seq ExpressionLag0018246
SyntenyLag0018246
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN61322.1 hypothetical protein VITISV_012106 [Vitis vinifera]2.3e-30542.3Show/hide
Query:  STPRSISTPSSSAANIVSSSFG-----HPLSTVLTVKLDENNYLLWRGMVLAILRGQNVDGYVLGTKAQPSEFKEVVTETGKQLEP---NPSYIDWTTVD
        STP   S+ S S  +  SS+         L+  L VKLD  NY+LWR  +  ++     + ++ GT   P           K L P   NP+++ W   D
Subjt:  STPRSISTPSSSAANIVSSSFG-----HPLSTVLTVKLDENNYLLWRGMVLAILRGQNVDGYVLGTKAQPSEFKEVVTETGKQLEP---NPSYIDWTTVD

Query:  QALLGWLFGSMSPAIAADVVSFTTSREVWKALEKVYGATSKARVNQLRGVLQNTKKGTMKMVEFLAIMKQAAENLKLAGNPISLSDLTSYVLVGLDSDYM
        + +L W++ S++P I A ++   TS   W ALE ++ ++S+AR+ QLR  LQ+TKKG+M M++++  +K AA+NL   G P+S  D    +L GL SDY 
Subjt:  QALLGWLFGSMSPAIAADVVSFTTSREVWKALEKVYGATSKARVNQLRGVLQNTKKGTMKMVEFLAIMKQAAENLKLAGNPISLSDLTSYVLVGLDSDYM

Query:  PIVCTIEDKDIS-TWQELSSILITFEGTLARYSVPSNVNELPDLAAHLTFSRQNQSNSGKQFSFGRGNQNNQGGNQFQQSNFSNNRGNSRGRGRGRNNFQ
         +V  I  +D   + + + S+L+ FE  L + S       +  ++A+   S  N    G++F+ GRG          Q  + +NN    RGRGRG  N Q
Subjt:  PIVCTIEDKDIS-TWQELSSILITFEGTLARYSVPSNVNELPDLAAHLTFSRQNQSNSGKQFSFGRGNQNNQGGNQFQQSNFSNNRGNSRGRGRGRNNFQ

Query:  --RSNNS---KPTCQLCGKYGHSAPYCYQRFEETF-----------NNPH-------AASSSNNQGYQG-----------IADGGNFATKSKYTGNKCLT
          R N+S   KP CQLCGK+GH+A  CY RF+ +F           NN +        AS+SNN   +              + GN  + S YTG   +T
Subjt:  --RSNNS---KPTCQLCGKYGHSAPYCYQRFEETF-----------NNPH-------AASSSNNQGYQG-----------IADGGNFATKSKYTGNKCLT

Query:  VGNGNKLDIEHTGSSSVACGNKTLKLNNMLHVPHIKRNLVSIARLTAYNDVYVQFNSGFCFVKDKTSRRVMLRGTLKDGLYQLELPSIQKSPVSPASFHV
        +GNG  L I + GS  +     + +L  + HVP I  NL+S+A+  + N+  ++F+S   FVKD  ++ V+ +G L++GLY+            P   ++
Subjt:  VGNGNKLDIEHTGSSSVACGNKTLKLNNMLHVPHIKRNLVSIARLTAYNDVYVQFNSGFCFVKDKTSRRVMLRGTLKDGLYQLELPSIQKSPVSPASFHV

Query:  AVSHSNNSCCPSLSSPKSVVESKL-LWHRRLGHASEKIVDAVIRSCNLKFNLNDKVPFCDACQLGKSHRLPFSHSNFRTHHPLELVYCDLWGPSPVASTS
            S N+     S   S VE+K  LWH RLGHAS  IV  V+ +CN+      K   C  CQL KSHRLP   SNF    PLELVY D+WGP+ + STS
Subjt:  AVSHSNNSCCPSLSSPKSVVESKL-LWHRRLGHASEKIVDAVIRSCNLKFNLNDKVPFCDACQLGKSHRLPFSHSNFRTHHPLELVYCDLWGPSPVASTS

Query:  GFRFYISFVDNFTRLTHIFPLKHKGEALSAFHQYKALVENRFEKSIKVLQTDWGGEFRSFTSPLKLFGIQFRHSCPHTSQQNGIVERKHRHIVEMGLSML
        G R++I FVD+++R T  + L+ K +AL  F  +K  +EN+F+  IK LQ+D GGEFRSFTS L+  GI  R SCP+ S QNG VERKHRH+VE GL++L
Subjt:  GFRFYISFVDNFTRLTHIFPLKHKGEALSAFHQYKALVENRFEKSIKVLQTDWGGEFRSFTSPLKLFGIQFRHSCPHTSQQNGIVERKHRHIVEMGLSML

Query:  AQSSLSLQYWWEAFSSAVFLINRLPSPVLHNLSPWEKAFNQVPDFSFLRTFGCACFPCLRPYQSHKFQFHSSKCVFLGYSFNHKGYKCLS-STGRIYISR
        + +SL ++YW  AF +  FLINR+PS VL   SP+   F + PD+   R FGC C+P +RPY +HK Q+ S +C+FLGYS NHKG+ CL  +TGR+YI+ 
Subjt:  AQSSLSLQYWWEAFSSAVFLINRLPSPVLHNLSPWEKAFNQVPDFSFLRTFGCACFPCLRPYQSHKFQFHSSKCVFLGYSFNHKGYKCLS-STGRIYISR

Query:  HVVFNETEFPFQHGFVSSSSSAVQPSNILVHCLPFPHASNISSSLPSPAP-AAATQADIDRSSVPCAPSPAVDVLPNHFNDLELSSAALDTAVVPSQELS
        HVVF+E+ FP      SSSS+           +  P       S P   P +  + A ID  S+  + SP    +P        SS+ LDT        S
Subjt:  HVVFNETEFPFQHGFVSSSSSAVQPSNILVHCLPFPHASNISSSLPSPAP-AAATQADIDRSSVPCAPSPAVDVLPNHFNDLELSSAALDTAVVPSQELS

Query:  SSSPQSVDASVSVSSASLPSSPHVDQSAPAIGTHPMQTRANSGIFRPKEWGAFLSSVDMILLLLS-PHQLRRHFSPLIGRVRWIQRIQALRKNDTWSLVP
        SSSP     ++ +S  S+P  P +   AP      M TR+  GI + K      + +D+  + +S P  L++ F            I AL +N TW LV 
Subjt:  SSSPQSVDASVSVSSASLPSSPHVDQSAPAIGTHPMQTRANSGIFRPKEWGAFLSSVDMILLLLS-PHQLRRHFSPLIGRVRWIQRIQALRKNDTWSLVP

Query:  PPSHVNLVGCKWVFKVKRQSDGCFDRCKARLVAKGFHQIPGIDFHETFSPVVKAPTIRIILTMVVARGWPVRQLDINNAFLNGRLEEDVFMKQPDGYVDP
         P +VN++GCKWV+K+K + DG  +R KARLVAKG++Q  G+D+ ETFSPVVKA TIRIILT+ ++  W +RQLD++NAFLNG LEE V+M QP GY DP
Subjt:  PPSHVNLVGCKWVFKVKRQSDGCFDRCKARLVAKGFHQIPGIDFHETFSPVVKAPTIRIILTMVVARGWPVRQLDINNAFLNGRLEEDVFMKQPDGYVDP

Query:  SCPTFVCKLNKALYGLKQAPRAWYDRLRSTLLEWGFLNSKADSSVFFFSNSTMIVIVLIYVHDILITGDCSEFINQFVSRLNDIFALKDMGLLSYYLGVE
          P  VC+L KALYGLKQAPRAW+ RL S LL+WGF  S+ DSS+F        +IVL+YV DIL+TG  S  I+  +++L+ +FAL+D+G LS++LG+E
Subjt:  SCPTFVCKLNKALYGLKQAPRAWYDRLRSTLLEWGFLNSKADSSVFFFSNSTMIVIVLIYVHDILITGDCSEFINQFVSRLNDIFALKDMGLLSYYLGVE

Query:  V------------------------FRTADA------------FDGDPMSNPTLYRSVLGALQYLTHTRPDISFIVNRLSQFLQQPTEVHWSGVKRILRY
        V                        F T  A            FDGDPM++ T YRSV+GALQY+T TRPDI+F VN+  QF+QQPT  HW  VKRILRY
Subjt:  V------------------------FRTADA------------FDGDPMSNPTLYRSVLGALQYLTHTRPDISFIVNRLSQFLQQPTEVHWSGVKRILRY

Query:  LKGTMEYGLFLPRSSSLSLFAFTDADWACSLDDRRSIAAHCVFLGGSLVSWSSKKQAVVARSSAESEYRSLAHTTAEVLWIQSLLTELSLSSASTPIIWC
        L+GTM+ GL    SS+L++  FTDADW   LDDRRS + + V+LGG+LVSWSS KQ VV+RSSAESEYR L   TAE++W+Q+LL EL +   + P++W 
Subjt:  LKGTMEYGLFLPRSSSLSLFAFTDADWACSLDDRRSIAAHCVFLGGSLVSWSSKKQAVVARSSAESEYRSLAHTTAEVLWIQSLLTELSLSSASTPIIWC

Query:  DNTSAISLARNPVLHSRTKHVELDIHFVRDKVLEKKIDVRYVPSSEQLADGLTKPLSESSFSSMRSKLGVMLSPSRLRGD
        DN SA  +A+NPV H+RTKH+E+D+HF+RD+V+  KI +++VP+ EQ  D LTK L+ S F S++S+L +   P  LRGD
Subjt:  DNTSAISLARNPVLHSRTKHVELDIHFVRDKVLEKKIDVRYVPSSEQLADGLTKPLSESSFSSMRSKLGVMLSPSRLRGD

GAU19483.1 hypothetical protein TSUD_77270 [Trifolium subterraneum]0.0e+0042.83Show/hide
Query:  ANIVSSSFGHPLSTVLTVKLDENNYLLWRGMVLAILRGQNVDGYVLGTKAQPSEFKEVVTETGKQLEPNPSYIDWTTVDQALLGWLFGSMSPAIAADVVS
        A+   S+  + L + ++VKLD NNY LW+ +VL ++RG  +DGY+LGT+  P EF   +T +      N ++++W   DQ LLGW+  SM+  IA  ++ 
Subjt:  ANIVSSSFGHPLSTVLTVKLDENNYLLWRGMVLAILRGQNVDGYVLGTKAQPSEFKEVVTETGKQLEPNPSYIDWTTVDQALLGWLFGSMSPAIAADVVS

Query:  FTTSREVWKALEKVYGATSKARVNQLRGVLQNTKKGTMKMVEFLAIMKQAAENLKLAGNPISLSDLTSYVLVGLDSDYMPIVCTIEDKDISTWQELSSIL
          TS+++W   + + GA +++++  L+    + +KG MKM ++L  MK   + LKLAGNP+S SDL    L GLDS+Y P+V  + D+   +W +L + L
Subjt:  FTTSREVWKALEKVYGATSKARVNQLRGVLQNTKKGTMKMVEFLAIMKQAAENLKLAGNPISLSDLTSYVLVGLDSDYMPIVCTIEDKDISTWQELSSIL

Query:  ITFEGTLARYSVPSNVNELPDLAAHLTFSRQNQSNSGKQFSFGRGNQNNQGGNQFQQSNFSNNRGNSRGRGRGRNNFQRSNNSKPTCQLCGKYGHSAPYC
        +TFE  +        +N L +L  + T +  N+S+       G+ + NN  G        SN+RG   GRGRG+       + K  CQ+CG   H A  C
Subjt:  ITFEGTLARYSVPSNVNELPDLAAHLTFSRQNQSNSGKQFSFGRGNQNNQGGNQFQQSNFSNNRGNSRGRGRGRNNFQRSNNSKPTCQLCGKYGHSAPYC

Query:  YQRFEETF--------------NNPHAASSSNNQGYQGIADGG----------NFATKSKYTGNKCLTVGNGNKLDIEHTGSSSVACGNKTLKLNNMLHV
        + RF++T+              +N   AS ++ + Y    D G           F   +++ G   L VGNG KL I  TGSS +    K+L L+++L+V
Subjt:  YQRFEETF--------------NNPHAASSSNNQGYQGIADGG----------NFATKSKYTGNKCLTVGNGNKLDIEHTGSSSVACGNKTLKLNNMLHV

Query:  PHIKRNLVSIARLTAYNDVYVQFNSGFCFVKDKTSRRVMLRGTLKDGLYQLELPSIQKSPVSPASFHVAVSHSNNSCCPSLSSPKSVVESKLLWHRRLGH
        P+I +NL+S+++L A N++ V+F+   CFVKDK + +V+L+G LKDGLYQL                            +  +P + V  K  WHRRLGH
Subjt:  PHIKRNLVSIARLTAYNDVYVQFNSGFCFVKDKTSRRVMLRGTLKDGLYQLELPSIQKSPVSPASFHVAVSHSNNSCCPSLSSPKSVVESKLLWHRRLGH

Query:  ASEKIVDAVIRSCNLKFNLNDKVPFCDACQLGKSHRLPFSHSNFRTHHPLELVYCDLWGPSPVASTSGFRFYISFVDNFTRLTHIFPLKHKGEALSAFHQ
         + K++D V+ SC +K   +D   FC+ACQ GK H LPF  S+     PLELV+ D+WGP+P+ ++SGF++Y+ FVD+F+R T I+PLK K E + AF Q
Subjt:  ASEKIVDAVIRSCNLKFNLNDKVPFCDACQLGKSHRLPFSHSNFRTHHPLELVYCDLWGPSPVASTSGFRFYISFVDNFTRLTHIFPLKHKGEALSAFHQ

Query:  YKALVENRFEKSIKVLQTDWGGEFRSFTSPLKLFGIQFRHSCPHTSQQNGIVERKHRHIVEMGLSMLAQSSLSLQYWWEAFSSAVFLINRLPSPVLHNLS
        +K L EN+F K IKV+Q D GGE++         GIQFR SCP+TSQQNG  ERKHRHI E GL++LAQ+ + L YWWEAFS+AV+LINRLPS V  N S
Subjt:  YKALVENRFEKSIKVLQTDWGGEFRSFTSPLKLFGIQFRHSCPHTSQQNGIVERKHRHIVEMGLSMLAQSSLSLQYWWEAFSSAVFLINRLPSPVLHNLS

Query:  PWEKAFNQVPDFSFLRTFGCACFPCLRPYQSHKFQFHSSKCVFLGYSFNHKGYKCLSSTGRIYISRHVVFNETEFPFQHGFVSSSSSAVQPSNILVHCLP
        P+     + PD+  L+TFGCAC+PCL+PY  HK Q+H+++CVFLGYS +HKGYKCL+S GRI+ISRHV+FNE  FPF  GF+++ S      N+     P
Subjt:  PWEKAFNQVPDFSFLRTFGCACFPCLRPYQSHKFQFHSSKCVFLGYSFNHKGYKCLSSTGRIYISRHVVFNETEFPFQHGFVSSSSSAVQPSNILVHCLP

Query:  FPHASNISSSLPSPAPAAATQADIDRSSVPCAPSPAVDVLPNHFNDLELSSAALDTAVVPSQELSSSSPQ-----SVDASVSVSSASLPSSPHVDQSAPA
           A N+                ID +S+P                LE  + A +T    SQ+++S + Q     S D +    +  +     V +++  
Subjt:  FPHASNISSSLPSPAPAAATQADIDRSSVPCAPSPAVDVLPNHFNDLELSSAALDTAVVPSQELSSSSPQ-----SVDASVSVSSASLPSSPHVDQSAPA

Query:  IGT-HPMQTRANSGIFRPKEWGAFLSSVDMILLLLSPHQLRRHFSPLIGRVRWIQRIQALRKNDTWSLVPPPSHVNLVGCKWVFKVKRQSDGCFDRCKAR
          T H + TR+ SGI +PK    ++   +     + P   +   S  + +    +  +AL  N TW LVP  +  N+V  KWVFK K + DG  +R KAR
Subjt:  IGT-HPMQTRANSGIFRPKEWGAFLSSVDMILLLLSPHQLRRHFSPLIGRVRWIQRIQALRKNDTWSLVPPPSHVNLVGCKWVFKVKRQSDGCFDRCKAR

Query:  LVAKGFHQIPGIDFHETFSPVVKAPTIRIILTMVVARGWPVRQLDINNAFLNGRLEEDVFMKQPDGYVDPSCPTFVCKLNKALYGLKQAPRAWYDRLRST
        LVAKGF Q  GID+ ETFSPV+KA T+RIIL++ V   W VRQLDINNAFLNG L+E VFM QP+G+VD + P  +CKL+KA+YGLKQAPRAW+D L++ 
Subjt:  LVAKGFHQIPGIDFHETFSPVVKAPTIRIILTMVVARGWPVRQLDINNAFLNGRLEEDVFMKQPDGYVDPSCPTFVCKLNKALYGLKQAPRAWYDRLRST

Query:  LLEWGFLNSKADSSVFFFSNSTMIVIVLIYVHDILITGDCSEFINQFVSRLNDIFALKDMGLLSYYLGVEVFRTADAF----------------------
        LL WGF N+K+DSS+F       I  +LIYV DI++TG   +F+  F+ +LND F+LKD+G L Y+LG+EV R A                         
Subjt:  LLEWGFLNSKADSSVFFFSNSTMIVIVLIYVHDILITGDCSEFINQFVSRLNDIFALKDMGLLSYYLGVEVFRTADAF----------------------

Query:  -------------DGDPMSNPTLYRSVLGALQYLTHTRPDISFIVNRLSQFLQQPTEVHWSGVKRILRYLKGTMEYGLFLPRSSSLSLFAFTDADWACSL
                     +G+ + +PT++R  +G LQYLTHT PDI+F VN+LSQ++  P+  HW G+KRILRYL+GT+ Y L +  S+ L +  F+DADWA S+
Subjt:  -------------DGDPMSNPTLYRSVLGALQYLTHTRPDISFIVNRLSQFLQQPTEVHWSGVKRILRYLKGTMEYGLFLPRSSSLSLFAFTDADWACSL

Query:  DDRRSIAAHCVFLGGSLVSWSSKKQAVVARSSAESEYRSLAHTTAEVLWIQSLLTELSLSSASTPIIWCDNTSAISLARNPVLHSRTKHVELDIHFVRDK
        DDR+S++  CVFLG +L+SWSS+KQ VV+RSS ESEYR+LA   AE+ WI+SLLTEL L     PI+WCDN SA +LA NPVLH+R+KH+E+D+H++RD+
Subjt:  DDRRSIAAHCVFLGGSLVSWSSKKQAVVARSSAESEYRSLAHTTAEVLWIQSLLTELSLSSASTPIIWCDNTSAISLARNPVLHSRTKHVELDIHFVRDK

Query:  VLEKKIDVRYVPSSEQLADGLTKPLSESSFSSMRSKLGVMLSP
        VL+ ++ V YVP+++Q+AD LTKPLS + FS +R KLGV+LSP
Subjt:  VLEKKIDVRYVPSSEQLADGLTKPLSESSFSSMRSKLGVMLSP

GAU51268.1 hypothetical protein TSUD_412550 [Trifolium subterraneum]0.0e+0042.82Show/hide
Query:  SSAANIVSSSFGHPLSTVLTVKLDENNYLLWRGMVLAILRGQNVDGYVLGTKAQPSEFKEVVTETGKQLEPNPSYIDWTTVDQALLGWLFGSMSPAIAAD
        SSAAN   S   + L ++++VKLD +NY LW+ +VL+++RG  +DGY+LGT   P +F   VT   K  + NP + DW   DQALLGWL  SM+  IA  
Subjt:  SSAANIVSSSFGHPLSTVLTVKLDENNYLLWRGMVLAILRGQNVDGYVLGTKAQPSEFKEVVTETGKQLEPNPSYIDWTTVDQALLGWLFGSMSPAIAAD

Query:  VVSFTTSREVWKALEKVYGATSKARVNQLRGVLQNTKKGTMKMVEFLAIMKQAAENLKLAGNPISLSDLTSYVLVGLDSDYMPIVCTIEDKDISTWQELS
        ++   TS+++W   + + GA +K+R+  L+    NT+KG MKM E+L  MK  ++ LKLAG+PIS SDL    L GLD++Y P+V  + D+   +W ++ 
Subjt:  VVSFTTSREVWKALEKVYGATSKARVNQLRGVLQNTKKGTMKMVEFLAIMKQAAENLKLAGNPISLSDLTSYVLVGLDSDYMPIVCTIEDKDISTWQELS

Query:  SILITFEGTLARYSVPSNVNELPDLAAHLTFSRQNQSNSGKQFSFGRGNQNNQGGNQFQQSNFSNNRGNSRGRGRGRNNFQRSNNSKPTCQLCGKYGHSA
        + L+ FE  L +++  S +     L A   F+ + +          RGN+ N  GN +++SNF   RG   GRG+G     R +N+K  CQ+C   GH A
Subjt:  SILITFEGTLARYSVPSNVNELPDLAAHLTFSRQNQSNSGKQFSFGRGNQNNQGGNQFQQSNFSNNRGNSRGRGRGRNNFQRSNNSKPTCQLCGKYGHSA

Query:  PYCYQRFEETFNNPHAASSSNNQG--------------YQGIADGG----------NFATKSKYTGNKCLTVGNGNKLDIEHTGSSSVACGNKTLKLNNM
          C  RF+  +   + ++ ++ QG              Y+   D G           F   +++ G   L VGNG KL I  +GS+ +      L L+++
Subjt:  PYCYQRFEETFNNPHAASSSNNQG--------------YQGIADGG----------NFATKSKYTGNKCLTVGNGNKLDIEHTGSSSVACGNKTLKLNNM

Query:  LHVPHIKRNLVSIARLTAYNDVYVQFNSGFCFVKDKTSRRVMLRGTLKDGLYQLELPSIQKSPVSPASFHVAVSHSNNSCCPSLSSPKSVVESKLLWHRR
        L+VP I +NL+S+++LTA N++ V+F++  C VKDK + + +L+G LKDGLYQL                     SN   C  +S  +S       WHR+
Subjt:  LHVPHIKRNLVSIARLTAYNDVYVQFNSGFCFVKDKTSRRVMLRGTLKDGLYQLELPSIQKSPVSPASFHVAVSHSNNSCCPSLSSPKSVVESKLLWHRR

Query:  LGHASEKIVDAVIRSCNLKFNLNDKVPFCDACQLGKSHRLPFSHSNFRTHHPLELVYCDLWGPSPVASTSGFRFYISFVDNFTRLTHIFPLKHKGEALSA
        LGH + K++D V++ CN+K + +D+  FC+ACQ GK H LPF  S+     PL L++ D+WGP+P+ S SGF++Y+ F+D+F+R T IFPLK K + + A
Subjt:  LGHASEKIVDAVIRSCNLKFNLNDKVPFCDACQLGKSHRLPFSHSNFRTHHPLELVYCDLWGPSPVASTSGFRFYISFVDNFTRLTHIFPLKHKGEALSA

Query:  FHQYKALVENRFEKSIKVLQTDWGGEFRSFTSPLKLFGIQFRHSCPHTSQQNGIVERKHRHIVEMGLSMLAQSSLSLQYWWEAFSSAVFLINRLPSPVLH
        F Q+K L EN+F K IK++Q D GGE+++        GIQFR SCP+TSQQNG  ERKHRH+ E+GL++LAQ+ + L+YWWEAFS+AV+LINRLPS V  
Subjt:  FHQYKALVENRFEKSIKVLQTDWGGEFRSFTSPLKLFGIQFRHSCPHTSQQNGIVERKHRHIVEMGLSMLAQSSLSLQYWWEAFSSAVFLINRLPSPVLH

Query:  NLSPWEKAFNQVPDFSFLRTFGCACFPCLRPYQSHKFQFHSSKCVFLGYSFNHKGYKCLSSTGRIYISRHVVFNETEFPFQHGFVSSSSSAVQPSNILVH
        N SP+   F + PD++ L+ FGCAC+PCL+PY  HK QFH+++CVF+GYS +HKGYKC++S GRI++SRHV+FNE  FPF  GF+ + +    P   L  
Subjt:  NLSPWEKAFNQVPDFSFLRTFGCACFPCLRPYQSHKFQFHSSKCVFLGYSFNHKGYKCLSSTGRIYISRHVVFNETEFPFQHGFVSSSSSAVQPSNILVH

Query:  CLPFPHASNISSSLPSPAPAAATQADI--DRSSVPCAPSPAVDVLPNHFNDLELSSAA--LDTAVVPSQEL-SSSSPQSVDASVSVSSASLPSSPHVDQS
                N S  LP+ +  A TQ  I  D ++     + +++   N+ N+ ++ S+   ++T    +Q++ + +S  S D + S  + ++      D S
Subjt:  CLPFPHASNISSSLPSPAPAAATQADI--DRSSVPCAPSPAVDVLPNHFNDLELSSAA--LDTAVVPSQEL-SSSSPQSVDASVSVSSASLPSSPHVDQS

Query:  APAIGTHPMQTRANSGIFRPKEWGAFLSSVDMILLLLSPHQLRRHFSPLIGRVRWIQRI----QALRKNDTWSLVPPPSHVNLVGCKWVFKVKRQSDGCF
             TH M+TR+  GI +PK     ++  D      S  +  +     +GR  W + +    +AL  N TW+LVP     N++  KW+FK K +SDG  
Subjt:  APAIGTHPMQTRANSGIFRPKEWGAFLSSVDMILLLLSPHQLRRHFSPLIGRVRWIQRI----QALRKNDTWSLVPPPSHVNLVGCKWVFKVKRQSDGCF

Query:  DRCKARLVAKGFHQIPGIDFHETFSPVVKAPTIRIILTMVVARGWPVRQLDINNAFLNGRLEEDVFMKQPDGYVDPSCPTFVCKLNKALYGLKQAPRAWY
        +R KARLVAKGF Q  G+DF ETFSPVVK+ T+RIILT+ V   W VRQLDINNAFLNG+L+E VFM QP+GY+D + P  +CKL+KA+YGLKQAPRAWY
Subjt:  DRCKARLVAKGFHQIPGIDFHETFSPVVKAPTIRIILTMVVARGWPVRQLDINNAFLNGRLEEDVFMKQPDGYVDPSCPTFVCKLNKALYGLKQAPRAWY

Query:  DRLRSTLLEWGFLNSKADSSVFFFSNSTMIVIVLIYVHDILITGDCSEFINQFVSRLNDIFALKDMGLLSYYLGVEVFR---------------------
        D LRSTL+ WGF N+K D+S+FF   +     +LIYV DI++TG   +F+  F ++LN  ++LKD+G L Y+LGVEV R                     
Subjt:  DRLRSTLLEWGFLNSKADSSVFFFSNSTMIVIVLIYVHDILITGDCSEFINQFVSRLNDIFALKDMGLLSYYLGVEVFR---------------------

Query:  ------------TADAF--DGDPMSNPTLYRSVLGALQYLTHTRPDISFIVNRLSQFLQQPTEVHWSGVKRILRYLKGTMEYGLFLPRSSSLSLFAFTDA
                    T   F  +G+ MSNPTLYR  +GALQYLT+TRPDI+F VN+LSQ++  PT  HW G+KRILRYL+GT  + L +  S++L +  F DA
Subjt:  ------------TADAF--DGDPMSNPTLYRSVLGALQYLTHTRPDISFIVNRLSQFLQQPTEVHWSGVKRILRYLKGTMEYGLFLPRSSSLSLFAFTDA

Query:  DWACSLDDRRSIAAHCVFLGGSLVSWSSKKQAVVARSSAESEYRSLAHTTAEVLWIQ-------------------SLLTELSLSSASTPIIWCDNTSAI
        DWA S DDR+S    CVFLG +LVSW+S+KQ VV+RSS ESEYRSLA   AEV                       +LL EL L     P++WCDN SA 
Subjt:  DWACSLDDRRSIAAHCVFLGGSLVSWSSKKQAVVARSSAESEYRSLAHTTAEVLWIQ-------------------SLLTELSLSSASTPIIWCDNTSAI

Query:  SLARNPVLHSRTKHVELDIHFVRDKVLEKKIDVRYVPSSEQLADGLTKPLSESSFSSMRSKLGVMLSPS
        +LA NPV+H+R+KH+E+D+H++RD+VLE K+ + YVP+++Q+AD LTKPL  + F+ MR KLGV +SPS
Subjt:  SLARNPVLHSRTKHVELDIHFVRDKVLEKKIDVRYVPSSEQLADGLTKPLSESSFSSMRSKLGVMLSPS

RVW64314.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]1.7e-30541.28Show/hide
Query:  PSSSAANIVSSSFGHPLSTVLTVKLDENNYLLWRGMVLAILRGQNVDGYVLGTKAQPSEFKEVVTETGKQLEPNPSYIDWTTVDQALLGWLFGSMSPAIA
        PSS+ A        H     L +KLD NNY+LWR  +  ++     + ++ G K  P +     T +G   E NP ++ W   D+ +L W++ S++P I 
Subjt:  PSSSAANIVSSSFGHPLSTVLTVKLDENNYLLWRGMVLAILRGQNVDGYVLGTKAQPSEFKEVVTETGKQLEPNPSYIDWTTVDQALLGWLFGSMSPAIA

Query:  ADVVSFTTSREVWKALEKVYGATSKARVNQLRGVLQNTKKGTMKMVEFLAIMKQAAENLKLAGNPISLSDLTSYVLVGLDSDYMPIVCTIEDK-DISTWQ
          +V + +S   W ALE+++ A+S+ARV QLR   Q T+KG++ M+E++  +K  A+NL   G P++  D    +L GL +DY  IV ++  + D  +  
Subjt:  ADVVSFTTSREVWKALEKVYGATSKARVNQLRGVLQNTKKGTMKMVEFLAIMKQAAENLKLAGNPISLSDLTSYVLVGLDSDYMPIVCTIEDK-DISTWQ

Query:  ELSSILITFEGTLARYSVPSNVNELPDLAAHLTFSRQNQSNSGKQFSFGRGNQNNQGGNQFQQSNFSNNRGNSRGRGRGRNNFQRSNNSKPTCQLCGKYG
         + SIL+T E    R S  ++V E   ++A+L   +    N            N +   Q +QS F+  RG + GR        +S+  +P CQLCGK+G
Subjt:  ELSSILITFEGTLARYSVPSNVNELPDLAAHLTFSRQNQSNSGKQFSFGRGNQNNQGGNQFQQSNFSNNRGNSRGRGRGRNNFQRSNNSKPTCQLCGKYG

Query:  HSAPYCYQRFEETF------------NNPHA--------ASSSNNQGYQGIADGG----------NFATKSKYTGNKCLTVGNGNKLDIEHTGSSSVACG
        H+   CY RF+  F            N P+A        AS S         D G            +    Y GN  + VGNG  L I HTG++     
Subjt:  HSAPYCYQRFEETF------------NNPHA--------ASSSNNQGYQGIADGG----------NFATKSKYTGNKCLTVGNGNKLDIEHTGSSSVACG

Query:  NKTLKLNNMLHVPHIKRNLVSIARLTAYNDVYVQFNSGFCFVKDKTSRRVMLRGTLKDGLYQLELPSIQKSPVSPASFHVAVSHSNNSCCPSLSSPKSVV
        +KT +L  +LHVP I  NL+S+++  A N+ + +F+  F FVKD+ +++++L+G+L+ GLY+     +     SPA+F   VS S +      SS  S+ 
Subjt:  NKTLKLNNMLHVPHIKRNLVSIARLTAYNDVYVQFNSGFCFVKDKTSRRVMLRGTLKDGLYQLELPSIQKSPVSPASFHVAVSHSNNSCCPSLSSPKSVV

Query:  ESKLLWHRRLGHASEKIVDAVIRSCNLKFNLNDKVPFCDACQLGKSHRLPFSHSNFRTHHPLELVYCDLWGPSPVASTSGFRFYISFVDNFTRLTHIFPL
         +  LWH RLGH +  I+  ++ SCN+    + K   C ACQ  KSH+LPF+ S  R  HPL L++ DLWGP+ + ST+G R++I FVD+F+R + I+PL
Subjt:  ESKLLWHRRLGHASEKIVDAVIRSCNLKFNLNDKVPFCDACQLGKSHRLPFSHSNFRTHHPLELVYCDLWGPSPVASTSGFRFYISFVDNFTRLTHIFPL

Query:  KHKGEALSAFHQYKALVENRFEKSIKVLQTDWGGEFRSFTSPLKLFGIQFRHSCPHTSQQNGIVERKHRHIVEMGLSMLAQSSLSLQYWWEAFSSAVFLI
          K +ALS F ++K+LVEN+F   I+ L++D GGEF++F+S L   GI+ + SCP+T +QNG  ERK RHI+E GL++LA +SL  ++W  AF +A+FLI
Subjt:  KHKGEALSAFHQYKALVENRFEKSIKVLQTDWGGEFRSFTSPLKLFGIQFRHSCPHTSQQNGIVERKHRHIVEMGLSMLAQSSLSLQYWWEAFSSAVFLI

Query:  NRLPSPVLHNLSPWEKAFNQVPDFSFLRTFGCACFPCLRPYQSHKFQFHSSKCVFLGYSFNHKGYKCLSS-TGRIYISRHVVFNETEFPFQHGFVSSSSS
        NRLP+ VL+  SP++  F + P++   + FGC C+P +RPY  +K  + SS+CVFLGYS NHKGY CL+  TGR+Y++RHVVF+ET FPFQ     SSS 
Subjt:  NRLPSPVLHNLSPWEKAFNQVPDFSFLRTFGCACFPCLRPYQSHKFQFHSSKCVFLGYSFNHKGYKCLSS-TGRIYISRHVVFNETEFPFQHGFVSSSSS

Query:  AVQPSNILVHCLPFPHASNISSSLPSPAPAAATQADIDRSSVPCAPSPAVDVLPNHFNDLELSSAALDTAVVPSQELSSSSPQSVDASVSVSSASLPSSP
           P+   + C   P +S  S + PS +    T       S+P                        D   VP  ++S+S P   +              
Subjt:  AVQPSNILVHCLPFPHASNISSSLPSPAPAAATQADIDRSSVPCAPSPAVDVLPNHFNDLELSSAALDTAVVPSQELSSSSPQSVDASVSVSSASLPSSP

Query:  HVDQSAPAIGTHPMQTRANSGIFRPKEWGAFLSSVDMILLLLSPHQLRRHFSPLIGRVRWI----QRIQALRKNDTWSLVPPPSHVNLVGCKWVFKVKRQ
                   HPM TRA +GI + K +  F S +               F+  +    W+    +   AL++N+TW LVPPPS+ N++GCKWV+K+K +
Subjt:  HVDQSAPAIGTHPMQTRANSGIFRPKEWGAFLSSVDMILLLLSPHQLRRHFSPLIGRVRWI----QRIQALRKNDTWSLVPPPSHVNLVGCKWVFKVKRQ

Query:  SDGCFDRCKARLVAKGFHQIPGIDFHETFSPVVKAPTIRIILTMVVARGWPVRQLDINNAFLNGRLEEDVFMKQPDGYVDPSCPTFVCKLNKALYGLKQA
         DG  DR KARLVA+GF Q  G+D+ ETFSPVVKA TIRIIL + ++  W V QLD+ NAFL+G LEE VFM+QP G+++   P+ VCKLNKALYGLKQA
Subjt:  SDGCFDRCKARLVAKGFHQIPGIDFHETFSPVVKAPTIRIILTMVVARGWPVRQLDINNAFLNGRLEEDVFMKQPDGYVDPSCPTFVCKLNKALYGLKQA

Query:  PRAWYDRLRSTLLEWGFLNSKADSSVFFFSNSTMIVIVLIYVHDILITGDCSEFINQFVSRLNDIFALKDMGLLSYYLGVEVFRTADAF-----------
        PRAWY++L ++LL WGF  S+ADSS+F   ++  ++I+LIYV DIL+TG  S  ++ F++RLN  FAL+D+G ++Y+LG+EV R+   F           
Subjt:  PRAWYDRLRSTLLEWGFLNSKADSSVFFFSNSTMIVIVLIYVHDILITGDCSEFINQFVSRLNDIFALKDMGLLSYYLGVEVFRTADAF-----------

Query:  -------------------------DGDPMSNPTLYRSVLGALQYLTHTRPDISFIVNRLSQFLQQPTEVHWSGVKRILRYLKGTMEYGLFLPRSSSLSL
                                 DG+P S+ TLYRS +GALQYLT TRPDISF VN+  QF+  PT  HW  VKRILRYLKGT+ YG+ + +S+SL +
Subjt:  -------------------------DGDPMSNPTLYRSVLGALQYLTHTRPDISFIVNRLSQFLQQPTEVHWSGVKRILRYLKGTMEYGLFLPRSSSLSL

Query:  FAFTDADWACSLDDRRSIAAHCVFLGGSLVSWSSKKQAVVARSSAESEYRSLAHTTAEVLWIQSLLTELSLSSASTPIIWCDNTSAISLARNPVLHSRTK
          +TDADWA   DDRRS   + +FLG +LVSWSS KQ VV+RSSAESEYR+LA  T+E++WIQ +L EL LSS+S P++WCDN SA  LA NPV H+RTK
Subjt:  FAFTDADWACSLDDRRSIAAHCVFLGGSLVSWSSKKQAVVARSSAESEYRSLAHTTAEVLWIQSLLTELSLSSASTPIIWCDNTSAISLARNPVLHSRTK

Query:  HVELDIHFVRDKVLEKKIDVRYVPSSEQLADGLTKPLSESSFSSMRSKLGVMLSPSRLRGDVRTKVYD
        H+E+D+HF+RD VL K++ ++Y+PS+EQ+AD  TK +S S F S R+KL V+ SP  LRGD R  + D
Subjt:  HVELDIHFVRDKVLEKKIDVRYVPSSEQLADGLTKPLSESSFSSMRSKLGVMLSPSRLRGDVRTKVYD

RVX06084.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]3.3e-30441.14Show/hide
Query:  PSSSAANIVSSSFGHPLSTVLTVKLDENNYLLWRGMVLAILRGQNVDGYVLGTKAQPSEFKEVVTETGKQLEPNPSYIDWTTVDQALLGWLFGSMSPAIA
        PSS+ A        H     L +KLD NNY+LWR  +  ++     + ++ G K  P +     T +G   E NP ++ W   D+ +L W++ S++P I 
Subjt:  PSSSAANIVSSSFGHPLSTVLTVKLDENNYLLWRGMVLAILRGQNVDGYVLGTKAQPSEFKEVVTETGKQLEPNPSYIDWTTVDQALLGWLFGSMSPAIA

Query:  ADVVSFTTSREVWKALEKVYGATSKARVNQLRGVLQNTKKGTMKMVEFLAIMKQAAENLKLAGNPISLSDLTSYVLVGLDSDYMPIVCTIEDK-DISTWQ
          +V + +S   W ALE+++ A+S+ARV QLR   Q T+KG++ M+E++  +K  A+NL   G P++  D    +L GL +DY  IV ++  + D  +  
Subjt:  ADVVSFTTSREVWKALEKVYGATSKARVNQLRGVLQNTKKGTMKMVEFLAIMKQAAENLKLAGNPISLSDLTSYVLVGLDSDYMPIVCTIEDK-DISTWQ

Query:  ELSSILITFEGTLARYSVPSNVNELPDLAAHLTFSRQNQSNSGKQFSFGRGNQNNQGGNQFQQSNFSNNRGNSRGRGRGRNNFQRSNNSKPTCQLCGKYG
         + SIL+T E    R S  ++V E   ++A+L   +    N            N +   Q +QS F+  RG + GR        +S+  +P CQLCGK+G
Subjt:  ELSSILITFEGTLARYSVPSNVNELPDLAAHLTFSRQNQSNSGKQFSFGRGNQNNQGGNQFQQSNFSNNRGNSRGRGRGRNNFQRSNNSKPTCQLCGKYG

Query:  HSAPYCYQRFEETF------------NNPHA--------ASSSNNQGYQGIADGG----------NFATKSKYTGNKCLTVGNGNKLDIEHTGSSSVACG
        H+   CY RF+  F            N P+A        AS S         D G            +    Y GN  + VGNG  L I HTG++     
Subjt:  HSAPYCYQRFEETF------------NNPHA--------ASSSNNQGYQGIADGG----------NFATKSKYTGNKCLTVGNGNKLDIEHTGSSSVACG

Query:  NKTLKLNNMLHVPHIKRNLVSIARLTAYNDVYVQFNSGFCFVKDKTSRRVMLRGTLKDGLYQLELPSIQKSPVSPASFHVAVSHSNNSCCPSLSSPKSVV
        +KT +L  +LHVP I  NL+S+++  A N+ + +F+  F FVKD+ +++++L+G+L+ GLY+     +     SPA+F   VS S +      SS  S+ 
Subjt:  NKTLKLNNMLHVPHIKRNLVSIARLTAYNDVYVQFNSGFCFVKDKTSRRVMLRGTLKDGLYQLELPSIQKSPVSPASFHVAVSHSNNSCCPSLSSPKSVV

Query:  ESKLLWHRRLGHASEKIVDAVIRSCNLKFNLNDKVPFCDACQLGKSHRLPFSHSNFRTHHPLELVYCDLWGPSPVASTSGFRFYISFVDNFTRLTHIFPL
         +  LWH RLGH ++ I+  ++ SCN+    + K   C ACQ  KSH+LPF+ S  R  HPL L++ DLWGP+ + ST+G R++I FVD+F+R + I+PL
Subjt:  ESKLLWHRRLGHASEKIVDAVIRSCNLKFNLNDKVPFCDACQLGKSHRLPFSHSNFRTHHPLELVYCDLWGPSPVASTSGFRFYISFVDNFTRLTHIFPL

Query:  KHKGEALSAFHQYKALVENRFEKSIKVLQTDWGGEFRSFTSPLKLFGIQFRHSCPHTSQQNGIVERKHRHIVEMGLSMLAQSSLSLQYWWEAFSSAVFLI
          K +ALS F ++K+LVEN+F   I+ L++D GGEF++F+S L   GI+ + SCP+T +QNG  ERK RHI+E GL++LA +SL  ++W  AF + +FLI
Subjt:  KHKGEALSAFHQYKALVENRFEKSIKVLQTDWGGEFRSFTSPLKLFGIQFRHSCPHTSQQNGIVERKHRHIVEMGLSMLAQSSLSLQYWWEAFSSAVFLI

Query:  NRLPSPVLHNLSPWEKAFNQVPDFSFLRTFGCACFPCLRPYQSHKFQFHSSKCVFLGYSFNHKGYKCLSS-TGRIYISRHVVFNETEFPFQHGFVSSSSS
        NRLP+ VL+  SP++  F + P++   + FGC C+P +RPY  +K  + SS+CVFLGYS NHKGY CL+  TGR+Y++RHVVF+ET FPFQ     SSS 
Subjt:  NRLPSPVLHNLSPWEKAFNQVPDFSFLRTFGCACFPCLRPYQSHKFQFHSSKCVFLGYSFNHKGYKCLSS-TGRIYISRHVVFNETEFPFQHGFVSSSSS

Query:  AVQPSNILVHCLPFPHASNISSSLPSPAPAAATQADIDRSSVPCAPSPAVDVLPNHFNDLELSSAALDTAVVPSQELSSSSPQSVDASVSVSSASLPSSP
           P+   + C   P +S  S + PS +    T       S+P                        D   VP  ++S+S P   +              
Subjt:  AVQPSNILVHCLPFPHASNISSSLPSPAPAAATQADIDRSSVPCAPSPAVDVLPNHFNDLELSSAALDTAVVPSQELSSSSPQSVDASVSVSSASLPSSP

Query:  HVDQSAPAIGTHPMQTRANSGIFRPKEWGAFLSSVDMILLLLSPHQLRRHFSPLIGRVRWI----QRIQALRKNDTWSLVPPPSHVNLVGCKWVFKVKRQ
                   HPM TRA +GI + K +  F S +               F+  +    W+    +   AL++N+TW LVPPPS+ N++GCKWV+K+K +
Subjt:  HVDQSAPAIGTHPMQTRANSGIFRPKEWGAFLSSVDMILLLLSPHQLRRHFSPLIGRVRWI----QRIQALRKNDTWSLVPPPSHVNLVGCKWVFKVKRQ

Query:  SDGCFDRCKARLVAKGFHQIPGIDFHETFSPVVKAPTIRIILTMVVARGWPVRQLDINNAFLNGRLEEDVFMKQPDGYVDPSCPTFVCKLNKALYGLKQA
         DG  DR KARLVA+GF Q  G+D+ ETFSPVVKA TIRIIL + ++  W V QLD+ NAFL+G LEE VFM+QP G+++   P+ VCKLNKALYGLKQA
Subjt:  SDGCFDRCKARLVAKGFHQIPGIDFHETFSPVVKAPTIRIILTMVVARGWPVRQLDINNAFLNGRLEEDVFMKQPDGYVDPSCPTFVCKLNKALYGLKQA

Query:  PRAWYDRLRSTLLEWGFLNSKADSSVFFFSNSTMIVIVLIYVHDILITGDCSEFINQFVSRLNDIFALKDMGLLSYYLGVEVFRTADAF-----------
        PRAWY++L ++LL WGF  S+ADSS+F   ++  ++I+LIYV DIL+TG  S  ++ F++RLN  FAL+D+G ++Y+LG+EV  +   F           
Subjt:  PRAWYDRLRSTLLEWGFLNSKADSSVFFFSNSTMIVIVLIYVHDILITGDCSEFINQFVSRLNDIFALKDMGLLSYYLGVEVFRTADAF-----------

Query:  -------------------------DGDPMSNPTLYRSVLGALQYLTHTRPDISFIVNRLSQFLQQPTEVHWSGVKRILRYLKGTMEYGLFLPRSSSLSL
                                 DG+P S+ TLYRS +GALQYLT TRPDISF VN+  QF+  PT  HW  VKRILRYLKGT+ YG+ + +S+SL +
Subjt:  -------------------------DGDPMSNPTLYRSVLGALQYLTHTRPDISFIVNRLSQFLQQPTEVHWSGVKRILRYLKGTMEYGLFLPRSSSLSL

Query:  FAFTDADWACSLDDRRSIAAHCVFLGGSLVSWSSKKQAVVARSSAESEYRSLAHTTAEVLWIQSLLTELSLSSASTPIIWCDNTSAISLARNPVLHSRTK
          +TDADWA   DDRRS   + +FLG +LVSWSS KQ VV+RSSAESEYR+LA  T+E++WIQ +L EL LSS+S P++WCDN SA  LA NPV H+RTK
Subjt:  FAFTDADWACSLDDRRSIAAHCVFLGGSLVSWSSKKQAVVARSSAESEYRSLAHTTAEVLWIQSLLTELSLSSASTPIIWCDNTSAISLARNPVLHSRTK

Query:  HVELDIHFVRDKVLEKKIDVRYVPSSEQLADGLTKPLSESSFSSMRSKLGVMLSPSRLRGDVRTKVYD
        H+E+D+HF+RD VL K++ ++Y+PS+EQ+AD  TK +S S F S R+KL V+ SP  LRGD R  + D
Subjt:  HVELDIHFVRDKVLEKKIDVRYVPSSEQLADGLTKPLSESSFSSMRSKLGVMLSPSRLRGDVRTKVYD

TrEMBL top hitse value%identityAlignment
A0A2Z6MBG6 Integrase catalytic domain-containing protein0.0e+0042.83Show/hide
Query:  ANIVSSSFGHPLSTVLTVKLDENNYLLWRGMVLAILRGQNVDGYVLGTKAQPSEFKEVVTETGKQLEPNPSYIDWTTVDQALLGWLFGSMSPAIAADVVS
        A+   S+  + L + ++VKLD NNY LW+ +VL ++RG  +DGY+LGT+  P EF   +T +      N ++++W   DQ LLGW+  SM+  IA  ++ 
Subjt:  ANIVSSSFGHPLSTVLTVKLDENNYLLWRGMVLAILRGQNVDGYVLGTKAQPSEFKEVVTETGKQLEPNPSYIDWTTVDQALLGWLFGSMSPAIAADVVS

Query:  FTTSREVWKALEKVYGATSKARVNQLRGVLQNTKKGTMKMVEFLAIMKQAAENLKLAGNPISLSDLTSYVLVGLDSDYMPIVCTIEDKDISTWQELSSIL
          TS+++W   + + GA +++++  L+    + +KG MKM ++L  MK   + LKLAGNP+S SDL    L GLDS+Y P+V  + D+   +W +L + L
Subjt:  FTTSREVWKALEKVYGATSKARVNQLRGVLQNTKKGTMKMVEFLAIMKQAAENLKLAGNPISLSDLTSYVLVGLDSDYMPIVCTIEDKDISTWQELSSIL

Query:  ITFEGTLARYSVPSNVNELPDLAAHLTFSRQNQSNSGKQFSFGRGNQNNQGGNQFQQSNFSNNRGNSRGRGRGRNNFQRSNNSKPTCQLCGKYGHSAPYC
        +TFE  +        +N L +L  + T +  N+S+       G+ + NN  G        SN+RG   GRGRG+       + K  CQ+CG   H A  C
Subjt:  ITFEGTLARYSVPSNVNELPDLAAHLTFSRQNQSNSGKQFSFGRGNQNNQGGNQFQQSNFSNNRGNSRGRGRGRNNFQRSNNSKPTCQLCGKYGHSAPYC

Query:  YQRFEETF--------------NNPHAASSSNNQGYQGIADGG----------NFATKSKYTGNKCLTVGNGNKLDIEHTGSSSVACGNKTLKLNNMLHV
        + RF++T+              +N   AS ++ + Y    D G           F   +++ G   L VGNG KL I  TGSS +    K+L L+++L+V
Subjt:  YQRFEETF--------------NNPHAASSSNNQGYQGIADGG----------NFATKSKYTGNKCLTVGNGNKLDIEHTGSSSVACGNKTLKLNNMLHV

Query:  PHIKRNLVSIARLTAYNDVYVQFNSGFCFVKDKTSRRVMLRGTLKDGLYQLELPSIQKSPVSPASFHVAVSHSNNSCCPSLSSPKSVVESKLLWHRRLGH
        P+I +NL+S+++L A N++ V+F+   CFVKDK + +V+L+G LKDGLYQL                            +  +P + V  K  WHRRLGH
Subjt:  PHIKRNLVSIARLTAYNDVYVQFNSGFCFVKDKTSRRVMLRGTLKDGLYQLELPSIQKSPVSPASFHVAVSHSNNSCCPSLSSPKSVVESKLLWHRRLGH

Query:  ASEKIVDAVIRSCNLKFNLNDKVPFCDACQLGKSHRLPFSHSNFRTHHPLELVYCDLWGPSPVASTSGFRFYISFVDNFTRLTHIFPLKHKGEALSAFHQ
         + K++D V+ SC +K   +D   FC+ACQ GK H LPF  S+     PLELV+ D+WGP+P+ ++SGF++Y+ FVD+F+R T I+PLK K E + AF Q
Subjt:  ASEKIVDAVIRSCNLKFNLNDKVPFCDACQLGKSHRLPFSHSNFRTHHPLELVYCDLWGPSPVASTSGFRFYISFVDNFTRLTHIFPLKHKGEALSAFHQ

Query:  YKALVENRFEKSIKVLQTDWGGEFRSFTSPLKLFGIQFRHSCPHTSQQNGIVERKHRHIVEMGLSMLAQSSLSLQYWWEAFSSAVFLINRLPSPVLHNLS
        +K L EN+F K IKV+Q D GGE++         GIQFR SCP+TSQQNG  ERKHRHI E GL++LAQ+ + L YWWEAFS+AV+LINRLPS V  N S
Subjt:  YKALVENRFEKSIKVLQTDWGGEFRSFTSPLKLFGIQFRHSCPHTSQQNGIVERKHRHIVEMGLSMLAQSSLSLQYWWEAFSSAVFLINRLPSPVLHNLS

Query:  PWEKAFNQVPDFSFLRTFGCACFPCLRPYQSHKFQFHSSKCVFLGYSFNHKGYKCLSSTGRIYISRHVVFNETEFPFQHGFVSSSSSAVQPSNILVHCLP
        P+     + PD+  L+TFGCAC+PCL+PY  HK Q+H+++CVFLGYS +HKGYKCL+S GRI+ISRHV+FNE  FPF  GF+++ S      N+     P
Subjt:  PWEKAFNQVPDFSFLRTFGCACFPCLRPYQSHKFQFHSSKCVFLGYSFNHKGYKCLSSTGRIYISRHVVFNETEFPFQHGFVSSSSSAVQPSNILVHCLP

Query:  FPHASNISSSLPSPAPAAATQADIDRSSVPCAPSPAVDVLPNHFNDLELSSAALDTAVVPSQELSSSSPQ-----SVDASVSVSSASLPSSPHVDQSAPA
           A N+                ID +S+P                LE  + A +T    SQ+++S + Q     S D +    +  +     V +++  
Subjt:  FPHASNISSSLPSPAPAAATQADIDRSSVPCAPSPAVDVLPNHFNDLELSSAALDTAVVPSQELSSSSPQ-----SVDASVSVSSASLPSSPHVDQSAPA

Query:  IGT-HPMQTRANSGIFRPKEWGAFLSSVDMILLLLSPHQLRRHFSPLIGRVRWIQRIQALRKNDTWSLVPPPSHVNLVGCKWVFKVKRQSDGCFDRCKAR
          T H + TR+ SGI +PK    ++   +     + P   +   S  + +    +  +AL  N TW LVP  +  N+V  KWVFK K + DG  +R KAR
Subjt:  IGT-HPMQTRANSGIFRPKEWGAFLSSVDMILLLLSPHQLRRHFSPLIGRVRWIQRIQALRKNDTWSLVPPPSHVNLVGCKWVFKVKRQSDGCFDRCKAR

Query:  LVAKGFHQIPGIDFHETFSPVVKAPTIRIILTMVVARGWPVRQLDINNAFLNGRLEEDVFMKQPDGYVDPSCPTFVCKLNKALYGLKQAPRAWYDRLRST
        LVAKGF Q  GID+ ETFSPV+KA T+RIIL++ V   W VRQLDINNAFLNG L+E VFM QP+G+VD + P  +CKL+KA+YGLKQAPRAW+D L++ 
Subjt:  LVAKGFHQIPGIDFHETFSPVVKAPTIRIILTMVVARGWPVRQLDINNAFLNGRLEEDVFMKQPDGYVDPSCPTFVCKLNKALYGLKQAPRAWYDRLRST

Query:  LLEWGFLNSKADSSVFFFSNSTMIVIVLIYVHDILITGDCSEFINQFVSRLNDIFALKDMGLLSYYLGVEVFRTADAF----------------------
        LL WGF N+K+DSS+F       I  +LIYV DI++TG   +F+  F+ +LND F+LKD+G L Y+LG+EV R A                         
Subjt:  LLEWGFLNSKADSSVFFFSNSTMIVIVLIYVHDILITGDCSEFINQFVSRLNDIFALKDMGLLSYYLGVEVFRTADAF----------------------

Query:  -------------DGDPMSNPTLYRSVLGALQYLTHTRPDISFIVNRLSQFLQQPTEVHWSGVKRILRYLKGTMEYGLFLPRSSSLSLFAFTDADWACSL
                     +G+ + +PT++R  +G LQYLTHT PDI+F VN+LSQ++  P+  HW G+KRILRYL+GT+ Y L +  S+ L +  F+DADWA S+
Subjt:  -------------DGDPMSNPTLYRSVLGALQYLTHTRPDISFIVNRLSQFLQQPTEVHWSGVKRILRYLKGTMEYGLFLPRSSSLSLFAFTDADWACSL

Query:  DDRRSIAAHCVFLGGSLVSWSSKKQAVVARSSAESEYRSLAHTTAEVLWIQSLLTELSLSSASTPIIWCDNTSAISLARNPVLHSRTKHVELDIHFVRDK
        DDR+S++  CVFLG +L+SWSS+KQ VV+RSS ESEYR+LA   AE+ WI+SLLTEL L     PI+WCDN SA +LA NPVLH+R+KH+E+D+H++RD+
Subjt:  DDRRSIAAHCVFLGGSLVSWSSKKQAVVARSSAESEYRSLAHTTAEVLWIQSLLTELSLSSASTPIIWCDNTSAISLARNPVLHSRTKHVELDIHFVRDK

Query:  VLEKKIDVRYVPSSEQLADGLTKPLSESSFSSMRSKLGVMLSP
        VL+ ++ V YVP+++Q+AD LTKPLS + FS +R KLGV+LSP
Subjt:  VLEKKIDVRYVPSSEQLADGLTKPLSESSFSSMRSKLGVMLSP

A0A2Z6P4D5 Integrase catalytic domain-containing protein0.0e+0042.82Show/hide
Query:  SSAANIVSSSFGHPLSTVLTVKLDENNYLLWRGMVLAILRGQNVDGYVLGTKAQPSEFKEVVTETGKQLEPNPSYIDWTTVDQALLGWLFGSMSPAIAAD
        SSAAN   S   + L ++++VKLD +NY LW+ +VL+++RG  +DGY+LGT   P +F   VT   K  + NP + DW   DQALLGWL  SM+  IA  
Subjt:  SSAANIVSSSFGHPLSTVLTVKLDENNYLLWRGMVLAILRGQNVDGYVLGTKAQPSEFKEVVTETGKQLEPNPSYIDWTTVDQALLGWLFGSMSPAIAAD

Query:  VVSFTTSREVWKALEKVYGATSKARVNQLRGVLQNTKKGTMKMVEFLAIMKQAAENLKLAGNPISLSDLTSYVLVGLDSDYMPIVCTIEDKDISTWQELS
        ++   TS+++W   + + GA +K+R+  L+    NT+KG MKM E+L  MK  ++ LKLAG+PIS SDL    L GLD++Y P+V  + D+   +W ++ 
Subjt:  VVSFTTSREVWKALEKVYGATSKARVNQLRGVLQNTKKGTMKMVEFLAIMKQAAENLKLAGNPISLSDLTSYVLVGLDSDYMPIVCTIEDKDISTWQELS

Query:  SILITFEGTLARYSVPSNVNELPDLAAHLTFSRQNQSNSGKQFSFGRGNQNNQGGNQFQQSNFSNNRGNSRGRGRGRNNFQRSNNSKPTCQLCGKYGHSA
        + L+ FE  L +++  S +     L A   F+ + +          RGN+ N  GN +++SNF   RG   GRG+G     R +N+K  CQ+C   GH A
Subjt:  SILITFEGTLARYSVPSNVNELPDLAAHLTFSRQNQSNSGKQFSFGRGNQNNQGGNQFQQSNFSNNRGNSRGRGRGRNNFQRSNNSKPTCQLCGKYGHSA

Query:  PYCYQRFEETFNNPHAASSSNNQG--------------YQGIADGG----------NFATKSKYTGNKCLTVGNGNKLDIEHTGSSSVACGNKTLKLNNM
          C  RF+  +   + ++ ++ QG              Y+   D G           F   +++ G   L VGNG KL I  +GS+ +      L L+++
Subjt:  PYCYQRFEETFNNPHAASSSNNQG--------------YQGIADGG----------NFATKSKYTGNKCLTVGNGNKLDIEHTGSSSVACGNKTLKLNNM

Query:  LHVPHIKRNLVSIARLTAYNDVYVQFNSGFCFVKDKTSRRVMLRGTLKDGLYQLELPSIQKSPVSPASFHVAVSHSNNSCCPSLSSPKSVVESKLLWHRR
        L+VP I +NL+S+++LTA N++ V+F++  C VKDK + + +L+G LKDGLYQL                     SN   C  +S  +S       WHR+
Subjt:  LHVPHIKRNLVSIARLTAYNDVYVQFNSGFCFVKDKTSRRVMLRGTLKDGLYQLELPSIQKSPVSPASFHVAVSHSNNSCCPSLSSPKSVVESKLLWHRR

Query:  LGHASEKIVDAVIRSCNLKFNLNDKVPFCDACQLGKSHRLPFSHSNFRTHHPLELVYCDLWGPSPVASTSGFRFYISFVDNFTRLTHIFPLKHKGEALSA
        LGH + K++D V++ CN+K + +D+  FC+ACQ GK H LPF  S+     PL L++ D+WGP+P+ S SGF++Y+ F+D+F+R T IFPLK K + + A
Subjt:  LGHASEKIVDAVIRSCNLKFNLNDKVPFCDACQLGKSHRLPFSHSNFRTHHPLELVYCDLWGPSPVASTSGFRFYISFVDNFTRLTHIFPLKHKGEALSA

Query:  FHQYKALVENRFEKSIKVLQTDWGGEFRSFTSPLKLFGIQFRHSCPHTSQQNGIVERKHRHIVEMGLSMLAQSSLSLQYWWEAFSSAVFLINRLPSPVLH
        F Q+K L EN+F K IK++Q D GGE+++        GIQFR SCP+TSQQNG  ERKHRH+ E+GL++LAQ+ + L+YWWEAFS+AV+LINRLPS V  
Subjt:  FHQYKALVENRFEKSIKVLQTDWGGEFRSFTSPLKLFGIQFRHSCPHTSQQNGIVERKHRHIVEMGLSMLAQSSLSLQYWWEAFSSAVFLINRLPSPVLH

Query:  NLSPWEKAFNQVPDFSFLRTFGCACFPCLRPYQSHKFQFHSSKCVFLGYSFNHKGYKCLSSTGRIYISRHVVFNETEFPFQHGFVSSSSSAVQPSNILVH
        N SP+   F + PD++ L+ FGCAC+PCL+PY  HK QFH+++CVF+GYS +HKGYKC++S GRI++SRHV+FNE  FPF  GF+ + +    P   L  
Subjt:  NLSPWEKAFNQVPDFSFLRTFGCACFPCLRPYQSHKFQFHSSKCVFLGYSFNHKGYKCLSSTGRIYISRHVVFNETEFPFQHGFVSSSSSAVQPSNILVH

Query:  CLPFPHASNISSSLPSPAPAAATQADI--DRSSVPCAPSPAVDVLPNHFNDLELSSAA--LDTAVVPSQEL-SSSSPQSVDASVSVSSASLPSSPHVDQS
                N S  LP+ +  A TQ  I  D ++     + +++   N+ N+ ++ S+   ++T    +Q++ + +S  S D + S  + ++      D S
Subjt:  CLPFPHASNISSSLPSPAPAAATQADI--DRSSVPCAPSPAVDVLPNHFNDLELSSAA--LDTAVVPSQEL-SSSSPQSVDASVSVSSASLPSSPHVDQS

Query:  APAIGTHPMQTRANSGIFRPKEWGAFLSSVDMILLLLSPHQLRRHFSPLIGRVRWIQRI----QALRKNDTWSLVPPPSHVNLVGCKWVFKVKRQSDGCF
             TH M+TR+  GI +PK     ++  D      S  +  +     +GR  W + +    +AL  N TW+LVP     N++  KW+FK K +SDG  
Subjt:  APAIGTHPMQTRANSGIFRPKEWGAFLSSVDMILLLLSPHQLRRHFSPLIGRVRWIQRI----QALRKNDTWSLVPPPSHVNLVGCKWVFKVKRQSDGCF

Query:  DRCKARLVAKGFHQIPGIDFHETFSPVVKAPTIRIILTMVVARGWPVRQLDINNAFLNGRLEEDVFMKQPDGYVDPSCPTFVCKLNKALYGLKQAPRAWY
        +R KARLVAKGF Q  G+DF ETFSPVVK+ T+RIILT+ V   W VRQLDINNAFLNG+L+E VFM QP+GY+D + P  +CKL+KA+YGLKQAPRAWY
Subjt:  DRCKARLVAKGFHQIPGIDFHETFSPVVKAPTIRIILTMVVARGWPVRQLDINNAFLNGRLEEDVFMKQPDGYVDPSCPTFVCKLNKALYGLKQAPRAWY

Query:  DRLRSTLLEWGFLNSKADSSVFFFSNSTMIVIVLIYVHDILITGDCSEFINQFVSRLNDIFALKDMGLLSYYLGVEVFR---------------------
        D LRSTL+ WGF N+K D+S+FF   +     +LIYV DI++TG   +F+  F ++LN  ++LKD+G L Y+LGVEV R                     
Subjt:  DRLRSTLLEWGFLNSKADSSVFFFSNSTMIVIVLIYVHDILITGDCSEFINQFVSRLNDIFALKDMGLLSYYLGVEVFR---------------------

Query:  ------------TADAF--DGDPMSNPTLYRSVLGALQYLTHTRPDISFIVNRLSQFLQQPTEVHWSGVKRILRYLKGTMEYGLFLPRSSSLSLFAFTDA
                    T   F  +G+ MSNPTLYR  +GALQYLT+TRPDI+F VN+LSQ++  PT  HW G+KRILRYL+GT  + L +  S++L +  F DA
Subjt:  ------------TADAF--DGDPMSNPTLYRSVLGALQYLTHTRPDISFIVNRLSQFLQQPTEVHWSGVKRILRYLKGTMEYGLFLPRSSSLSLFAFTDA

Query:  DWACSLDDRRSIAAHCVFLGGSLVSWSSKKQAVVARSSAESEYRSLAHTTAEVLWIQ-------------------SLLTELSLSSASTPIIWCDNTSAI
        DWA S DDR+S    CVFLG +LVSW+S+KQ VV+RSS ESEYRSLA   AEV                       +LL EL L     P++WCDN SA 
Subjt:  DWACSLDDRRSIAAHCVFLGGSLVSWSSKKQAVVARSSAESEYRSLAHTTAEVLWIQ-------------------SLLTELSLSSASTPIIWCDNTSAI

Query:  SLARNPVLHSRTKHVELDIHFVRDKVLEKKIDVRYVPSSEQLADGLTKPLSESSFSSMRSKLGVMLSPS
        +LA NPV+H+R+KH+E+D+H++RD+VLE K+ + YVP+++Q+AD LTKPL  + F+ MR KLGV +SPS
Subjt:  SLARNPVLHSRTKHVELDIHFVRDKVLEKKIDVRYVPSSEQLADGLTKPLSESSFSSMRSKLGVMLSPS

A0A438FWJ3 Retrovirus-related Pol polyprotein from transposon TNT 1-948.4e-30641.28Show/hide
Query:  PSSSAANIVSSSFGHPLSTVLTVKLDENNYLLWRGMVLAILRGQNVDGYVLGTKAQPSEFKEVVTETGKQLEPNPSYIDWTTVDQALLGWLFGSMSPAIA
        PSS+ A        H     L +KLD NNY+LWR  +  ++     + ++ G K  P +     T +G   E NP ++ W   D+ +L W++ S++P I 
Subjt:  PSSSAANIVSSSFGHPLSTVLTVKLDENNYLLWRGMVLAILRGQNVDGYVLGTKAQPSEFKEVVTETGKQLEPNPSYIDWTTVDQALLGWLFGSMSPAIA

Query:  ADVVSFTTSREVWKALEKVYGATSKARVNQLRGVLQNTKKGTMKMVEFLAIMKQAAENLKLAGNPISLSDLTSYVLVGLDSDYMPIVCTIEDK-DISTWQ
          +V + +S   W ALE+++ A+S+ARV QLR   Q T+KG++ M+E++  +K  A+NL   G P++  D    +L GL +DY  IV ++  + D  +  
Subjt:  ADVVSFTTSREVWKALEKVYGATSKARVNQLRGVLQNTKKGTMKMVEFLAIMKQAAENLKLAGNPISLSDLTSYVLVGLDSDYMPIVCTIEDK-DISTWQ

Query:  ELSSILITFEGTLARYSVPSNVNELPDLAAHLTFSRQNQSNSGKQFSFGRGNQNNQGGNQFQQSNFSNNRGNSRGRGRGRNNFQRSNNSKPTCQLCGKYG
         + SIL+T E    R S  ++V E   ++A+L   +    N            N +   Q +QS F+  RG + GR        +S+  +P CQLCGK+G
Subjt:  ELSSILITFEGTLARYSVPSNVNELPDLAAHLTFSRQNQSNSGKQFSFGRGNQNNQGGNQFQQSNFSNNRGNSRGRGRGRNNFQRSNNSKPTCQLCGKYG

Query:  HSAPYCYQRFEETF------------NNPHA--------ASSSNNQGYQGIADGG----------NFATKSKYTGNKCLTVGNGNKLDIEHTGSSSVACG
        H+   CY RF+  F            N P+A        AS S         D G            +    Y GN  + VGNG  L I HTG++     
Subjt:  HSAPYCYQRFEETF------------NNPHA--------ASSSNNQGYQGIADGG----------NFATKSKYTGNKCLTVGNGNKLDIEHTGSSSVACG

Query:  NKTLKLNNMLHVPHIKRNLVSIARLTAYNDVYVQFNSGFCFVKDKTSRRVMLRGTLKDGLYQLELPSIQKSPVSPASFHVAVSHSNNSCCPSLSSPKSVV
        +KT +L  +LHVP I  NL+S+++  A N+ + +F+  F FVKD+ +++++L+G+L+ GLY+     +     SPA+F   VS S +      SS  S+ 
Subjt:  NKTLKLNNMLHVPHIKRNLVSIARLTAYNDVYVQFNSGFCFVKDKTSRRVMLRGTLKDGLYQLELPSIQKSPVSPASFHVAVSHSNNSCCPSLSSPKSVV

Query:  ESKLLWHRRLGHASEKIVDAVIRSCNLKFNLNDKVPFCDACQLGKSHRLPFSHSNFRTHHPLELVYCDLWGPSPVASTSGFRFYISFVDNFTRLTHIFPL
         +  LWH RLGH +  I+  ++ SCN+    + K   C ACQ  KSH+LPF+ S  R  HPL L++ DLWGP+ + ST+G R++I FVD+F+R + I+PL
Subjt:  ESKLLWHRRLGHASEKIVDAVIRSCNLKFNLNDKVPFCDACQLGKSHRLPFSHSNFRTHHPLELVYCDLWGPSPVASTSGFRFYISFVDNFTRLTHIFPL

Query:  KHKGEALSAFHQYKALVENRFEKSIKVLQTDWGGEFRSFTSPLKLFGIQFRHSCPHTSQQNGIVERKHRHIVEMGLSMLAQSSLSLQYWWEAFSSAVFLI
          K +ALS F ++K+LVEN+F   I+ L++D GGEF++F+S L   GI+ + SCP+T +QNG  ERK RHI+E GL++LA +SL  ++W  AF +A+FLI
Subjt:  KHKGEALSAFHQYKALVENRFEKSIKVLQTDWGGEFRSFTSPLKLFGIQFRHSCPHTSQQNGIVERKHRHIVEMGLSMLAQSSLSLQYWWEAFSSAVFLI

Query:  NRLPSPVLHNLSPWEKAFNQVPDFSFLRTFGCACFPCLRPYQSHKFQFHSSKCVFLGYSFNHKGYKCLSS-TGRIYISRHVVFNETEFPFQHGFVSSSSS
        NRLP+ VL+  SP++  F + P++   + FGC C+P +RPY  +K  + SS+CVFLGYS NHKGY CL+  TGR+Y++RHVVF+ET FPFQ     SSS 
Subjt:  NRLPSPVLHNLSPWEKAFNQVPDFSFLRTFGCACFPCLRPYQSHKFQFHSSKCVFLGYSFNHKGYKCLSS-TGRIYISRHVVFNETEFPFQHGFVSSSSS

Query:  AVQPSNILVHCLPFPHASNISSSLPSPAPAAATQADIDRSSVPCAPSPAVDVLPNHFNDLELSSAALDTAVVPSQELSSSSPQSVDASVSVSSASLPSSP
           P+   + C   P +S  S + PS +    T       S+P                        D   VP  ++S+S P   +              
Subjt:  AVQPSNILVHCLPFPHASNISSSLPSPAPAAATQADIDRSSVPCAPSPAVDVLPNHFNDLELSSAALDTAVVPSQELSSSSPQSVDASVSVSSASLPSSP

Query:  HVDQSAPAIGTHPMQTRANSGIFRPKEWGAFLSSVDMILLLLSPHQLRRHFSPLIGRVRWI----QRIQALRKNDTWSLVPPPSHVNLVGCKWVFKVKRQ
                   HPM TRA +GI + K +  F S +               F+  +    W+    +   AL++N+TW LVPPPS+ N++GCKWV+K+K +
Subjt:  HVDQSAPAIGTHPMQTRANSGIFRPKEWGAFLSSVDMILLLLSPHQLRRHFSPLIGRVRWI----QRIQALRKNDTWSLVPPPSHVNLVGCKWVFKVKRQ

Query:  SDGCFDRCKARLVAKGFHQIPGIDFHETFSPVVKAPTIRIILTMVVARGWPVRQLDINNAFLNGRLEEDVFMKQPDGYVDPSCPTFVCKLNKALYGLKQA
         DG  DR KARLVA+GF Q  G+D+ ETFSPVVKA TIRIIL + ++  W V QLD+ NAFL+G LEE VFM+QP G+++   P+ VCKLNKALYGLKQA
Subjt:  SDGCFDRCKARLVAKGFHQIPGIDFHETFSPVVKAPTIRIILTMVVARGWPVRQLDINNAFLNGRLEEDVFMKQPDGYVDPSCPTFVCKLNKALYGLKQA

Query:  PRAWYDRLRSTLLEWGFLNSKADSSVFFFSNSTMIVIVLIYVHDILITGDCSEFINQFVSRLNDIFALKDMGLLSYYLGVEVFRTADAF-----------
        PRAWY++L ++LL WGF  S+ADSS+F   ++  ++I+LIYV DIL+TG  S  ++ F++RLN  FAL+D+G ++Y+LG+EV R+   F           
Subjt:  PRAWYDRLRSTLLEWGFLNSKADSSVFFFSNSTMIVIVLIYVHDILITGDCSEFINQFVSRLNDIFALKDMGLLSYYLGVEVFRTADAF-----------

Query:  -------------------------DGDPMSNPTLYRSVLGALQYLTHTRPDISFIVNRLSQFLQQPTEVHWSGVKRILRYLKGTMEYGLFLPRSSSLSL
                                 DG+P S+ TLYRS +GALQYLT TRPDISF VN+  QF+  PT  HW  VKRILRYLKGT+ YG+ + +S+SL +
Subjt:  -------------------------DGDPMSNPTLYRSVLGALQYLTHTRPDISFIVNRLSQFLQQPTEVHWSGVKRILRYLKGTMEYGLFLPRSSSLSL

Query:  FAFTDADWACSLDDRRSIAAHCVFLGGSLVSWSSKKQAVVARSSAESEYRSLAHTTAEVLWIQSLLTELSLSSASTPIIWCDNTSAISLARNPVLHSRTK
          +TDADWA   DDRRS   + +FLG +LVSWSS KQ VV+RSSAESEYR+LA  T+E++WIQ +L EL LSS+S P++WCDN SA  LA NPV H+RTK
Subjt:  FAFTDADWACSLDDRRSIAAHCVFLGGSLVSWSSKKQAVVARSSAESEYRSLAHTTAEVLWIQSLLTELSLSSASTPIIWCDNTSAISLARNPVLHSRTK

Query:  HVELDIHFVRDKVLEKKIDVRYVPSSEQLADGLTKPLSESSFSSMRSKLGVMLSPSRLRGDVRTKVYD
        H+E+D+HF+RD VL K++ ++Y+PS+EQ+AD  TK +S S F S R+KL V+ SP  LRGD R  + D
Subjt:  HVELDIHFVRDKVLEKKIDVRYVPSSEQLADGLTKPLSESSFSSMRSKLGVMLSPSRLRGDVRTKVYD

A0A803PM38 Uncharacterized protein0.0e+0044.39Show/hide
Query:  SISTPSSSAANIVSSSFGHPLSTVLTVKLDENNYLLWRGMVLAILRGQNVDGYVLGTKAQPSEFKEVVTETGKQL---EPNPSYIDWTTVDQALLGWLFG
        S+      A NIV   FG  L+    +KLD NN+ LWR MV AI+RG  +DGY+ GT  +P EF       G      + NP++  W   DQ LLGWL+G
Subjt:  SISTPSSSAANIVSSSFGHPLSTVLTVKLDENNYLLWRGMVLAILRGQNVDGYVLGTKAQPSEFKEVVTETGKQL---EPNPSYIDWTTVDQALLGWLFG

Query:  SMSPAIAADVVSFTTSREVWKALEKVYGATSKARVNQLRGVLQNTKKGTMKMVEFLAIMKQAAENLKLAGNPISLSDLTSYVLVGLDSDYMPIVCTIEDK
        SM+  IA +V+   +S  +W ALE+++GA SKA++++ R  +Q  +KG + M ++L   +Q A+ L LAG P   + L S VL GLD +Y+P+V  IE +
Subjt:  SMSPAIAADVVSFTTSREVWKALEKVYGATSKARVNQLRGVLQNTKKGTMKMVEFLAIMKQAAENLKLAGNPISLSDLTSYVLVGLDSDYMPIVCTIEDK

Query:  DISTWQELSSILITFEGTLARYSVPSNVNELPDLAAHLTFSRQNQSNSGKQFSFGRGNQNNQGGNQFQQSNFSNNRG-NSRGRGRGRNNFQRSNNSKPTC
          +TWQ+L  +L++ +  + R    S  ++L  +  + + S    +N G      RGN NN       +   SNNRG N+R RGRG     R++  +PTC
Subjt:  DISTWQELSSILITFEGTLARYSVPSNVNELPDLAAHLTFSRQNQSNSGKQFSFGRGNQNNQGGNQFQQSNFSNNRG-NSRGRGRGRNNFQRSNNSKPTC

Query:  QLCGKYGHSAPYCYQRFEETFNNPHAASSSNNQGYQGIADGGNFATKSKYTGNKCLTVGNGNKLDIEHTGSSSV-ACGNKTLKLNNMLHVPHIKRNLVSI
        Q+CGKYGHSA +CY R        H  S  N               K +Y G + +TV NGN+L I H G  S+       L L  +LHVP I +NL+SI
Subjt:  QLCGKYGHSAPYCYQRFEETFNNPHAASSSNNQGYQGIADGGNFATKSKYTGNKCLTVGNGNKLDIEHTGSSSV-ACGNKTLKLNNMLHVPHIKRNLVSI

Query:  ARLTAYNDVYVQFNSGFCFVKDKTSRRVMLRGTLKDGLYQLELP------SIQKSPVSPASFHVAVSHSNNSCCPSLSSPKSVVESKLLWHRRLGHASEK
        ++LT+ N+V V+F S  CFVKDK + +V+L+G LKDGLYQ + P      S  +S   P SF   V  +  S      + + +   K  WHRRLGH S +
Subjt:  ARLTAYNDVYVQFNSGFCFVKDKTSRRVMLRGTLKDGLYQLELP------SIQKSPVSPASFHVAVSHSNNSCCPSLSSPKSVVESKLLWHRRLGHASEK

Query:  IVDAVIRSCNLKFNLNDKVPFCDACQLGKSHRLPFSHSNFRTHHPLELVYCDLWGPSPVASTSGFRFYISFVDNFTRLTHIFPLKHKGEALSAFHQYKAL
        ++D V+   N+K N+N  + FCDACQLGKSH LPF  +  R   PLELV+ D+WGPSP+ S + FR+YI F+D+F+R T I+PLK K EAL+AF Q+K L
Subjt:  IVDAVIRSCNLKFNLNDKVPFCDACQLGKSHRLPFSHSNFRTHHPLELVYCDLWGPSPVASTSGFRFYISFVDNFTRLTHIFPLKHKGEALSAFHQYKAL

Query:  VENRFEKSIKVLQTDWGGEFRSFTSPLKLFGIQFRHSCPHTSQQNGIVERKHRHIVEMGLSMLAQSSLSLQYWWEAFSSAVFLINRLPSPVLHNLSPWEK
        VEN+F   +K +QTDWGGE++ F       GI F+H CPHTS QNG  ERKHRHIVEMGL++LAQ+ +  +YWW+AF +AV+LINRLP+PVL   +P+E 
Subjt:  VENRFEKSIKVLQTDWGGEFRSFTSPLKLFGIQFRHSCPHTSQQNGIVERKHRHIVEMGLSMLAQSSLSLQYWWEAFSSAVFLINRLPSPVLHNLSPWEK

Query:  AFNQVPDFSFLRTFGCACFPCLRPYQSHKFQFHSSKCVFLGYSFNHKGYKCLSSTGRIYISRHVVFNETEFPFQHGFVSSSSSAVQPSNILVHCLPFPHA
         F Q PD+ FL+ FG +CFPCLR YQ+HKFQFHS+KCV LGYS  HKGYKCLSSTGR+YISR V+FNE EFPF+ GF++++     P ++LV   PF  A
Subjt:  AFNQVPDFSFLRTFGCACFPCLRPYQSHKFQFHSSKCVFLGYSFNHKGYKCLSSTGRIYISRHVVFNETEFPFQHGFVSSSSSAVQPSNILVHCLPFPHA

Query:  SNISSSLPS-----PAPAAATQADIDRSSVPCAPSPAVDVLPNHFNDLELSSAALDTAVVPSQELSSSSPQSVDASVSVSSASLPSSPHVDQSA------
        S+  +S  S      +     Q D      P       D+     ND        D  +     +   S   +      ++    + P +D SA      
Subjt:  SNISSSLPS-----PAPAAATQADIDRSSVPCAPSPAVDVLPNHFNDLELSSAALDTAVVPSQELSSSSPQSVDASVSVSSASLPSSPHVDQSA------

Query:  PAIGTHPMQTRANSGIFRPKEWGAFLSSVDMILLLLSPHQLRRHFSPLIGRVRWIQRIQALRKNDTWSLVPPPSHVNLVGCKWVFKVKRQSDGCFDRCKA
          + THPM TRA +GIF+PK    +L+    I     P  +                + AL +N TW LVP   H++++  KWV+K KR +DG F R KA
Subjt:  PAIGTHPMQTRANSGIFRPKEWGAFLSSVDMILLLLSPHQLRRHFSPLIGRVRWIQRIQALRKNDTWSLVPPPSHVNLVGCKWVFKVKRQSDGCFDRCKA

Query:  RLVAKGFHQIPGIDFHETFSPVVKAPTIRIILTMVVARGWPVRQLDINNAFLNGRLEEDVFMKQPDGYVDPSCPTFVCKLNKALYGLKQAPRAWYDRLRS
        RLVAKGF Q PG+DF ETFSPV+KA T+RI+L++ V + W VRQLDINNAFLNG + ED++MKQP G+ D + P  VCKL K++YGL+QAPRAW+D+L++
Subjt:  RLVAKGFHQIPGIDFHETFSPVVKAPTIRIILTMVVARGWPVRQLDINNAFLNGRLEEDVFMKQPDGYVDPSCPTFVCKLNKALYGLKQAPRAWYDRLRS

Query:  TLLEWGFLNSKADSSVFFFSNSTMIVIVLIYVHDILITGDCSEFINQFVSRLNDIFALKDMGLLSYYLGVEVFRTADAFDGDPMSNPTLYRSVLGALQYL
        TL  W F NSKADSS+FF   S+ I++VLIYV DI+ITG+ S  +  F+++LN  FALKD+G L Y+LG+EV R A    G  +S P     +L  +  +
Subjt:  TLLEWGFLNSKADSSVFFFSNSTMIVIVLIYVHDILITGDCSEFINQFVSRLNDIFALKDMGLLSYYLGVEVFRTADAFDGDPMSNPTLYRSVLGALQYL

Query:  THTRPDISFIVNRLSQFLQQPTEVHWSGVKRILRYLKGTMEYGLFLPRSSSLSLFAFTDADWACSLDDRRSIAAHCVFLGGSLVSWSSKKQAVVARSSAE
                                        L+     M  G  L      SL   T+       +DRRS+A  CV+LG +L+SWSS+KQ VV+RSS E
Subjt:  THTRPDISFIVNRLSQFLQQPTEVHWSGVKRILRYLKGTMEYGLFLPRSSSLSLFAFTDADWACSLDDRRSIAAHCVFLGGSLVSWSSKKQAVVARSSAE

Query:  SEYRSLAHTTAEVLWIQSLLTELSLSSASTPIIWCDNTSAISLARNPVLHSRTKHVELDIHFVRDKVLEKKIDVRYVPSSEQLADGLTKPLSESSFSSMR
        SEYR+LA   AE+ W+QSLL EL     +TPIIWCDN  A +LA NPV H+RTKH+E+DIHFVRDK++EKK++VRY+PSSEQ+AD LTK L+      + 
Subjt:  SEYRSLAHTTAEVLWIQSLLTELSLSSASTPIIWCDNTSAISLARNPVLHSRTKHVELDIHFVRDKVLEKKIDVRYVPSSEQLADGLTKPLSESSFSSMR

Query:  SKLGVMLSPSRLRGDVR
        SKLGV+  P  LRG+VR
Subjt:  SKLGVMLSPSRLRGDVR

A5BFR8 Integrase catalytic domain-containing protein1.1e-30542.3Show/hide
Query:  STPRSISTPSSSAANIVSSSFG-----HPLSTVLTVKLDENNYLLWRGMVLAILRGQNVDGYVLGTKAQPSEFKEVVTETGKQLEP---NPSYIDWTTVD
        STP   S+ S S  +  SS+         L+  L VKLD  NY+LWR  +  ++     + ++ GT   P           K L P   NP+++ W   D
Subjt:  STPRSISTPSSSAANIVSSSFG-----HPLSTVLTVKLDENNYLLWRGMVLAILRGQNVDGYVLGTKAQPSEFKEVVTETGKQLEP---NPSYIDWTTVD

Query:  QALLGWLFGSMSPAIAADVVSFTTSREVWKALEKVYGATSKARVNQLRGVLQNTKKGTMKMVEFLAIMKQAAENLKLAGNPISLSDLTSYVLVGLDSDYM
        + +L W++ S++P I A ++   TS   W ALE ++ ++S+AR+ QLR  LQ+TKKG+M M++++  +K AA+NL   G P+S  D    +L GL SDY 
Subjt:  QALLGWLFGSMSPAIAADVVSFTTSREVWKALEKVYGATSKARVNQLRGVLQNTKKGTMKMVEFLAIMKQAAENLKLAGNPISLSDLTSYVLVGLDSDYM

Query:  PIVCTIEDKDIS-TWQELSSILITFEGTLARYSVPSNVNELPDLAAHLTFSRQNQSNSGKQFSFGRGNQNNQGGNQFQQSNFSNNRGNSRGRGRGRNNFQ
         +V  I  +D   + + + S+L+ FE  L + S       +  ++A+   S  N    G++F+ GRG          Q  + +NN    RGRGRG  N Q
Subjt:  PIVCTIEDKDIS-TWQELSSILITFEGTLARYSVPSNVNELPDLAAHLTFSRQNQSNSGKQFSFGRGNQNNQGGNQFQQSNFSNNRGNSRGRGRGRNNFQ

Query:  --RSNNS---KPTCQLCGKYGHSAPYCYQRFEETF-----------NNPH-------AASSSNNQGYQG-----------IADGGNFATKSKYTGNKCLT
          R N+S   KP CQLCGK+GH+A  CY RF+ +F           NN +        AS+SNN   +              + GN  + S YTG   +T
Subjt:  --RSNNS---KPTCQLCGKYGHSAPYCYQRFEETF-----------NNPH-------AASSSNNQGYQG-----------IADGGNFATKSKYTGNKCLT

Query:  VGNGNKLDIEHTGSSSVACGNKTLKLNNMLHVPHIKRNLVSIARLTAYNDVYVQFNSGFCFVKDKTSRRVMLRGTLKDGLYQLELPSIQKSPVSPASFHV
        +GNG  L I + GS  +     + +L  + HVP I  NL+S+A+  + N+  ++F+S   FVKD  ++ V+ +G L++GLY+            P   ++
Subjt:  VGNGNKLDIEHTGSSSVACGNKTLKLNNMLHVPHIKRNLVSIARLTAYNDVYVQFNSGFCFVKDKTSRRVMLRGTLKDGLYQLELPSIQKSPVSPASFHV

Query:  AVSHSNNSCCPSLSSPKSVVESKL-LWHRRLGHASEKIVDAVIRSCNLKFNLNDKVPFCDACQLGKSHRLPFSHSNFRTHHPLELVYCDLWGPSPVASTS
            S N+     S   S VE+K  LWH RLGHAS  IV  V+ +CN+      K   C  CQL KSHRLP   SNF    PLELVY D+WGP+ + STS
Subjt:  AVSHSNNSCCPSLSSPKSVVESKL-LWHRRLGHASEKIVDAVIRSCNLKFNLNDKVPFCDACQLGKSHRLPFSHSNFRTHHPLELVYCDLWGPSPVASTS

Query:  GFRFYISFVDNFTRLTHIFPLKHKGEALSAFHQYKALVENRFEKSIKVLQTDWGGEFRSFTSPLKLFGIQFRHSCPHTSQQNGIVERKHRHIVEMGLSML
        G R++I FVD+++R T  + L+ K +AL  F  +K  +EN+F+  IK LQ+D GGEFRSFTS L+  GI  R SCP+ S QNG VERKHRH+VE GL++L
Subjt:  GFRFYISFVDNFTRLTHIFPLKHKGEALSAFHQYKALVENRFEKSIKVLQTDWGGEFRSFTSPLKLFGIQFRHSCPHTSQQNGIVERKHRHIVEMGLSML

Query:  AQSSLSLQYWWEAFSSAVFLINRLPSPVLHNLSPWEKAFNQVPDFSFLRTFGCACFPCLRPYQSHKFQFHSSKCVFLGYSFNHKGYKCLS-STGRIYISR
        + +SL ++YW  AF +  FLINR+PS VL   SP+   F + PD+   R FGC C+P +RPY +HK Q+ S +C+FLGYS NHKG+ CL  +TGR+YI+ 
Subjt:  AQSSLSLQYWWEAFSSAVFLINRLPSPVLHNLSPWEKAFNQVPDFSFLRTFGCACFPCLRPYQSHKFQFHSSKCVFLGYSFNHKGYKCLS-STGRIYISR

Query:  HVVFNETEFPFQHGFVSSSSSAVQPSNILVHCLPFPHASNISSSLPSPAP-AAATQADIDRSSVPCAPSPAVDVLPNHFNDLELSSAALDTAVVPSQELS
        HVVF+E+ FP      SSSS+           +  P       S P   P +  + A ID  S+  + SP    +P        SS+ LDT        S
Subjt:  HVVFNETEFPFQHGFVSSSSSAVQPSNILVHCLPFPHASNISSSLPSPAP-AAATQADIDRSSVPCAPSPAVDVLPNHFNDLELSSAALDTAVVPSQELS

Query:  SSSPQSVDASVSVSSASLPSSPHVDQSAPAIGTHPMQTRANSGIFRPKEWGAFLSSVDMILLLLS-PHQLRRHFSPLIGRVRWIQRIQALRKNDTWSLVP
        SSSP     ++ +S  S+P  P +   AP      M TR+  GI + K      + +D+  + +S P  L++ F            I AL +N TW LV 
Subjt:  SSSPQSVDASVSVSSASLPSSPHVDQSAPAIGTHPMQTRANSGIFRPKEWGAFLSSVDMILLLLS-PHQLRRHFSPLIGRVRWIQRIQALRKNDTWSLVP

Query:  PPSHVNLVGCKWVFKVKRQSDGCFDRCKARLVAKGFHQIPGIDFHETFSPVVKAPTIRIILTMVVARGWPVRQLDINNAFLNGRLEEDVFMKQPDGYVDP
         P +VN++GCKWV+K+K + DG  +R KARLVAKG++Q  G+D+ ETFSPVVKA TIRIILT+ ++  W +RQLD++NAFLNG LEE V+M QP GY DP
Subjt:  PPSHVNLVGCKWVFKVKRQSDGCFDRCKARLVAKGFHQIPGIDFHETFSPVVKAPTIRIILTMVVARGWPVRQLDINNAFLNGRLEEDVFMKQPDGYVDP

Query:  SCPTFVCKLNKALYGLKQAPRAWYDRLRSTLLEWGFLNSKADSSVFFFSNSTMIVIVLIYVHDILITGDCSEFINQFVSRLNDIFALKDMGLLSYYLGVE
          P  VC+L KALYGLKQAPRAW+ RL S LL+WGF  S+ DSS+F        +IVL+YV DIL+TG  S  I+  +++L+ +FAL+D+G LS++LG+E
Subjt:  SCPTFVCKLNKALYGLKQAPRAWYDRLRSTLLEWGFLNSKADSSVFFFSNSTMIVIVLIYVHDILITGDCSEFINQFVSRLNDIFALKDMGLLSYYLGVE

Query:  V------------------------FRTADA------------FDGDPMSNPTLYRSVLGALQYLTHTRPDISFIVNRLSQFLQQPTEVHWSGVKRILRY
        V                        F T  A            FDGDPM++ T YRSV+GALQY+T TRPDI+F VN+  QF+QQPT  HW  VKRILRY
Subjt:  V------------------------FRTADA------------FDGDPMSNPTLYRSVLGALQYLTHTRPDISFIVNRLSQFLQQPTEVHWSGVKRILRY

Query:  LKGTMEYGLFLPRSSSLSLFAFTDADWACSLDDRRSIAAHCVFLGGSLVSWSSKKQAVVARSSAESEYRSLAHTTAEVLWIQSLLTELSLSSASTPIIWC
        L+GTM+ GL    SS+L++  FTDADW   LDDRRS + + V+LGG+LVSWSS KQ VV+RSSAESEYR L   TAE++W+Q+LL EL +   + P++W 
Subjt:  LKGTMEYGLFLPRSSSLSLFAFTDADWACSLDDRRSIAAHCVFLGGSLVSWSSKKQAVVARSSAESEYRSLAHTTAEVLWIQSLLTELSLSSASTPIIWC

Query:  DNTSAISLARNPVLHSRTKHVELDIHFVRDKVLEKKIDVRYVPSSEQLADGLTKPLSESSFSSMRSKLGVMLSPSRLRGD
        DN SA  +A+NPV H+RTKH+E+D+HF+RD+V+  KI +++VP+ EQ  D LTK L+ S F S++S+L +   P  LRGD
Subjt:  DNTSAISLARNPVLHSRTKHVELDIHFVRDKVLEKKIDVRYVPSSEQLADGLTKPLSESSFSSMRSKLGVMLSPSRLRGD

SwissProt top hitse value%identityAlignment
P04146 Copia protein7.0e-10826.38Show/hide
Query:  DENNYLLWRGMVLAILRGQNVDGYVLGTKAQPSEFKEVVTETGKQLEPNPSYIDWTTVDQALLGWLFGSMSPAIAADVVSFTTSREVWKALEKVYGATSK
        D   Y +W+  + A+L  Q+V   V G                  L PN     W   ++     +   +S +      S  T+R++ + L+ VY   S 
Subjt:  DENNYLLWRGMVLAILRGQNVDGYVLGTKAQPSEFKEVVTETGKQLEPNPSYIDWTTVDQALLGWLFGSMSPAIAADVVSFTTSREVWKALEKVYGATSK

Query:  ARVNQLRGVLQNTK-KGTMKMVEFLAIMKQAAENLKLAGNPISLSDLTSYVLVGLDSDYMPIVCTIEDKDISTWQELSSILITFEGTLARYSVPSNVNEL
        A    LR  L + K    M ++    I  +    L  AG  I   D  S++L+ L S Y  I+  IE     T  E +  L   +  L            
Subjt:  ARVNQLRGVLQNTK-KGTMKMVEFLAIMKQAAENLKLAGNPISLSDLTSYVLVGLDSDYMPIVCTIEDKDISTWQELSSILITFEGTLARYSVPSNVNEL

Query:  PDLAAHLTFSRQNQSNSGKQFSFGRGNQNNQGGNQFQQSNFSNNRGNSRGRGRGRNNFQRSNNSKPTCQLCGKYGHSAPYCYQRFEETFNNPHAASSSNN
          L   +     +   S K  +    N NN          + NN   +R   + +  F+ ++  K  C  CG+ GH    C+  ++   NN     +  N
Subjt:  PDLAAHLTFSRQNQSNSGKQFSFGRGNQNNQGGNQFQQSNFSNNRGNSRGRGRGRNNFQRSNNSKPTCQLCGKYGHSAPYCYQRFEETFNNPHAASSSNN

Query:  QGYQGIADGGNFATKSKYTGNKCLTVGNGNKLDIEHTGSSSVACGNKTLKLNNMLHVPHIKRNLV-----------SIARLTAYNDVYVQFNSGFCFVKD
        +     A     A   K   N  +    G  LD   +G+S     +++L  +++  VP +K  +             I RL   +++ ++      F K+
Subjt:  QGYQGIADGGNFATKSKYTGNKCLTVGNGNKLDIEHTGSSSVACGNKTLKLNNMLHVPHIKRNLV-----------SIARLTAYNDVYVQFNSGFCFVKD

Query:  KTSRRVMLRGTLKDGLYQLELPSIQKSPVSPASFHVAVSHSNNSCCPSLSSPKSVVESK-----LLWHRRLGHASE-KIVDA----VIRSCNLKFNLNDK
             + ++   + G+  +E        +S     V  +    +  P ++     + +K      LWH R GH S+ K+++     +    +L  NL   
Subjt:  KTSRRVMLRGTLKDGLYQLELPSIQKSPVSPASFHVAVSHSNNSCCPSLSSPKSVVESK-----LLWHRRLGHASE-KIVDA----VIRSCNLKFNLNDK

Query:  VPFCDACQLGKSHRLPFSHSNFRTH--HPLELVYCDLWGPSPVASTSGFRFYISFVDNFTRLTHIFPLKHKGEALSAFHQYKALVENRFEKSIKVLQTDW
           C+ C  GK  RLPF     +TH   PL +V+ D+ GP    +     +++ FVD FT     + +K+K +  S F  + A  E  F   +  L  D 
Subjt:  VPFCDACQLGKSHRLPFSHSNFRTH--HPLELVYCDLWGPSPVASTSGFRFYISFVDNFTRLTHIFPLKHKGEALSAFHQYKALVENRFEKSIKVLQTDW

Query:  GGEFRSFTSPLKLF----GIQFRHSCPHTSQQNGIVERKHRHIVEMGLSMLAQSSLSLQYWWEAFSSAVFLINRLPSPVLHNLS--PWEKAFNQVPDFSF
        G E+ S  + ++ F    GI +  + PHT Q NG+ ER  R I E   +M++ + L   +W EA  +A +LINR+PS  L + S  P+E   N+ P    
Subjt:  GGEFRSFTSPLKLF----GIQFRHSCPHTSQQNGIVERKHRHIVEMGLSMLAQSSLSLQYWWEAFSSAVFLINRLPSPVLHNLS--PWEKAFNQVPDFSF

Query:  LRTFGCACFPCLRPYQSHKFQFHSSKCVFLGYSFNHKGYKCLSSTG-RIYISRHVVFNETEF------PFQHGFVSSSSSAVQPSNILVHCLPFPHASN-
        LR FG   +  ++  Q  KF   S K +F+GY  N  G+K   +   +  ++R VV +ET         F+  F+  S  +   +        FP+ S  
Subjt:  LRTFGCACFPCLRPYQSHKFQFHSSKCVFLGYSFNHKGYKCLSSTG-RIYISRHVVFNETEF------PFQHGFVSSSSSAVQPSNILVHCLPFPHASN-

Query:  -ISSSLPSPAPAAAT---QADIDRSSVPCAPSPAVDVL----PNHFNDLELSSAALDTAVVPSQELSSSSPQSVDASVSVSSAS-LPSSPHVDQSAPAIG
         I +  P+ +          D   S     P+ +  ++    PN   + +      D+       L+ S  +  D  ++ S  S  P+     ++A  + 
Subjt:  -ISSSLPSPAPAAAT---QADIDRSSVPCAPSPAVDVL----PNHFNDLELSSAALDTAVVPSQELSSSSPQSVDASVSVSSAS-LPSSPHVDQSAPAIG

Query:  THPMQT-RANSGIFRPKEWGAFLSSVDMI----------LLLLSPHQLRRHFSPLIGRVR-------WIQRI----QALRKNDTWSLVPPPSHVNLVGCK
           +     N GI         L +   I           ++L+ H +          ++       W + I     A + N+TW++   P + N+V  +
Subjt:  THPMQT-RANSGIFRPKEWGAFLSSVDMI----------LLLLSPHQLRRHFSPLIGRVR-------WIQRI----QALRKNDTWSLVPPPSHVNLVGCK

Query:  WVFKVKRQSDGCFDRCKARLVAKGFHQIPGIDFHETFSPVVKAPTIRIILTMVVARGWPVRQLDINNAFLNGRLEEDVFMKQPDGYVDPSCPT-FVCKLN
        WVF VK    G   R KARLVA+GF Q   ID+ ETF+PV +  + R IL++V+     V Q+D+  AFLNG L+E+++M+ P G    SC +  VCKLN
Subjt:  WVFKVKRQSDGCFDRCKARLVAKGFHQIPGIDFHETFSPVVKAPTIRIILTMVVARGWPVRQLDINNAFLNGRLEEDVFMKQPDGYVDPSCPT-FVCKLN

Query:  KALYGLKQAPRAWYDRLRSTLLEWGFLNSKADSSVFFF--SNSTMIVIVLIYVHDILI-TGDCSEFINQFVSRLNDIFALKDMGLLSYYLGVEVFRTADA
        KA+YGLKQA R W++     L E  F+NS  D  ++     N    + VL+YV D++I TGD +  +N F   L + F + D+  + +++G+ +    D 
Subjt:  KALYGLKQAPRAWYDRLRSTLLEWGFLNSKADSSVFFF--SNSTMIVIVLIYVHDILI-TGDCSEFINQFVSRLNDIFALKDMGLLSYYLGVEVFRTADA

Query:  F------------------DGDPMSNP-----------------TLYRSVLGALQY-LTHTRPDISFIVNRLSQFLQQPTEVHWSGVKRILRYLKGTMEY
                           + + +S P                 T  RS++G L Y +  TRPD++  VN LS++  +     W  +KR+LRYLKGT++ 
Subjt:  F------------------DGDPMSNP-----------------TLYRSVLGALQY-LTHTRPDISFIVNRLSQFLQQPTEVHWSGVKRILRYLKGTMEY

Query:  GLFLPRSSSL--SLFAFTDADWACSLDDRRSIAAHCV-FLGGSLVSWSSKKQAVVARSSAESEYRSLAHTTAEVLWIQSLLTELSLSSASTPIIWCDNTS
         L   ++ +    +  + D+DWA S  DR+S   +       +L+ W++K+Q  VA SS E+EY +L     E LW++ LLT +++   +   I+ DN  
Subjt:  GLFLPRSSSL--SLFAFTDADWACSLDDRRSIAAHCV-FLGGSLVSWSSKKQAVVARSSAESEYRSLAHTTAEVLWIQSLLTELSLSSASTPIIWCDNTS

Query:  AISLARNPVLHSRTKHVELDIHFVRDKVLEKKIDVRYVPSSEQLADGLTKPLSESSFSSMRSKLGVM
         IS+A NP  H R KH+++  HF R++V    I + Y+P+  QLAD  TKPL  + F  +R KLG++
Subjt:  AISLARNPVLHSRTKHVELDIHFVRDKVLEKKIDVRYVPSSEQLADGLTKPLSESSFSSMRSKLGVM

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-945.9e-12327.82Show/hide
Query:  DWTTVDQALLGWLFGSMSPAIAADVVSFTTSREVWKALEKVYGATSKARVNQLRGVLQNTKKGTMKMVEFLAIMKQAAENLKLAGNPISLSDLTSYVLVG
        DW  +D+     +   +S  +  +++   T+R +W  LE +Y   SK   N+L                    +K+    L ++     LS L   V  G
Subjt:  DWTTVDQALLGWLFGSMSPAIAADVVSFTTSREVWKALEKVYGATSKARVNQLRGVLQNTKKGTMKMVEFLAIMKQAAENLKLAGNPISLSDLTSYVLVG

Query:  LDSDYMPIVCTIEDKDISTWQELSSILITFEGTLARYSVPSNVNELPDLAAHLTFSRQNQSNSGKQFSFGRGNQNNQGGNQFQQSNFSNNRGNSRGRGRG
        L +    +   IE++D +    L+S+  +++             EL D+ + L  + + +     Q   G+       G  +Q+S  SNN G S  RG+ 
Subjt:  LDSDYMPIVCTIEDKDISTWQELSSILITFEGTLARYSVPSNVNELPDLAAHLTFSRQNQSNSGKQFSFGRGNQNNQGGNQFQQSNFSNNRGNSRGRGRG

Query:  RNNFQRSNNSKPTCQLCGKYGHSAPYCYQRFEETFNNP-----HAASSSNNQGYQGIADGGNFATKSKYTGNKCLTVGNGNKLDIEHTGSSSVACGNKTL
        +N   RS +    C  C + GH        F+    NP       +   N+     +    +          +C+ +       +  T +S  A   + L
Subjt:  RNNFQRSNNSKPTCQLCGKYGHSAPYCYQRFEETFNNP-----HAASSSNNQGYQGIADGGNFATKSKYTGNKCLTVGNGNKLDIEHTGSSSVACGNKTL

Query:  KLNNML-HVPHIKRNLVSIARLTAYNDVYVQFNSGFCFVKDKTSRRVMLRGTLKDGL------YQLELPSIQKSPVSPASFHVA-------VSHSNNSCC
            +      +K    S +++    D+ ++ N G   V         LR  L  G+      Y+    + QK  ++  S  +A       +  +N   C
Subjt:  KLNNML-HVPHIKRNLVSIARLTAYNDVYVQFNSGFCFVKDKTSRRVMLRGTLKDGL------YQLELPSIQKSPVSPASFHVA-------VSHSNNSCC

Query:  PSLSSPKSVVESKLLWHRRLGHASEKIVDAVIRSCNLKFNLNDKVPFCDACQLGKSHRLPFSHSNFRTHHPLELVYCDLWGPSPVASTSGFRFYISFVDN
            +      S  LWH+R+GH SEK +  + +   + +     V  CD C  GK HR+ F  S+ R  + L+LVY D+ GP  + S  G +++++F+D+
Subjt:  PSLSSPKSVVESKLLWHRRLGHASEKIVDAVIRSCNLKFNLNDKVPFCDACQLGKSHRLPFSHSNFRTHHPLELVYCDLWGPSPVASTSGFRFYISFVDN

Query:  FTRLTHIFPLKHKGEALSAFHQYKALVENRFEKSIKVLQTDWGGEF--RSFTSPLKLFGIQFRHSCPHTSQQNGIVERKHRHIVEMGLSMLAQSSLSLQY
         +R   ++ LK K +    F ++ ALVE    + +K L++D GGE+  R F       GI+   + P T Q NG+ ER +R IVE   SML  + L   +
Subjt:  FTRLTHIFPLKHKGEALSAFHQYKALVENRFEKSIKVLQTDWGGEF--RSFTSPLKLFGIQFRHSCPHTSQQNGIVERKHRHIVEMGLSMLAQSSLSLQY

Query:  WWEAFSSAVFLINRLPSPVLHNLSPWEKAFNQVPDFSFLRTFGCACFPCLRPYQSHKFQFHSSKCVFLGYSFNHKGYKCLSSTGRIYI-SRHVVFNETEF
        W EA  +A +LINR PS  L    P     N+   +S L+ FGC  F  +   Q  K    S  C+F+GY     GY+      +  I SR VVF E+E 
Subjt:  WWEAFSSAVFLINRLPSPVLHNLSPWEKAFNQVPDFSFLRTFGCACFPCLRPYQSHKFQFHSSKCVFLGYSFNHKGYKCLSSTGRIYI-SRHVVFNETEF

Query:  PFQHGFVSSSSSAVQPSNILVHCLPFPHASNISSSLPSPAPAAATQADIDRSSVPCAPSPAVDVLPNHFNDLELSSAALDTAVVPSQELSSSSPQSVDAS
               ++  S    + I+ + +  P  SN  +S  S     + Q +         P   ++         E     ++    P+Q      P      
Subjt:  PFQHGFVSSSSSAVQPSNILVHCLPFPHASNISSSLPSPAPAAATQADIDRSSVPCAPSPAVDVLPNHFNDLELSSAALDTAVVPSQELSSSSPQSVDAS

Query:  VSVSSASLPSSPHV---DQSAPAIGTHPMQTRANSGIFRPKEWGAFLSSVDMILLLLSPHQLRRHFSPLIGRVRWIQRIQALRKNDTWSLVPPPSHVNLV
          V S   PS+ +V   D   P                          S+  +L     +QL +            + +++L+KN T+ LV  P     +
Subjt:  VSVSSASLPSSPHV---DQSAPAIGTHPMQTRANSGIFRPKEWGAFLSSVDMILLLLSPHQLRRHFSPLIGRVRWIQRIQALRKNDTWSLVPPPSHVNLV

Query:  GCKWVFKVKRQSDGCFDRCKARLVAKGFHQIPGIDFHETFSPVVKAPTIRIILTMVVARGWPVRQLDINNAFLNGRLEEDVFMKQPDGYVDPSCPTFVCK
         CKWVFK+K+  D    R KARLV KGF Q  GIDF E FSPVVK  +IR IL++  +    V QLD+  AFL+G LEE+++M+QP+G+        VCK
Subjt:  GCKWVFKVKRQSDGCFDRCKARLVAKGFHQIPGIDFHETFSPVVKAPTIRIILTMVVARGWPVRQLDINNAFLNGRLEEDVFMKQPDGYVDPSCPTFVCK

Query:  LNKALYGLKQAPRAWYDRLRSTLLEWGFLNSKADSSVFF--FSNSTMIVIVLIYVHDILITGDCSEFINQFVSRLNDIFALKDMGLLSYYLGVEVFRTA-
        LNK+LYGLKQAPR WY +  S +    +L + +D  V+F  FS +  I I+L+YV D+LI G     I +    L+  F +KD+G     LG+++ R   
Subjt:  LNKALYGLKQAPRAWYDRLRSTLLEWGFLNSKADSSVFF--FSNSTMIVIVLIYVHDILITGDCSEFINQFVSRLNDIFALKDMGLLSYYLGVEVFRTA-

Query:  -------------------DAFDGDPMSNPTL------------------------YRSVLGALQY-LTHTRPDISFIVNRLSQFLQQPTEVHWSGVKRI
                           +  +  P+S P                          Y S +G+L Y +  TRPDI+  V  +S+FL+ P + HW  VK I
Subjt:  -------------------DAFDGDPMSNPTL------------------------YRSVLGALQY-LTHTRPDISFIVNRLSQFLQQPTEVHWSGVKRI

Query:  LRYLKGTMEYGLFLPRSSSLSLFAFTDADWACSLDDRRSIAAHCVFLGGSLVSWSSKKQAVVARSSAESEYRSLAHTTAEVLWIQSLLTELSLSSASTPI
        LRYL+GT    L    S  + L  +TDAD A  +D+R+S   +     G  +SW SK Q  VA S+ E+EY +   T  E++W++  L EL L      +
Subjt:  LRYLKGTMEYGLFLPRSSSLSLFAFTDADWACSLDDRRSIAAHCVFLGGSLVSWSSKKQAVVARSSAESEYRSLAHTTAEVLWIQSLLTELSLSSASTPI

Query:  IWCDNTSAISLARNPVLHSRTKHVELDIHFVRDKVLEKKIDVRYVPSSEQLADGLTKPLSESSFSSMRSKLGV
        ++CD+ SAI L++N + H+RTKH+++  H++R+ V ++ + V  + ++E  AD LTK +  + F   +  +G+
Subjt:  IWCDNTSAISLARNPVLHSRTKHVELDIHFVRDKVLEKKIDVRYVPSSEQLADGLTKPLSESSFSSMRSKLGV

P92519 Uncharacterized mitochondrial protein AtMg008102.4e-3939.01Show/hide
Query:  VLIYVHDILITGDCSEFINQFVSRLNDIFALKDMGLLSYYLGV------------------EVFRTADAFDGDPMS-----------------NPTLYRS
        +L+YV DIL+TG  +  +N  + +L+  F++KD+G + Y+LG+                  ++   A   D  PMS                 +P+ +RS
Subjt:  VLIYVHDILITGDCSEFINQFVSRLNDIFALKDMGLLSYYLGV------------------EVFRTADAFDGDPMS-----------------NPTLYRS

Query:  VLGALQYLTHTRPDISFIVNRLSQFLQQPTEVHWSGVKRILRYLKGTMEYGLFLPRSSSLSLFAFTDADWACSLDDRRSIAAHCVFLGGSLVSWSSKKQA
        ++GALQYLT TRPDIS+ VN + Q + +PT   +  +KR+LRY+KGT+ +GL++ ++S L++ AF D+DWA     RRS    C FLG +++SWS+K+Q 
Subjt:  VLGALQYLTHTRPDISFIVNRLSQFLQQPTEVHWSGVKRILRYLKGTMEYGLFLPRSSSLSLFAFTDADWACSLDDRRSIAAHCVFLGGSLVSWSSKKQA

Query:  VVARSSAESEYRSLAHTTAEVLW
         V+RSS E+EYR+LA T AE+ W
Subjt:  VVARSSAESEYRSLAHTTAEVLW

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE14.1e-24936.85Show/hide
Query:  KLDENNYLLWRGMVLAILRGQNVDGYVLGTKAQPSEFKEVVTETGKQLEPNPSYIDWTTVDQALLGWLFGSMSPAIAADVVSFTTSREVWKALEKVYGAT
        KL   NYL+W   V A+  G  + G++ G+   P     + T+   ++  NP Y  W   D+ +   + G++S ++   V   TT+ ++W+ L K+Y   
Subjt:  KLDENNYLLWRGMVLAILRGQNVDGYVLGTKAQPSEFKEVVTETGKQLEPNPSYIDWTTVDQALLGWLFGSMSPAIAADVVSFTTSREVWKALEKVYGAT

Query:  SKARVNQLRGVLQNTKKGTMKMVEFLAIMKQAAENLKLAGNPISLSDLTSYVLVGLDSDYMPIVCTIEDKDI-STWQELSSILITFEGTLARYSVPSNVN
        S   V QLR  L+   KGT  + +++  +    + L L G P+   +    VL  L  +Y P++  I  KD   T  E+   L+  E  +   S  + + 
Subjt:  SKARVNQLRGVLQNTKKGTMKMVEFLAIMKQAAENLKLAGNPISLSDLTSYVLVGLDSDYMPIVCTIEDKDI-STWQELSSILITFEGTLARYSVPSNVN

Query:  ELPDLAAHLTFSRQNQSNSGKQFSFGRGNQNNQGGNQFQQSNFSNNRGNSRGRGRGRNNFQRSNN-SKP---TCQLCGKYGHSAPYC--YQRFEETFNNP
           +  +H   +  N +N+        GN+NN+  N+       NN  NS+   +   NF  +NN SKP    CQ+CG  GHSA  C   Q F  + N+ 
Subjt:  ELPDLAAHLTFSRQNQSNSGKQFSFGRGNQNNQGGNQFQQSNFSNNRGNSRGRGRGRNNFQRSNN-SKP---TCQLCGKYGHSAPYC--YQRFEETFNNP

Query:  HAAS-----------------SSNN------QGYQGIADGGNFATKSKYTGNKCLTVGNGNKLDIEHTGSSSVACGNKTLKLNNMLHVPHIKRNLVSIAR
           S                 SSNN        +   +D  N +    YTG   + V +G+ + I HTGS+S++  ++ L L+N+L+VP+I +NL+S+ R
Subjt:  HAAS-----------------SSNN------QGYQGIADGGNFATKSKYTGNKCLTVGNGNKLDIEHTGSSSVACGNKTLKLNNMLHVPHIKRNLVSIAR

Query:  LTAYNDVYVQFNSGFCFVKDKTSRRVMLRGTLKDGLYQLELPSIQKSPVSP-ASFHVAVSHSNNSCCPSLSSPKSVVESKLLWHRRLGHASEKIVDAVIR
        L   N V V+F      VKD  +   +L+G  KD LY  E P     PVS  AS     +HS+                   WH RLGH +  I+++VI 
Subjt:  LTAYNDVYVQFNSGFCFVKDKTSRRVMLRGTLKDGLYQLELPSIQKSPVSP-ASFHVAVSHSNNSCCPSLSSPKSVVESKLLWHRRLGHASEKIVDAVIR

Query:  SCNLK-FNLNDKVPFCDACQLGKSHRLPFSHSNFRTHHPLELVYCDLWGPSPVASTSGFRFYISFVDNFTRLTHIFPLKHKGEALSAFHQYKALVENRFE
        + +L   N + K   C  C + KS+++PFS S   +  PLE +Y D+W  SP+ S   +R+Y+ FVD+FTR T ++PLK K +    F  +K L+ENRF+
Subjt:  SCNLK-FNLNDKVPFCDACQLGKSHRLPFSHSNFRTHHPLELVYCDLWGPSPVASTSGFRFYISFVDNFTRLTHIFPLKHKGEALSAFHQYKALVENRFE

Query:  KSIKVLQTDWGGEFRSFTSPLKLFGIQFRHSCPHTSQQNGIVERKHRHIVEMGLSMLAQSSLSLQYWWEAFSSAVFLINRLPSPVLHNLSPWEKAFNQVP
          I    +D GGEF +        GI    S PHT + NG+ ERKHRHIVE GL++L+ +S+   YW  AF+ AV+LINRLP+P+L   SP++K F   P
Subjt:  KSIKVLQTDWGGEFRSFTSPLKLFGIQFRHSCPHTSQQNGIVERKHRHIVEMGLSMLAQSSLSLQYWWEAFSSAVFLINRLPSPVLHNLSPWEKAFNQVP

Query:  DFSFLRTFGCACFPCLRPYQSHKFQFHSSKCVFLGYSFNHKGYKCLS-STGRIYISRHVVFNETEFPFQHGFVSSSSSAVQPSNILVHCLPFPHASNISS
        ++  LR FGCAC+P LRPY  HK    S +CVFLGYS     Y CL   T R+YISRHV F+E  FPF +    ++ S VQ       C+  PH +  + 
Subjt:  DFSFLRTFGCACFPCLRPYQSHKFQFHSSKCVFLGYSFNHKGYKCLS-STGRIYISRHVVFNETEFPFQHGFVSSSSSAVQPSNILVHCLPFPHASNISS

Query:  SLPSPAPAAA---------------------TQADID---RSSVPCAPSPAV---------------------DVLPNHFNDLELSSAALDTAVVPSQEL
        +   PAP+ +                     + +++D    SS P +P P                           +  N    S + L  ++    + 
Subjt:  SLPSPAPAAA---------------------TQADID---RSSVPCAPSPAV---------------------DVLPNHFNDLELSSAALDTAVVPSQEL

Query:  SSSSPQSVDASVSVSSASLPSSPHVDQSAP-----------AIGTHPMQTRANSGIFRPKEWGAFLSSVDMILLLLSPHQLRRHFSPLIGRVRWI----Q
        SSSSP    ++ S S++  P S  +    P            + TH M TRA +GI +P           + + L +  + R     L    RW      
Subjt:  SSSSPQSVDASVSVSSASLPSSPHVDQSAP-----------AIGTHPMQTRANSGIFRPKEWGAFLSSVDMILLLLSPHQLRRHFSPLIGRVRWI----Q

Query:  RIQALRKNDTWSLV-PPPSHVNLVGCKWVFKVKRQSDGCFDRCKARLVAKGFHQIPGIDFHETFSPVVKAPTIRIILTMVVARGWPVRQLDINNAFLNGR
         I A   N TW LV PPPSHV +VGC+W+F  K  SDG  +R KARLVAKG++Q PG+D+ ETFSPV+K+ +IRI+L + V R WP+RQLD+NNAFL G 
Subjt:  RIQALRKNDTWSLV-PPPSHVNLVGCKWVFKVKRQSDGCFDRCKARLVAKGFHQIPGIDFHETFSPVVKAPTIRIILTMVVARGWPVRQLDINNAFLNGR

Query:  LEEDVFMKQPDGYVDPSCPTFVCKLNKALYGLKQAPRAWYDRLRSTLLEWGFLNSKADSSVFFFSNSTMIVIVLIYVHDILITGDCSEFINQFVSRLNDI
        L +DV+M QP G++D   P +VCKL KALYGLKQAPRAWY  LR+ LL  GF+NS +D+S+F       IV +L+YV DILITG+    ++  +  L+  
Subjt:  LEEDVFMKQPDGYVDPSCPTFVCKLNKALYGLKQAPRAWYDRLRSTLLEWGFLNSKADSSVFFFSNSTMIVIVLIYVHDILITGDCSEFINQFVSRLNDI

Query:  FALKDMGLLSYYLGVEVFRT------------------------------------ADAFDGDPMSNPTLYRSVLGALQYLTHTRPDISFIVNRLSQFLQ
        F++KD   L Y+LG+E  R                                        + G  +++PT YR ++G+LQYL  TRPDIS+ VNRLSQF+ 
Subjt:  FALKDMGLLSYYLGVEVFRT------------------------------------ADAFDGDPMSNPTLYRSVLGALQYLTHTRPDISFIVNRLSQFLQ

Query:  QPTEVHWSGVKRILRYLKGTMEYGLFLPRSSSLSLFAFTDADWACSLDDRRSIAAHCVFLGGSLVSWSSKKQAVVARSSAESEYRSLAHTTAEVLWIQSL
         PTE H   +KRILRYL GT  +G+FL + ++LSL A++DADWA   DD  S   + V+LG   +SWSSKKQ  V RSS E+EYRS+A+T++E+ WI SL
Subjt:  QPTEVHWSGVKRILRYLKGTMEYGLFLPRSSSLSLFAFTDADWACSLDDRRSIAAHCVFLGGSLVSWSSKKQAVVARSSAESEYRSLAHTTAEVLWIQSL

Query:  LTELSLSSASTPIIWCDNTSAISLARNPVLHSRTKHVELDIHFVRDKVLEKKIDVRYVPSSEQLADGLTKPLSESSFSSMRSKLGVMLSP
        LTEL +     P+I+CDN  A  L  NPV HSR KH+ +D HF+R++V    + V +V + +QLAD LTKPLS ++F +  SK+GV   P
Subjt:  LTELSLSSASTPIIWCDNTSAISLARNPVLHSRTKHVELDIHFVRDKVLEKKIDVRYVPSSEQLADGLTKPLSESSFSSMRSKLGVMLSP

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE27.2e-24636.6Show/hide
Query:  KLDENNYLLWRGMVLAILRGQNVDGYVLGTKAQPSEFKEVVTETGKQLEPNPSYIDWTTVDQALLGWLFGSMSPAIAADVVSFTTSREVWKALEKVYGAT
        KL   NYL+W   V A+  G  + G++ G+   P         T      NP Y  W   D+ +   + G++S ++   V   TT+ ++W+ L K+Y   
Subjt:  KLDENNYLLWRGMVLAILRGQNVDGYVLGTKAQPSEFKEVVTETGKQLEPNPSYIDWTTVDQALLGWLFGSMSPAIAADVVSFTTSREVWKALEKVYGAT

Query:  SKARVNQLRGVLQNTKKGTMKMVEFLAIMKQAAENLKLAGNPISLSDLTSYVLVGLDSDYMPIVCTIEDKDI-STWQELSSILITFEGTLARYSVPSNVN
        S   V QLR + +                    + L L G P+   +    VL  L  DY P++  I  KD   +  E+   LI  E  L    +  N  
Subjt:  SKARVNQLRGVLQNTKKGTMKMVEFLAIMKQAAENLKLAGNPISLSDLTSYVLVGLDSDYMPIVCTIEDKDI-STWQELSSILITFEGTLARYSVPSNVN

Query:  ELPDLAAHLTFSRQNQSNSGKQFSFGRGNQNNQGGNQFQQSNFSNNRGNSRGRGRGRNNFQRSNNSKP-----TCQLCGKYGHSAPYCYQ--RFEETFNN
        E+  + A++   R   +N          NQNN+G N+   +N  NNR NS    +  ++  RS+N +P      CQ+C   GHSA  C Q  +F+ T N 
Subjt:  ELPDLAAHLTFSRQNQSNSGKQFSFGRGNQNNQGGNQFQQSNFSNNRGNSRGRGRGRNNFQRSNNSKP-----TCQLCGKYGHSAPYCYQ--RFEETFNN

Query:  PHAAS-----------------SSNN------QGYQGIADGGNFATKSKYTGNKCLTVGNGNKLDIEHTGSSSVACGNKTLKLNNMLHVPHIKRNLVSIA
          + S                 ++NN        +   +D  N +    YTG   + + +G+ + I HTGS+S+   +++L LN +L+VP+I +NL+S+ 
Subjt:  PHAAS-----------------SSNN------QGYQGIADGGNFATKSKYTGNKCLTVGNGNKLDIEHTGSSSVACGNKTLKLNNMLHVPHIKRNLVSIA

Query:  RLTAYNDVYVQFNSGFCFVKDKTSRRVMLRGTLKDGLYQLELPSIQKSPVSPASFHVAVSHSNNSCCPSLSSPKSVVESKLLWHRRLGHASEKIVDAVIR
        RL   N V V+F      VKD  +   +L+G  KD LY+  + S Q           AVS   + C  +  S          WH RLGH S  I+++VI 
Subjt:  RLTAYNDVYVQFNSGFCFVKDKTSRRVMLRGTLKDGLYQLELPSIQKSPVSPASFHVAVSHSNNSCCPSLSSPKSVVESKLLWHRRLGHASEKIVDAVIR

Query:  SCNLK-FNLNDKVPFCDACQLGKSHRLPFSHSNFRTHHPLELVYCDLWGPSPVASTSGFRFYISFVDNFTRLTHIFPLKHKGEALSAFHQYKALVENRFE
        + +L   N + K+  C  C + KSH++PFS+S   +  PLE +Y D+W  SP+ S   +R+Y+ FVD+FTR T ++PLK K +    F  +K+LVENRF+
Subjt:  SCNLK-FNLNDKVPFCDACQLGKSHRLPFSHSNFRTHHPLELVYCDLWGPSPVASTSGFRFYISFVDNFTRLTHIFPLKHKGEALSAFHQYKALVENRFE

Query:  KSIKVLQTDWGGEFRSFTSPLKLFGIQFRHSCPHTSQQNGIVERKHRHIVEMGLSMLAQSSLSLQYWWEAFSSAVFLINRLPSPVLHNLSPWEKAFNQVP
          I  L +D GGEF      L   GI    S PHT + NG+ ERKHRHIVEMGL++L+ +S+   YW  AFS AV+LINRLP+P+L   SP++K F Q P
Subjt:  KSIKVLQTDWGGEFRSFTSPLKLFGIQFRHSCPHTSQQNGIVERKHRHIVEMGLSMLAQSSLSLQYWWEAFSSAVFLINRLPSPVLHNLSPWEKAFNQVP

Query:  DFSFLRTFGCACFPCLRPYQSHKFQFHSSKCVFLGYSFNHKGYKCLS-STGRIYISRHVVFNETEFPFQHGFVSSSSSAVQ--------PSNILVHCLPF
        ++  L+ FGCAC+P LRPY  HK +  S +C F+GYS     Y CL   TGR+Y SRHV F+E  FPF       S+S  Q        PS+  +   P 
Subjt:  DFSFLRTFGCACFPCLRPYQSHKFQFHSSKCVFLGYSFNHKGYKCLS-STGRIYISRHVVFNETEFPFQHGFVSSSSSAVQ--------PSNILVHCLPF

Query:  ---------PHASNISSSLPSPAPAAATQADIDR----------SSVPCAPS---PAVDVLPNHFNDLELSSAALD--------------TAVVPSQELS
                 PH         SP+P   TQ               SS P APS   P     P+   +   +S  L+               + +P   +S
Subjt:  ---------PHASNISSSLPSPAPAAATQADIDR----------SSVPCAPS---PAVDVLPNHFNDLELSSAALD--------------TAVVPSQELS

Query:  S-------------SSPQSVDASVSVSSASLPSSPHVDQSAPA-IGTHPMQTRANSGIFRPKEWGAFLSSVDMILLLLSPHQLRRHFSPLIGRVRWIQ--
        S             +SP S   S       LP+ P +  +A A + TH M TRA  GI +P +  ++ +S       L+ +   R     +   RW Q  
Subjt:  S-------------SSPQSVDASVSVSSASLPSSPHVDQSAPA-IGTHPMQTRANSGIFRPKEWGAFLSSVDMILLLLSPHQLRRHFSPLIGRVRWIQ--

Query:  --RIQALRKNDTWSLV-PPPSHVNLVGCKWVFKVKRQSDGCFDRCKARLVAKGFHQIPGIDFHETFSPVVKAPTIRIILTMVVARGWPVRQLDINNAFLN
           I A   N TW LV PPP  V +VGC+W+F  K  SDG  +R KARLVAKG++Q PG+D+ ETFSPV+K+ +IRI+L + V R WP+RQLD+NNAFL 
Subjt:  --RIQALRKNDTWSLV-PPPSHVNLVGCKWVFKVKRQSDGCFDRCKARLVAKGFHQIPGIDFHETFSPVVKAPTIRIILTMVVARGWPVRQLDINNAFLN

Query:  GRLEEDVFMKQPDGYVDPSCPTFVCKLNKALYGLKQAPRAWYDRLRSTLLEWGFLNSKADSSVFFFSNSTMIVIVLIYVHDILITGDCSEFINQFVSRLN
        G L ++V+M QP G+VD   P +VC+L KA+YGLKQAPRAWY  LR+ LL  GF+NS +D+S+F       I+ +L+YV DILITG+ +  +   +  L+
Subjt:  GRLEEDVFMKQPDGYVDPSCPTFVCKLNKALYGLKQAPRAWYDRLRSTLLEWGFLNSKADSSVFFFSNSTMIVIVLIYVHDILITGDCSEFINQFVSRLN

Query:  DIFALKDMGLLSYYLGVEVFRTADAF------------------------------------DGDPMSNPTLYRSVLGALQYLTHTRPDISFIVNRLSQF
          F++K+   L Y+LG+E  R                                          G  + +PT YR ++G+LQYL  TRPD+S+ VNRLSQ+
Subjt:  DIFALKDMGLLSYYLGVEVFRTADAF------------------------------------DGDPMSNPTLYRSVLGALQYLTHTRPDISFIVNRLSQF

Query:  LQQPTEVHWSGVKRILRYLKGTMEYGLFLPRSSSLSLFAFTDADWACSLDDRRSIAAHCVFLGGSLVSWSSKKQAVVARSSAESEYRSLAHTTAEVLWIQ
        +  PT+ HW+ +KR+LRYL GT ++G+FL + ++LSL A++DADWA   DD  S   + V+LG   +SWSSKKQ  V RSS E+EYRS+A+T++E+ WI 
Subjt:  LQQPTEVHWSGVKRILRYLKGTMEYGLFLPRSSSLSLFAFTDADWACSLDDRRSIAAHCVFLGGSLVSWSSKKQAVVARSSAESEYRSLAHTTAEVLWIQ

Query:  SLLTELSLSSASTPIIWCDNTSAISLARNPVLHSRTKHVELDIHFVRDKVLEKKIDVRYVPSSEQLADGLTKPLSESSFSSMRSKLGVMLSPSRLRGDVR
        SLLTEL +  +  P+I+CDN  A  L  NPV HSR KH+ LD HF+R++V    + V +V + +QLAD LTKPLS  +F +   K+GV+  P    G +R
Subjt:  SLLTELSLSSASTPIIWCDNTSAISLARNPVLHSRTKHVELDIHFVRDKVLEKKIDVRYVPSSEQLADGLTKPLSESSFSSMRSKLGVMLSPSRLRGDVR

Arabidopsis top hitse value%identityAlignment
AT1G34070.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)1.2e-0622.79Show/hide
Query:  LDENNYLLWRGMVLAILRGQNVDGYVLGTKAQPSEFKEVVTETGKQLEPNPSYIDWTTVDQALLGWLFGSMSP-AIAADVVSFTTSREVWKALEKVYGAT
        ++E+NY  WR + L      +V G++ GT                 L  N + ++W   D  +   L+G+++P       V+ +TSR++W  ++  +   
Subjt:  LDENNYLLWRGMVLAILRGQNVDGYVLGTKAQPSEFKEVVTETGKQLEPNPSYIDWTTVDQALLGWLFGSMSP-AIAADVVSFTTSREVWKALEKVYGAT

Query:  SKARVNQLRGVLQNTKKGTMKMVEFLAIMKQAAENLKLAGNPISLSDLTSYVLVGLDSDYMPIVCTIEDKD-ISTWQELSSILITFEGTLARYSVPSNVN
          AR  +L   L+    G M++ ++   MK+ A++L+    P++  +L  YVL GL+  +  I+  I+ +    ++ + +++L   E  L R   P+   
Subjt:  SKARVNQLRGVLQNTKKGTMKMVEFLAIMKQAAENLKLAGNPISLSDLTSYVLVGLDSDYMPIVCTIEDKD-ISTWQELSSILITFEGTLARYSVPSNVN

Query:  ELPDLAAHLTFSRQNQSNSGKQFSFGRGNQNNQGGNQFQQSNFSNNRGNSRGRGRGRNNFQRSNNSKPTCQLCGKYGHSAPYCYQRFEETFNNP
          P    H + S     +     +    N    GGNQ             RGRGRG N F+          +      + P  YQ   + +N+P
Subjt:  ELPDLAAHLTFSRQNQSNSGKQFSFGRGNQNNQGGNQFQQSNFSNNRGNSRGRGRGRNNFQRSNNSKPTCQLCGKYGHSAPYCYQRFEETFNNP

AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 81.5e-9740.34Show/hide
Query:  IQALRKNDTWSLVPPPSHVNLVGCKWVFKVKRQSDGCFDRCKARLVAKGFHQIPGIDFHETFSPVVKAPTIRIILTMVVARGWPVRQLDINNAFLNGRLE
        I A+    TW +   P +   +GCKWV+K+K  SDG  +R KARLVAKG+ Q  GIDF ETFSPV K  ++++IL +     + + QLDI+NAFLNG L+
Subjt:  IQALRKNDTWSLVPPPSHVNLVGCKWVFKVKRQSDGCFDRCKARLVAKGFHQIPGIDFHETFSPVVKAPTIRIILTMVVARGWPVRQLDINNAFLNGRLE

Query:  EDVFMKQPDGYV----DPSCPTFVCKLNKALYGLKQAPRAWYDRLRSTLLEWGFLNSKADSSVFFFSNSTMIVIVLIYVHDILITGDCSEFINQFVSRLN
        E+++MK P GY     D   P  VC L K++YGLKQA R W+ +   TL+ +GF+ S +D + F    +T+ + VL+YV DI+I  +    +++  S+L 
Subjt:  EDVFMKQPDGYV----DPSCPTFVCKLNKALYGLKQAPRAWYDRLRSTLLEWGFLNSKADSSVFFFSNSTMIVIVLIYVHDILITGDCSEFINQFVSRLN

Query:  DIFALKDMGLLSYYLGVEVFRTA------------------------------------DAFDGDPMSNPTLYRSVLGALQYLTHTRPDISFIVNRLSQF
          F L+D+G L Y+LG+E+ R+A                                     A  G    +   YR ++G L YL  TR DISF VN+LSQF
Subjt:  DIFALKDMGLLSYYLGVEVFRTA------------------------------------DAFDGDPMSNPTLYRSVLGALQYLTHTRPDISFIVNRLSQF

Query:  LQQPTEVHWSGVKRILRYLKGTMEYGLFLPRSSSLSLFAFTDADWACSLDDRRSIAAHCVFLGGSLVSWSSKKQAVVARSSAESEYRSLAHTTAEVLWIQ
         + P   H   V +IL Y+KGT+  GLF    + + L  F+DA +    D RRS   +C+FLG SL+SW SKKQ VV++SSAE+EYR+L+  T E++W+ 
Subjt:  LQQPTEVHWSGVKRILRYLKGTMEYGLFLPRSSSLSLFAFTDADWACSLDDRRSIAAHCVFLGGSLVSWSSKKQAVVARSSAESEYRSLAHTTAEVLWIQ

Query:  SLLTELSLSSASTPIIWCDNTSAISLARNPVLHSRTKHVELDIHFVRDK-VLEKKIDVRYVPSSEQLADGLTKPLS
            EL L  +   +++CDNT+AI +A N V H RTKH+E D H VR++ V +  +   +    EQ  DG T+ LS
Subjt:  SLLTELSLSSASTPIIWCDNTSAISLARNPVLHSRTKHVELDIHFVRDK-VLEKKIDVRYVPSSEQLADGLTKPLS

ATMG00240.1 Gag-Pol-related retrotransposon family protein4.6e-1448.72Show/hide
Query:  YLTHTRPDISFIVNRLSQFLQQPTEVHWSGVKRILRYLKGTMEYGLFLPRSSSLSLFAFTDADWACSLDDRRSIAAHC
        YLT TRPD++F VNRLSQF           V ++L Y+KGT+  GLF   +S L L AF D+DWA   D RRS+   C
Subjt:  YLTHTRPDISFIVNRLSQFLQQPTEVHWSGVKRILRYLKGTMEYGLFLPRSSSLSLFAFTDADWACSLDDRRSIAAHC

ATMG00810.1 DNA/RNA polymerases superfamily protein1.7e-4039.01Show/hide
Query:  VLIYVHDILITGDCSEFINQFVSRLNDIFALKDMGLLSYYLGV------------------EVFRTADAFDGDPMS-----------------NPTLYRS
        +L+YV DIL+TG  +  +N  + +L+  F++KD+G + Y+LG+                  ++   A   D  PMS                 +P+ +RS
Subjt:  VLIYVHDILITGDCSEFINQFVSRLNDIFALKDMGLLSYYLGV------------------EVFRTADAFDGDPMS-----------------NPTLYRS

Query:  VLGALQYLTHTRPDISFIVNRLSQFLQQPTEVHWSGVKRILRYLKGTMEYGLFLPRSSSLSLFAFTDADWACSLDDRRSIAAHCVFLGGSLVSWSSKKQA
        ++GALQYLT TRPDIS+ VN + Q + +PT   +  +KR+LRY+KGT+ +GL++ ++S L++ AF D+DWA     RRS    C FLG +++SWS+K+Q 
Subjt:  VLGALQYLTHTRPDISFIVNRLSQFLQQPTEVHWSGVKRILRYLKGTMEYGLFLPRSSSLSLFAFTDADWACSLDDRRSIAAHCVFLGGSLVSWSSKKQA

Query:  VVARSSAESEYRSLAHTTAEVLW
         V+RSS E+EYR+LA T AE+ W
Subjt:  VVARSSAESEYRSLAHTTAEVLW

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)5.4e-2361.18Show/hide
Query:  WIQRIQ----ALRKNDTWSLVPPPSHVNLVGCKWVFKVKRQSDGCFDRCKARLVAKGFHQIPGIDFHETFSPVVKAPTIRIILTM
        W Q +Q    AL +N TW LVPPP + N++GCKWVFK K  SDG  DR KARLVAKGFHQ  GI F ET+SPVV+  TIR IL +
Subjt:  WIQRIQ----ALRKNDTWSLVPPPSHVNLVGCKWVFKVKRQSDGCFDRCKARLVAKGFHQIPGIDFHETFSPVVKAPTIRIILTM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAGATGGAAATGTGTCTTCCTCCTCTGCTGCAGCTATTTCTACTCCAAGGTCTATTTCTACTCCAAGTTCGTCAGCTGCAAATATCGTGAGTTCTTCTTTTGGTCA
TCCGCTTAGCACAGTTCTCACAGTGAAGCTTGATGAGAACAATTACTTGCTATGGAGAGGTATGGTGCTTGCTATTCTTCGAGGTCAGAATGTTGATGGGTATGTGTTAG
GTACGAAAGCTCAGCCGTCAGAGTTCAAAGAGGTTGTCACCGAAACTGGTAAGCAACTTGAACCTAATCCTTCTTACATAGATTGGACAACAGTCGATCAAGCCTTGTTA
GGATGGCTATTTGGGTCCATGTCTCCTGCAATTGCTGCAGATGTGGTGAGTTTTACAACATCTAGAGAAGTGTGGAAGGCTCTAGAGAAGGTCTATGGTGCGACAAGTAA
GGCTAGGGTTAATCAGCTAAGAGGAGTCTTACAAAACACCAAGAAAGGCACAATGAAGATGGTAGAGTTCTTAGCAATAATGAAACAAGCAGCTGAGAATCTGAAATTGG
CGGGTAATCCTATCTCTCTTAGTGATTTAACATCTTATGTGCTTGTTGGTCTTGATTCAGATTATATGCCTATTGTGTGTACGATTGAGGACAAGGACATCAGCACTTGG
CAAGAGCTATCCTCTATTCTCATTACATTTGAAGGTACTCTGGCACGTTATTCTGTTCCCTCAAATGTAAATGAATTGCCGGATCTTGCAGCTCATCTTACTTTCAGTCG
CCAAAATCAGAGTAATTCTGGCAAGCAGTTTAGTTTTGGGAGAGGTAATCAAAACAATCAAGGAGGTAATCAGTTTCAGCAGTCCAATTTTTCGAATAACAGAGGGAATA
GTCGTGGCAGAGGACGCGGTCGAAACAATTTCCAACGGAGTAACAATTCTAAACCGACTTGTCAGTTGTGTGGAAAATATGGTCACTCTGCACCATACTGCTACCAACGT
TTTGAGGAAACTTTCAACAACCCGCATGCTGCAAGCAGTTCTAACAACCAGGGCTACCAAGGAATCGCTGATGGTGGTAATTTTGCTACAAAATCTAAGTATACTGGTAA
CAAATGTTTAACTGTTGGAAATGGCAATAAGCTTGATATTGAACATACTGGTAGTAGCTCTGTTGCTTGTGGCAACAAGACTTTAAAGCTTAATAATATGCTTCATGTGC
CTCACATTAAGAGAAATTTAGTAAGCATAGCTCGCCTAACTGCTTATAATGATGTTTATGTTCAGTTTAACTCAGGTTTTTGTTTTGTGAAGGACAAAACTTCAAGGAGG
GTGATGCTGCGCGGAACACTTAAAGATGGTCTTTATCAGCTTGAGCTCCCCTCAATTCAAAAGTCTCCTGTTAGTCCAGCTTCCTTTCATGTTGCTGTTTCTCATTCCAA
TAACTCATGTTGTCCGAGTCTATCTAGTCCTAAGTCTGTTGTAGAGTCTAAGCTGTTGTGGCATCGTCGTCTTGGACATGCGTCTGAGAAAATTGTTGATGCTGTGATCC
GTTCTTGTAATTTGAAGTTTAATCTTAATGACAAAGTACCTTTTTGTGATGCTTGTCAGCTTGGAAAGTCTCACCGGTTGCCTTTTTCTCATTCTAATTTTCGCACACAC
CACCCTCTAGAACTTGTGTATTGTGATCTTTGGGGTCCCTCTCCTGTTGCCTCCACCTCCGGTTTTCGTTTTTACATTAGTTTTGTTGATAATTTCACCAGGCTAACTCA
CATATTTCCCCTCAAACATAAAGGTGAAGCTCTTTCGGCCTTTCACCAATATAAGGCCCTTGTTGAAAACCGTTTTGAAAAATCCATTAAAGTTTTACAAACTGATTGGG
GTGGGGAGTTTCGTTCGTTTACTTCACCCTTAAAGCTTTTTGGGATACAATTTCGTCACTCATGTCCCCACACTAGTCAACAAAACGGTATTGTAGAGCGTAAGCATAGA
CATATTGTTGAAATGGGATTGTCTATGCTTGCTCAATCCTCTTTATCCTTGCAATATTGGTGGGAGGCTTTCTCTTCAGCCGTTTTTCTCATCAATAGATTACCTTCACC
TGTTCTTCACAACCTCTCTCCGTGGGAAAAAGCCTTTAATCAAGTTCCTGATTTTTCCTTTCTCCGAACCTTTGGGTGTGCCTGCTTCCCTTGTCTTCGTCCTTATCAGT
CTCACAAATTCCAATTCCATAGCTCTAAATGTGTGTTTTTGGGCTATAGTTTCAATCACAAGGGCTATAAATGTTTAAGTTCAACCGGTCGCATCTATATCTCCCGTCAT
GTTGTTTTTAATGAAACAGAATTTCCTTTCCAACATGGTTTCGTGTCATCCTCTTCCTCTGCTGTCCAGCCCTCGAATATTCTTGTTCATTGTTTACCTTTTCCTCATGC
TTCAAATATTTCCTCTTCTTTACCTTCTCCTGCCCCGGCTGCTGCCACTCAAGCTGATATTGATCGCTCTTCTGTGCCCTGTGCTCCTTCACCAGCCGTGGATGTTTTGC
CTAATCATTTCAATGACTTGGAGCTATCCTCTGCTGCCTTGGATACTGCTGTGGTTCCCTCTCAGGAGTTGTCTTCCTCCTCTCCGCAGTCAGTTGATGCTTCTGTCTCT
GTTTCGTCAGCTTCTTTACCATCCTCTCCTCATGTGGATCAATCAGCTCCTGCCATTGGAACACATCCAATGCAAACTCGTGCAAATAGTGGAATATTTCGGCCCAAGGA
GTGGGGTGCTTTTCTTTCCTCGGTTGACATGATCCTCCTTCTTTTGAGCCCACATCAGTTAAGGAGGCACTTCAGTCCTCTCATTGGAAGGGTGCGATGGATTCAGAGAA
TACAAGCCTTACGGAAAAATGATACATGGTCTCTTGTGCCACCTCCCTCTCATGTCAATCTTGTTGGATGTAAGTGGGTATTTAAAGTTAAAAGACAGTCTGACGGTTGC
TTTGACAGGTGCAAGGCACGATTGGTAGCTAAGGGGTTTCATCAAATCCCTGGGATTGATTTCCATGAAACCTTCAGCCCGGTTGTCAAGGCTCCAACAATTCGTATTAT
TCTAACCATGGTTGTTGCTCGTGGCTGGCCTGTTCGTCAGTTAGATATCAATAATGCGTTTCTTAATGGCCGTTTGGAAGAGGATGTGTTCATGAAGCAACCGGATGGCT
ATGTTGATCCGTCTTGTCCCACGTTTGTTTGTAAACTTAATAAGGCTCTCTATGGTCTGAAGCAAGCCCCTCGTGCGTGGTATGATAGGTTACGGTCTACACTGTTGGAG
TGGGGTTTCCTAAATTCCAAAGCTGATAGCTCAGTATTTTTCTTCTCCAACTCTACCATGATTGTCATAGTTCTTATATATGTTCATGACATCTTAATTACTGGTGATTG
CTCTGAGTTTATCAATCAATTTGTGTCTCGATTGAATGACATTTTTGCTTTAAAAGACATGGGGTTGCTTAGTTATTATCTGGGTGTTGAGGTTTTTCGAACAGCTGATG
CCTTTGATGGTGATCCTATGTCTAACCCGACATTGTATCGTAGTGTTCTTGGGGCATTGCAATACCTTACCCATACTCGTCCTGACATTTCTTTTATTGTCAATAGATTA
AGTCAATTTTTGCAGCAGCCGACTGAGGTGCATTGGAGTGGTGTCAAAAGGATTCTGCGGTACCTTAAAGGAACAATGGAGTATGGCTTATTTCTACCTCGCTCCTCTTC
CTTGTCGCTATTTGCGTTCACTGATGCGGATTGGGCATGTAGCCTTGATGATAGGCGTTCTATTGCTGCTCACTGTGTTTTTCTGGGTGGCTCTCTTGTCTCATGGTCTT
CGAAAAAGCAAGCTGTTGTTGCGCGGTCGAGTGCTGAATCTGAGTATAGGTCTCTTGCTCACACTACAGCTGAGGTATTATGGATTCAATCTCTTCTTACTGAGTTGTCT
CTGTCGTCTGCTTCTACACCAATTATCTGGTGTGACAACACTAGTGCCATCTCGTTGGCTCGAAATCCTGTCCTCCACTCTCGCACCAAACACGTGGAACTTGATATTCA
CTTTGTCCGTGATAAGGTCCTTGAGAAGAAAATTGATGTTCGCTATGTTCCCAGTTCAGAACAGCTCGCCGATGGTCTTACTAAGCCTCTGTCAGAATCTTCCTTTTCTT
CTATGCGATCCAAACTTGGTGTGATGTTATCACCCTCTCGTTTGAGGGGGGATGTTAGGACTAAAGTCTACGACGCCGTTTTAGAGGATTTTACTAGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGAGATGGAAATGTGTCTTCCTCCTCTGCTGCAGCTATTTCTACTCCAAGGTCTATTTCTACTCCAAGTTCGTCAGCTGCAAATATCGTGAGTTCTTCTTTTGGTCA
TCCGCTTAGCACAGTTCTCACAGTGAAGCTTGATGAGAACAATTACTTGCTATGGAGAGGTATGGTGCTTGCTATTCTTCGAGGTCAGAATGTTGATGGGTATGTGTTAG
GTACGAAAGCTCAGCCGTCAGAGTTCAAAGAGGTTGTCACCGAAACTGGTAAGCAACTTGAACCTAATCCTTCTTACATAGATTGGACAACAGTCGATCAAGCCTTGTTA
GGATGGCTATTTGGGTCCATGTCTCCTGCAATTGCTGCAGATGTGGTGAGTTTTACAACATCTAGAGAAGTGTGGAAGGCTCTAGAGAAGGTCTATGGTGCGACAAGTAA
GGCTAGGGTTAATCAGCTAAGAGGAGTCTTACAAAACACCAAGAAAGGCACAATGAAGATGGTAGAGTTCTTAGCAATAATGAAACAAGCAGCTGAGAATCTGAAATTGG
CGGGTAATCCTATCTCTCTTAGTGATTTAACATCTTATGTGCTTGTTGGTCTTGATTCAGATTATATGCCTATTGTGTGTACGATTGAGGACAAGGACATCAGCACTTGG
CAAGAGCTATCCTCTATTCTCATTACATTTGAAGGTACTCTGGCACGTTATTCTGTTCCCTCAAATGTAAATGAATTGCCGGATCTTGCAGCTCATCTTACTTTCAGTCG
CCAAAATCAGAGTAATTCTGGCAAGCAGTTTAGTTTTGGGAGAGGTAATCAAAACAATCAAGGAGGTAATCAGTTTCAGCAGTCCAATTTTTCGAATAACAGAGGGAATA
GTCGTGGCAGAGGACGCGGTCGAAACAATTTCCAACGGAGTAACAATTCTAAACCGACTTGTCAGTTGTGTGGAAAATATGGTCACTCTGCACCATACTGCTACCAACGT
TTTGAGGAAACTTTCAACAACCCGCATGCTGCAAGCAGTTCTAACAACCAGGGCTACCAAGGAATCGCTGATGGTGGTAATTTTGCTACAAAATCTAAGTATACTGGTAA
CAAATGTTTAACTGTTGGAAATGGCAATAAGCTTGATATTGAACATACTGGTAGTAGCTCTGTTGCTTGTGGCAACAAGACTTTAAAGCTTAATAATATGCTTCATGTGC
CTCACATTAAGAGAAATTTAGTAAGCATAGCTCGCCTAACTGCTTATAATGATGTTTATGTTCAGTTTAACTCAGGTTTTTGTTTTGTGAAGGACAAAACTTCAAGGAGG
GTGATGCTGCGCGGAACACTTAAAGATGGTCTTTATCAGCTTGAGCTCCCCTCAATTCAAAAGTCTCCTGTTAGTCCAGCTTCCTTTCATGTTGCTGTTTCTCATTCCAA
TAACTCATGTTGTCCGAGTCTATCTAGTCCTAAGTCTGTTGTAGAGTCTAAGCTGTTGTGGCATCGTCGTCTTGGACATGCGTCTGAGAAAATTGTTGATGCTGTGATCC
GTTCTTGTAATTTGAAGTTTAATCTTAATGACAAAGTACCTTTTTGTGATGCTTGTCAGCTTGGAAAGTCTCACCGGTTGCCTTTTTCTCATTCTAATTTTCGCACACAC
CACCCTCTAGAACTTGTGTATTGTGATCTTTGGGGTCCCTCTCCTGTTGCCTCCACCTCCGGTTTTCGTTTTTACATTAGTTTTGTTGATAATTTCACCAGGCTAACTCA
CATATTTCCCCTCAAACATAAAGGTGAAGCTCTTTCGGCCTTTCACCAATATAAGGCCCTTGTTGAAAACCGTTTTGAAAAATCCATTAAAGTTTTACAAACTGATTGGG
GTGGGGAGTTTCGTTCGTTTACTTCACCCTTAAAGCTTTTTGGGATACAATTTCGTCACTCATGTCCCCACACTAGTCAACAAAACGGTATTGTAGAGCGTAAGCATAGA
CATATTGTTGAAATGGGATTGTCTATGCTTGCTCAATCCTCTTTATCCTTGCAATATTGGTGGGAGGCTTTCTCTTCAGCCGTTTTTCTCATCAATAGATTACCTTCACC
TGTTCTTCACAACCTCTCTCCGTGGGAAAAAGCCTTTAATCAAGTTCCTGATTTTTCCTTTCTCCGAACCTTTGGGTGTGCCTGCTTCCCTTGTCTTCGTCCTTATCAGT
CTCACAAATTCCAATTCCATAGCTCTAAATGTGTGTTTTTGGGCTATAGTTTCAATCACAAGGGCTATAAATGTTTAAGTTCAACCGGTCGCATCTATATCTCCCGTCAT
GTTGTTTTTAATGAAACAGAATTTCCTTTCCAACATGGTTTCGTGTCATCCTCTTCCTCTGCTGTCCAGCCCTCGAATATTCTTGTTCATTGTTTACCTTTTCCTCATGC
TTCAAATATTTCCTCTTCTTTACCTTCTCCTGCCCCGGCTGCTGCCACTCAAGCTGATATTGATCGCTCTTCTGTGCCCTGTGCTCCTTCACCAGCCGTGGATGTTTTGC
CTAATCATTTCAATGACTTGGAGCTATCCTCTGCTGCCTTGGATACTGCTGTGGTTCCCTCTCAGGAGTTGTCTTCCTCCTCTCCGCAGTCAGTTGATGCTTCTGTCTCT
GTTTCGTCAGCTTCTTTACCATCCTCTCCTCATGTGGATCAATCAGCTCCTGCCATTGGAACACATCCAATGCAAACTCGTGCAAATAGTGGAATATTTCGGCCCAAGGA
GTGGGGTGCTTTTCTTTCCTCGGTTGACATGATCCTCCTTCTTTTGAGCCCACATCAGTTAAGGAGGCACTTCAGTCCTCTCATTGGAAGGGTGCGATGGATTCAGAGAA
TACAAGCCTTACGGAAAAATGATACATGGTCTCTTGTGCCACCTCCCTCTCATGTCAATCTTGTTGGATGTAAGTGGGTATTTAAAGTTAAAAGACAGTCTGACGGTTGC
TTTGACAGGTGCAAGGCACGATTGGTAGCTAAGGGGTTTCATCAAATCCCTGGGATTGATTTCCATGAAACCTTCAGCCCGGTTGTCAAGGCTCCAACAATTCGTATTAT
TCTAACCATGGTTGTTGCTCGTGGCTGGCCTGTTCGTCAGTTAGATATCAATAATGCGTTTCTTAATGGCCGTTTGGAAGAGGATGTGTTCATGAAGCAACCGGATGGCT
ATGTTGATCCGTCTTGTCCCACGTTTGTTTGTAAACTTAATAAGGCTCTCTATGGTCTGAAGCAAGCCCCTCGTGCGTGGTATGATAGGTTACGGTCTACACTGTTGGAG
TGGGGTTTCCTAAATTCCAAAGCTGATAGCTCAGTATTTTTCTTCTCCAACTCTACCATGATTGTCATAGTTCTTATATATGTTCATGACATCTTAATTACTGGTGATTG
CTCTGAGTTTATCAATCAATTTGTGTCTCGATTGAATGACATTTTTGCTTTAAAAGACATGGGGTTGCTTAGTTATTATCTGGGTGTTGAGGTTTTTCGAACAGCTGATG
CCTTTGATGGTGATCCTATGTCTAACCCGACATTGTATCGTAGTGTTCTTGGGGCATTGCAATACCTTACCCATACTCGTCCTGACATTTCTTTTATTGTCAATAGATTA
AGTCAATTTTTGCAGCAGCCGACTGAGGTGCATTGGAGTGGTGTCAAAAGGATTCTGCGGTACCTTAAAGGAACAATGGAGTATGGCTTATTTCTACCTCGCTCCTCTTC
CTTGTCGCTATTTGCGTTCACTGATGCGGATTGGGCATGTAGCCTTGATGATAGGCGTTCTATTGCTGCTCACTGTGTTTTTCTGGGTGGCTCTCTTGTCTCATGGTCTT
CGAAAAAGCAAGCTGTTGTTGCGCGGTCGAGTGCTGAATCTGAGTATAGGTCTCTTGCTCACACTACAGCTGAGGTATTATGGATTCAATCTCTTCTTACTGAGTTGTCT
CTGTCGTCTGCTTCTACACCAATTATCTGGTGTGACAACACTAGTGCCATCTCGTTGGCTCGAAATCCTGTCCTCCACTCTCGCACCAAACACGTGGAACTTGATATTCA
CTTTGTCCGTGATAAGGTCCTTGAGAAGAAAATTGATGTTCGCTATGTTCCCAGTTCAGAACAGCTCGCCGATGGTCTTACTAAGCCTCTGTCAGAATCTTCCTTTTCTT
CTATGCGATCCAAACTTGGTGTGATGTTATCACCCTCTCGTTTGAGGGGGGATGTTAGGACTAAAGTCTACGACGCCGTTTTAGAGGATTTTACTAGCTGA
Protein sequenceShow/hide protein sequence
MGDGNVSSSSAAAISTPRSISTPSSSAANIVSSSFGHPLSTVLTVKLDENNYLLWRGMVLAILRGQNVDGYVLGTKAQPSEFKEVVTETGKQLEPNPSYIDWTTVDQALL
GWLFGSMSPAIAADVVSFTTSREVWKALEKVYGATSKARVNQLRGVLQNTKKGTMKMVEFLAIMKQAAENLKLAGNPISLSDLTSYVLVGLDSDYMPIVCTIEDKDISTW
QELSSILITFEGTLARYSVPSNVNELPDLAAHLTFSRQNQSNSGKQFSFGRGNQNNQGGNQFQQSNFSNNRGNSRGRGRGRNNFQRSNNSKPTCQLCGKYGHSAPYCYQR
FEETFNNPHAASSSNNQGYQGIADGGNFATKSKYTGNKCLTVGNGNKLDIEHTGSSSVACGNKTLKLNNMLHVPHIKRNLVSIARLTAYNDVYVQFNSGFCFVKDKTSRR
VMLRGTLKDGLYQLELPSIQKSPVSPASFHVAVSHSNNSCCPSLSSPKSVVESKLLWHRRLGHASEKIVDAVIRSCNLKFNLNDKVPFCDACQLGKSHRLPFSHSNFRTH
HPLELVYCDLWGPSPVASTSGFRFYISFVDNFTRLTHIFPLKHKGEALSAFHQYKALVENRFEKSIKVLQTDWGGEFRSFTSPLKLFGIQFRHSCPHTSQQNGIVERKHR
HIVEMGLSMLAQSSLSLQYWWEAFSSAVFLINRLPSPVLHNLSPWEKAFNQVPDFSFLRTFGCACFPCLRPYQSHKFQFHSSKCVFLGYSFNHKGYKCLSSTGRIYISRH
VVFNETEFPFQHGFVSSSSSAVQPSNILVHCLPFPHASNISSSLPSPAPAAATQADIDRSSVPCAPSPAVDVLPNHFNDLELSSAALDTAVVPSQELSSSSPQSVDASVS
VSSASLPSSPHVDQSAPAIGTHPMQTRANSGIFRPKEWGAFLSSVDMILLLLSPHQLRRHFSPLIGRVRWIQRIQALRKNDTWSLVPPPSHVNLVGCKWVFKVKRQSDGC
FDRCKARLVAKGFHQIPGIDFHETFSPVVKAPTIRIILTMVVARGWPVRQLDINNAFLNGRLEEDVFMKQPDGYVDPSCPTFVCKLNKALYGLKQAPRAWYDRLRSTLLE
WGFLNSKADSSVFFFSNSTMIVIVLIYVHDILITGDCSEFINQFVSRLNDIFALKDMGLLSYYLGVEVFRTADAFDGDPMSNPTLYRSVLGALQYLTHTRPDISFIVNRL
SQFLQQPTEVHWSGVKRILRYLKGTMEYGLFLPRSSSLSLFAFTDADWACSLDDRRSIAAHCVFLGGSLVSWSSKKQAVVARSSAESEYRSLAHTTAEVLWIQSLLTELS
LSSASTPIIWCDNTSAISLARNPVLHSRTKHVELDIHFVRDKVLEKKIDVRYVPSSEQLADGLTKPLSESSFSSMRSKLGVMLSPSRLRGDVRTKVYDAVLEDFTS