; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0040782 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0040782
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationchr13:8201344..8210206
RNA-Seq ExpressionLag0040782
SyntenyLag0040782
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN81099.1 hypothetical protein VITISV_017741 [Vitis vinifera]0.0e+0044.57Show/hide
Query:  SPSESLSSSQPMKAINPGSKTNIVKLDEENFLYWKIQVRAVLRDHGLGHFIEGDAHI----PPKTTRSGDTSSATLPNPEYETWIRQ----DNLITSWFL
        +P+ +  SS     IN     + VKLD +NFL WK Q+ + +R +GL  F+  D  +     P+  R            + E  +R     +N I    L
Subjt:  SPSESLSSSQPMKAINPGSKTNIVKLDEENFLYWKIQVRAVLRDHGLGHFIEGDAHI----PPKTTRSGDTSSATLPNPEYETWIRQ----DNLITSWFL

Query:  GSMSNEILSEVLDCETAREIWKILNDRFSSRNLARVMQIKSRLETIKKGNLSLKDYFVKVKNLVDSLAAADKKISSDDHVMHILKGLGSEYDPTVSVITE
        G  S+  LS+             L   F+S+  A+  Q K++L+  KKG  ++ +Y  K+K  VDSLA+    +S+ DHV  IL GL ++Y+  V+ +  
Subjt:  GSMSNEILSEVLDCETAREIWKILNDRFSSRNLARVMQIKSRLETIKKGNLSLKDYFVKVKNLVDSLAAADKKISSDDHVMHILKGLGSEYDPTVSVITE

Query:  KDDTPPLQKVYSLLLTQESRIERNSSINSDGFLPSVNYTALPSSKPQQQSSTD----NTQ-----------------KRGGQV--RGSNGNNNG------
        ++D   ++++ +LL+  ESR+E+N+  NS    PS +  +  + +   +   D    N+Q                 +RGG    RG N N NG      
Subjt:  KDDTPPLQKVYSLLLTQESRIERNSSINSDGFLPSVNYTALPSSKPQQQSSTD----NTQ-----------------KRGGQV--RGSNGNNNG------

Query:  --GRSN------RRNWN--NNGGKPQCQLCGRFGHTVLRCYMRFERWYQGPNNNNSQTPSSPNFPQNQFPSSSGPGNCSSSQQPFTAYSLQHDLNKENQW
          GR N      R  WN  N   KP CQLCG+ GH V +CY RF+  +Q P N +S+  SSP    +  P  +G    S             ++  ++ W
Subjt:  --GRSN------RRNWN--NNGGKPQCQLCGRFGHTVLRCYMRFERWYQGPNNNNSQTPSSPNFPQNQFPSSSGPGNCSSSQQPFTAYSLQHDLNKENQW

Query:  YPDSGASNHVTSDISNLTVGSEYTDDNKVHVGNDTGLDVLHIGSSFVQSKSHPNSVFHLKNLLHVPTITKNLISVSQFAHDNQVFFEFHPSHYFVKDLST
        YPDSGASNHVT +  NL   +E+   N+VHVGN TGL + HIG S   S      +  L +LLHVP+ITKNL+SVS+FA DN+VFFEFH    FVKD  T
Subjt:  YPDSGASNHVTSDISNLTVGSEYTDDNKVHVGNDTGLDVLHIGSSFVQSKSHPNSVFHLKNLLHVPTITKNLISVSQFAHDNQVFFEFHPSHYFVKDLST

Query:  GRILLQGKVTDGLYQF-----SLEKAESLTPKGG-CSSSFQPQVLHTSLSSVSNVVSDIPSTCNTHTLELWHKRLGHTSLPTVKRILQTCKIPYSDNKMM
          +L+ GKV DGLY F     +L   +SL+      +SSF  +V  TSLSS               T +LWHKRLGH S  T+K +L  C + + +    
Subjt:  GRILLQGKVTDGLYQF-----SLEKAESLTPKGG-CSSSFQPQVLHTSLSSVSNVVSDIPSTCNTHTLELWHKRLGHTSLPTVKRILQTCKIPYSDNKMM

Query:  SLWHACAIGKSHKLPFPTSSTVYTSPLQLVVADIWGPAFKTSRNNFRYYISFVDVYSRFTWIYFLNSKNEAFQAFLLFKTHVEKLLNCSILRFQTDGGGE
        +   +C +GK H+ PF  S T YT PL+L+  D+WGP    S + +RYYI FVD +SRF+WI+ L +K+EA + F+ FKT VE   +  I   QTD GGE
Subjt:  SLWHACAIGKSHKLPFPTSSTVYTSPLQLVVADIWGPAFKTSRNNFRYYISFVDVYSRFTWIYFLNSKNEAFQAFLLFKTHVEKLLNCSILRFQTDGGGE

Query:  FRLFTSYLKENDIEHRLSCPYTSQQNGIVERKHRQIIDMGPTLLSQASLSLEFWDDAFSTATYLTNRLPSIVHQNISPLEVLFGYQPNYSFFKTFGCLCY
        FR F SYL EN I HR+SCP+T QQNG+ ERKHR I++ G TLL  ASL L+FWD++F T  YL+NRLP+ +  +  P+EVLF   P+YSF K FGC C+
Subjt:  FRLFTSYLKENDIEHRLSCPYTSQQNGIVERKHRQIIDMGPTLLSQASLSLEFWDDAFSTATYLTNRLPSIVHQNISPLEVLFGYQPNYSFFKTFGCLCY

Query:  PLLRPYNKHKIEPRSTPCLFLGYSTTHKGYKCLSFSGRLFISRHVQFDETKFPAIVPSLFSSNNSNLEAPSVTTSLPIVKSATISHVSPFPADSHPTPSA
        P LRPYN HK++ RS  C FLGYS  HKGYKC+S +GR++IS  V F+ET FP                 S T     V S  +S VS  P+ SH +PSA
Subjt:  PLLRPYNKHKIEPRSTPCLFLGYSTTHKGYKCLSFSGRLFISRHVQFDETKFPAIVPSLFSSNNSNLEAPSVTTSLPIVKSATISHVSPFPADSHPTPSA

Query:  NPSPVTALSIPSQPQPCSSA-PLS-MDNTTTDMTLHNNDPSPLPSPSSNNTNPDQSLSISNQSVSHPSTSDNNHSSQPLSTSNTVLVPTSAAVANSHPMV
        +P  ++   +P+   P SSA P+S MDN    ++ H + P+          + D +L+ + Q VS+P  +   H    ++ ++        A  N+HPM+
Subjt:  NPSPVTALSIPSQPQPCSSA-PLS-MDNTTTDMTLHNNDPSPLPSPSSNNTNPDQSLSISNQSVSHPSTSDNNHSSQPLSTSNTVLVPTSAAVANSHPMV

Query:  TRAKSGIFKPRVLLTEYTEVEPPTVKEALKSAHWVAAMKEEHQALIKNDTWTLVNPPVDKKIIGCKWVFKIKRNSDGSVARYKARLVAKGFHQEADLDYT
        TRAKSGI KP++ +      EP +V  AL+   W  AM  E+ AL +N+TW+LV  P  ++ IGCKWV+K K N DG+V +YKARLVAKGFHQ+A  D+T
Subjt:  TRAKSGIFKPRVLLTEYTEVEPPTVKEALKSAHWVAAMKEEHQALIKNDTWTLVNPPVDKKIIGCKWVFKIKRNSDGSVARYKARLVAKGFHQEADLDYT

Query:  ETFSPVVKPATIRVIFTMALTYGWPLQQVDINNAFLHGFLSEEVFMTQPLGMTDSTNPHMVCKLKKALYGLKQAPRAWFDRLSSFLHSIGLVSSKTDSSL
        ETFSPVVKP+T+RV+FT+AL+  W ++Q+D+NNAFL+G L EEVFM QP G  D  NP++VC+L KALYGLKQAPRAWF++L   L S G VS+K+D SL
Subjt:  ETFSPVVKPATIRVIFTMALTYGWPLQQVDINNAFLHGFLSEEVFMTQPLGMTDSTNPHMVCKLKKALYGLKQAPRAWFDRLSSFLHSIGLVSSKTDSSL

Query:  LVCHHNGTCCYILVYVDDIVITGNSPLVIDQLINMLHQKFSLKDLGLLSYFLGVEVSYPPSGGIFLSQTKYIYDILHKTKMHEANPIATPMISGSVLSAY
         +        Y+LVYVDDI++ G+    I  LI  L+ +FSLKDLG + YFLG++VS+  + G+ LSQTKYI D+L KTKM    P  TP+ +G  L   
Subjt:  LVCHHNGTCCYILVYVDDIVITGNSPLVIDQLINMLHQKFSLKDLGLLSYFLGVEVSYPPSGGIFLSQTKYIYDILHKTKMHEANPIATPMISGSVLSAY

Query:  HGEPFSDTRLYRSTVGALQYATLTRPEISYSVNKVCQFMHAPTQGHWQAVKRILRYLIGTIDHGLFFRKPSSFCLEGFADADWASDPDDRKSTSSFCVYL
         G+P  D   YRSTVGALQY T+TRPE+S+SVNKVCQFM  PT+ HW+ VKRILRYL GT+ HGL  +K S+  L GF DADWASD DDR+STS  CV+L
Subjt:  HGEPFSDTRLYRSTVGALQYATLTRPEISYSVNKVCQFMHAPTQGHWQAVKRILRYLIGTIDHGLFFRKPSSFCLEGFADADWASDPDDRKSTSSFCVYL

Query:  GGNLITWSSKKQHIISRSSTEAEYRSLAHVSADLIWLQSLFKDLRLNISRPPILWCDNLGAVHFSANPVLHSRTKHVEIDIYFVRDLVLQKRLTVCHLPT
        G NLI+W SKKQHI+SRSS E EYRSLA + A++ WL+SL  +L+L +++PP++WCDNL  V  SANPVLH+RTKH+E+D+YFVR+ V++K + V H+P+
Subjt:  GGNLITWSSKKQHIISRSSTEAEYRSLAHVSADLIWLQSLFKDLRLNISRPPILWCDNLGAVHFSANPVLHSRTKHVEIDIYFVRDLVLQKRLTVCHLPT

Query:  TAQIADIFTKPLSSSSFLPLRSKLNVCSSNTIGLRGVLR
          Q+AD+ TK +SS+ F+  R KL + + +T+ LRG +R
Subjt:  TAQIADIFTKPLSSSSFLPLRSKLNVCSSNTIGLRGVLR

KHN22040.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94, partial [Glycine soja]0.0e+0043.95Show/hide
Query:  VKLDEENFLYWKIQVRAVLRDHGLGHFIEGDAHIPPKTTRSGDTSSATLPNPEYETWIRQDNLITSWFLGSMSNEILSEVLDCETAREIWKILNDRFSSR
        +KLDE+NFL W  QV  V+  H L  F+  +  IP +     D +   + + EY+ W+ +D  + +W L ++S+ +L  VL C  A E+W  ++  F+S 
Subjt:  VKLDEENFLYWKIQVRAVLRDHGLGHFIEGDAHIPPKTTRSGDTSSATLPNPEYETWIRQDNLITSWFLGSMSNEILSEVLDCETAREIWKILNDRFSSR

Query:  NLARVMQIKSRLETIKKGNLSLKDYFVKVKNLVDSLAAADKKISSDDHVMHILKGLGSEYDPTVSVITEKDDTPPLQKVYSLLLTQESRIER-NSSINSD
          +R  Q++S L+  KK + S+ +Y +++K++V+SL A    +S  + V  IL+GL  E++  V ++  + DTP ++ V +LLL QE++ E+    + S 
Subjt:  NLARVMQIKSRLETIKKGNLSLKDYFVKVKNLVDSLAAADKKISSDDHVMHILKGLGSEYDPTVSVITEKDDTPPLQKVYSLLLTQESRIER-NSSINSD

Query:  GFLPSVNYTALPSSKPQQQSSTDNTQKRGGQVRGSNGNNNG---GRSNRRNWNNNGGKPQCQLCGRFGHTVLRCYMRFERWYQGPNNNNSQTPSSPNFPQ
            +V +T   +S    +  +         V  + G   G   GR   +    N GK +CQ+C +  H  + C+ R++      N+         N PQ
Subjt:  GFLPSVNYTALPSSKPQQQSSTDNTQKRGGQVRGSNGNNNG---GRSNRRNWNNNGGKPQCQLCGRFGHTVLRCYMRFERWYQGPNNNNSQTPSSPNFPQ

Query:  NQFPSSSGPGNCSSSQQPFTAYSLQHDLNKENQWYPDSGASNHVTSDISNLTVGSEYTDDNKVHVGNDTGLDVLHIGSSFVQSKSHPNSVFHLKNLLHVP
        N F     P    +  QP+   ++  D      WYPDSGAS+H+T + +NL+  S YT  ++V +GN  G+ +  +G S   S + PN    LK+LLHVP
Subjt:  NQFPSSSGPGNCSSSQQPFTAYSLQHDLNKENQWYPDSGASNHVTSDISNLTVGSEYTDDNKVHVGNDTGLDVLHIGSSFVQSKSHPNSVFHLKNLLHVP

Query:  TITKNLISVSQFAHDNQVFFEFHPSHYFVKDLSTGRILLQGKV-TDGLYQFSLEKAESLTPKGGCSSSFQPQVLHTSLSSVSNVVSDIPSTC----NTHT
         I+KNL+SVS+FA DN V FEFHP H FVK   + ++LL+G V  DGLYQF  +  + LT  G  S+S    +  +S  SV N   +  +T     N + 
Subjt:  TITKNLISVSQFAHDNQVFFEFHPSHYFVKDLSTGRILLQGKV-TDGLYQFSLEKAESLTPKGGCSSSFQPQVLHTSLSSVSNVVSDIPSTC----NTHT

Query:  LELWHKRLGHTSLPTVKRILQTCKIPYSDNKMMSLWHACAIGKSHKLPFPTSSTVYTSPLQLVVADIWGPAFKTSRNNFRYYISFVDVYSRFTWIYFLNS
         ++WH RLGH     VK IL  C IP+S+         C +GKSH+L  P S+TVYT P +++  D+WGPA   S   + YYI+FVD Y++FTWIYFL +
Subjt:  LELWHKRLGHTSLPTVKRILQTCKIPYSDNKMMSLWHACAIGKSHKLPFPTSSTVYTSPLQLVVADIWGPAFKTSRNNFRYYISFVDVYSRFTWIYFLNS

Query:  KNEAFQAFLLFKTHVEKLLNCSILRFQTDGGGEFRLFTSYLKENDIEHRLSCPYTSQQNGIVERKHRQIIDMGPTLLSQASLSLEFWDDAFSTATYLTNR
        K++A +AF  FK  ++     SI   Q+D GGEFR FT+ L E  I+HRL+CP+TS QNG VERKHR I++MG TLLS A L L+FWD +F+ A YL NR
Subjt:  KNEAFQAFLLFKTHVEKLLNCSILRFQTDGGGEFRLFTSYLKENDIEHRLSCPYTSQQNGIVERKHRQIIDMGPTLLSQASLSLEFWDDAFSTATYLTNR

Query:  LP-SIVHQNISPLEVLFGYQPNYSFFKTFGCLCYPLLRPYNKHKIEPRSTPCLFLGYSTTHKGYKCLSFSGRLFISRHVQFDETKFPAIVPSLFSSNNSN
        LP S +    SP   L   QP+YS  K FGCLC+P LRPYN HK++ +S+PC++LG S  HKG+KCL  +GR++IS+ V F E++FP +  SLF+S++SN
Subjt:  LP-SIVHQNISPLEVLFGYQPNYSFFKTFGCLCYPLLRPYNKHKIEPRSTPCLFLGYSTTHKGYKCLSFSGRLFISRHVQFDETKFPAIVPSLFSSNNSN

Query:  LEAPSVTTSLPIVKSATISHVSPFPADSHPTPSANPSPVTALSIPSQPQPCSSAPLSMD--NTTTDMTLHNNDPSPLPSPSSNNTNPDQSLSISNQSVSH
            S T    I+  +  ++ +  P  S    +  P PV+ L  P    P SS PL++   N+     +    PS +P+P +   +P  ++  ++Q++SH
Subjt:  LEAPSVTTSLPIVKSATISHVSPFPADSHPTPSANPSPVTALSIPSQPQPCSSAPLSMD--NTTTDMTLHNNDPSPLPSPSSNNTNPDQSLSISNQSVSH

Query:  PSTSDNNHSSQPLSTSNTVLVPTSAAVANSHPMVTRAKSGIFKPRVLLTEYTEVEPPTVKEALKSAHWVAAMKEEHQALIKNDTWTLVNPPVDKKIIGCK
                                    N+HPM+TRAK+G  KP+V +     +EP +VK AL    W+ AM+ E++AL+ N TWTLV  P  +K IGCK
Subjt:  PSTSDNNHSSQPLSTSNTVLVPTSAAVANSHPMVTRAKSGIFKPRVLLTEYTEVEPPTVKEALKSAHWVAAMKEEHQALIKNDTWTLVNPPVDKKIIGCK

Query:  WVFKIKRNSDGSVARYKARLVAKGFHQEADLDYTETFSPVVKPATIRVIFTMALTYGWPLQQVDINNAFLHGFLSEEVFMTQPLGMTDSTNPHMVCKLKK
        W+F++K N DG+V ++KARLVAKGF Q A  D+TETFSPV+KP TIR+I T+A+T+ W +QQ+D+NNAFL+G L EEV+MTQP G  ++++  +VCKL K
Subjt:  WVFKIKRNSDGSVARYKARLVAKGFHQEADLDYTETFSPVVKPATIRVIFTMALTYGWPLQQVDINNAFLHGFLSEEVFMTQPLGMTDSTNPHMVCKLKK

Query:  ALYGLKQAPRAWFDRLSSFLHSIGLVSSKTDSSLLVCHHNGTCCYILVYVDDIVITGNSPLVIDQLINMLHQKFSLKDLGLLSYFLGVEVSYPPSGGIFL
        +LYGLKQAPRAW+DRL+  L  +G V SK D SLL+   NG C Y+L+YVDDI+ITG++P +I  LI+ L+ KF+LK LG + YFLG+EV + PSG + L
Subjt:  ALYGLKQAPRAWFDRLSSFLHSIGLVSSKTDSSLLVCHHNGTCCYILVYVDDIVITGNSPLVIDQLINMLHQKFSLKDLGLLSYFLGVEVSYPPSGGIFL

Query:  SQTKYIYDILHKTKMHEANPIATPMISGSVLSAYHGEPFSDTRLYRSTVGALQYATLTRPEISYSVNKVCQFMHAPTQGHWQAVKRILRYLIGTIDHGLF
        +QTKYI D+L K  M ++ PI +PM+S   LS Y  +  +D  LYRS VGALQYATLTRP+I++SVNKVCQFM  P + HW+AVKRILRYL GT+ HGL 
Subjt:  SQTKYIYDILHKTKMHEANPIATPMISGSVLSAYHGEPFSDTRLYRSTVGALQYATLTRPEISYSVNKVCQFMHAPTQGHWQAVKRILRYLIGTIDHGLF

Query:  FR-KPSS--FCLEGFADADWASDPDDRKSTSSFCVYLGGNLITWSSKKQHIISRSSTEAEYRSLAHVSADLIWLQSLFKDLRLNISRPPILWCDNLGAVH
            P+S  F L  ++DADWA+D DDR+STS  C+Y G NLI+W SKKQ +++RSSTEAEYRS+A+ +ADL+W+QSL ++L++    P +L CDNL AV 
Subjt:  FR-KPSS--FCLEGFADADWASDPDDRKSTSSFCVYLGGNLITWSSKKQHIISRSSTEAEYRSLAHVSADLIWLQSLFKDLRLNISRPPILWCDNLGAVH

Query:  FSANPVLHSRTKHVEIDIYFVRDLVLQKRLTVCHLPTTAQIADIFTKPLSSSSFLPLRSKLNV
         S NP+LHS+TKH+E+DI+FVR+ VL K+L + H+P   Q+AD  TKPLS S    +R+KL V
Subjt:  FSANPVLHSRTKHVEIDIYFVRDLVLQKRLTVCHLPTTAQIADIFTKPLSSSSFLPLRSKLNV

KHN36156.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94, partial [Glycine soja]0.0e+0044.02Show/hide
Query:  VKLDEENFLYWKIQVRAVLRDHGLGHFIEGDAHIPPKTTRSGDTSSATLPNPEYETWIRQDNLITSWFLGSMSNEILSEVLDCETAREIWKILNDRFSSR
        +KLDE+NFL W  QV  V+  H L  F+  +  IP +     D +   + + EY+ W+ +D  + +W L ++S+ +L  VL C  A E+W  ++  F+S 
Subjt:  VKLDEENFLYWKIQVRAVLRDHGLGHFIEGDAHIPPKTTRSGDTSSATLPNPEYETWIRQDNLITSWFLGSMSNEILSEVLDCETAREIWKILNDRFSSR

Query:  NLARVMQIKSRLETIKKGNLSLKDYFVKVKNLVDSLAAADKKISSDDHVMHILKGLGSEYDPTVSVITEKDDTPPLQKVYSLLLTQESRIER-NSSINSD
          +R  Q++S L+  KK + S+ +Y +++K++V+SL A    +S  + V  IL+GL  E++  V ++  + DTP ++ V +LLL QE++ E+    + S 
Subjt:  NLARVMQIKSRLETIKKGNLSLKDYFVKVKNLVDSLAAADKKISSDDHVMHILKGLGSEYDPTVSVITEKDDTPPLQKVYSLLLTQESRIER-NSSINSD

Query:  GFLPSVNYTALPSSKPQQQSSTDNTQKRGGQVRGSNGNNNG---GRSNRRNWNNNGGKPQCQLCGRFGHTVLRCYMRFERWYQGPNNNNSQTPSSPNFPQ
            +V +T   +S    +  +         V  + G   G   GR   +    N GK +CQ+C +  H  + C+ R++      N+         N PQ
Subjt:  GFLPSVNYTALPSSKPQQQSSTDNTQKRGGQVRGSNGNNNG---GRSNRRNWNNNGGKPQCQLCGRFGHTVLRCYMRFERWYQGPNNNNSQTPSSPNFPQ

Query:  NQFPSSSGPGNCSSSQQPFTAYSLQHDLNKENQWYPDSGASNHVTSDISNLTVGSEYTDDNKVHVGNDTGLDVLHIGSSFVQSKSHPNSVFHLKNLLHVP
        N F     P    +  QP+   ++  D      WYPDSGAS+H+T + +NL+  S YT  ++V +GN  G+ +  +G S   S + PN    LK+LLHVP
Subjt:  NQFPSSSGPGNCSSSQQPFTAYSLQHDLNKENQWYPDSGASNHVTSDISNLTVGSEYTDDNKVHVGNDTGLDVLHIGSSFVQSKSHPNSVFHLKNLLHVP

Query:  TITKNLISVSQFAHDNQVFFEFHPSHYFVKDLSTGRILLQGKV-TDGLYQFSLEKAESLTPKGGCSSSFQPQVLHTSLSSVSNVVSDIPSTC----NTHT
         I+KNL+SVS+FA DN V FEFHP H FVK   + ++LL+G V  DGLYQF  +  + LT  G  S+S    +  +S  SV N   +  +T     N + 
Subjt:  TITKNLISVSQFAHDNQVFFEFHPSHYFVKDLSTGRILLQGKV-TDGLYQFSLEKAESLTPKGGCSSSFQPQVLHTSLSSVSNVVSDIPSTC----NTHT

Query:  LELWHKRLGHTSLPTVKRILQTCKIPYSDNKMMSLWHACAIGKSHKLPFPTSSTVYTSPLQLVVADIWGPAFKTSRNNFRYYISFVDVYSRFTWIYFLNS
         ++WH RLGH     VK IL  C IP+S+         C +GKSH+L  P S+TVYT P +++  D+WGPA   S   + YYI+FVD Y++FTWIYFL +
Subjt:  LELWHKRLGHTSLPTVKRILQTCKIPYSDNKMMSLWHACAIGKSHKLPFPTSSTVYTSPLQLVVADIWGPAFKTSRNNFRYYISFVDVYSRFTWIYFLNS

Query:  KNEAFQAFLLFKTHVEKLLNCSILRFQTDGGGEFRLFTSYLKENDIEHRLSCPYTSQQNGIVERKHRQIIDMGPTLLSQASLSLEFWDDAFSTATYLTNR
        K++A +AF  FK  ++     SI   Q+D GGEFR FT+ L E  I+HRL+CP+TS QNG VERKHR I++MG TLLS A+L L+FWD +F+ A YL NR
Subjt:  KNEAFQAFLLFKTHVEKLLNCSILRFQTDGGGEFRLFTSYLKENDIEHRLSCPYTSQQNGIVERKHRQIIDMGPTLLSQASLSLEFWDDAFSTATYLTNR

Query:  LP-SIVHQNISPLEVLFGYQPNYSFFKTFGCLCYPLLRPYNKHKIEPRSTPCLFLGYSTTHKGYKCLSFSGRLFISRHVQFDETKFPAIVPSLFSSNNSN
        LP S +    SP   L   QP+YS  K FGCLC+P LRPYN HK++ RS+PC++LG S  HKG+KCL  +GR++IS+ V F E++FP +  SLF+S++SN
Subjt:  LP-SIVHQNISPLEVLFGYQPNYSFFKTFGCLCYPLLRPYNKHKIEPRSTPCLFLGYSTTHKGYKCLSFSGRLFISRHVQFDETKFPAIVPSLFSSNNSN

Query:  LEAPSVTTSLPIVKSATISHVSPFPADSHPTPSANPSPVTALSIPSQPQPCSSAPLSMD--NTTTDMTLHNNDPSPLPSPSSNNTNPDQSLSISNQSVSH
            S T    I+  +  ++ +  P  S    +  P PV+ L  P    P SS PL++   N+     +    PS +P+P +   +P  ++  ++Q++SH
Subjt:  LEAPSVTTSLPIVKSATISHVSPFPADSHPTPSANPSPVTALSIPSQPQPCSSAPLSMD--NTTTDMTLHNNDPSPLPSPSSNNTNPDQSLSISNQSVSH

Query:  PSTSDNNHSSQPLSTSNTVLVPTSAAVANSHPMVTRAKSGIFKPRVLLTEYTEVEPPTVKEALKSAHWVAAMKEEHQALIKNDTWTLVNPPVDKKIIGCK
                                    N+HPM+TRAK+G  KP+V +     +EP +VK AL    W+ AM+ E++AL+ N TWTLV  P  +K IGCK
Subjt:  PSTSDNNHSSQPLSTSNTVLVPTSAAVANSHPMVTRAKSGIFKPRVLLTEYTEVEPPTVKEALKSAHWVAAMKEEHQALIKNDTWTLVNPPVDKKIIGCK

Query:  WVFKIKRNSDGSVARYKARLVAKGFHQEADLDYTETFSPVVKPATIRVIFTMALTYGWPLQQVDINNAFLHGFLSEEVFMTQPLGMTDSTNPHMVCKLKK
        W+F++K N DG+V ++KARLVAKGF Q A  D+TETFSPV+KP TIR+I T+A+T+ W +QQ+D+NNAFL+G L EEV+MTQP G  ++++  +VCKL K
Subjt:  WVFKIKRNSDGSVARYKARLVAKGFHQEADLDYTETFSPVVKPATIRVIFTMALTYGWPLQQVDINNAFLHGFLSEEVFMTQPLGMTDSTNPHMVCKLKK

Query:  ALYGLKQAPRAWFDRLSSFLHSIGLVSSKTDSSLLVCHHNGTCCYILVYVDDIVITGNSPLVIDQLINMLHQKFSLKDLGLLSYFLGVEVSYPPSGGIFL
        +LYGLKQAPRAW+DRL+  L  +G V SK D SLL+   NG C Y+L+YVDDI+ITG++P +I  LI+ L+ KF+LK LG + YFLG+EV + PSG + L
Subjt:  ALYGLKQAPRAWFDRLSSFLHSIGLVSSKTDSSLLVCHHNGTCCYILVYVDDIVITGNSPLVIDQLINMLHQKFSLKDLGLLSYFLGVEVSYPPSGGIFL

Query:  SQTKYIYDILHKTKMHEANPIATPMISGSVLSAYHGEPFSDTRLYRSTVGALQYATLTRPEISYSVNKVCQFMHAPTQGHWQAVKRILRYLIGTIDHGLF
        +QTKYI D+L K  M ++ PI +PM+S   LS Y  +  +D  LYRS VGALQYATLTRP+I++SVNKVCQFM  P + HW+AVKRILRYL GT+ HGL 
Subjt:  SQTKYIYDILHKTKMHEANPIATPMISGSVLSAYHGEPFSDTRLYRSTVGALQYATLTRPEISYSVNKVCQFMHAPTQGHWQAVKRILRYLIGTIDHGLF

Query:  FR-KPSS--FCLEGFADADWASDPDDRKSTSSFCVYLGGNLITWSSKKQHIISRSSTEAEYRSLAHVSADLIWLQSLFKDLRLNISRPPILWCDNLGAVH
            P+S  F L  ++DADWA+D DDR+STS  C+Y G NLI+W SKKQ +++RSSTEAEYRS+A+ +ADL+W+QSL ++L++    P +L CDNL AV 
Subjt:  FR-KPSS--FCLEGFADADWASDPDDRKSTSSFCVYLGGNLITWSSKKQHIISRSSTEAEYRSLAHVSADLIWLQSLFKDLRLNISRPPILWCDNLGAVH

Query:  FSANPVLHSRTKHVEIDIYFVRDLVLQKRLTVCHLPTTAQIADIFTKPLSSSSFLPLRSKLNV
         S NP+LHS+TKH+E+DI+FVR+ VL K+L + H+P   Q+AD  TKPLS S    +R+KL V
Subjt:  FSANPVLHSRTKHVEIDIYFVRDLVLQKRLTVCHLPTTAQIADIFTKPLSSSSFLPLRSKLNV

RVW44519.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]0.0e+0046.63Show/hide
Query:  NETSPSESLSSSQPMKAINPGSKTNIVKLDEENFLYWKIQVRAVLRDHGLGHFIEGDAHIPPKTTRSGDTSSATLPNPEYETWIRQDNLITSWFLGSMSN
        N T+  ESL        I+P S+   ++L+++NFL WK Q+   +R +GL  F+ G   +PPK     D     +PNP++  + RQD+L+ SW L S+ +
Subjt:  NETSPSESLSSSQPMKAINPGSKTNIVKLDEENFLYWKIQVRAVLRDHGLGHFIEGDAHIPPKTTRSGDTSSATLPNPEYETWIRQDNLITSWFLGSMSN

Query:  EILSEVLDCETAREIWKILNDRFSSRNLARVMQIKSRLETIKKGNLSLKDYFVKVKNLVDSLAAADKKISSDDHVMHILKGLGSEYDPTVSVITEKDDTP
          L +V+ C +A E                               L+++DY  K+KN  D LA A  KIS  DH++ I++GLG EY+  ++VI+ K  +P
Subjt:  EILSEVLDCETAREIWKILNDRFSSRNLARVMQIKSRLETIKKGNLSLKDYFVKVKNLVDSLAAADKKISSDDHVMHILKGLGSEYDPTVSVITEKDDTP

Query:  PLQKVYSLLLTQESRIERNSSINSDGFLPSVNYTALPSSKPQQQSSTDN----------TQKRGGQVRGSNGNNNGGRSNRRNWNNNGGKPQCQLCGRFG
         LQ V S L+  E RI    S N      SVNYT+  S++    S   N           Q  G QV   +  +N GR   R     G KPQCQLC +FG
Subjt:  PLQKVYSLLLTQESRIERNSSINSDGFLPSVNYTALPSSKPQQQSSTDN----------TQKRGGQVRGSNGNNNGGRSNRRNWNNNGGKPQCQLCGRFG

Query:  HTVLRCYMRFERWYQGPNNNNSQTPS--SPNFPQNQFPSSSGPGNCSSSQ------QPFTAYSLQHDLNKENQ---WYPDSGASNHVTSDISNLTVGSEY
        HTV RC+ R++  + G    N  TP             S S  GN + ++      Q ++         ++ Q   W+PDSGA+NHVT D+ NL  G+EY
Subjt:  HTVLRCYMRFERWYQGPNNNNSQTPS--SPNFPQNQFPSSSGPGNCSSSQ------QPFTAYSLQHDLNKENQ---WYPDSGASNHVTSDISNLTVGSEY

Query:  TDDNKVHVGNDTGLDVLHIGSSFVQSKSHPNSVFHLKNLLHVPTITKNLISVSQFAHDNQVFFEFHPSHYFVKDLSTGRILLQGKVTDGLYQFSLEKAES
          ++K+H+GN TGL + HIG S   S S PN V  LKN+L VP I KNL+SVSQFA DN V+FEFHP   FVKD S   +LLQG +  GLYQF+L K   
Subjt:  TDDNKVHVGNDTGLDVLHIGSSFVQSKSHPNSVFHLKNLLHVPTITKNLISVSQFAHDNQVFFEFHPSHYFVKDLSTGRILLQGKVTDGLYQFSLEKAES

Query:  LTPKGGCSSSFQPQVLHTSLSSVSNVVSDIPSTCNT--HTLELWHKRLGHTSLPTVKRILQTCKIPYSDNKMMSLWHACAIGKSHKLPFPTSSTVYTSPL
            G   S+ + ++   + S V N  SD P   N+  H  +LWHKRLGH +   V ++L   KIP+S     S+  AC +GKSH LPFP S TVYT PL
Subjt:  LTPKGGCSSSFQPQVLHTSLSSVSNVVSDIPSTCNT--HTLELWHKRLGHTSLPTVKRILQTCKIPYSDNKMMSLWHACAIGKSHKLPFPTSSTVYTSPL

Query:  QLVVADIWGPAFKTSRNNFRYYISFVDVYSRFTWIYFLNSKNEAFQAFLLFKTHVEKLLNCSILRFQTDGGGEFRLFTSYLKENDIEHRLSCPYTSQQNG
        QLVV+D+WGPA   S   F YY+SFVD YSR+TW+YFL +K++  +AFL+FK   E    C +  FQTD GGEFR   +Y ++N I HRLSCP+TS+QNG
Subjt:  QLVVADIWGPAFKTSRNNFRYYISFVDVYSRFTWIYFLNSKNEAFQAFLLFKTHVEKLLNCSILRFQTDGGGEFRLFTSYLKENDIEHRLSCPYTSQQNG

Query:  IVERKHRQIIDMGPTLLSQASLSLEFWDDAFSTATYLTNRLPSIVHQNISPLEVLFGYQPNYSFFKTFGCLCYPLLRPYNKHKIEPRSTPCLFLGYSTTH
        I+ERKHR I+++G TLL+QASL L++W DAFSTA +L NRLP+ V +   P E LF  +PNYS  K FGCLC+P LRPYNKHK++ RS+PC FLGYS+ H
Subjt:  IVERKHRQIIDMGPTLLSQASLSLEFWDDAFSTATYLTNRLPSIVHQNISPLEVLFGYQPNYSFFKTFGCLCYPLLRPYNKHKIEPRSTPCLFLGYSTTH

Query:  KGYKCLSFSGRLFISRHVQFDETKFP---AIVPSLFSSNNSNLEAPSVTTSLPIVKSATISHVSPFPADSHPTPSANPSPVTALSIPSQPQPCSSAPLSM
        KGYKCL+  GR+FISR V FDET+FP    +   +   ++S +  P +    P+VK+  +  +S  P+ S PT SA                 SS  L  
Subjt:  KGYKCLSFSGRLFISRHVQFDETKFP---AIVPSLFSSNNSNLEAPSVTTSLPIVKSATISHVSPFPADSHPTPSANPSPVTALSIPSQPQPCSSAPLSM

Query:  DNTTTDMTLHNNDPSPLPSPSSNNTNPDQSLSISNQSVSHPSTSD----------NNHSSQPLSTSNTVLVPTSAAVANSHPMVTRAKSGIFKPRVLLTE
        +N  +D+     D S        NT+   ++ I N+S S PS+S+          + +S +P  + NT  V         H MVTR+K+GIFKP+V   +
Subjt:  DNTTTDMTLHNNDPSPLPSPSSNNTNPDQSLSISNQSVSHPSTSD----------NNHSSQPLSTSNTVLVPTSAAVANSHPMVTRAKSGIFKPRVLLTE

Query:  YTEVEPPTVKEALKSAHWVAAMKEEHQALIKNDTWTLVNPPVDKKIIGCKWVFKIKRNSDGSVARYKARLVAKGFHQEADLDYTETFSPVVKPATIRVIF
            EP T +EA+    W  AM EE +AL+KN TW+LV+ P ++  +GC+WVFK+KRN DGSV+RYKARLVAKG+ Q    D+ ETFSPVVKP TIRV+ 
Subjt:  YTEVEPPTVKEALKSAHWVAAMKEEHQALIKNDTWTLVNPPVDKKIIGCKWVFKIKRNSDGSVARYKARLVAKGFHQEADLDYTETFSPVVKPATIRVIF

Query:  TMALTYGWPLQQVDINNAFLHGFLSEEVFMTQPLGMTDSTNPH--MVCKLKKALYGLKQAPRAWFDRLSSFLHSIGLVSSKTDSSLLVCHHNGTCCYILV
         +A++  W ++Q+D+NNAFL+G L EEV+M QP G    TN    +VCKL KALYGLKQAPRAWFD+L   L   G  S+K+D SL V   N +  ++LV
Subjt:  TMALTYGWPLQQVDINNAFLHGFLSEEVFMTQPLGMTDSTNPH--MVCKLKKALYGLKQAPRAWFDRLSSFLHSIGLVSSKTDSSLLVCHHNGTCCYILV

Query:  YVDDIVITGNSPLVIDQLINMLHQKFSLKDLGLLSYFLGVEVSYPPSGGIFLSQTKYIYDILHKTKMHEANPIATPMISGSVLSAYHGEPFSDTRLYRST
        YVDDIV+TG+S   I +LI+ L   FSLKDLG LSYFLG+EV     GG+ LSQ KYI D+L KTKM  A  + TPM+SG  LSA  G+P  +   YRS 
Subjt:  YVDDIVITGNSPLVIDQLINMLHQKFSLKDLGLLSYFLGVEVSYPPSGGIFLSQTKYIYDILHKTKMHEANPIATPMISGSVLSAYHGEPFSDTRLYRST

Query:  VGALQYATLTRPEISYSVNKVCQFMHAPTQGHWQAVKRILRYLIGTIDHGLFFRKPSSFCLEGFADADWASDPDDRKSTSSFCVYLGGNLITWSSKKQHI
        VGALQY T+TRPEI++SVNKVCQFM  P   HW+AVKRILRYL GT D G+  +   +  L GF DADW SD DDR+STS  CV+LG +L++WSSKKQH 
Subjt:  VGALQYATLTRPEISYSVNKVCQFMHAPTQGHWQAVKRILRYLIGTIDHGLFFRKPSSFCLEGFADADWASDPDDRKSTSSFCVYLGGNLITWSSKKQHI

Query:  ISRSSTEAEYRSLAHVSADLIWLQSLFKDLRLNISRPPILWCDNLGAVHFSANPVLHSRTKHVEIDIYFVRDLVLQKRLTVCHLPTTAQIADIFTKPLSS
         SRSSTEAEYRSLA ++++++WLQSL  +L+  ++  P++WCDN+  V  SANPVLHSRTKH+E+D+YFVR+ V++++L V H+PT  Q+AD+FTKPLS 
Subjt:  ISRSSTEAEYRSLAHVSADLIWLQSLFKDLRLNISRPPILWCDNLGAVHFSANPVLHSRTKHVEIDIYFVRDLVLQKRLTVCHLPTTAQIADIFTKPLSS

Query:  SSFLPLRSKLNVCS
          F  LR KL V S
Subjt:  SSFLPLRSKLNVCS

RVW60229.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]0.0e+0046.7Show/hide
Query:  NETSPSESLSSSQPMKAINPGSKTNIVKLDEENFLYWKIQVRAVLRDHGLGHFIEGDAHIPPKTTRSGDTSSATLPNPEYETWIRQDNLITSWFLGSMSN
        N T+  ESL        I+P S+   ++L+++NFL WK Q+   +R +GL  F+ G   +PPK     D     +PNP++  + RQD+L+ SW L S+ +
Subjt:  NETSPSESLSSSQPMKAINPGSKTNIVKLDEENFLYWKIQVRAVLRDHGLGHFIEGDAHIPPKTTRSGDTSSATLPNPEYETWIRQDNLITSWFLGSMSN

Query:  EILSEVLDCETAREIWKILNDRFSSRNLARVMQIKSRLETIKKGNLSLKDYFVKVKNLVDSLAAADKKISSDDHVMHILKGLGSEYDPTVSVITEKDDTP
          L +V+ C +A E+W  ++  F+S++ A+VM  KS+++ +KK  L+++DY  K+KN  D LA A  KIS  DH++ I++GLG EY+  ++VI+ K  +P
Subjt:  EILSEVLDCETAREIWKILNDRFSSRNLARVMQIKSRLETIKKGNLSLKDYFVKVKNLVDSLAAADKKISSDDHVMHILKGLGSEYDPTVSVITEKDDTP

Query:  PLQKVYSLLLTQESRIERNSSINSDGFLPSVNYTALPSSKPQQQSSTDN----------TQKRGGQVRGSNGNNNGGRSNRRNWNNNGGKPQCQLCGRFG
         LQ V S L+  E RI    S N      SVNYT+  S++    S   N           Q  G QV   +  +N GR   R     G KPQCQLC +FG
Subjt:  PLQKVYSLLLTQESRIERNSSINSDGFLPSVNYTALPSSKPQQQSSTDN----------TQKRGGQVRGSNGNNNGGRSNRRNWNNNGGKPQCQLCGRFG

Query:  HTVLRCYMRFERWYQGPNNNNSQTPS--SPNFPQNQFPSSSGPGNCSSSQ------QPFTAYSLQHDLNKENQ---WYPDSGASNHVTSDISNLTVGSEY
        HTV RC+ R++  + G    N  TP             S S  GN + ++      Q ++         ++ Q   W+PDSGA+NHVT D+ NL  G+EY
Subjt:  HTVLRCYMRFERWYQGPNNNNSQTPS--SPNFPQNQFPSSSGPGNCSSSQ------QPFTAYSLQHDLNKENQ---WYPDSGASNHVTSDISNLTVGSEY

Query:  TDDNKVHVGNDTGLDVLHIGSSFVQSKSHPNSVFHLKNLLHVPTITKNLISVSQFAHDNQVFFEFHPSHYFVKDLSTGRILLQGKVTDGLYQFSLEKAES
          ++K+H+GN TGL + HIG S   S S PN V  LKN+L VP I KNL+SVSQFA DN V+FEFHP   FVKD S   +LLQG +  GLYQF+L K   
Subjt:  TDDNKVHVGNDTGLDVLHIGSSFVQSKSHPNSVFHLKNLLHVPTITKNLISVSQFAHDNQVFFEFHPSHYFVKDLSTGRILLQGKVTDGLYQFSLEKAES

Query:  LTPKGGCSSSFQPQVLHTSLSSVSNVVSDIPSTCNT--HTLELWHKRLGHTSLPTVKRILQTCKIPYSDNKMMSLWHACAIGKSHKLPFPTSSTVYTSPL
            G   S+ + ++   + S V N  SD P   N+  H  +LWHKRLGH +   V ++L   KIP+S     S+  AC +GKSH LPFP S TVYT PL
Subjt:  LTPKGGCSSSFQPQVLHTSLSSVSNVVSDIPSTCNT--HTLELWHKRLGHTSLPTVKRILQTCKIPYSDNKMMSLWHACAIGKSHKLPFPTSSTVYTSPL

Query:  QLVVADIWGPAFKTSRNNFRYYISFVDVYSRFTWIYFLNSKNEAFQAFLLFKTHVEKLLNCSILRFQTDGGGEFRLFTSYLKENDIEHRLSCPYTSQQNG
        QLVV+D+WGPA   S   F YY+SFVD YSR+TW+YFL +K++  +AFL+FK   E    C +  FQTD GGEFR   +Y ++N I HRLSCP+TS+QNG
Subjt:  QLVVADIWGPAFKTSRNNFRYYISFVDVYSRFTWIYFLNSKNEAFQAFLLFKTHVEKLLNCSILRFQTDGGGEFRLFTSYLKENDIEHRLSCPYTSQQNG

Query:  IVERKHRQIIDMGPTLLSQASLSLEFWDDAFSTATYLTNRLPSIVHQNISPLEVLFGYQPNYSFFKTFGCLCYPLLRPYNKHKIEPRSTPCLFLGYSTTH
        I+ERKHR I+++G TLL+QASL L++W DAFSTA +L NRLP+ V +   P E LF  +PNYS  K FGCLC+P LRPYNKHK++ RS+PC FLGYS+ H
Subjt:  IVERKHRQIIDMGPTLLSQASLSLEFWDDAFSTATYLTNRLPSIVHQNISPLEVLFGYQPNYSFFKTFGCLCYPLLRPYNKHKIEPRSTPCLFLGYSTTH

Query:  KGYKCLSFSGRLFISRHVQFDETKFP---AIVPSLFSSNNSNLEAPSVTTSLPIVKSATISHVSPFPADSHPTPSANPSPVTALSIPSQPQPCSSAPLSM
        KGYKCL+  GR+FISR V FDET+FP    +   +   ++S +  P +    P+VK+  +  +S  P+ S PT SA                 SS  L  
Subjt:  KGYKCLSFSGRLFISRHVQFDETKFP---AIVPSLFSSNNSNLEAPSVTTSLPIVKSATISHVSPFPADSHPTPSANPSPVTALSIPSQPQPCSSAPLSM

Query:  DNTTTDMTLHNNDPSPLPSPSSNNTNPDQSLSISNQSVSHPSTSD----------NNHSSQPLSTSNTVLVPTSAAVANSHPMVTRAKSGIFKPRVLLTE
        +N  +D+     D S        NT+   ++ I N+S S PS+S+          + +S +P  + NT  V         H MVTR+K+GIFKP+V   +
Subjt:  DNTTTDMTLHNNDPSPLPSPSSNNTNPDQSLSISNQSVSHPSTSD----------NNHSSQPLSTSNTVLVPTSAAVANSHPMVTRAKSGIFKPRVLLTE

Query:  YTEVEPPTVKEALKSAHWVAAMKEEHQALIKNDTWTLVNPPVDKKIIGCKWVFKIKRNSDGSVARYKARLVAKGFHQEADLDYTETFSPVVKPATIRVIF
            EP T +EA+    W  AM EE +AL+KN TW+LV+ P ++  +GC+WVFK+KRN DGSV+RYKARLVAKG+ Q    D+ ETFSPVVKP TIRV+ 
Subjt:  YTEVEPPTVKEALKSAHWVAAMKEEHQALIKNDTWTLVNPPVDKKIIGCKWVFKIKRNSDGSVARYKARLVAKGFHQEADLDYTETFSPVVKPATIRVIF

Query:  TMALTYGWPLQQVDINNAFLHGFLSEEVFMTQPLGMTDSTNPH--MVCKLKKALYGLKQAPRAWFDRLSSFLHSIGLVSSKTDSSLLVCHHNGTCCYILV
         +A++  W ++Q+D+NNAFL+G L EEV+M QP G    TN    +VCKL KALYGLKQAPRAWFD+L   L   G  S+K+D SL V   N +  ++LV
Subjt:  TMALTYGWPLQQVDINNAFLHGFLSEEVFMTQPLGMTDSTNPH--MVCKLKKALYGLKQAPRAWFDRLSSFLHSIGLVSSKTDSSLLVCHHNGTCCYILV

Query:  YVDDIVITGNSPLVIDQLINMLHQKFSLKDLGLLSYFLGVEVSYPPSGGIFLSQTKYIYDILHKTKMHEANPIATPMISGSVLSAYHGEPFSDTRLYRST
        YVDDIV+TG+S   I +LI+ L   FSLKDLG LSYFLG+E                  D+L KTKM  A  + TPM+SG  LSA  G+P  +   YRS 
Subjt:  YVDDIVITGNSPLVIDQLINMLHQKFSLKDLGLLSYFLGVEVSYPPSGGIFLSQTKYIYDILHKTKMHEANPIATPMISGSVLSAYHGEPFSDTRLYRST

Query:  VGALQYATLTRPEISYSVNKVCQFMHAPTQGHWQAVKRILRYLIGTIDHGLFFRKPSSFCLEGFADADWASDPDDRKSTSSFCVYLGGNLITWSSKKQHI
        VGALQY T+TRPEI++SVNKVCQFM  P   HW+AVKRILRYL GT D G+  +   +  L GF DADW SD DDR+STS  CV+LG +L++WSSKKQH 
Subjt:  VGALQYATLTRPEISYSVNKVCQFMHAPTQGHWQAVKRILRYLIGTIDHGLFFRKPSSFCLEGFADADWASDPDDRKSTSSFCVYLGGNLITWSSKKQHI

Query:  ISRSSTEAEYRSLAHVSADLIWLQSLFKDLRLNISRPPILWCDNLGAVHFSANPVLHSRTKHVEIDIYFVRDLVLQKRLTVCHLPTTAQIADIFTKPLSS
         SRSSTEAEYRSLA ++++++WLQSL  +L+  ++  P++WCDN+  V  SANPVLHSRTKH+E+D+YFVR+ V++++L V H+PT  Q+AD+FTKPLS 
Subjt:  ISRSSTEAEYRSLAHVSADLIWLQSLFKDLRLNISRPPILWCDNLGAVHFSANPVLHSRTKHVEIDIYFVRDLVLQKRLTVCHLPTTAQIADIFTKPLSS

Query:  SSFLPLRSKLNVCS
          F  LR KL V S
Subjt:  SSFLPLRSKLNVCS

TrEMBL top hitse value%identityAlignment
A0A2K3MRS5 Histone deacetylase (Fragment)0.0e+0044.93Show/hide
Query:  KLDEENFLYWKIQVRAVLRDHGLGHFIEGDAHIPPKTTRSGDTSSATLPNPEYETWIRQDNLITSWFLGSMSNEILSEVLDCETAREIWKILNDRFSSRN
        KL E+NF  W+ QV   +  H L  F+  D HIPP+     D  +ATL NP Y +W ++D ++ SW   ++S+EIL+  L C    E+W  +   F  + 
Subjt:  KLDEENFLYWKIQVRAVLRDHGLGHFIEGDAHIPPKTTRSGDTSSATLPNPEYETWIRQDNLITSWFLGSMSNEILSEVLDCETAREIWKILNDRFSSRN

Query:  LARVMQIKSRLETIKKGNLSLKDYFVKVKNLVDSLAAADKKISSDDHVMHILKGLGSEYDPTVSVITEKDDTPPLQKVYSLLLTQESRIERN--------
         A+  Q++  L +    N ++K+Y ++++ L+D+L +    +  + H+  IL+GL ++++  +SVI  K D   + +V +LLL  E+R+E+         
Subjt:  LARVMQIKSRLETIKKGNLSLKDYFVKVKNLVDSLAAADKKISSDDHVMHILKGLGSEYDPTVSVITEKDDTPPLQKVYSLLLTQESRIERN--------

Query:  SSIN-------------SDGFLPSVNYTALPSSKPQQQSSTDNTQKRGGQ-VRGSNGNNNGGRSNRRNWNNNGGKPQCQLCGRFGHTVLRCYMRFERWYQ
        +SIN              +   PS+N T   S  P    +  N++ RG +  +G  G N GGR++  N NNN    QCQ+C +  HT L C+ R +  YQ
Subjt:  SSIN-------------SDGFLPSVNYTALPSSKPQQQSSTDNTQKRGGQ-VRGSNGNNNGGRSNRRNWNNNGGKPQCQLCGRFGHTVLRCYMRFERWYQ

Query:  GPNNNNSQT-PSSP---------------NFPQN----------QFP----SSSGPGNCSSSQQPFTAYSLQHDLNKENQWYPDSGASNHVTSDISNLTV
          N  NSQ  P +P               NFP             FP    +SS P   +    P    +     N  N WYPDSGAS HVT+D  N+  
Subjt:  GPNNNNSQT-PSSP---------------NFPQN----------QFP----SSSGPGNCSSSQQPFTAYSLQHDLNKENQWYPDSGASNHVTSDISNLTV

Query:  GSEYTDDNKVHVGNDTGLDVLHIGSSFVQSKSHPNSVFHLKNLLHVPTITKNLISVSQFAHDNQVFFEFHPSHYFVKDLSTGRILLQGKV-TDGLYQFSL
         S ++  +++ +GN   L VL  GSS   S +H  +   L NLLHVPTITKNLISVSQFA DN VFFEFH +H  VK   T  ILLQG V  DGLY FS 
Subjt:  GSEYTDDNKVHVGNDTGLDVLHIGSSFVQSKSHPNSVFHLKNLLHVPTITKNLISVSQFAHDNQVFFEFHPSHYFVKDLSTGRILLQGKV-TDGLYQFSL

Query:  EKAESLTPKGGCSSSFQPQVLHTSLSSVSNVVSDIPSTCNTHTLELWHKRLGHTSLPTVKRILQTCKIPY--SDNKMMSLWHACAIGKSHKLPFPTSSTV
            S+ P    ++S   + +  S++S SNV S   S+ + ++  +WH RLGH +  T+K  L+ C I +  +DN +  L  AC +GK+H+L  P S TV
Subjt:  EKAESLTPKGGCSSSFQPQVLHTSLSSVSNVVSDIPSTCNTHTLELWHKRLGHTSLPTVKRILQTCKIPY--SDNKMMSLWHACAIGKSHKLPFPTSSTV

Query:  YTSPLQLVVADIWGPAFKTSRNNFRYYISFVDVYSRFTWIYFLNSKNEAFQAFLLFKTHVEKLLNCSILRFQTDGGGEFRLFTSYLKENDIEHRLSCPYT
        Y+ PL+LV +D+WGPA  TS   + YYI+F+D +S++TWIY L SK+EA   F  FKT  E  L CS+   QTD GGEFR FT  L E  + HRL CP+T
Subjt:  YTSPLQLVVADIWGPAFKTSRNNFRYYISFVDVYSRFTWIYFLNSKNEAFQAFLLFKTHVEKLLNCSILRFQTDGGGEFRLFTSYLKENDIEHRLSCPYT

Query:  SQQNGIVERKHRQIIDMGPTLLSQASLSLEFWDDAFSTATYLTNRLPSIVHQNISPLEVLFGYQPNYSFFKTFGCLCYPLLRPYNKHKIEPRSTPCLFLG
          QNG+VERKHR ++D+G TLLSQA+L + +WD AF TA +L NRLP+       P  VLF   P+Y   K FGC C+PLLRPYN HK++ RS  C+F+G
Subjt:  SQQNGIVERKHRQIIDMGPTLLSQASLSLEFWDDAFSTATYLTNRLPSIVHQNISPLEVLFGYQPNYSFFKTFGCLCYPLLRPYNKHKIEPRSTPCLFLG

Query:  YSTTHKGYKCLSFSGRLFISRHVQFDETKFPAIVPSLFSSNNSNLEAPSVTTSLPIVKSATISHVSPFPADSHPTPSANPSPVTALSIPSQPQPCSSAPL
        YS THKGYKCLS SG++FIS+ V F+E +FP     LF S   N   PS    +PI         S  P  +H     N S +  L+   Q Q  S+  L
Subjt:  YSTTHKGYKCLSFSGRLFISRHVQFDETKFPAIVPSLFSSNNSNLEAPSVTTSLPIVKSATISHVSPFPADSHPTPSANPSPVTALSIPSQPQPCSSAPL

Query:  SMDNTTTDMTLHNNDPSPLPSPSSNNTNPDQSLSISNQSVSHPSTSDNNHSSQPLSTSNTVLVPTSAAVANSHPMVTRAKSGIFKPRVLLT-EYTEVEPP
        S     + +        P  SP+SNN+N  Q  +         + SD   S+ P  ++NT            H M TR+KSGI  P+   T   T  EP 
Subjt:  SMDNTTTDMTLHNNDPSPLPSPSSNNTNPDQSLSISNQSVSHPSTSDNNHSSQPLSTSNTVLVPTSAAVANSHPMVTRAKSGIFKPRVLLT-EYTEVEPP

Query:  TVKEALKSAHWVAAMKEEHQALIKNDTWTLVNPPVDKKIIGCKWVFKIKRNSDGSVARYKARLVAKGFHQEADLDYTETFSPVVKPATIRVIFTMALTYG
        TVK+AL    W  AMK+E  AL +N TWTLV+ P ++  IGCKWVF+ K N DGS+ +YKARLVAKGFHQ+   D+ ETFSPV+KP TIR+I ++A++Y 
Subjt:  TVKEALKSAHWVAAMKEEHQALIKNDTWTLVNPPVDKKIIGCKWVFKIKRNSDGSVARYKARLVAKGFHQEADLDYTETFSPVVKPATIRVIFTMALTYG

Query:  WPLQQVDINNAFLHGFLSEEVFMTQPLGMTDSTNPHMVCKLKKALYGLKQAPRAWFDRLSSFLHSIGLVSSKTDSSLLVCHHNGTCCYILVYVDDIVITG
        WPL+Q+DINNAFL+G L+EEV+M QP G  +  +  +VCKL KALYGLKQAPR WFD+L+S L   G  +SK D SL +   N    Y+LVYVDDI+ITG
Subjt:  WPLQQVDINNAFLHGFLSEEVFMTQPLGMTDSTNPHMVCKLKKALYGLKQAPRAWFDRLSSFLHSIGLVSSKTDSSLLVCHHNGTCCYILVYVDDIVITG

Query:  NSPLVIDQLINMLHQKFSLKDLGLLSYFLGVEVSYPPSGGIFLSQTKYIYDILHKTKMHEANPIATPMISGSVLSAYHGEPFSDTRLYRSTVGALQYATL
        +SP ++  L+N L   FSLK LG L YFLG+EV     G + L+QTKYI D+L KT M ++NPIATPM+S   LS    +  +D  LYRS VG+LQYAT+
Subjt:  NSPLVIDQLINMLHQKFSLKDLGLLSYFLGVEVSYPPSGGIFLSQTKYIYDILHKTKMHEANPIATPMISGSVLSAYHGEPFSDTRLYRSTVGALQYATL

Query:  TRPEISYSVNKVCQFMHAPTQGHWQAVKRILRYLIGTIDHGLFFRKPS---SFCLEGFADADWASDPDDRKSTSSFCVYLGGNLITWSSKKQHIISRSST
        +RPEIS++VNKVCQFM AP + HW AVKRILRYL GT   GL     +      L  F DADWASDPDDR+STS   ++ G NLI+W S+KQ +++RSST
Subjt:  TRPEISYSVNKVCQFMHAPTQGHWQAVKRILRYLIGTIDHGLFFRKPS---SFCLEGFADADWASDPDDRKSTSSFCVYLGGNLITWSSKKQHIISRSST

Query:  EAEYRSLAHVSADLIWLQSLFKDLRLNISRPPILWCDNLGAVHFSANPVLHSRTKHVEIDIYFVRDLVLQKRLTVCHLPTTAQIADIFTKPLSSSSFLPL
        EAEYRSLA ++AD++WLQ+L K+L +  + P + +CDN  AV  + NPVLHSRTKH+EID++FVR+ V+ K+LTV H+P T Q+ADI TKP+S+  FL +
Subjt:  EAEYRSLAHVSADLIWLQSLFKDLRLNISRPPILWCDNLGAVHFSANPVLHSRTKHVEIDIYFVRDLVLQKRLTVCHLPTTAQIADIFTKPLSSSSFLPL

Query:  RSKLNVCSS
        RSKLNV  S
Subjt:  RSKLNVCSS

A0A2K3NRY4 Histone deacetylase0.0e+0043.15Show/hide
Query:  VKLDEENFLYWKIQVRAVLRDHGLGHFIEGDAHIPPKTTRSGDTSSATLPNPEYETWIRQDNLITSWFLGSMSNEILSEVLDCETAREIWKILNDRFSSR
        +KLDE+NFL W  QV  V+  H L  FI  +  IP K     + ++ T  + EY  W+ QD  + +W L ++S+ +L  VL C+ A ++W  ++  F+S 
Subjt:  VKLDEENFLYWKIQVRAVLRDHGLGHFIEGDAHIPPKTTRSGDTSSATLPNPEYETWIRQDNLITSWFLGSMSNEILSEVLDCETAREIWKILNDRFSSR

Query:  NLARVMQIKSRLETIKKGNLSLKDYFVKVKNLVDSLAAADKKISSDDHVMHILKGLGSEYDPTVSVITEKDDTPPLQKVYSLLLTQESRIERNSSINSDG
          AR  Q++S L+  KK   S+ +Y +++K ++DSL A    +S  +H+  IL+GL  E++  V +I  + DTP ++ V +LL+ QE++ E+      + 
Subjt:  NLARVMQIKSRLETIKKGNLSLKDYFVKVKNLVDSLAAADKKISSDDHVMHILKGLGSEYDPTVSVITEKDDTPPLQKVYSLLLTQESRIERNSSINSDG

Query:  FLPSVNYTALPSSKPQQQSSTDNTQKRGG---QVRGSNGNNNG---GRSNRRNWNNNGGKPQCQLCGRFGHTVLRCYMRFERWYQGPNNNNSQTPSSPNF
          P+V+     S       S ++     G      G  G   G   GR   R+  N      CQ+CG+ GH    C+ R++     PN     TP    F
Subjt:  FLPSVNYTALPSSKPQQQSSTDNTQKRGG---QVRGSNGNNNG---GRSNRRNWNNNGGKPQCQLCGRFGHTVLRCYMRFERWYQGPNNNNSQTPSSPNF

Query:  PQNQFPSSSGPGNCSSSQQPFTA----------YSLQH----DLNKENQWYPDSGASNHVTSDISNLTVGSEYTDDNKVHVGNDTGLDVLHIGSSFVQSK
        P   F SSS P     S  P+            Y++       +  E  WYPDSGA++H+T +  NL   S YT   +V +GN  G+ +  +G S +QS+
Subjt:  PQNQFPSSSGPGNCSSSQQPFTA----------YSLQH----DLNKENQWYPDSGASNHVTSDISNLTVGSEYTDDNKVHVGNDTGLDVLHIGSSFVQSK

Query:  SHPNSVFHLKNLLHVPTITKNLISVSQFAHDNQVFFEFHPSHYFVKDLSTGRILLQGKV-TDGLYQFSLEKAESLTPKGGCSSSFQPQVLHTSLSSVSNV
          PN    L NLLHVP I+KNLISVS+FA DN+V FEFHP   FVK  ++ ++LL+G V TDGLYQF  +  + L        S    V   + S+ S+V
Subjt:  SHPNSVFHLKNLLHVPTITKNLISVSQFAHDNQVFFEFHPSHYFVKDLSTGRILLQGKV-TDGLYQFSLEKAESLTPKGGCSSSFQPQVLHTSLSSVSNV

Query:  VSDIPSTCNT--HTLELWHKRLGHTSLPTVKRILQTCKIPYSDNKMMSLWHACAIGKSHKLPFPTSSTVYTSPLQLVVADIWGPAFKTSRNNFRYYISFV
          +   T N+  +  + WH RLGH+    VK +L  CKIP+++   +   +AC++GKSHKL  P S T++++P +++  D+WGPA  TS   + YYI+FV
Subjt:  VSDIPSTCNT--HTLELWHKRLGHTSLPTVKRILQTCKIPYSDNKMMSLWHACAIGKSHKLPFPTSSTVYTSPLQLVVADIWGPAFKTSRNNFRYYISFV

Query:  DVYSRFTWIYFLNSKNEAFQAFLLFKTHVEKLLNCSILRFQTDGGGEFRLFTSYLKENDIEHRLSCPYTSQQNGIVERKHRQIIDMGPTLLSQASLSLEF
        D  ++FTWIYFL SK +A  AF  F T ++     +I   Q+D GGEFR FT+ L    I+HRL+CP+TS QNGIVERKHRQI++MG T++SQAS+ L++
Subjt:  DVYSRFTWIYFLNSKNEAFQAFLLFKTHVEKLLNCSILRFQTDGGGEFRLFTSYLKENDIEHRLSCPYTSQQNGIVERKHRQIIDMGPTLLSQASLSLEF

Query:  WDDAFSTATYLTNRLP-SIVHQNISPLEVLFGYQPNYSFFKTFGCLCYPLLRPYNKHKIEPRSTPCLFLGYSTTHKGYKCLSFSGRLFISRHVQFDETKF
        WD +F+ + +L NRLP S +  + SP   L+   P+Y+  K FGC C+P LRPYNKHK + RS+ C++LG S  HKG+KCLS  GR++IS+ V+FDE  F
Subjt:  WDDAFSTATYLTNRLP-SIVHQNISPLEVLFGYQPNYSFFKTFGCLCYPLLRPYNKHKIEPRSTPCLFLGYSTTHKGYKCLSFSGRLFISRHVQFDETKF

Query:  P---AIVPSLFSSNNSNLEAPSVTTSLPIVKSATISHVSPFPADSHPTPSANPSPVTALSIPSQPQPCSSAPLSMDNTTTDMTLHNNDPSPLPSPSSNNT
        P        + SS++ +  + S   ++PI   ++                 N +P  +L++ +     ++  +S ++T +  +  +++ SP PSP  +N 
Subjt:  P---AIVPSLFSSNNSNLEAPSVTTSLPIVKSATISHVSPFPADSHPTPSANPSPVTALSIPSQPQPCSSAPLSMDNTTTDMTLHNNDPSPLPSPSSNNT

Query:  NPDQSLSISNQSVSHPSTSDNNHSSQPLSTSNTVLVPTSAAVANSHPMVTRAKSGIFKPRVLLTEYTEVEPPTVKEALKSAHWVAAMKEEHQALIKNDTW
         P            +P   +N  +  P  T   + V ++    N+HPMVTR K+G  KP+  L   T++EP TV+ AL  + W+AAMK+E+ AL+ N TW
Subjt:  NPDQSLSISNQSVSHPSTSDNNHSSQPLSTSNTVLVPTSAAVANSHPMVTRAKSGIFKPRVLLTEYTEVEPPTVKEALKSAHWVAAMKEEHQALIKNDTW

Query:  TLVNPPVDKKIIGCKWVFKIKRNSDGSVARYKARLVAKGFHQEADLDYTETFSPVVKPATIRVIFTMALTYGWPLQQVDINNAFLHGFLSEEVFMTQPLG
         LV  P  KK IGC+W+F++K N DGSV ++KARLVAKGFHQ A  D+TETFSPV+KP TIR+I T+A+T  W +QQ+DINNAFL+GFL EEV+M QP G
Subjt:  TLVNPPVDKKIIGCKWVFKIKRNSDGSVARYKARLVAKGFHQEADLDYTETFSPVVKPATIRVIFTMALTYGWPLQQVDINNAFLHGFLSEEVFMTQPLG

Query:  MTDSTNPHMVCKLKKALYGLKQAPRAWFDRLSSFLHSIGLVSSKTDSSLLVCHHNGTCCYILVYVDDIVITGNSPLVIDQLINMLHQKFSLKDLGLLSYF
          + ++  +VCKL ++LYGLKQAPRAW++RL++ L + G   S+ D SLLV   NG+  Y+L+YVDDI+ITG++P +I  LIN LHQ+FSLK LG L YF
Subjt:  MTDSTNPHMVCKLKKALYGLKQAPRAWFDRLSSFLHSIGLVSSKTDSSLLVCHHNGTCCYILVYVDDIVITGNSPLVIDQLINMLHQKFSLKDLGLLSYF

Query:  LGVEVSYPPSGGIFLSQTKYIYDILHKTKMHEANPIATPMISGSVLSAYHGEPFSDTRLYRSTVGALQYATLTRPEISYSVNKVCQFMHAPTQGHWQAVK
        LG+EV +  SG I L+Q KYI D+L K  M EAN I +PM+S   LS +  +  +D  +YRS VGALQYATLTRP+I++SVNKV QFM  P + HW+AVK
Subjt:  LGVEVSYPPSGGIFLSQTKYIYDILHKTKMHEANPIATPMISGSVLSAYHGEPFSDTRLYRSTVGALQYATLTRPEISYSVNKVCQFMHAPTQGHWQAVK

Query:  RILRYLIGTIDHGLFFR---KPSSFCLEGFADADWASDPDDRKSTSSFCVYLGGNLITWSSKKQHIISRSSTEAEYRSLAHVSADLIWLQSLFKDLRLNI
        RILRYL G++ HGL  +   +   + L  ++DADWA+D DDR+S S  C+Y G NLI WSSKKQ +++RSSTEAEYRS+A+ +A+L+W+QSL  +L++  
Subjt:  RILRYLIGTIDHGLFFR---KPSSFCLEGFADADWASDPDDRKSTSSFCVYLGGNLITWSSKKQHIISRSSTEAEYRSLAHVSADLIWLQSLFKDLRLNI

Query:  SRPPILWCDNLGAVHFSANPVLHSRTKHVEIDIYFVRDLVLQKRLTVCHLPTTAQIADIFTKPLSSSSFLPLRSKLNVCSS
          P +L CDNL AV  + NP+LHSRTKH+E+DI+FVR+ VL K+L V H+P   Q+AD FTKPLS +++  +R+KL V SS
Subjt:  SRPPILWCDNLGAVHFSANPVLHSRTKHVEIDIYFVRDLVLQKRLTVCHLPTTAQIADIFTKPLSSSSFLPLRSKLNVCSS

A0A438EA49 Retrovirus-related Pol polyprotein from transposon TNT 1-940.0e+0046.63Show/hide
Query:  NETSPSESLSSSQPMKAINPGSKTNIVKLDEENFLYWKIQVRAVLRDHGLGHFIEGDAHIPPKTTRSGDTSSATLPNPEYETWIRQDNLITSWFLGSMSN
        N T+  ESL        I+P S+   ++L+++NFL WK Q+   +R +GL  F+ G   +PPK     D     +PNP++  + RQD+L+ SW L S+ +
Subjt:  NETSPSESLSSSQPMKAINPGSKTNIVKLDEENFLYWKIQVRAVLRDHGLGHFIEGDAHIPPKTTRSGDTSSATLPNPEYETWIRQDNLITSWFLGSMSN

Query:  EILSEVLDCETAREIWKILNDRFSSRNLARVMQIKSRLETIKKGNLSLKDYFVKVKNLVDSLAAADKKISSDDHVMHILKGLGSEYDPTVSVITEKDDTP
          L +V+ C +A E                               L+++DY  K+KN  D LA A  KIS  DH++ I++GLG EY+  ++VI+ K  +P
Subjt:  EILSEVLDCETAREIWKILNDRFSSRNLARVMQIKSRLETIKKGNLSLKDYFVKVKNLVDSLAAADKKISSDDHVMHILKGLGSEYDPTVSVITEKDDTP

Query:  PLQKVYSLLLTQESRIERNSSINSDGFLPSVNYTALPSSKPQQQSSTDN----------TQKRGGQVRGSNGNNNGGRSNRRNWNNNGGKPQCQLCGRFG
         LQ V S L+  E RI    S N      SVNYT+  S++    S   N           Q  G QV   +  +N GR   R     G KPQCQLC +FG
Subjt:  PLQKVYSLLLTQESRIERNSSINSDGFLPSVNYTALPSSKPQQQSSTDN----------TQKRGGQVRGSNGNNNGGRSNRRNWNNNGGKPQCQLCGRFG

Query:  HTVLRCYMRFERWYQGPNNNNSQTPS--SPNFPQNQFPSSSGPGNCSSSQ------QPFTAYSLQHDLNKENQ---WYPDSGASNHVTSDISNLTVGSEY
        HTV RC+ R++  + G    N  TP             S S  GN + ++      Q ++         ++ Q   W+PDSGA+NHVT D+ NL  G+EY
Subjt:  HTVLRCYMRFERWYQGPNNNNSQTPS--SPNFPQNQFPSSSGPGNCSSSQ------QPFTAYSLQHDLNKENQ---WYPDSGASNHVTSDISNLTVGSEY

Query:  TDDNKVHVGNDTGLDVLHIGSSFVQSKSHPNSVFHLKNLLHVPTITKNLISVSQFAHDNQVFFEFHPSHYFVKDLSTGRILLQGKVTDGLYQFSLEKAES
          ++K+H+GN TGL + HIG S   S S PN V  LKN+L VP I KNL+SVSQFA DN V+FEFHP   FVKD S   +LLQG +  GLYQF+L K   
Subjt:  TDDNKVHVGNDTGLDVLHIGSSFVQSKSHPNSVFHLKNLLHVPTITKNLISVSQFAHDNQVFFEFHPSHYFVKDLSTGRILLQGKVTDGLYQFSLEKAES

Query:  LTPKGGCSSSFQPQVLHTSLSSVSNVVSDIPSTCNT--HTLELWHKRLGHTSLPTVKRILQTCKIPYSDNKMMSLWHACAIGKSHKLPFPTSSTVYTSPL
            G   S+ + ++   + S V N  SD P   N+  H  +LWHKRLGH +   V ++L   KIP+S     S+  AC +GKSH LPFP S TVYT PL
Subjt:  LTPKGGCSSSFQPQVLHTSLSSVSNVVSDIPSTCNT--HTLELWHKRLGHTSLPTVKRILQTCKIPYSDNKMMSLWHACAIGKSHKLPFPTSSTVYTSPL

Query:  QLVVADIWGPAFKTSRNNFRYYISFVDVYSRFTWIYFLNSKNEAFQAFLLFKTHVEKLLNCSILRFQTDGGGEFRLFTSYLKENDIEHRLSCPYTSQQNG
        QLVV+D+WGPA   S   F YY+SFVD YSR+TW+YFL +K++  +AFL+FK   E    C +  FQTD GGEFR   +Y ++N I HRLSCP+TS+QNG
Subjt:  QLVVADIWGPAFKTSRNNFRYYISFVDVYSRFTWIYFLNSKNEAFQAFLLFKTHVEKLLNCSILRFQTDGGGEFRLFTSYLKENDIEHRLSCPYTSQQNG

Query:  IVERKHRQIIDMGPTLLSQASLSLEFWDDAFSTATYLTNRLPSIVHQNISPLEVLFGYQPNYSFFKTFGCLCYPLLRPYNKHKIEPRSTPCLFLGYSTTH
        I+ERKHR I+++G TLL+QASL L++W DAFSTA +L NRLP+ V +   P E LF  +PNYS  K FGCLC+P LRPYNKHK++ RS+PC FLGYS+ H
Subjt:  IVERKHRQIIDMGPTLLSQASLSLEFWDDAFSTATYLTNRLPSIVHQNISPLEVLFGYQPNYSFFKTFGCLCYPLLRPYNKHKIEPRSTPCLFLGYSTTH

Query:  KGYKCLSFSGRLFISRHVQFDETKFP---AIVPSLFSSNNSNLEAPSVTTSLPIVKSATISHVSPFPADSHPTPSANPSPVTALSIPSQPQPCSSAPLSM
        KGYKCL+  GR+FISR V FDET+FP    +   +   ++S +  P +    P+VK+  +  +S  P+ S PT SA                 SS  L  
Subjt:  KGYKCLSFSGRLFISRHVQFDETKFP---AIVPSLFSSNNSNLEAPSVTTSLPIVKSATISHVSPFPADSHPTPSANPSPVTALSIPSQPQPCSSAPLSM

Query:  DNTTTDMTLHNNDPSPLPSPSSNNTNPDQSLSISNQSVSHPSTSD----------NNHSSQPLSTSNTVLVPTSAAVANSHPMVTRAKSGIFKPRVLLTE
        +N  +D+     D S        NT+   ++ I N+S S PS+S+          + +S +P  + NT  V         H MVTR+K+GIFKP+V   +
Subjt:  DNTTTDMTLHNNDPSPLPSPSSNNTNPDQSLSISNQSVSHPSTSD----------NNHSSQPLSTSNTVLVPTSAAVANSHPMVTRAKSGIFKPRVLLTE

Query:  YTEVEPPTVKEALKSAHWVAAMKEEHQALIKNDTWTLVNPPVDKKIIGCKWVFKIKRNSDGSVARYKARLVAKGFHQEADLDYTETFSPVVKPATIRVIF
            EP T +EA+    W  AM EE +AL+KN TW+LV+ P ++  +GC+WVFK+KRN DGSV+RYKARLVAKG+ Q    D+ ETFSPVVKP TIRV+ 
Subjt:  YTEVEPPTVKEALKSAHWVAAMKEEHQALIKNDTWTLVNPPVDKKIIGCKWVFKIKRNSDGSVARYKARLVAKGFHQEADLDYTETFSPVVKPATIRVIF

Query:  TMALTYGWPLQQVDINNAFLHGFLSEEVFMTQPLGMTDSTNPH--MVCKLKKALYGLKQAPRAWFDRLSSFLHSIGLVSSKTDSSLLVCHHNGTCCYILV
         +A++  W ++Q+D+NNAFL+G L EEV+M QP G    TN    +VCKL KALYGLKQAPRAWFD+L   L   G  S+K+D SL V   N +  ++LV
Subjt:  TMALTYGWPLQQVDINNAFLHGFLSEEVFMTQPLGMTDSTNPH--MVCKLKKALYGLKQAPRAWFDRLSSFLHSIGLVSSKTDSSLLVCHHNGTCCYILV

Query:  YVDDIVITGNSPLVIDQLINMLHQKFSLKDLGLLSYFLGVEVSYPPSGGIFLSQTKYIYDILHKTKMHEANPIATPMISGSVLSAYHGEPFSDTRLYRST
        YVDDIV+TG+S   I +LI+ L   FSLKDLG LSYFLG+EV     GG+ LSQ KYI D+L KTKM  A  + TPM+SG  LSA  G+P  +   YRS 
Subjt:  YVDDIVITGNSPLVIDQLINMLHQKFSLKDLGLLSYFLGVEVSYPPSGGIFLSQTKYIYDILHKTKMHEANPIATPMISGSVLSAYHGEPFSDTRLYRST

Query:  VGALQYATLTRPEISYSVNKVCQFMHAPTQGHWQAVKRILRYLIGTIDHGLFFRKPSSFCLEGFADADWASDPDDRKSTSSFCVYLGGNLITWSSKKQHI
        VGALQY T+TRPEI++SVNKVCQFM  P   HW+AVKRILRYL GT D G+  +   +  L GF DADW SD DDR+STS  CV+LG +L++WSSKKQH 
Subjt:  VGALQYATLTRPEISYSVNKVCQFMHAPTQGHWQAVKRILRYLIGTIDHGLFFRKPSSFCLEGFADADWASDPDDRKSTSSFCVYLGGNLITWSSKKQHI

Query:  ISRSSTEAEYRSLAHVSADLIWLQSLFKDLRLNISRPPILWCDNLGAVHFSANPVLHSRTKHVEIDIYFVRDLVLQKRLTVCHLPTTAQIADIFTKPLSS
         SRSSTEAEYRSLA ++++++WLQSL  +L+  ++  P++WCDN+  V  SANPVLHSRTKH+E+D+YFVR+ V++++L V H+PT  Q+AD+FTKPLS 
Subjt:  ISRSSTEAEYRSLAHVSADLIWLQSLFKDLRLNISRPPILWCDNLGAVHFSANPVLHSRTKHVEIDIYFVRDLVLQKRLTVCHLPTTAQIADIFTKPLSS

Query:  SSFLPLRSKLNVCS
          F  LR KL V S
Subjt:  SSFLPLRSKLNVCS

A0A438FJP6 Retrovirus-related Pol polyprotein from transposon TNT 1-940.0e+0046.7Show/hide
Query:  NETSPSESLSSSQPMKAINPGSKTNIVKLDEENFLYWKIQVRAVLRDHGLGHFIEGDAHIPPKTTRSGDTSSATLPNPEYETWIRQDNLITSWFLGSMSN
        N T+  ESL        I+P S+   ++L+++NFL WK Q+   +R +GL  F+ G   +PPK     D     +PNP++  + RQD+L+ SW L S+ +
Subjt:  NETSPSESLSSSQPMKAINPGSKTNIVKLDEENFLYWKIQVRAVLRDHGLGHFIEGDAHIPPKTTRSGDTSSATLPNPEYETWIRQDNLITSWFLGSMSN

Query:  EILSEVLDCETAREIWKILNDRFSSRNLARVMQIKSRLETIKKGNLSLKDYFVKVKNLVDSLAAADKKISSDDHVMHILKGLGSEYDPTVSVITEKDDTP
          L +V+ C +A E+W  ++  F+S++ A+VM  KS+++ +KK  L+++DY  K+KN  D LA A  KIS  DH++ I++GLG EY+  ++VI+ K  +P
Subjt:  EILSEVLDCETAREIWKILNDRFSSRNLARVMQIKSRLETIKKGNLSLKDYFVKVKNLVDSLAAADKKISSDDHVMHILKGLGSEYDPTVSVITEKDDTP

Query:  PLQKVYSLLLTQESRIERNSSINSDGFLPSVNYTALPSSKPQQQSSTDN----------TQKRGGQVRGSNGNNNGGRSNRRNWNNNGGKPQCQLCGRFG
         LQ V S L+  E RI    S N      SVNYT+  S++    S   N           Q  G QV   +  +N GR   R     G KPQCQLC +FG
Subjt:  PLQKVYSLLLTQESRIERNSSINSDGFLPSVNYTALPSSKPQQQSSTDN----------TQKRGGQVRGSNGNNNGGRSNRRNWNNNGGKPQCQLCGRFG

Query:  HTVLRCYMRFERWYQGPNNNNSQTPS--SPNFPQNQFPSSSGPGNCSSSQ------QPFTAYSLQHDLNKENQ---WYPDSGASNHVTSDISNLTVGSEY
        HTV RC+ R++  + G    N  TP             S S  GN + ++      Q ++         ++ Q   W+PDSGA+NHVT D+ NL  G+EY
Subjt:  HTVLRCYMRFERWYQGPNNNNSQTPS--SPNFPQNQFPSSSGPGNCSSSQ------QPFTAYSLQHDLNKENQ---WYPDSGASNHVTSDISNLTVGSEY

Query:  TDDNKVHVGNDTGLDVLHIGSSFVQSKSHPNSVFHLKNLLHVPTITKNLISVSQFAHDNQVFFEFHPSHYFVKDLSTGRILLQGKVTDGLYQFSLEKAES
          ++K+H+GN TGL + HIG S   S S PN V  LKN+L VP I KNL+SVSQFA DN V+FEFHP   FVKD S   +LLQG +  GLYQF+L K   
Subjt:  TDDNKVHVGNDTGLDVLHIGSSFVQSKSHPNSVFHLKNLLHVPTITKNLISVSQFAHDNQVFFEFHPSHYFVKDLSTGRILLQGKVTDGLYQFSLEKAES

Query:  LTPKGGCSSSFQPQVLHTSLSSVSNVVSDIPSTCNT--HTLELWHKRLGHTSLPTVKRILQTCKIPYSDNKMMSLWHACAIGKSHKLPFPTSSTVYTSPL
            G   S+ + ++   + S V N  SD P   N+  H  +LWHKRLGH +   V ++L   KIP+S     S+  AC +GKSH LPFP S TVYT PL
Subjt:  LTPKGGCSSSFQPQVLHTSLSSVSNVVSDIPSTCNT--HTLELWHKRLGHTSLPTVKRILQTCKIPYSDNKMMSLWHACAIGKSHKLPFPTSSTVYTSPL

Query:  QLVVADIWGPAFKTSRNNFRYYISFVDVYSRFTWIYFLNSKNEAFQAFLLFKTHVEKLLNCSILRFQTDGGGEFRLFTSYLKENDIEHRLSCPYTSQQNG
        QLVV+D+WGPA   S   F YY+SFVD YSR+TW+YFL +K++  +AFL+FK   E    C +  FQTD GGEFR   +Y ++N I HRLSCP+TS+QNG
Subjt:  QLVVADIWGPAFKTSRNNFRYYISFVDVYSRFTWIYFLNSKNEAFQAFLLFKTHVEKLLNCSILRFQTDGGGEFRLFTSYLKENDIEHRLSCPYTSQQNG

Query:  IVERKHRQIIDMGPTLLSQASLSLEFWDDAFSTATYLTNRLPSIVHQNISPLEVLFGYQPNYSFFKTFGCLCYPLLRPYNKHKIEPRSTPCLFLGYSTTH
        I+ERKHR I+++G TLL+QASL L++W DAFSTA +L NRLP+ V +   P E LF  +PNYS  K FGCLC+P LRPYNKHK++ RS+PC FLGYS+ H
Subjt:  IVERKHRQIIDMGPTLLSQASLSLEFWDDAFSTATYLTNRLPSIVHQNISPLEVLFGYQPNYSFFKTFGCLCYPLLRPYNKHKIEPRSTPCLFLGYSTTH

Query:  KGYKCLSFSGRLFISRHVQFDETKFP---AIVPSLFSSNNSNLEAPSVTTSLPIVKSATISHVSPFPADSHPTPSANPSPVTALSIPSQPQPCSSAPLSM
        KGYKCL+  GR+FISR V FDET+FP    +   +   ++S +  P +    P+VK+  +  +S  P+ S PT SA                 SS  L  
Subjt:  KGYKCLSFSGRLFISRHVQFDETKFP---AIVPSLFSSNNSNLEAPSVTTSLPIVKSATISHVSPFPADSHPTPSANPSPVTALSIPSQPQPCSSAPLSM

Query:  DNTTTDMTLHNNDPSPLPSPSSNNTNPDQSLSISNQSVSHPSTSD----------NNHSSQPLSTSNTVLVPTSAAVANSHPMVTRAKSGIFKPRVLLTE
        +N  +D+     D S        NT+   ++ I N+S S PS+S+          + +S +P  + NT  V         H MVTR+K+GIFKP+V   +
Subjt:  DNTTTDMTLHNNDPSPLPSPSSNNTNPDQSLSISNQSVSHPSTSD----------NNHSSQPLSTSNTVLVPTSAAVANSHPMVTRAKSGIFKPRVLLTE

Query:  YTEVEPPTVKEALKSAHWVAAMKEEHQALIKNDTWTLVNPPVDKKIIGCKWVFKIKRNSDGSVARYKARLVAKGFHQEADLDYTETFSPVVKPATIRVIF
            EP T +EA+    W  AM EE +AL+KN TW+LV+ P ++  +GC+WVFK+KRN DGSV+RYKARLVAKG+ Q    D+ ETFSPVVKP TIRV+ 
Subjt:  YTEVEPPTVKEALKSAHWVAAMKEEHQALIKNDTWTLVNPPVDKKIIGCKWVFKIKRNSDGSVARYKARLVAKGFHQEADLDYTETFSPVVKPATIRVIF

Query:  TMALTYGWPLQQVDINNAFLHGFLSEEVFMTQPLGMTDSTNPH--MVCKLKKALYGLKQAPRAWFDRLSSFLHSIGLVSSKTDSSLLVCHHNGTCCYILV
         +A++  W ++Q+D+NNAFL+G L EEV+M QP G    TN    +VCKL KALYGLKQAPRAWFD+L   L   G  S+K+D SL V   N +  ++LV
Subjt:  TMALTYGWPLQQVDINNAFLHGFLSEEVFMTQPLGMTDSTNPH--MVCKLKKALYGLKQAPRAWFDRLSSFLHSIGLVSSKTDSSLLVCHHNGTCCYILV

Query:  YVDDIVITGNSPLVIDQLINMLHQKFSLKDLGLLSYFLGVEVSYPPSGGIFLSQTKYIYDILHKTKMHEANPIATPMISGSVLSAYHGEPFSDTRLYRST
        YVDDIV+TG+S   I +LI+ L   FSLKDLG LSYFLG+E                  D+L KTKM  A  + TPM+SG  LSA  G+P  +   YRS 
Subjt:  YVDDIVITGNSPLVIDQLINMLHQKFSLKDLGLLSYFLGVEVSYPPSGGIFLSQTKYIYDILHKTKMHEANPIATPMISGSVLSAYHGEPFSDTRLYRST

Query:  VGALQYATLTRPEISYSVNKVCQFMHAPTQGHWQAVKRILRYLIGTIDHGLFFRKPSSFCLEGFADADWASDPDDRKSTSSFCVYLGGNLITWSSKKQHI
        VGALQY T+TRPEI++SVNKVCQFM  P   HW+AVKRILRYL GT D G+  +   +  L GF DADW SD DDR+STS  CV+LG +L++WSSKKQH 
Subjt:  VGALQYATLTRPEISYSVNKVCQFMHAPTQGHWQAVKRILRYLIGTIDHGLFFRKPSSFCLEGFADADWASDPDDRKSTSSFCVYLGGNLITWSSKKQHI

Query:  ISRSSTEAEYRSLAHVSADLIWLQSLFKDLRLNISRPPILWCDNLGAVHFSANPVLHSRTKHVEIDIYFVRDLVLQKRLTVCHLPTTAQIADIFTKPLSS
         SRSSTEAEYRSLA ++++++WLQSL  +L+  ++  P++WCDN+  V  SANPVLHSRTKH+E+D+YFVR+ V++++L V H+PT  Q+AD+FTKPLS 
Subjt:  ISRSSTEAEYRSLAHVSADLIWLQSLFKDLRLNISRPPILWCDNLGAVHFSANPVLHSRTKHVEIDIYFVRDLVLQKRLTVCHLPTTAQIADIFTKPLSS

Query:  SSFLPLRSKLNVCS
          F  LR KL V S
Subjt:  SSFLPLRSKLNVCS

A5BFT3 Integrase catalytic domain-containing protein0.0e+0044.57Show/hide
Query:  SPSESLSSSQPMKAINPGSKTNIVKLDEENFLYWKIQVRAVLRDHGLGHFIEGDAHI----PPKTTRSGDTSSATLPNPEYETWIRQ----DNLITSWFL
        +P+ +  SS     IN     + VKLD +NFL WK Q+ + +R +GL  F+  D  +     P+  R            + E  +R     +N I    L
Subjt:  SPSESLSSSQPMKAINPGSKTNIVKLDEENFLYWKIQVRAVLRDHGLGHFIEGDAHI----PPKTTRSGDTSSATLPNPEYETWIRQ----DNLITSWFL

Query:  GSMSNEILSEVLDCETAREIWKILNDRFSSRNLARVMQIKSRLETIKKGNLSLKDYFVKVKNLVDSLAAADKKISSDDHVMHILKGLGSEYDPTVSVITE
        G  S+  LS+             L   F+S+  A+  Q K++L+  KKG  ++ +Y  K+K  VDSLA+    +S+ DHV  IL GL ++Y+  V+ +  
Subjt:  GSMSNEILSEVLDCETAREIWKILNDRFSSRNLARVMQIKSRLETIKKGNLSLKDYFVKVKNLVDSLAAADKKISSDDHVMHILKGLGSEYDPTVSVITE

Query:  KDDTPPLQKVYSLLLTQESRIERNSSINSDGFLPSVNYTALPSSKPQQQSSTD----NTQ-----------------KRGGQV--RGSNGNNNG------
        ++D   ++++ +LL+  ESR+E+N+  NS    PS +  +  + +   +   D    N+Q                 +RGG    RG N N NG      
Subjt:  KDDTPPLQKVYSLLLTQESRIERNSSINSDGFLPSVNYTALPSSKPQQQSSTD----NTQ-----------------KRGGQV--RGSNGNNNG------

Query:  --GRSN------RRNWN--NNGGKPQCQLCGRFGHTVLRCYMRFERWYQGPNNNNSQTPSSPNFPQNQFPSSSGPGNCSSSQQPFTAYSLQHDLNKENQW
          GR N      R  WN  N   KP CQLCG+ GH V +CY RF+  +Q P N +S+  SSP    +  P  +G    S             ++  ++ W
Subjt:  --GRSN------RRNWN--NNGGKPQCQLCGRFGHTVLRCYMRFERWYQGPNNNNSQTPSSPNFPQNQFPSSSGPGNCSSSQQPFTAYSLQHDLNKENQW

Query:  YPDSGASNHVTSDISNLTVGSEYTDDNKVHVGNDTGLDVLHIGSSFVQSKSHPNSVFHLKNLLHVPTITKNLISVSQFAHDNQVFFEFHPSHYFVKDLST
        YPDSGASNHVT +  NL   +E+   N+VHVGN TGL + HIG S   S      +  L +LLHVP+ITKNL+SVS+FA DN+VFFEFH    FVKD  T
Subjt:  YPDSGASNHVTSDISNLTVGSEYTDDNKVHVGNDTGLDVLHIGSSFVQSKSHPNSVFHLKNLLHVPTITKNLISVSQFAHDNQVFFEFHPSHYFVKDLST

Query:  GRILLQGKVTDGLYQF-----SLEKAESLTPKGG-CSSSFQPQVLHTSLSSVSNVVSDIPSTCNTHTLELWHKRLGHTSLPTVKRILQTCKIPYSDNKMM
          +L+ GKV DGLY F     +L   +SL+      +SSF  +V  TSLSS               T +LWHKRLGH S  T+K +L  C + + +    
Subjt:  GRILLQGKVTDGLYQF-----SLEKAESLTPKGG-CSSSFQPQVLHTSLSSVSNVVSDIPSTCNTHTLELWHKRLGHTSLPTVKRILQTCKIPYSDNKMM

Query:  SLWHACAIGKSHKLPFPTSSTVYTSPLQLVVADIWGPAFKTSRNNFRYYISFVDVYSRFTWIYFLNSKNEAFQAFLLFKTHVEKLLNCSILRFQTDGGGE
        +   +C +GK H+ PF  S T YT PL+L+  D+WGP    S + +RYYI FVD +SRF+WI+ L +K+EA + F+ FKT VE   +  I   QTD GGE
Subjt:  SLWHACAIGKSHKLPFPTSSTVYTSPLQLVVADIWGPAFKTSRNNFRYYISFVDVYSRFTWIYFLNSKNEAFQAFLLFKTHVEKLLNCSILRFQTDGGGE

Query:  FRLFTSYLKENDIEHRLSCPYTSQQNGIVERKHRQIIDMGPTLLSQASLSLEFWDDAFSTATYLTNRLPSIVHQNISPLEVLFGYQPNYSFFKTFGCLCY
        FR F SYL EN I HR+SCP+T QQNG+ ERKHR I++ G TLL  ASL L+FWD++F T  YL+NRLP+ +  +  P+EVLF   P+YSF K FGC C+
Subjt:  FRLFTSYLKENDIEHRLSCPYTSQQNGIVERKHRQIIDMGPTLLSQASLSLEFWDDAFSTATYLTNRLPSIVHQNISPLEVLFGYQPNYSFFKTFGCLCY

Query:  PLLRPYNKHKIEPRSTPCLFLGYSTTHKGYKCLSFSGRLFISRHVQFDETKFPAIVPSLFSSNNSNLEAPSVTTSLPIVKSATISHVSPFPADSHPTPSA
        P LRPYN HK++ RS  C FLGYS  HKGYKC+S +GR++IS  V F+ET FP                 S T     V S  +S VS  P+ SH +PSA
Subjt:  PLLRPYNKHKIEPRSTPCLFLGYSTTHKGYKCLSFSGRLFISRHVQFDETKFPAIVPSLFSSNNSNLEAPSVTTSLPIVKSATISHVSPFPADSHPTPSA

Query:  NPSPVTALSIPSQPQPCSSA-PLS-MDNTTTDMTLHNNDPSPLPSPSSNNTNPDQSLSISNQSVSHPSTSDNNHSSQPLSTSNTVLVPTSAAVANSHPMV
        +P  ++   +P+   P SSA P+S MDN    ++ H + P+          + D +L+ + Q VS+P  +   H    ++ ++        A  N+HPM+
Subjt:  NPSPVTALSIPSQPQPCSSA-PLS-MDNTTTDMTLHNNDPSPLPSPSSNNTNPDQSLSISNQSVSHPSTSDNNHSSQPLSTSNTVLVPTSAAVANSHPMV

Query:  TRAKSGIFKPRVLLTEYTEVEPPTVKEALKSAHWVAAMKEEHQALIKNDTWTLVNPPVDKKIIGCKWVFKIKRNSDGSVARYKARLVAKGFHQEADLDYT
        TRAKSGI KP++ +      EP +V  AL+   W  AM  E+ AL +N+TW+LV  P  ++ IGCKWV+K K N DG+V +YKARLVAKGFHQ+A  D+T
Subjt:  TRAKSGIFKPRVLLTEYTEVEPPTVKEALKSAHWVAAMKEEHQALIKNDTWTLVNPPVDKKIIGCKWVFKIKRNSDGSVARYKARLVAKGFHQEADLDYT

Query:  ETFSPVVKPATIRVIFTMALTYGWPLQQVDINNAFLHGFLSEEVFMTQPLGMTDSTNPHMVCKLKKALYGLKQAPRAWFDRLSSFLHSIGLVSSKTDSSL
        ETFSPVVKP+T+RV+FT+AL+  W ++Q+D+NNAFL+G L EEVFM QP G  D  NP++VC+L KALYGLKQAPRAWF++L   L S G VS+K+D SL
Subjt:  ETFSPVVKPATIRVIFTMALTYGWPLQQVDINNAFLHGFLSEEVFMTQPLGMTDSTNPHMVCKLKKALYGLKQAPRAWFDRLSSFLHSIGLVSSKTDSSL

Query:  LVCHHNGTCCYILVYVDDIVITGNSPLVIDQLINMLHQKFSLKDLGLLSYFLGVEVSYPPSGGIFLSQTKYIYDILHKTKMHEANPIATPMISGSVLSAY
         +        Y+LVYVDDI++ G+    I  LI  L+ +FSLKDLG + YFLG++VS+  + G+ LSQTKYI D+L KTKM    P  TP+ +G  L   
Subjt:  LVCHHNGTCCYILVYVDDIVITGNSPLVIDQLINMLHQKFSLKDLGLLSYFLGVEVSYPPSGGIFLSQTKYIYDILHKTKMHEANPIATPMISGSVLSAY

Query:  HGEPFSDTRLYRSTVGALQYATLTRPEISYSVNKVCQFMHAPTQGHWQAVKRILRYLIGTIDHGLFFRKPSSFCLEGFADADWASDPDDRKSTSSFCVYL
         G+P  D   YRSTVGALQY T+TRPE+S+SVNKVCQFM  PT+ HW+ VKRILRYL GT+ HGL  +K S+  L GF DADWASD DDR+STS  CV+L
Subjt:  HGEPFSDTRLYRSTVGALQYATLTRPEISYSVNKVCQFMHAPTQGHWQAVKRILRYLIGTIDHGLFFRKPSSFCLEGFADADWASDPDDRKSTSSFCVYL

Query:  GGNLITWSSKKQHIISRSSTEAEYRSLAHVSADLIWLQSLFKDLRLNISRPPILWCDNLGAVHFSANPVLHSRTKHVEIDIYFVRDLVLQKRLTVCHLPT
        G NLI+W SKKQHI+SRSS E EYRSLA + A++ WL+SL  +L+L +++PP++WCDNL  V  SANPVLH+RTKH+E+D+YFVR+ V++K + V H+P+
Subjt:  GGNLITWSSKKQHIISRSSTEAEYRSLAHVSADLIWLQSLFKDLRLNISRPPILWCDNLGAVHFSANPVLHSRTKHVEIDIYFVRDLVLQKRLTVCHLPT

Query:  TAQIADIFTKPLSSSSFLPLRSKLNVCSSNTIGLRGVLR
          Q+AD+ TK +SS+ F+  R KL + + +T+ LRG +R
Subjt:  TAQIADIFTKPLSSSSFLPLRSKLNVCSSNTIGLRGVLR

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.0e-12926.67Show/hide
Query:  SKTNIVKLDEENFLYWKIQVRAVLRDHGLGHFIEGDAHIPPKTTRSGDTSSATLPNPEYETWIRQDNLITSWFLGSMSNEILSEVLDCETAREIWKILND
        +K NI   D E +  WK ++RA+L +  +   ++G                  +PN   ++W + +    S  +  +S+  L+      TAR+I + L+ 
Subjt:  SKTNIVKLDEENFLYWKIQVRAVLRDHGLGHFIEGDAHIPPKTTRSGDTSSATLPNPEYETWIRQDNLITSWFLGSMSNEILSEVLDCETAREIWKILND

Query:  RFSSRNLARVMQIKSRLETIK-KGNLSLKDYFVKVKNLVDSLAAADKKISSDDHVMHILKGLGSEYDPTVSVI-TEKDDTPPLQKVYSLLLTQESRIERN
         +  ++LA  + ++ RL ++K    +SL  +F     L+  L AA  KI   D + H+L  L S YD  ++ I T  ++   L  V + LL QE +I+ +
Subjt:  RFSSRNLARVMQIKSRLETIK-KGNLSLKDYFVKVKNLVDSLAAADKKISSDDHVMHILKGLGSEYDPTVSVI-TEKDDTPPLQKVYSLLLTQESRIERN

Query:  SSINSDGFLPSVNYTALPSSKPQQQSSTDNTQKRGGQVRGSNGNNNGGRSNRRNWNNNGGKPQCQLCGRFGHTVLRCYMRFERWYQGPNNNNS---QTPS
         +  S   + ++ +            + +NT K       +   N   +  +    N+  K +C  CGR GH    C+  ++R     N  N    QT +
Subjt:  SSINSDGFLPSVNYTALPSSKPQQQSSTDNTQKRGGQVRGSNGNNNGGRSNRRNWNNNGGKPQCQLCGRFGHTVLRCYMRFERWYQGPNNNNS---QTPS

Query:  SPN--FPQNQFPSSSGPGNCSSSQQPFTAYSLQHDLNKENQWYPDSGASNHVTSDISNLTVGSEYTDDNKVHVGNDTGLDVLHIGSSFVQSKSHPNSVFH
        S    F   +  ++S   NC                     +  DSGAS+H+ +D S  T   E     K+ V              F+ +      +  
Subjt:  SPN--FPQNQFPSSSGPGNCSSSQQPFTAYSLQHDLNKENQWYPDSGASNHVTSDISNLTVGSEYTDDNKVHVGNDTGLDVLHIGSSFVQSKSHPNSVFH

Query:  LKNLLHVPTITKNLISVSQFAHDNQVFFEFHPSHYFVKDLSTGRILLQGKVTDGLYQFSLEKAESLTPKGGCSSSFQPQVLHTSLSSVSNVVSDIPSTCN
        L+N                   D+++  E      F K+ + G ++   ++ +       +K+     K G        +L+     V N  +   +  +
Subjt:  LKNLLHVPTITKNLISVSQFAHDNQVFFEFHPSHYFVKDLSTGRILLQGKVTDGLYQFSLEKAESLTPKGGCSSSFQPQVLHTSLSSVSNVVSDIPSTCN

Query:  THTLELWHKRLGHTS---LPTVKRILQTCKIPYSDNKMMS-------LWHACAIGKSHKLPFP--TSSTVYTSPLQLVVADIWGPAFKTSRNNFRYYISF
         +   LWH+R GH S   L  +KR     K  +SD  +++       +   C  GK  +LPF      T    PL +V +D+ GP    + ++  Y++ F
Subjt:  THTLELWHKRLGHTS---LPTVKRILQTCKIPYSDNKMMS-------LWHACAIGKSHKLPFP--TSSTVYTSPLQLVVADIWGPAFKTSRNNFRYYISF

Query:  VDVYSRFTWIYFLNSKNEAFQAFLLFKTHVEKLLNCSILRFQTDGGGEF--RLFTSYLKENDIEHRLSCPYTSQQNGIVERKHRQIIDMGPTLLSQASLS
        VD ++ +   Y +  K++ F  F  F    E   N  ++    D G E+       +  +  I + L+ P+T Q NG+ ER  R I +   T++S A L 
Subjt:  VDVYSRFTWIYFLNSKNEAFQAFLLFKTHVEKLLNCSILRFQTDGGGEF--RLFTSYLKENDIEHRLSCPYTSQQNGIVERKHRQIIDMGPTLLSQASLS

Query:  LEFWDDAFSTATYLTNRLPS--IVHQNISPLEVLFGYQPNYSFFKTFGCLCYPLLRPYNKH-KIEPRSTPCLFLGYSTTHKGYKCL-SFSGRLFISRHVQ
          FW +A  TATYL NR+PS  +V  + +P E+    +P     + FG   Y  ++  NK  K + +S   +F+GY     G+K   + + +  ++R V 
Subjt:  LEFWDDAFSTATYLTNRLPS--IVHQNISPLEVLFGYQPNYSFFKTFGCLCYPLLRPYNKH-KIEPRSTPCLFLGYSTTHKGYKCL-SFSGRLFISRHVQ

Query:  FDET--------KFPAI-VPSLFSSNNSNLEAPS---VTTSLP----------IVKSATISHVSPFPADSHPTPSANPSPVTALSIPSQPQPCSSAPLSM
         DET        KF  + +     S N N    S   + T  P           +K +  S    FP DS          +     P++ + C +     
Subjt:  FDET--------KFPAI-VPSLFSSNNSNLEAPS---VTTSLP----------IVKSATISHVSPFPADSHPTPSANPSPVTALSIPSQPQPCSSAPLSM

Query:  DNTTTDMTLHNNDPSPLP----SPSSNNTNPDQSLSISNQSVSHPSTSDNNHSSQPLSTSNTVLVPTSAAVANSHPMVTRAKSGIFKPRVLLTEYTEV-E
        D+  ++    N           + S  + NP++S              DN     P       ++   +    + P ++  +      +V+L  +T   +
Subjt:  DNTTTDMTLHNNDPSPLP----SPSSNNTNPDQSLSISNQSVSHPSTSDNNHSSQPLSTSNTVLVPTSAAVANSHPMVTRAKSGIFKPRVLLTEYTEV-E

Query:  PPTVKEALK----SAHWVAAMKEEHQALIKNDTWTLVNPPVDKKIIGCKWVFKIKRNSDGSVARYKARLVAKGFHQEADLDYTETFSPVVKPATIRVIFT
         P   + ++     + W  A+  E  A   N+TWT+   P +K I+  +WVF +K N  G+  RYKARLVA+GF Q+  +DY ETF+PV + ++ R I +
Subjt:  PPTVKEALK----SAHWVAAMKEEHQALIKNDTWTLVNPPVDKKIIGCKWVFKIKRNSDGSVARYKARLVAKGFHQEADLDYTETFSPVVKPATIRVIFT

Query:  MALTYGWPLQQVDINNAFLHGFLSEEVFMTQPLGMTDSTNPHMVCKLKKALYGLKQAPRAWFDRLSSFLHSIGLVSSKTDSSLLVCHHN--GTCCYILVY
        + + Y   + Q+D+  AFL+G L EE++M  P G+  S N   VCKL KA+YGLKQA R WF+     L     V+S  D  + +          Y+L+Y
Subjt:  MALTYGWPLQQVDINNAFLHGFLSEEVFMTQPLGMTDSTNPHMVCKLKKALYGLKQAPRAWFDRLSSFLHSIGLVSSKTDSSLLVCHHN--GTCCYILVY

Query:  VDDIVITGNSPLVIDQLINMLHQKFSLKDLGLLSYFLGVEVSYPPSGGIFLSQTKYIYDILHKTKMHEANPIATPMISGSVLSAYHGEPFSDTRLYRSTV
        VDD+VI       ++     L +KF + DL  + +F+G+ +       I+LSQ+ Y+  IL K  M   N ++TP+ S       + +   +T   RS +
Subjt:  VDDIVITGNSPLVIDQLINMLHQKFSLKDLGLLSYFLGVEVSYPPSGGIFLSQTKYIYDILHKTKMHEANPIATPMISGSVLSAYHGEPFSDTRLYRSTV

Query:  GALQYATL-TRPEISYSVNKVCQFMHAPTQGHWQAVKRILRYLIGTIDHGLFFRKPSSF--CLEGFADADWASDPDDRKSTSSFCVYL-GGNLITWSSKK
        G L Y  L TRP+++ +VN + ++        WQ +KR+LRYL GTID  L F+K  +F   + G+ D+DWA    DRKST+ +   +   NLI W++K+
Subjt:  GALQYATL-TRPEISYSVNKVCQFMHAPTQGHWQAVKRILRYLIGTIDHGLFFRKPSSF--CLEGFADADWASDPDDRKSTSSFCVYL-GGNLITWSSKK

Query:  QHIISRSSTEAEYRSLAHVSADLIWLQSLFKDLRLNISRPPILWCDNLGAVHFSANPVLHSRTKHVEIDIYFVRDLVLQKRLTVCHLPTTAQIADIFTKP
        Q+ ++ SSTEAEY +L     + +WL+ L   + + +  P  ++ DN G +  + NP  H R KH++I  +F R+ V    + + ++PT  Q+ADIFTKP
Subjt:  QHIISRSSTEAEYRSLAHVSADLIWLQSLFKDLRLNISRPPILWCDNLGAVHFSANPVLHSRTKHVEIDIYFVRDLVLQKRLTVCHLPTTAQIADIFTKP

Query:  LSSSSFLPLRSKLNV
        L ++ F+ LR KL +
Subjt:  LSSSSFLPLRSKLNV

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.2e-15129.07Show/hide
Query:  GSKTNIVKLDEEN-FLYWKIQVRAVLRDHGLGHFIEGDAHIPPKTTRSGDTSSATLPNPEYETWIRQDNLITSWFLGSMSNEILSEVLDCETAREIWKIL
        G K  + K + +N F  W+ ++R +L   GL   ++ D+  P       DT  A       E W   D    S     +S+++++ ++D +TAR IW  L
Subjt:  GSKTNIVKLDEEN-FLYWKIQVRAVLRDHGLGHFIEGDAHIPPKTTRSGDTSSATLPNPEYETWIRQDNLITSWFLGSMSNEILSEVLDCETAREIWKIL

Query:  NDRFSSRNLARVMQIKSRLETIKKG-NLSLKDYFVKVKNLVDSLAAADKKISSDDHVMHILKGLGSEYDPTVSVITEKDDTPPLQKVYSLLLTQESRIER
           + S+ L   + +K +L  +      +   +      L+  LA    KI  +D  + +L  L S YD   + I     T  L+ V S LL  E     
Subjt:  NDRFSSRNLARVMQIKSRLETIKKG-NLSLKDYFVKVKNLVDSLAAADKKISSDDHVMHILKGLGSEYDPTVSVITEKDDTPPLQKVYSLLLTQESRIER

Query:  NSSINSDGFLPSVNYTALPSSKPQQQSSTDNTQKRGGQVRGSNGNNNGGRSNRRNWNNNGGKPQ---CQLCGRFGHTVLRCYMRFERWYQGPNNNNSQTP
                             KP+ Q     T+ RG   + S  +NN GRS  R  + N  K +   C  C + GH        F+R    P     +T 
Subjt:  NSSINSDGFLPSVNYTALPSSKPQQQSSTDNTQKRGGQVRGSNGNNNGGRSNRRNWNNNGGKPQ---CQLCGRFGHTVLRCYMRFERWYQGPNNNNSQTP

Query:  SSPNFPQNQFPSSSGPGNCSSSQQPFTAYSLQHDLNKENQWYPDSGASNHVTSDISNLTVGSEYTDDNKVHVGNDTGLDVLHIGSSFVQSKSHPNSVFHL
           N   +   ++    N +            H    E++W  D+ AS+H T  + +L       D   V +GN +   +  IG   +  K++      L
Subjt:  SSPNFPQNQFPSSSGPGNCSSSQQPFTAYSLQHDLNKENQWYPDSGASNHVTSDISNLTVGSEYTDDNKVHVGNDTGLDVLHIGSSFVQSKSHPNSVFHL

Query:  KNLLHVPTITKNLISVSQFAHDNQVFFEFHPSHYFVKD---LSTGRILLQGKVTDG-LYQFSLEKAESLTPKGGCSSSFQPQVLHTSLSSVSNVVSDIPS
        K++ HVP +  NLIS      D           YF      L+ G +++   V  G LY+ + E                  +    L++  + +S    
Subjt:  KNLLHVPTITKNLISVSQFAHDNQVFFEFHPSHYFVKD---LSTGRILLQGKVTDG-LYQFSLEKAESLTPKGGCSSSFQPQVLHTSLSSVSNVVSDIPS

Query:  TCNTHTLELWHKRLGHTSLPTVKRILQTCKIPYSDNKMMSLWHACAIGKSHKLPFPTSSTVYTSPLQLVVADIWGPAFKTSRNNFRYYISFVDVYSRFTW
              ++LWHKR+GH S   ++ + +   I Y+    +     C  GK H++ F TSS    + L LV +D+ GP    S    +Y+++F+D  SR  W
Subjt:  TCNTHTLELWHKRLGHTSLPTVKRILQTCKIPYSDNKMMSLWHACAIGKSHKLPFPTSSTVYTSPLQLVVADIWGPAFKTSRNNFRYYISFVDVYSRFTW

Query:  IYFLNSKNEAFQAFLLFKTHVEKLLNCSILRFQTDGGGEF--RLFTSYLKENDIEHRLSCPYTSQQNGIVERKHRQIIDMGPTLLSQASLSLEFWDDAFS
        +Y L +K++ FQ F  F   VE+     + R ++D GGE+  R F  Y   + I H  + P T Q NG+ ER +R I++   ++L  A L   FW +A  
Subjt:  IYFLNSKNEAFQAFLLFKTHVEKLLNCSILRFQTDGGGEF--RLFTSYLKENDIEHRLSCPYTSQQNGIVERKHRQIIDMGPTLLSQASLSLEFWDDAFS

Query:  TATYLTNRLPSIVHQNISPLEVLFGYQPNYSFFKTFGCLCYPLLRPYNKHKIEPRSTPCLFLGYSTTHKGYKCLS-FSGRLFISRHVQFDETKFPAIVPS
        TA YL NR PS+      P  V    + +YS  K FGC  +  +    + K++ +S PC+F+GY     GY+       ++  SR V F E+        
Subjt:  TATYLTNRLPSIVHQNISPLEVLFGYQPNYSFFKTFGCLCYPLLRPYNKHKIEPRSTPCLFLGYSTTHKGYKCLS-FSGRLFISRHVQFDETKFPAIVPS

Query:  LFSSNNSNLEAPSVTTSLPIVKSATISHVSPFPADSHPTPSANPSPVTALSIPSQPQPCSSAPLSMDNTTTDMTLHNNDPSPLPSPSSNNTNPDQSLSIS
                 E  +       VK+  I                 P+ VT   IPS     S+ P S ++TT +++     P  +           + L   
Subjt:  LFSSNNSNLEAPSVTTSLPIVKSATISHVSPFPADSHPTPSANPSPVTALSIPSQPQPCSSAPLSMDNTTTDMTLHNNDPSPLPSPSSNNTNPDQSLSIS

Query:  NQSVSHPSTSDNNHSSQPLSTSNTVLVPTSAAVANSHPMVTRAKSGIFKPRVLLTEYTEVEPPTVKEAL---KSAHWVAAMKEEHQALIKNDTWTLVNPP
         + V HP+  +  H  QPL  S    V +    +  + +++                 + EP ++KE L   +    + AM+EE ++L KN T+ LV  P
Subjt:  NQSVSHPSTSDNNHSSQPLSTSNTVLVPTSAAVANSHPMVTRAKSGIFKPRVLLTEYTEVEPPTVKEAL---KSAHWVAAMKEEHQALIKNDTWTLVNPP

Query:  VDKKIIGCKWVFKIKRNSDGSVARYKARLVAKGFHQEADLDYTETFSPVVKPATIRVIFTMALTYGWPLQQVDINNAFLHGFLSEEVFMTQPLGMTDSTN
          K+ + CKWVFK+K++ D  + RYKARLV KGF Q+  +D+ E FSPVVK  +IR I ++A +    ++Q+D+  AFLHG L EE++M QP G   +  
Subjt:  VDKKIIGCKWVFKIKRNSDGSVARYKARLVAKGFHQEADLDYTETFSPVVKPATIRVIFTMALTYGWPLQQVDINNAFLHGFLSEEVFMTQPLGMTDSTN

Query:  PHMVCKLKKALYGLKQAPRAWFDRLSSFLHSIGLVSSKTDSSLLVCHHN-GTCCYILVYVDDIVITGNSPLVIDQLINMLHQKFSLKDLGLLSYFLGVE-
         HMVCKL K+LYGLKQAPR W+ +  SF+ S   + + +D  +     +      +L+YVDD++I G    +I +L   L + F +KDLG     LG++ 
Subjt:  PHMVCKLKKALYGLKQAPRAWFDRLSSFLHSIGLVSSKTDSSLLVCHHN-GTCCYILVYVDDIVITGNSPLVIDQLINMLHQKFSLKDLGLLSYFLGVE-

Query:  VSYPPSGGIFLSQTKYIYDILHKTKMHEANPIATPMISGSVLSAYHGEPFSDTR------LYRSTVGALQYATL-TRPEISYSVNKVCQFMHAPTQGHWQ
        V    S  ++LSQ KYI  +L +  M  A P++TP+     LS        + +       Y S VG+L YA + TRP+I+++V  V +F+  P + HW+
Subjt:  VSYPPSGGIFLSQTKYIYDILHKTKMHEANPIATPMISGSVLSAYHGEPFSDTR------LYRSTVGALQYATL-TRPEISYSVNKVCQFMHAPTQGHWQ

Query:  AVKRILRYLIGTIDHGLFFRKPSSFCLEGFADADWASDPDDRKSTSSFCVYLGGNLITWSSKKQHIISRSSTEAEYRSLAHVSADLIWLQSLFKDLRLNI
        AVK ILRYL GT    L F   S   L+G+ DAD A D D+RKS++ +     G  I+W SK Q  ++ S+TEAEY +      ++IWL+   ++L L+ 
Subjt:  AVKRILRYLIGTIDHGLFFRKPSSFCLEGFADADWASDPDDRKSTSSFCVYLGGNLITWSSKKQHIISRSSTEAEYRSLAHVSADLIWLQSLFKDLRLNI

Query:  SRPPILWCDNLGAVHFSANPVLHSRTKHVEIDIYFVRDLVLQKRLTVCHLPTTAQIADIFTKPLSSSSF
         +  +++CD+  A+  S N + H+RTKH+++  +++R++V  + L V  + T    AD+ TK +  + F
Subjt:  SRPPILWCDNLGAVHFSANPVLHSRTKHVEIDIYFVRDLVLQKRLTVCHLPTTAQIADIFTKPLSSSSF

P92519 Uncharacterized mitochondrial protein AtMg008103.9e-5750.22Show/hide
Query:  YILVYVDDIVITGNSPLVIDQLINMLHQKFSLKDLGLLSYFLGVEVSYPPSGGIFLSQTKYIYDILHKTKMHEANPIATPM---ISGSVLSAYHGEPFSD
        Y+L+YVDDI++TG+S  +++ LI  L   FS+KDLG + YFLG+++   PS G+FLSQTKY   IL+   M +  P++TP+   ++ SV +A + +P SD
Subjt:  YILVYVDDIVITGNSPLVIDQLINMLHQKFSLKDLGLLSYFLGVEVSYPPSGGIFLSQTKYIYDILHKTKMHEANPIATPM---ISGSVLSAYHGEPFSD

Query:  TRLYRSTVGALQYATLTRPEISYSVNKVCQFMHAPTQGHWQAVKRILRYLIGTIDHGLFFRKPSSFCLEGFADADWASDPDDRKSTSSFCVYLGGNLITW
           +RS VGALQY TLTRP+ISY+VN VCQ MH PT   +  +KR+LRY+ GTI HGL+  K S   ++ F D+DWA     R+ST+ FC +LG N+I+W
Subjt:  TRLYRSTVGALQYATLTRPEISYSVNKVCQFMHAPTQGHWQAVKRILRYLIGTIDHGLFFRKPSSFCLEGFADADWASDPDDRKSTSSFCVYLGGNLITW

Query:  SSKKQHIISRSSTEAEYRSLAHVSADLIW
        S+K+Q  +SRSSTE EYR+LA  +A+L W
Subjt:  SSKKQHIISRSSTEAEYRSLAHVSADLIW

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.1e-28139.25Show/hide
Query:  TNIVKLDEENFLYWKIQVRAVLRDHGLGHFIEGDAHIPPKTTRSGDTSSATLPNPEYETWIRQDNLITSWFLGSMSNEILSEVLDCETAREIWKILNDRF
        +N+ KL   N+L W  QV A+   + L  F++G   +PP T     T +A   NP+Y  W RQD LI S  LG++S  +   V    TA +IW+ L   +
Subjt:  TNIVKLDEENFLYWKIQVRAVLRDHGLGHFIEGDAHIPPKTTRSGDTSSATLPNPEYETWIRQDNLITSWFLGSMSNEILSEVLDCETAREIWKILNDRF

Query:  SSRNLARVMQIKSRLETIKKGNLSLKDYFVKVKNLVDSLAAADKKISSDDHVMHILKGLGSEYDPTVSVITEKDDTPPLQKVYSLLLTQESRIERNSSIN
        ++ +   V Q++++L+   KG  ++ DY   +    D LA   K +  D+ V  +L+ L  EY P +  I  KD  P L +++  LL  ES+I    +++
Subjt:  SSRNLARVMQIKSRLETIKKGNLSLKDYFVKVKNLVDSLAAADKKISSDDHVMHILKGLGSEYDPTVSVITEKDDTPPLQKVYSLLLTQESRIERNSSIN

Query:  SDGFLPSVNYTALPSSKPQQQSSTDNTQKRGGQVRGSNGNNNG----GRSNRRNWNNNGGKP---QCQLCGRFGHTVLRCYMRFERWYQGPNNNNSQTPS
        S   +P +   A+ S +    ++ +N   R  +    N NNN       S   + NNN  KP   +CQ+CG  GH+  RC  + + +    N+    +P 
Subjt:  SDGFLPSVNYTALPSSKPQQQSSTDNTQKRGGQVRGSNGNNNG----GRSNRRNWNNNGGKP---QCQLCGRFGHTVLRCYMRFERWYQGPNNNNSQTPS

Query:  SPNFPQNQFPSSSGPGNCSSSQQPFTAYSLQHDLNKENQWYPDSGASNHVTSDISNLTVGSEYTDDNKVHVGNDTGLDVLHIGSSFVQSKSHPNSVFHLK
        +P  P+      S          P+++          N W  DSGA++H+TSD +NL++   YT  + V V + + + + H GS+ + +KS P    +L 
Subjt:  SPNFPQNQFPSSSGPGNCSSSQQPFTAYSLQHDLNKENQWYPDSGASNHVTSDISNLTVGSEYTDDNKVHVGNDTGLDVLHIGSSFVQSKSHPNSVFHLK

Query:  NLLHVPTITKNLISVSQFAHDNQVFFEFHPSHYFVKDLSTGRILLQGKVTDGLYQFSLEKAESLTPKGGCSSSFQPQVLHTSLSSVSNVVSDIPSTCNTH
        N+L+VP I KNLISV +  + N V  EF P+ + VKDL+TG  LLQGK  D LY++ +             +S QP  L  S           PS+  TH
Subjt:  NLLHVPTITKNLISVSQFAHDNQVFFEFHPSHYFVKDLSTGRILLQGKVTDGLYQFSLEKAESLTPKGGCSSSFQPQVLHTSLSSVSNVVSDIPSTCNTH

Query:  TLELWHKRLGHTSLPTVKRILQ--TCKIPYSDNKMMSLWHACAIGKSHKLPFPTSSTVYTSPLQLVVADIWGPAFKTSRNNFRYYISFVDVYSRFTWIYF
        +   WH RLGH +   +  ++   +  +    +K +S    C I KS+K+PF  S+   T PL+ + +D+W      S +N+RYY+ FVD ++R+TW+Y 
Subjt:  TLELWHKRLGHTSLPTVKRILQ--TCKIPYSDNKMMSLWHACAIGKSHKLPFPTSSTVYTSPLQLVVADIWGPAFKTSRNNFRYYISFVDVYSRFTWIYF

Query:  LNSKNEAFQAFLLFKTHVEKLLNCSILRFQTDGGGEFRLFTSYLKENDIEHRLSCPYTSQQNGIVERKHRQIIDMGPTLLSQASLSLEFWDDAFSTATYL
        L  K++  + F+ FK  +E      I  F +D GGEF     Y  ++ I H  S P+T + NG+ ERKHR I++ G TLLS AS+   +W  AF+ A YL
Subjt:  LNSKNEAFQAFLLFKTHVEKLLNCSILRFQTDGGGEFRLFTSYLKENDIEHRLSCPYTSQQNGIVERKHRQIIDMGPTLLSQASLSLEFWDDAFSTATYL

Query:  TNRLPSIVHQNISPLEVLFGYQPNYSFFKTFGCLCYPLLRPYNKHKIEPRSTPCLFLGYSTTHKGYKCLSF-SGRLFISRHVQFDETKFP--AIVPSLFS
         NRLP+ + Q  SP + LFG  PNY   + FGC CYP LRPYN+HK++ +S  C+FLGYS T   Y CL   + RL+ISRHV+FDE  FP    + +L  
Subjt:  TNRLPSIVHQNISPLEVLFGYQPNYSFFKTFGCLCYPLLRPYNKHKIEPRSTPCLFLGYSTTHKGYKCLSF-SGRLFISRHVQFDETKFP--AIVPSLFS

Query:  SNNSNLEAPSVTTSLPIVKSATISHVSPFPADSHP----TPSANPSPVTALSIPSQPQPCSSAPLSMDNTTTDMTLHNNDPSPLPSP--------SSNNT
              E+  V +  P     T + V P P+ S P    TP ++PS     S  S     SS   S  ++        N P P   P        SS NT
Subjt:  SNNSNLEAPSVTTSLPIVKSATISHVSPFPADSHP----TPSANPSPVTALSIPSQPQPCSSAPLSMDNTTTDMTLHNNDPSPLPSP--------SSNNT

Query:  ---NP-DQSLSISNQSVSHPSTSDNNHSSQPLSTSNTVLVPT-------------------SAAVANSHPMVTRAKSGIFKPR----VLLTEYTEVEPPT
           NP ++S S   QS+S P+ S ++  S   S S++   PT                   + A  N+H M TRAK+GI KP     + ++   E EP T
Subjt:  ---NP-DQSLSISNQSVSHPSTSDNNHSSQPLSTSNTVLVPT-------------------SAAVANSHPMVTRAKSGIFKPR----VLLTEYTEVEPPT

Query:  VKEALKSAHWVAAMKEEHQALIKNDTWTLVNPPVDK-KIIGCKWVFKIKRNSDGSVARYKARLVAKGFHQEADLDYTETFSPVVKPATIRVIFTMALTYG
          +ALK   W  AM  E  A I N TW LV PP     I+GC+W+F  K NSDGS+ RYKARLVAKG++Q   LDY ETFSPV+K  +IR++  +A+   
Subjt:  VKEALKSAHWVAAMKEEHQALIKNDTWTLVNPPVDK-KIIGCKWVFKIKRNSDGSVARYKARLVAKGFHQEADLDYTETFSPVVKPATIRVIFTMALTYG

Query:  WPLQQVDINNAFLHGFLSEEVFMTQPLGMTDSTNPHMVCKLKKALYGLKQAPRAWFDRLSSFLHSIGLVSSKTDSSLLVCHHNGTCCYILVYVDDIVITG
        WP++Q+D+NNAFL G L+++V+M+QP G  D   P+ VCKL+KALYGLKQAPRAW+  L ++L +IG V+S +D+SL V     +  Y+LVYVDDI+ITG
Subjt:  WPLQQVDINNAFLHGFLSEEVFMTQPLGMTDSTNPHMVCKLKKALYGLKQAPRAWFDRLSSFLHSIGLVSSKTDSSLLVCHHNGTCCYILVYVDDIVITG

Query:  NSPLVIDQLINMLHQKFSLKDLGLLSYFLGVEVSYPPSGGIFLSQTKYIYDILHKTKMHEANPIATPMISGSVLSAYHGEPFSDTRLYRSTVGALQYATL
        N P ++   ++ L Q+FS+KD   L YFLG+E    P+ G+ LSQ +YI D+L +T M  A P+ TPM     LS Y G   +D   YR  VG+LQY   
Subjt:  NSPLVIDQLINMLHQKFSLKDLGLLSYFLGVEVSYPPSGGIFLSQTKYIYDILHKTKMHEANPIATPMISGSVLSAYHGEPFSDTRLYRSTVGALQYATL

Query:  TRPEISYSVNKVCQFMHAPTQGHWQAVKRILRYLIGTIDHGLFFRKPSSFCLEGFADADWASDPDDRKSTSSFCVYLGGNLITWSSKKQHIISRSSTEAE
        TRP+ISY+VN++ QFMH PT+ H QA+KRILRYL GT +HG+F +K ++  L  ++DADWA D DD  ST+ + VYLG + I+WSSKKQ  + RSSTEAE
Subjt:  TRPEISYSVNKVCQFMHAPTQGHWQAVKRILRYLIGTIDHGLFFRKPSSFCLEGFADADWASDPDDRKSTSSFCVYLGGNLITWSSKKQHIISRSSTEAE

Query:  YRSLAHVSADLIWLQSLFKDLRLNISRPPILWCDNLGAVHFSANPVLHSRTKHVEIDIYFVRDLVLQKRLTVCHLPTTAQIADIFTKPLSSSSFLPLRSK
        YRS+A+ S+++ W+ SL  +L + ++RPP+++CDN+GA +  ANPV HSR KH+ ID +F+R+ V    L V H+ T  Q+AD  TKPLS ++F    SK
Subjt:  YRSLAHVSADLIWLQSLFKDLRLNISRPPILWCDNLGAVHFSANPVLHSRTKHVEIDIYFVRDLVLQKRLTVCHLPTTAQIADIFTKPLSSSSFLPLRSK

Query:  LNV
        + V
Subjt:  LNV

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE29.3e-27737.86Show/hide
Query:  TNIVKLDEENFLYWKIQVRAVLRDHGLGHFIEGDAHIPPKTTRSGDTSSATLPNPEYETWIRQDNLITSWFLGSMSNEILSEVLDCETAREIWKILNDRF
        +N+ KL   N+L W  QV A+   + L  F++G   +PP T     T +    NP+Y  W RQD LI S  LG++S  +   V    TA +IW+ L   +
Subjt:  TNIVKLDEENFLYWKIQVRAVLRDHGLGHFIEGDAHIPPKTTRSGDTSSATLPNPEYETWIRQDNLITSWFLGSMSNEILSEVLDCETAREIWKILNDRF

Query:  SSRNLARVMQIKSRLETIKKGNLSLKDYFVKVKNLVDSLAAADKKISSDDHVMHILKGLGSEYDPTVSVITEKDDTPPLQKVYSLLLTQESRIERNSSIN
        ++ +   V Q++                F+      D LA   K +  D+ V  +L+ L  +Y P +  I  KD  P L +++  L+ +ES++    ++N
Subjt:  SSRNLARVMQIKSRLETIKKGNLSLKDYFVKVKNLVDSLAAADKKISSDDHVMHILKGLGSEYDPTVSVITEKDDTPPLQKVYSLLLTQESRIERNSSIN

Query:  SDGFLPSVNYTALPSSKPQQQSSTDNTQKRGGQVRGSNGNNNGGRSNRRNWNNNGG-------KP---QCQLCGRFGHTVLRCYMRFERWYQGPNNNNSQ
        S   +P        +    + ++T+  Q   G  R  N NNN  RSN    +++G        KP   +CQ+C   GH+  RC  +  ++    N   S 
Subjt:  SDGFLPSVNYTALPSSKPQQQSSTDNTQKRGGQVRGSNGNNNGGRSNRRNWNNNGG-------KP---QCQLCGRFGHTVLRCYMRFERWYQGPNNNNSQ

Query:  TPSSPNFPQNQFPSSSGPGNCSSSQQPFTAYSLQHDLNKENQWYPDSGASNHVTSDISNLTVGSEYTDDNKVHVGNDTGLDVLHIGSSFVQSKSHPNSVF
        +P +P  P+     +S          P+ A          N W  DSGA++H+TSD +NL+    YT  + V + + + + + H GS+ + + S      
Subjt:  TPSSPNFPQNQFPSSSGPGNCSSSQQPFTAYSLQHDLNKENQWYPDSGASNHVTSDISNLTVGSEYTDDNKVHVGNDTGLDVLHIGSSFVQSKSHPNSVF

Query:  HLKNLLHVPTITKNLISVSQFAHDNQVFFEFHPSHYFVKDLSTGRILLQGKVTDGLYQFSLEKAESLTPKGGCSSSFQPQVLHTSLSSVSNVVSDIPSTC
         L  +L+VP I KNLISV +  + N+V  EF P+ + VKDL+TG  LLQGK  D LY++ +                          + S  VS   S C
Subjt:  HLKNLLHVPTITKNLISVSQFAHDNQVFFEFHPSHYFVKDLSTGRILLQGKVTDGLYQFSLEKAESLTPKGGCSSSFQPQVLHTSLSSVSNVVSDIPSTC

Query:  NTHTLELWHKRLGHTSLPTVKRILQTCKIPY--SDNKMMSLWHACAIGKSHKLPFPTSSTVYTSPLQLVVADIWGPAFKTSRNNFRYYISFVDVYSRFTW
        +  T   WH RLGH SL  +  ++    +P     +K++S    C I KSHK+PF  S+   + PL+ + +D+W      S +N+RYY+ FVD ++R+TW
Subjt:  NTHTLELWHKRLGHTSLPTVKRILQTCKIPY--SDNKMMSLWHACAIGKSHKLPFPTSSTVYTSPLQLVVADIWGPAFKTSRNNFRYYISFVDVYSRFTW

Query:  IYFLNSKNEAFQAFLLFKTHVEKLLNCSILRFQTDGGGEFRLFTSYLKENDIEHRLSCPYTSQQNGIVERKHRQIIDMGPTLLSQASLSLEFWDDAFSTA
        +Y L  K++    F++FK+ VE      I    +D GGEF +   YL ++ I H  S P+T + NG+ ERKHR I++MG TLLS AS+   +W  AFS A
Subjt:  IYFLNSKNEAFQAFLLFKTHVEKLLNCSILRFQTDGGGEFRLFTSYLKENDIEHRLSCPYTSQQNGIVERKHRQIIDMGPTLLSQASLSLEFWDDAFSTA

Query:  TYLTNRLPSIVHQNISPLEVLFGYQPNYSFFKTFGCLCYPLLRPYNKHKIEPRSTPCLFLGYSTTHKGYKCLSF-SGRLFISRHVQFDETKFPAIVPSLF
         YL NRLP+ + Q  SP + LFG  PNY   K FGC CYP LRPYN+HK+E +S  C F+GYS T   Y CL   +GRL+ SRHVQFDE  FP    +  
Subjt:  TYLTNRLPSIVHQNISPLEVLFGYQPNYSFFKTFGCLCYPLLRPYNKHKIEPRSTPCLFLGYSTTHKGYKCLSF-SGRLFISRHVQFDETKFPAIVPSLF

Query:  SSNNSNLEAPSVTTSLPIVKSATISHVSPFP------ADSHPTPSANPSPV------------TALSIPSQPQPC---------SSAPLSMDNTTTDMTL
         S +    + S           T   V P P       D+ P P ++PSP+            +++S PS  +P          ++ P    N+ ++  +
Subjt:  SSNNSNLEAPSVTTSLPIVKSATISHVSPFP------ADSHPTPSANPSPV------------TALSIPSQPQPC---------SSAPLSMDNTTTDMTL

Query:  HNNDPSPLPSPSSNNTN-PDQSLSISNQSVSHPSTSDNNHSSQPLSTSNTVLVP-----------TSAAVANSHPMVTRAKSGIFKPRVLLTEYTEV---
         NN     PSP+S N N P     IS+  +  PSTS +  +S   S+++T  +P            + A  N+H M TRAK GI KP    +  T +   
Subjt:  HNNDPSPLPSPSSNNTN-PDQSLSISNQSVSHPSTSDNNHSSQPLSTSNTVLVP-----------TSAAVANSHPMVTRAKSGIFKPRVLLTEYTEV---

Query:  -EPPTVKEALKSAHWVAAMKEEHQALIKNDTWTLVNPPVDK-KIIGCKWVFKIKRNSDGSVARYKARLVAKGFHQEADLDYTETFSPVVKPATIRVIFTM
         EP T  +A+K   W  AM  E  A I N TW LV PP     I+GC+W+F  K NSDGS+ RYKARLVAKG++Q   LDY ETFSPV+K  +IR++  +
Subjt:  -EPPTVKEALKSAHWVAAMKEEHQALIKNDTWTLVNPPVDK-KIIGCKWVFKIKRNSDGSVARYKARLVAKGFHQEADLDYTETFSPVVKPATIRVIFTM

Query:  ALTYGWPLQQVDINNAFLHGFLSEEVFMTQPLGMTDSTNPHMVCKLKKALYGLKQAPRAWFDRLSSFLHSIGLVSSKTDSSLLVCHHNGTCCYILVYVDD
        A+   WP++Q+D+NNAFL G L++EV+M+QP G  D   P  VC+L+KA+YGLKQAPRAW+  L ++L ++G V+S +D+SL V     +  Y+LVYVDD
Subjt:  ALTYGWPLQQVDINNAFLHGFLSEEVFMTQPLGMTDSTNPHMVCKLKKALYGLKQAPRAWFDRLSSFLHSIGLVSSKTDSSLLVCHHNGTCCYILVYVDD

Query:  IVITGNSPLVIDQLINMLHQKFSLKDLGLLSYFLGVEVSYPPSGGIFLSQTKYIYDILHKTKMHEANPIATPMISGSVLSAYHGEPFSDTRLYRSTVGAL
        I+ITGN  +++   ++ L Q+FS+K+   L YFLG+E    P  G+ LSQ +Y  D+L +T M  A P+ATPM +   L+ + G    D   YR  VG+L
Subjt:  IVITGNSPLVIDQLINMLHQKFSLKDLGLLSYFLGVEVSYPPSGGIFLSQTKYIYDILHKTKMHEANPIATPMISGSVLSAYHGEPFSDTRLYRSTVGAL

Query:  QYATLTRPEISYSVNKVCQFMHAPTQGHWQAVKRILRYLIGTIDHGLFFRKPSSFCLEGFADADWASDPDDRKSTSSFCVYLGGNLITWSSKKQHIISRS
        QY   TRP++SY+VN++ Q+MH PT  HW A+KR+LRYL GT DHG+F +K ++  L  ++DADWA D DD  ST+ + VYLG + I+WSSKKQ  + RS
Subjt:  QYATLTRPEISYSVNKVCQFMHAPTQGHWQAVKRILRYLIGTIDHGLFFRKPSSFCLEGFADADWASDPDDRKSTSSFCVYLGGNLITWSSKKQHIISRS

Query:  STEAEYRSLAHVSADLIWLQSLFKDLRLNISRPPILWCDNLGAVHFSANPVLHSRTKHVEIDIYFVRDLVLQKRLTVCHLPTTAQIADIFTKPLSSSSFL
        STEAEYRS+A+ S++L W+ SL  +L + +S PP+++CDN+GA +  ANPV HSR KH+ +D +F+R+ V    L V H+ T  Q+AD  TKPLS  +F 
Subjt:  STEAEYRSLAHVSADLIWLQSLFKDLRLNISRPPILWCDNLGAVHFSANPVLHSRTKHVEIDIYFVRDLVLQKRLTVCHLPTTAQIADIFTKPLSSSSFL

Query:  PLRSKLNV
            K+ V
Subjt:  PLRSKLNV

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 81.4e-11543.83Show/hide
Query:  EPPTVKEALKSAHWVAAMKEEHQALIKNDTWTLVNPPVDKKIIGCKWVFKIKRNSDGSVARYKARLVAKGFHQEADLDYTETFSPVVKPATIRVIFTMAL
        EP T  EA +   W  AM +E  A+    TW +   P +KK IGCKWV+KIK NSDG++ RYKARLVAKG+ Q+  +D+ ETFSPV K  ++++I  ++ 
Subjt:  EPPTVKEALKSAHWVAAMKEEHQALIKNDTWTLVNPPVDKKIIGCKWVFKIKRNSDGSVARYKARLVAKGFHQEADLDYTETFSPVVKPATIRVIFTMAL

Query:  TYGWPLQQVDINNAFLHGFLSEEVFMTQPLGMT----DSTNPHMVCKLKKALYGLKQAPRAWFDRLSSFLHSIGLVSSKTDSSLLVCHHNGTCCYILVYV
         Y + L Q+DI+NAFL+G L EE++M  P G      DS  P+ VC LKK++YGLKQA R WF + S  L   G V S +D +  +         +LVYV
Subjt:  TYGWPLQQVDINNAFLHGFLSEEVFMTQPLGMT----DSTNPHMVCKLKKALYGLKQAPRAWFDRLSSFLHSIGLVSSKTDSSLLVCHHNGTCCYILVYV

Query:  DDIVITGNSPLVIDQLINMLHQKFSLKDLGLLSYFLGVEVSYPPSGGIFLSQTKYIYDILHKTKMHEANPIATPMISGSVLSAYHGEPFSDTRLYRSTVG
        DDI+I  N+   +D+L + L   F L+DLG L YFLG+E++   + GI + Q KY  D+L +T +    P + PM      SA+ G  F D + YR  +G
Subjt:  DDIVITGNSPLVIDQLINMLHQKFSLKDLGLLSYFLGVEVSYPPSGGIFLSQTKYIYDILHKTKMHEANPIATPMISGSVLSAYHGEPFSDTRLYRSTVG

Query:  ALQYATLTRPEISYSVNKVCQFMHAPTQGHWQAVKRILRYLIGTIDHGLFFRKPSSFCLEGFADADWASDPDDRKSTSSFCVYLGGNLITWSSKKQHIIS
         L Y  +TR +IS++VNK+ QF  AP   H QAV +IL Y+ GT+  GLF+   +   L+ F+DA + S  D R+ST+ +C++LG +LI+W SKKQ ++S
Subjt:  ALQYATLTRPEISYSVNKVCQFMHAPTQGHWQAVKRILRYLIGTIDHGLFFRKPSSFCLEGFADADWASDPDDRKSTSSFCVYLGGNLITWSSKKQHIIS

Query:  RSSTEAEYRSLAHVSADLIWLQSLFKDLRLNISRPPILWCDNLGAVHFSANPVLHSRTKHVEIDIYFVRD
        +SS EAEYR+L+  + +++WL   F++L+L +S+P +L+CDN  A+H + N V H RTKH+E D + VR+
Subjt:  RSSTEAEYRSLAHVSADLIWLQSLFKDLRLNISRPPILWCDNLGAVHFSANPVLHSRTKHVEIDIYFVRD

AT5G48050.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)6.9e-1731.78Show/hide
Query:  DTSSATLPNPEYETWIRQDNLITSWFLGSMSNEILSEVLDCE-TAREIWKILNDRFSSRNLARVMQIKSRLETIKKGNLSLKDYFVKVKNLVDSLAAADK
        D SS   P  E + W  +D L+  W  G++++ +L  ++    TAR++W  L + F     AR +Q ++ L T    +LS+ +Y  K+K+L D L   D 
Subjt:  DTSSATLPNPEYETWIRQDNLITSWFLGSMSNEILSEVLDCE-TAREIWKILNDRFSSRNLARVMQIKSRLETIKKGNLSLKDYFVKVKNLVDSLAAADK

Query:  KISSDDHVMHILKGLGSEYDPTVSVITEKDDTPPLQKVYSLLLTQESRIERNS----SINSDGFLPSVNYTALPSSKPQQQSSTDNTQKRGGQVRGSNGN
         IS    VMH+L GL  +YD  ++VI  K   P   +  S+LL +ESR+   S    S  +   L +V +T +P  + +      N     G+ R    N
Subjt:  KISSDDHVMHILKGLGSEYDPTVSVITEKDDTPPLQKVYSLLLTQESRIERNS----SINSDGFLPSVNYTALPSSKPQQQSSTDNTQKRGGQVRGSNGN

Query:  NNGGRSNRRNWNNN
          GG S+ R  NNN
Subjt:  NNGGRSNRRNWNNN

ATMG00240.1 Gag-Pol-related retrotransposon family protein2.9e-1544.87Show/hide
Query:  YATLTRPEISYSVNKVCQFMHAPTQGHWQAVKRILRYLIGTIDHGLFFRKPSSFCLEGFADADWASDPDDRKSTSSFC
        Y T+TRP+++++VN++ QF  A      QAV ++L Y+ GT+  GLF+   S   L+ FAD+DWAS PD R+S + FC
Subjt:  YATLTRPEISYSVNKVCQFMHAPTQGHWQAVKRILRYLIGTIDHGLFFRKPSSFCLEGFADADWASDPDDRKSTSSFC

ATMG00810.1 DNA/RNA polymerases superfamily protein2.8e-5850.22Show/hide
Query:  YILVYVDDIVITGNSPLVIDQLINMLHQKFSLKDLGLLSYFLGVEVSYPPSGGIFLSQTKYIYDILHKTKMHEANPIATPM---ISGSVLSAYHGEPFSD
        Y+L+YVDDI++TG+S  +++ LI  L   FS+KDLG + YFLG+++   PS G+FLSQTKY   IL+   M +  P++TP+   ++ SV +A + +P SD
Subjt:  YILVYVDDIVITGNSPLVIDQLINMLHQKFSLKDLGLLSYFLGVEVSYPPSGGIFLSQTKYIYDILHKTKMHEANPIATPM---ISGSVLSAYHGEPFSD

Query:  TRLYRSTVGALQYATLTRPEISYSVNKVCQFMHAPTQGHWQAVKRILRYLIGTIDHGLFFRKPSSFCLEGFADADWASDPDDRKSTSSFCVYLGGNLITW
           +RS VGALQY TLTRP+ISY+VN VCQ MH PT   +  +KR+LRY+ GTI HGL+  K S   ++ F D+DWA     R+ST+ FC +LG N+I+W
Subjt:  TRLYRSTVGALQYATLTRPEISYSVNKVCQFMHAPTQGHWQAVKRILRYLIGTIDHGLFFRKPSSFCLEGFADADWASDPDDRKSTSSFCVYLGGNLITW

Query:  SSKKQHIISRSSTEAEYRSLAHVSADLIW
        S+K+Q  +SRSSTE EYR+LA  +A+L W
Subjt:  SSKKQHIISRSSTEAEYRSLAHVSADLIW

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)2.7e-2955.2Show/hide
Query:  MVTRAKSGIFK--PRVLLTEYTEV--EPPTVKEALKSAHWVAAMKEEHQALIKNDTWTLVNPPVDKKIIGCKWVFKIKRNSDGSVARYKARLVAKGFHQE
        M+TR+K+GI K  P+  LT  T +  EP +V  ALK   W  AM+EE  AL +N TW LV PPV++ I+GCKWVFK K +SDG++ R KARLVAKGFHQE
Subjt:  MVTRAKSGIFK--PRVLLTEYTEV--EPPTVKEALKSAHWVAAMKEEHQALIKNDTWTLVNPPVDKKIIGCKWVFKIKRNSDGSVARYKARLVAKGFHQE

Query:  ADLDYTETFSPVVKPATIRVIFTMA
          + + ET+SPVV+ ATIR I  +A
Subjt:  ADLDYTETFSPVVKPATIRVIFTMA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTCCTCCAATGAAACATCTCCTTCTGAGTCGCTAAGTTCCTCTCAGCCGATGAAAGCGATTAATCCTGGAAGCAAGACCAACATCGTCAAATTAGACGAAGAGAA
TTTCTTATACTGGAAGATTCAAGTTCGAGCGGTTCTTCGCGACCATGGTTTAGGGCATTTTATTGAAGGTGATGCACATATTCCTCCGAAGACCACTCGATCAGGCGACA
CTTCCTCTGCTACTCTTCCCAATCCAGAATATGAAACATGGATTCGTCAAGACAATCTTATCACCTCGTGGTTTCTGGGCTCTATGTCGAACGAAATTCTTTCAGAGGTT
CTTGATTGTGAGACAGCTCGTGAGATCTGGAAAATTCTAAACGATCGATTCTCCTCCCGAAATCTTGCCCGTGTTATGCAGATCAAATCTAGGCTTGAAACGATCAAGAA
AGGTAATCTTAGTCTCAAAGATTACTTTGTCAAAGTCAAGAATCTTGTCGATTCTTTAGCTGCTGCTGATAAAAAGATTTCTTCAGATGACCATGTGATGCATATCTTGA
AGGGCTTGGGCTCTGAGTACGATCCTACTGTATCGGTTATCACAGAGAAGGATGATACTCCACCACTCCAGAAGGTTTACTCTTTACTCTTAACGCAAGAAAGTAGAATC
GAAAGAAATTCTTCGATAAACTCTGATGGTTTTCTACCATCGGTAAACTACACGGCTCTTCCCTCTTCAAAGCCGCAGCAACAGTCTTCTACCGATAATACTCAGAAGCG
AGGTGGACAAGTAAGAGGATCTAATGGCAATAACAATGGCGGACGATCCAATCGTAGAAACTGGAATAACAATGGAGGTAAACCGCAGTGCCAGTTATGTGGCCGATTTG
GTCATACTGTCCTCAGATGTTATATGCGTTTTGAAAGGTGGTATCAAGGACCGAATAACAATAATTCTCAGACACCGAGTAGTCCCAACTTTCCTCAAAATCAATTCCCA
TCTTCATCTGGGCCAGGGAATTGCTCTTCTTCTCAACAACCCTTCACTGCTTACTCCCTTCAGCATGACTTAAACAAAGAAAATCAGTGGTATCCGGACTCCGGTGCTTC
CAATCATGTAACGAGTGACATTTCAAATTTGACCGTGGGCTCTGAATATACTGATGACAACAAGGTTCATGTAGGCAACGATACAGGTTTGGATGTTCTTCATATTGGCT
CTTCTTTTGTTCAATCTAAATCTCATCCTAACTCTGTTTTCCACCTTAAAAATCTATTACATGTACCTACCATAACCAAAAATCTCATAAGCGTCAGTCAATTTGCTCAT
GATAATCAAGTTTTCTTTGAGTTTCACCCCTCTCATTACTTTGTGAAGGATCTTTCTACTGGCCGAATTCTACTCCAAGGCAAGGTAACTGATGGACTATACCAATTTTC
TTTGGAGAAGGCTGAATCCCTGACTCCTAAAGGTGGATGTTCTTCTTCCTTCCAACCGCAAGTACTTCATACATCTCTTTCTTCTGTGTCTAATGTTGTTTCTGATATCC
CTAGTACGTGTAATACTCATACTTTAGAACTATGGCACAAACGATTAGGCCATACTTCACTACCTACTGTCAAAAGGATCTTACAGACTTGTAAAATTCCATATTCTGAT
AATAAAATGATGTCTTTGTGGCATGCTTGTGCCATTGGAAAAAGTCACAAACTACCATTCCCTACTTCATCTACTGTGTATACCTCACCCCTTCAATTAGTTGTAGCTGA
CATATGGGGACCAGCCTTCAAAACATCTAGAAACAACTTTAGATACTACATTAGTTTTGTAGATGTATACTCAAGATTCACTTGGATATACTTCCTTAATTCAAAAAATG
AAGCGTTTCAAGCTTTTCTTCTATTCAAGACTCATGTAGAAAAACTCTTAAATTGTTCCATTCTTCGGTTCCAAACTGATGGAGGAGGTGAATTTCGTTTGTTCACCTCC
TATCTTAAAGAAAATGACATTGAACATCGTTTGTCGTGTCCCTACACCTCCCAACAAAATGGGATTGTCGAACGCAAGCATAGACAGATCATAGACATGGGGCCTACCCT
CCTGTCTCAAGCTTCTTTATCTTTAGAGTTTTGGGATGACGCCTTCTCCACAGCCACATACCTAACAAATCGGCTTCCATCCATAGTCCATCAGAACATCTCTCCCCTGG
AAGTTTTATTTGGCTATCAGCCTAATTACTCATTTTTCAAAACTTTTGGATGTCTTTGTTATCCTTTATTACGTCCATACAATAAACACAAAATTGAGCCTCGTTCTACA
CCATGCCTTTTTCTCGGGTATAGTACCACTCATAAAGGGTACAAATGCTTATCCTTTTCAGGTCGTTTATTCATCTCTAGACATGTCCAATTTGATGAAACTAAGTTTCC
AGCTATAGTTCCCTCCCTTTTCTCTTCAAATAACTCTAATCTCGAGGCTCCGTCTGTGACCACTTCTTTACCAATAGTCAAATCTGCCACTATATCTCATGTTTCCCCTT
TTCCAGCTGACAGTCATCCTACTCCCTCAGCTAACCCTTCACCCGTGACTGCTTTGTCTATTCCCAGTCAACCACAACCCTGTTCTAGTGCACCCTTGTCTATGGACAAT
ACAACTACTGATATGACACTTCATAACAATGATCCTTCACCCCTTCCCTCTCCTTCCTCAAACAACACCAATCCAGACCAAAGTTTATCCATTTCAAACCAGTCTGTTTC
ACATCCCTCTACCTCAGACAATAATCATTCTTCTCAGCCCTTGAGCACGTCGAACACTGTACTTGTGCCTACTTCTGCCGCAGTTGCTAATAGTCATCCCATGGTCACTC
GAGCGAAGTCAGGTATCTTTAAACCAAGGGTTCTCTTAACTGAGTATACTGAGGTAGAACCACCCACAGTGAAGGAGGCTTTGAAAAGTGCTCATTGGGTTGCTGCAATG
AAGGAGGAACACCAGGCCTTGATAAAAAATGATACCTGGACTCTTGTCAATCCTCCTGTTGACAAGAAAATCATTGGATGCAAGTGGGTGTTTAAAATAAAAAGGAACTC
GGATGGTTCAGTGGCTCGATATAAAGCTCGCCTAGTTGCAAAAGGGTTCCATCAAGAGGCTGACTTAGACTATACAGAGACGTTCAGCCCTGTTGTCAAACCAGCTACCA
TCCGAGTTATTTTTACCATGGCTCTCACTTATGGCTGGCCATTGCAACAAGTTGATATAAATAACGCGTTTCTTCATGGATTTCTCAGTGAAGAGGTGTTTATGACACAA
CCTCTGGGGATGACAGATAGTACAAATCCACACATGGTCTGCAAGCTGAAGAAAGCTCTATATGGCTTAAAACAAGCCCCTAGAGCGTGGTTTGACAGGTTGAGTTCTTT
TCTTCACTCTATTGGATTGGTTAGCTCAAAAACTGATTCTTCTTTGCTGGTGTGTCATCACAATGGTACTTGCTGCTATATCTTAGTGTATGTAGATGATATTGTGATTA
CTGGCAACTCTCCTCTTGTCATAGATCAGCTGATTAACATGCTGCATCAGAAATTCTCATTAAAGGATCTTGGCCTGTTAAGCTACTTTTTGGGTGTGGAGGTCTCTTAC
CCGCCCTCTGGTGGAATTTTCCTATCCCAAACCAAATACATTTATGATATTCTACACAAGACTAAAATGCATGAGGCAAATCCCATTGCTACTCCTATGATTAGTGGCTC
GGTATTATCGGCCTACCATGGTGAACCTTTCTCTGATACTCGACTATATAGGAGTACTGTTGGTGCACTCCAATATGCCACTCTCACCAGACCTGAAATCTCTTACAGTG
TCAACAAAGTATGTCAGTTCATGCACGCTCCGACTCAAGGTCATTGGCAGGCTGTTAAAAGAATCCTTCGTTACCTTATTGGTACTATTGATCATGGCCTATTCTTTCGT
AAACCGAGCTCCTTCTGTCTTGAAGGCTTTGCTGATGCAGATTGGGCCTCAGATCCTGATGATCGCAAATCTACATCGAGTTTTTGTGTCTATCTTGGTGGCAATCTAAT
TACTTGGTCTTCCAAAAAGCAGCACATTATATCTCGGTCCAGCACCGAAGCAGAGTATAGAAGCCTGGCTCACGTATCTGCAGATCTCATCTGGCTGCAATCACTCTTCA
AGGATCTTCGGCTAAACATCTCAAGACCTCCAATTCTGTGGTGTGACAATCTTGGTGCAGTGCATTTTAGTGCCAACCCTGTCCTTCACTCCAGAACTAAGCATGTAGAA
ATTGACATCTACTTCGTTCGTGATCTTGTTCTGCAAAAGCGCCTCACTGTATGTCATCTCCCTACGACTGCCCAGATAGCTGATATCTTCACCAAACCGCTTTCTTCTTC
TAGTTTTTTACCTCTCAGAAGCAAGCTCAATGTCTGTTCTTCCAACACCATTGGCTTGAGGGGGGTGTTAAGAGAGCACACTGATATGAGTCCCCTGCTCGCGAAAAACA
CACAAGAGAGGGAGAGTGGGTTTCCCAGATCTGTGTGGTCCAGTGCGACGATTGAGCGTGGGTATCCTAGATCTGGGTTTCCTAGATGTTGTGTAGTCAGACGAGCAGTA
ATCCGAGAGAAAGAGAGAGAAAATCGAGAGAAAGAGAGGAAGGATTCGAGAAGGAAAGCTATTCGAGTATTTGGTGCGGCGACTGAGTGTGGGTATGCCTTCGACGGAGC
GTGGGTATGCATGCAGCAGGGGACTCCGGTGTTGAAATTGGCTTCTTCGTTCGCCATTGAAACGCAGAGACAAATGACAGAGAAGAACAGAACGCTAGTTCTTGCTTACC
TCTTGATGGGCCTCCTCATAGCTTCTCAATGTGTTAATTCGCATGTCTTCGTGATTCTTGAATCGGCTGAATCCAGAGAAACGCAATTGGGCAGAAGATTCATGCTCGTC
AATACAGGCGAAGAACAGGCGAAAGAAAGCAAACGAAAGAATCCTTCAGGGCCGAACCCAATTGGGAATCGACACCCACCGTCGATCCAAGGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGATTCCTCCAATGAAACATCTCCTTCTGAGTCGCTAAGTTCCTCTCAGCCGATGAAAGCGATTAATCCTGGAAGCAAGACCAACATCGTCAAATTAGACGAAGAGAA
TTTCTTATACTGGAAGATTCAAGTTCGAGCGGTTCTTCGCGACCATGGTTTAGGGCATTTTATTGAAGGTGATGCACATATTCCTCCGAAGACCACTCGATCAGGCGACA
CTTCCTCTGCTACTCTTCCCAATCCAGAATATGAAACATGGATTCGTCAAGACAATCTTATCACCTCGTGGTTTCTGGGCTCTATGTCGAACGAAATTCTTTCAGAGGTT
CTTGATTGTGAGACAGCTCGTGAGATCTGGAAAATTCTAAACGATCGATTCTCCTCCCGAAATCTTGCCCGTGTTATGCAGATCAAATCTAGGCTTGAAACGATCAAGAA
AGGTAATCTTAGTCTCAAAGATTACTTTGTCAAAGTCAAGAATCTTGTCGATTCTTTAGCTGCTGCTGATAAAAAGATTTCTTCAGATGACCATGTGATGCATATCTTGA
AGGGCTTGGGCTCTGAGTACGATCCTACTGTATCGGTTATCACAGAGAAGGATGATACTCCACCACTCCAGAAGGTTTACTCTTTACTCTTAACGCAAGAAAGTAGAATC
GAAAGAAATTCTTCGATAAACTCTGATGGTTTTCTACCATCGGTAAACTACACGGCTCTTCCCTCTTCAAAGCCGCAGCAACAGTCTTCTACCGATAATACTCAGAAGCG
AGGTGGACAAGTAAGAGGATCTAATGGCAATAACAATGGCGGACGATCCAATCGTAGAAACTGGAATAACAATGGAGGTAAACCGCAGTGCCAGTTATGTGGCCGATTTG
GTCATACTGTCCTCAGATGTTATATGCGTTTTGAAAGGTGGTATCAAGGACCGAATAACAATAATTCTCAGACACCGAGTAGTCCCAACTTTCCTCAAAATCAATTCCCA
TCTTCATCTGGGCCAGGGAATTGCTCTTCTTCTCAACAACCCTTCACTGCTTACTCCCTTCAGCATGACTTAAACAAAGAAAATCAGTGGTATCCGGACTCCGGTGCTTC
CAATCATGTAACGAGTGACATTTCAAATTTGACCGTGGGCTCTGAATATACTGATGACAACAAGGTTCATGTAGGCAACGATACAGGTTTGGATGTTCTTCATATTGGCT
CTTCTTTTGTTCAATCTAAATCTCATCCTAACTCTGTTTTCCACCTTAAAAATCTATTACATGTACCTACCATAACCAAAAATCTCATAAGCGTCAGTCAATTTGCTCAT
GATAATCAAGTTTTCTTTGAGTTTCACCCCTCTCATTACTTTGTGAAGGATCTTTCTACTGGCCGAATTCTACTCCAAGGCAAGGTAACTGATGGACTATACCAATTTTC
TTTGGAGAAGGCTGAATCCCTGACTCCTAAAGGTGGATGTTCTTCTTCCTTCCAACCGCAAGTACTTCATACATCTCTTTCTTCTGTGTCTAATGTTGTTTCTGATATCC
CTAGTACGTGTAATACTCATACTTTAGAACTATGGCACAAACGATTAGGCCATACTTCACTACCTACTGTCAAAAGGATCTTACAGACTTGTAAAATTCCATATTCTGAT
AATAAAATGATGTCTTTGTGGCATGCTTGTGCCATTGGAAAAAGTCACAAACTACCATTCCCTACTTCATCTACTGTGTATACCTCACCCCTTCAATTAGTTGTAGCTGA
CATATGGGGACCAGCCTTCAAAACATCTAGAAACAACTTTAGATACTACATTAGTTTTGTAGATGTATACTCAAGATTCACTTGGATATACTTCCTTAATTCAAAAAATG
AAGCGTTTCAAGCTTTTCTTCTATTCAAGACTCATGTAGAAAAACTCTTAAATTGTTCCATTCTTCGGTTCCAAACTGATGGAGGAGGTGAATTTCGTTTGTTCACCTCC
TATCTTAAAGAAAATGACATTGAACATCGTTTGTCGTGTCCCTACACCTCCCAACAAAATGGGATTGTCGAACGCAAGCATAGACAGATCATAGACATGGGGCCTACCCT
CCTGTCTCAAGCTTCTTTATCTTTAGAGTTTTGGGATGACGCCTTCTCCACAGCCACATACCTAACAAATCGGCTTCCATCCATAGTCCATCAGAACATCTCTCCCCTGG
AAGTTTTATTTGGCTATCAGCCTAATTACTCATTTTTCAAAACTTTTGGATGTCTTTGTTATCCTTTATTACGTCCATACAATAAACACAAAATTGAGCCTCGTTCTACA
CCATGCCTTTTTCTCGGGTATAGTACCACTCATAAAGGGTACAAATGCTTATCCTTTTCAGGTCGTTTATTCATCTCTAGACATGTCCAATTTGATGAAACTAAGTTTCC
AGCTATAGTTCCCTCCCTTTTCTCTTCAAATAACTCTAATCTCGAGGCTCCGTCTGTGACCACTTCTTTACCAATAGTCAAATCTGCCACTATATCTCATGTTTCCCCTT
TTCCAGCTGACAGTCATCCTACTCCCTCAGCTAACCCTTCACCCGTGACTGCTTTGTCTATTCCCAGTCAACCACAACCCTGTTCTAGTGCACCCTTGTCTATGGACAAT
ACAACTACTGATATGACACTTCATAACAATGATCCTTCACCCCTTCCCTCTCCTTCCTCAAACAACACCAATCCAGACCAAAGTTTATCCATTTCAAACCAGTCTGTTTC
ACATCCCTCTACCTCAGACAATAATCATTCTTCTCAGCCCTTGAGCACGTCGAACACTGTACTTGTGCCTACTTCTGCCGCAGTTGCTAATAGTCATCCCATGGTCACTC
GAGCGAAGTCAGGTATCTTTAAACCAAGGGTTCTCTTAACTGAGTATACTGAGGTAGAACCACCCACAGTGAAGGAGGCTTTGAAAAGTGCTCATTGGGTTGCTGCAATG
AAGGAGGAACACCAGGCCTTGATAAAAAATGATACCTGGACTCTTGTCAATCCTCCTGTTGACAAGAAAATCATTGGATGCAAGTGGGTGTTTAAAATAAAAAGGAACTC
GGATGGTTCAGTGGCTCGATATAAAGCTCGCCTAGTTGCAAAAGGGTTCCATCAAGAGGCTGACTTAGACTATACAGAGACGTTCAGCCCTGTTGTCAAACCAGCTACCA
TCCGAGTTATTTTTACCATGGCTCTCACTTATGGCTGGCCATTGCAACAAGTTGATATAAATAACGCGTTTCTTCATGGATTTCTCAGTGAAGAGGTGTTTATGACACAA
CCTCTGGGGATGACAGATAGTACAAATCCACACATGGTCTGCAAGCTGAAGAAAGCTCTATATGGCTTAAAACAAGCCCCTAGAGCGTGGTTTGACAGGTTGAGTTCTTT
TCTTCACTCTATTGGATTGGTTAGCTCAAAAACTGATTCTTCTTTGCTGGTGTGTCATCACAATGGTACTTGCTGCTATATCTTAGTGTATGTAGATGATATTGTGATTA
CTGGCAACTCTCCTCTTGTCATAGATCAGCTGATTAACATGCTGCATCAGAAATTCTCATTAAAGGATCTTGGCCTGTTAAGCTACTTTTTGGGTGTGGAGGTCTCTTAC
CCGCCCTCTGGTGGAATTTTCCTATCCCAAACCAAATACATTTATGATATTCTACACAAGACTAAAATGCATGAGGCAAATCCCATTGCTACTCCTATGATTAGTGGCTC
GGTATTATCGGCCTACCATGGTGAACCTTTCTCTGATACTCGACTATATAGGAGTACTGTTGGTGCACTCCAATATGCCACTCTCACCAGACCTGAAATCTCTTACAGTG
TCAACAAAGTATGTCAGTTCATGCACGCTCCGACTCAAGGTCATTGGCAGGCTGTTAAAAGAATCCTTCGTTACCTTATTGGTACTATTGATCATGGCCTATTCTTTCGT
AAACCGAGCTCCTTCTGTCTTGAAGGCTTTGCTGATGCAGATTGGGCCTCAGATCCTGATGATCGCAAATCTACATCGAGTTTTTGTGTCTATCTTGGTGGCAATCTAAT
TACTTGGTCTTCCAAAAAGCAGCACATTATATCTCGGTCCAGCACCGAAGCAGAGTATAGAAGCCTGGCTCACGTATCTGCAGATCTCATCTGGCTGCAATCACTCTTCA
AGGATCTTCGGCTAAACATCTCAAGACCTCCAATTCTGTGGTGTGACAATCTTGGTGCAGTGCATTTTAGTGCCAACCCTGTCCTTCACTCCAGAACTAAGCATGTAGAA
ATTGACATCTACTTCGTTCGTGATCTTGTTCTGCAAAAGCGCCTCACTGTATGTCATCTCCCTACGACTGCCCAGATAGCTGATATCTTCACCAAACCGCTTTCTTCTTC
TAGTTTTTTACCTCTCAGAAGCAAGCTCAATGTCTGTTCTTCCAACACCATTGGCTTGAGGGGGGTGTTAAGAGAGCACACTGATATGAGTCCCCTGCTCGCGAAAAACA
CACAAGAGAGGGAGAGTGGGTTTCCCAGATCTGTGTGGTCCAGTGCGACGATTGAGCGTGGGTATCCTAGATCTGGGTTTCCTAGATGTTGTGTAGTCAGACGAGCAGTA
ATCCGAGAGAAAGAGAGAGAAAATCGAGAGAAAGAGAGGAAGGATTCGAGAAGGAAAGCTATTCGAGTATTTGGTGCGGCGACTGAGTGTGGGTATGCCTTCGACGGAGC
GTGGGTATGCATGCAGCAGGGGACTCCGGTGTTGAAATTGGCTTCTTCGTTCGCCATTGAAACGCAGAGACAAATGACAGAGAAGAACAGAACGCTAGTTCTTGCTTACC
TCTTGATGGGCCTCCTCATAGCTTCTCAATGTGTTAATTCGCATGTCTTCGTGATTCTTGAATCGGCTGAATCCAGAGAAACGCAATTGGGCAGAAGATTCATGCTCGTC
AATACAGGCGAAGAACAGGCGAAAGAAAGCAAACGAAAGAATCCTTCAGGGCCGAACCCAATTGGGAATCGACACCCACCGTCGATCCAAGGCTGA
Protein sequenceShow/hide protein sequence
MDSSNETSPSESLSSSQPMKAINPGSKTNIVKLDEENFLYWKIQVRAVLRDHGLGHFIEGDAHIPPKTTRSGDTSSATLPNPEYETWIRQDNLITSWFLGSMSNEILSEV
LDCETAREIWKILNDRFSSRNLARVMQIKSRLETIKKGNLSLKDYFVKVKNLVDSLAAADKKISSDDHVMHILKGLGSEYDPTVSVITEKDDTPPLQKVYSLLLTQESRI
ERNSSINSDGFLPSVNYTALPSSKPQQQSSTDNTQKRGGQVRGSNGNNNGGRSNRRNWNNNGGKPQCQLCGRFGHTVLRCYMRFERWYQGPNNNNSQTPSSPNFPQNQFP
SSSGPGNCSSSQQPFTAYSLQHDLNKENQWYPDSGASNHVTSDISNLTVGSEYTDDNKVHVGNDTGLDVLHIGSSFVQSKSHPNSVFHLKNLLHVPTITKNLISVSQFAH
DNQVFFEFHPSHYFVKDLSTGRILLQGKVTDGLYQFSLEKAESLTPKGGCSSSFQPQVLHTSLSSVSNVVSDIPSTCNTHTLELWHKRLGHTSLPTVKRILQTCKIPYSD
NKMMSLWHACAIGKSHKLPFPTSSTVYTSPLQLVVADIWGPAFKTSRNNFRYYISFVDVYSRFTWIYFLNSKNEAFQAFLLFKTHVEKLLNCSILRFQTDGGGEFRLFTS
YLKENDIEHRLSCPYTSQQNGIVERKHRQIIDMGPTLLSQASLSLEFWDDAFSTATYLTNRLPSIVHQNISPLEVLFGYQPNYSFFKTFGCLCYPLLRPYNKHKIEPRST
PCLFLGYSTTHKGYKCLSFSGRLFISRHVQFDETKFPAIVPSLFSSNNSNLEAPSVTTSLPIVKSATISHVSPFPADSHPTPSANPSPVTALSIPSQPQPCSSAPLSMDN
TTTDMTLHNNDPSPLPSPSSNNTNPDQSLSISNQSVSHPSTSDNNHSSQPLSTSNTVLVPTSAAVANSHPMVTRAKSGIFKPRVLLTEYTEVEPPTVKEALKSAHWVAAM
KEEHQALIKNDTWTLVNPPVDKKIIGCKWVFKIKRNSDGSVARYKARLVAKGFHQEADLDYTETFSPVVKPATIRVIFTMALTYGWPLQQVDINNAFLHGFLSEEVFMTQ
PLGMTDSTNPHMVCKLKKALYGLKQAPRAWFDRLSSFLHSIGLVSSKTDSSLLVCHHNGTCCYILVYVDDIVITGNSPLVIDQLINMLHQKFSLKDLGLLSYFLGVEVSY
PPSGGIFLSQTKYIYDILHKTKMHEANPIATPMISGSVLSAYHGEPFSDTRLYRSTVGALQYATLTRPEISYSVNKVCQFMHAPTQGHWQAVKRILRYLIGTIDHGLFFR
KPSSFCLEGFADADWASDPDDRKSTSSFCVYLGGNLITWSSKKQHIISRSSTEAEYRSLAHVSADLIWLQSLFKDLRLNISRPPILWCDNLGAVHFSANPVLHSRTKHVE
IDIYFVRDLVLQKRLTVCHLPTTAQIADIFTKPLSSSSFLPLRSKLNVCSSNTIGLRGVLREHTDMSPLLAKNTQERESGFPRSVWSSATIERGYPRSGFPRCCVVRRAV
IREKERENREKERKDSRRKAIRVFGAATECGYAFDGAWVCMQQGTPVLKLASSFAIETQRQMTEKNRTLVLAYLLMGLLIASQCVNSHVFVILESAESRETQLGRRFMLV
NTGEEQAKESKRKNPSGPNPIGNRHPPSIQG