; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0009021 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0009021
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr9:34117551..34128727
RNA-Seq ExpressionLag0009021
SyntenyLag0009021
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN61322.1 hypothetical protein VITISV_012106 [Vitis vinifera]0.0e+0041.44Show/hide
Query:  MANASSMSSTSVTNVG---NTTFTSPPLNQLLNQITTIKLDRGNYLLWKNLAMPILRSYKLEGHLLGTKSCPPEFIRQDGEPVEVTSGAAIGAPSSQTDG
        MA+  + SS+S  ++G   ++T  S P  Q+LN    +KLDR NY+LW++    ++ +   E  + GT  CP +         +++ G            
Subjt:  MANASSMSSTSVTNVG---NTTFTSPPLNQLLNQITTIKLDRGNYLLWKNLAMPILRSYKLEGHLLGTKSCPPEFIRQDGEPVEVTSGAAIGAPSSQTDG

Query:  SGASTSEARLSMNPQYEAWVTVDQLLLGWLYNSMTPEVATQVMGIENAKDLWSAIQELFGVQSRAEEDFLRQTFQQTRKGNSKMSDYLRLMKTHADNLGL
                   MNP + AW   D+ +L W+Y+S+TP +  Q++G   +   W+A++ +F   SRA    LR   Q T+KG+  M DY+  +K  ADNL  
Subjt:  SGASTSEARLSMNPQYEAWVTVDQLLLGWLYNSMTPEVATQVMGIENAKDLWSAIQELFGVQSRAEEDFLRQTFQQTRKGNSKMSDYLRLMKTHADNLGL

Query:  AGSPVSNRNLVSQVLLGLDEEYNAIVAMIQGR-ASVTWAELQAELLVFEKRLELQNSVKNTTTFSQNALANMASSKGVSSPKQTNQITSNGNGNRPWYNN
         G PVS ++ V  +L GL  +YNA+V  I  R   ++   + + LL FE RLE Q+S++  +       AN ASS       +       G G  P  NN
Subjt:  AGSPVSNRNLVSQVLLGLDEEYNAIVAMIQGR-ASVTWAELQAELLVFEKRLELQNSVKNTTTFSQNALANMASSKGVSSPKQTNQITSNGNGNRPWYNN

Query:  YNQRGSGNRGRG--RGRGYNNYNNRQICQVCGKVGHSALVCYNRFNKEFSPIQNRGNGNGNGNHNQNRGQNQQSNAFMATQPTATPETLADPNWYADSGA
        Y  RG G  GR    GR  ++ + +  CQ+CGK GH+A +CY+RF+  F        G    +H+ N G NQ +   M    +  P   AD +WY DSGA
Subjt:  YNQRGSGNRGRG--RGRGYNNYNNRQICQVCGKVGHSALVCYNRFNKEFSPIQNRGNGNGNGNHNQNRGQNQQSNAFMATQPTATPETLADPNWYADSGA

Query:  SNHVTSNYDNLSNPTDYEGNECVTIGNGDKLPITCIGSSRLTDGNHVLQLEHVLCVPDIAKNLVSMSKLAQDNNVFIEFHGNFCLVKDKTTGRVVLKGAL
        S+H+T N  NL++ + Y G + VTIGNG  L I+ IGS +L    H  +L+ V  VP I+ NL+S++K   +NN  IEFH N   VKD  T  V+ +G L
Subjt:  SNHVTSNYDNLSNPTDYEGNECVTIGNGDKLPITCIGSSRLTDGNHVLQLEHVLCVPDIAKNLVSMSKLAQDNNVFIEFHGNFCLVKDKTTGRVVLKGAL

Query:  KDGLYQLQGV-------NLRNLSFSASSSSMRQENKIEKSYNEGAVFVVSNVVPCANMAVSKKIWHRRLGHPSEKVLNSIVKDCKLSVKVNEPLQFCESC
        ++GLY+           ++ N S   S  S   ENK E                         +WH RLGH S  +++ ++  C ++    +    C  C
Subjt:  KDGLYQLQGV-------NLRNLSFSASSSSMRQENKIEKSYNEGAVFVVSNVVPCANMAVSKKIWHRRLGHPSEKVLNSIVKDCKLSVKVNEPLQFCESC

Query:  QFGKSHALKFPLSDSRASKRFDLIHTDIWGPAPVLSGDGYRYYVLFLDDYSRYVWLYPLKLKSDTLSAFNHFLTMVKTQFGSMIKAIQSDNGGEYVKVHR
        Q  KSH L   LS+  ASK  +L++TDIWGPA + S  G RY++LF+DDYSRY W Y L+ K   L  F  F   ++ QF + IK +QSDNGGE+     
Subjt:  QFGKSHALKFPLSDSRASKRFDLIHTDIWGPAPVLSGDGYRYYVLFLDDYSRYVWLYPLKLKSDTLSAFNHFLTMVKTQFGSMIKAIQSDNGGEYVKVHR

Query:  LCNQLGIQSRYSCPHTSAQNGRAERKHRHVVETGLTLLAQASMPLAYWWDAFMAAARLINGLPTTVLKGKSPMELMFLKKLDFTALKTFGCSCYPCLRPY
            +GI  R+SCP+ S QNGR ERKHRHVVETGL LL+ AS+P+ YW  AF     LIN +P+ VL+  SP   +F +  D+ + + FGC CYP +RPY
Subjt:  LCNQLGIQSRYSCPHTSAQNGRAERKHRHVVETGLTLLAQASMPLAYWWDAFMAAARLINGLPTTVLKGKSPMELMFLKKLDFTALKTFGCSCYPCLRPY

Query:  QNHKFYFHTDQCVNLGLSASHKGYRCMNKA-GRVFVSRHVKFDEETFPFAAGFGTVDSSMSGSNTTLAPHILQWFPQPNIPQSGIFSPPVNQPPLTCVQP
          HK  + + QC+ LG S +HKG+ C++ A GRV+++ HV FDE TFP A        S S SN T A               G     +  P   C+ P
Subjt:  QNHKFYFHTDQCVNLGLSASHKGYRCMNKA-GRVFVSRHVKFDEETFPFAAGFGTVDSSMSGSNTTLAPHILQWFPQPNIPQSGIFSPPVNQPPLTCVQP

Query:  SPSPAPLQQPTGQNNEPCSQTSPSPPPSQQPAVQNTSPSILPFPNQETSVSSPDSNTSQTSPASEPSPETILNSNPCPQST---HPMVTRGKAGIFKPKA
            + +   +  ++   +  SP P  S  P                        +TS +SPA + SP+++    P PQ T     M TR   GI K K 
Subjt:  SPSPAPLQQPTGQNNEPCSQTSPSPPPSQQPAVQNTSPSILPFPNQETSVSSPDSNTSQTSPASEPSPETILNSNPCPQST---HPMVTRGKAGIFKPKA

Query:  WLSRQQVDWSLTEPTRVQDALATPQWKAAMDTEFSALIKNQTWSLVPHAPSFNVVGNKWIFRIKRNADGSIQRYKARLVAKGFHQYPGVDFFETFSPVVK
         L    +   ++EP+ ++ A   P W  AM+ E +AL +N TW LV   P+ NV+G KW++++K   DGSI+RYKARLVAKG++Q  G+D+FETFSPVVK
Subjt:  WLSRQQVDWSLTEPTRVQDALATPQWKAAMDTEFSALIKNQTWSLVPHAPSFNVVGNKWIFRIKRNADGSIQRYKARLVAKGFHQYPGVDFFETFSPVVK

Query:  ASTIRIVLSLAVTRGWELRQLDFNNAFLNGTLNEVVYMKQPPGYVDPNRPNHVCKLKKAIYGLKQAPRAWNTTLKAVLLSWGFHNSRSDNSLFIFRTENV
        A+TIRI+L++A++  WE+RQLD +NAFLNG L E VYM QPPGY DP  PN VC+LKKA+YGLKQAPRAW   L + LL WGF  SR+D+S+F+   +  
Subjt:  ASTIRIVLSLAVTRGWELRQLDFNNAFLNGTLNEVVYMKQPPGYVDPNRPNHVCKLKKAIYGLKQAPRAWNTTLKAVLLSWGFHNSRSDNSLFIFRTENV

Query:  CLLLLVYVDDVIVTGNNSKMINRLIVELDNRFALKDLGRLNYFLGIQVTYIPSGLLLTQAKYIDDLLTKLDLLHLKPAPSPCVIGKKMSIHDGKPLEDPF
         L++LVYVDD++VTG++S  I+ LI +LD+ FAL+DLG+L++FLGI+V+Y    + L+Q KYI DLL + +L   KPA +P  +GK +S  DG P+ D  
Subjt:  CLLLLVYVDDVIVTGNNSKMINRLIVELDNRFALKDLGRLNYFLGIQVTYIPSGLLLTQAKYIDDLLTKLDLLHLKPAPSPCVIGKKMSIHDGKPLEDPF

Query:  IYRSTIGALQYLTTTRPDIAYIINQLSQFLQTPTDIHWQAVKRVLRYLTGTKHLGLLFQPGSNLSVSAFSDADWASNIDDRKSVAAYCVFLGNNLVSWSS
         YRS +GALQY+T TRPDIA+ +N+  QF+Q PT  HW +VKR+LRYL GT   GLLF P SNL++  F+DADW +++DDR+S + Y V+LG NLVSWSS
Subjt:  IYRSTIGALQYLTTTRPDIAYIINQLSQFLQTPTDIHWQAVKRVLRYLTGTKHLGLLFQPGSNLSVSAFSDADWASNIDDRKSVAAYCVFLGNNLVSWSS

Query:  KKQSVVARSSTESEYRALSLASAEIIWLQQLLKELGCH-SSKPILWCDNISAGALAANPVFHARTKHIEVDVHFVRDQILWGALEVRYVPSHDQLADCLT
         KQ VV+RSS ESEYR L  A+AEI+W+Q LL+EL     + P+LW DNISA  +A NPVFHARTKHIE+D+HF+RDQ++ G +++++VP+ +Q  D LT
Subjt:  KKQSVVARSSTESEYRALSLASAEIIWLQQLLKELGCH-SSKPILWCDNISAGALAANPVFHARTKHIEVDVHFVRDQILWGALEVRYVPSHDQLADCLT

Query:  KPLTHTQFLYLRSKLGLVDTPSRLRGDIKEPSHSVSSASPSKKHNPEEEQAKGS
        K LT ++FL L+S+L +   P  LRGD K  +              EE +  GS
Subjt:  KPLTHTQFLYLRSKLGLVDTPSRLRGDIKEPSHSVSSASPSKKHNPEEEQAKGS

GAU19483.1 hypothetical protein TSUD_77270 [Trifolium subterraneum]0.0e+0046.07Show/hide
Query:  TIKLDRGNYLLWKNLAMPILRSYKLEGHLLGTKSCPPEFIRQDGEPVEVTSGAAIGAPSSQTDGSGASTSEARLSMNPQYEAWVTVDQLLLGWLYNSMTP
        ++KLDR NY LWK+L +P++R  KL+G++LGT+ CP EFI                           ++S++  + N  +  W   DQ LLGW+ NSMT 
Subjt:  TIKLDRGNYLLWKNLAMPILRSYKLEGHLLGTKSCPPEFIRQDGEPVEVTSGAAIGAPSSQTDGSGASTSEARLSMNPQYEAWVTVDQLLLGWLYNSMTP

Query:  EVATQVMGIENAKDLWSAIQELFGVQSRAEEDFLRQTFQQTRKGNSKMSDYLRLMKTHADNLGLAGSPVSNRNLVSQVLLGLDEEYNAIVAMIQGRASVT
        E+ATQ++  E +K LW   Q L G  +R++  +L+  F   RKG  KM DYL  MK   D L LAG+PVS  +L+ Q L GLD EYN +V  +  + +++
Subjt:  EVATQVMGIENAKDLWSAIQELFGVQSRAEEDFLRQTFQQTRKGNSKMSDYLRLMKTHADNLGLAGSPVSNRNLVSQVLLGLDEEYNAIVAMIQGRASVT

Query:  WAELQAELLVFEKRLELQNSVKNTTTFSQNALANMASSKGVSSPKQTNQITSNGNGNRPWYNNYNQRGSGNRG--RGRGRGYNNYNNRQICQVCGKVGHS
        W +LQA+LL FE R+E  N++ N T    NA AN+A                N + +R   +N N RGS +RG   GRGRG +  N    CQVCG   H 
Subjt:  WAELQAELLVFEKRLELQNSVKNTTTFSQNALANMASSKGVSSPKQTNQITSNGNGNRPWYNNYNQRGSGNRG--RGRGRGYNNYNNRQICQVCGKVGHS

Query:  ALVCYNRFNKEFSPIQNRGNGNGNGNHNQNRGQNQQSNAFMATQPTATPETLADPNWYADSGASNHVTSNYDNLSNPTDYEGNECVTIGNGDKLPITCIG
        A+ C++RF+K +S            NH+    +    NAF+A+Q      ++ D +WY DSGASNHVT   +   + T++ G   + +GNG+KL I   G
Subjt:  ALVCYNRFNKEFSPIQNRGNGNGNGNHNQNRGQNQQSNAFMATQPTATPETLADPNWYADSGASNHVTSNYDNLSNPTDYEGNECVTIGNGDKLPITCIG

Query:  SSRLTDGNHVLQLEHVLCVPDIAKNLVSMSKLAQDNNVFIEFHGNFCLVKDKTTGRVVLKGALKDGLYQLQGVNLRNLSFSASSSSMRQENKIEKSYNEG
        SS+L      L L  +L VP+I KNL+S+SKLA DNN+ +EF  N C VKDK TG+V+LKG LKDGLYQL G   RN                       
Subjt:  SSRLTDGNHVLQLEHVLCVPDIAKNLVSMSKLAQDNNVFIEFHGNFCLVKDKTTGRVVLKGALKDGLYQLQGVNLRNLSFSASSSSMRQENKIEKSYNEG

Query:  AVFVVSNVVPCANMAVSKKIWHRRLGHPSEKVLNSIVKDCKLSVKVNEPLQFCESCQFGKSHALKFPLSDSRASKRFDLIHTDIWGPAPVLSGDGYRYYV
                 P A ++V K+ WHRRLGHP+ KVL+ +++ CK+ V  ++   FCE+CQ+GK H L F  S S A +  +L+HTD+WGPAP+++  G++YYV
Subjt:  AVFVVSNVVPCANMAVSKKIWHRRLGHPSEKVLNSIVKDCKLSVKVNEPLQFCESCQFGKSHALKFPLSDSRASKRFDLIHTDIWGPAPVLSGDGYRYYV

Query:  LFLDDYSRYVWLYPLKLKSDTLSAFNHFLTMVKTQFGSMIKAIQSDNGGEYVKVHRLCNQLGIQSRYSCPHTSAQNGRAERKHRHVVETGLTLLAQASMP
         F+DD+SR+ W+YPLK KS+T+ AF  F  + + QF   IK IQ D GGEY  V +L  + GIQ R SCP+TS QNGRAERKHRH+ E GLTLLAQA MP
Subjt:  LFLDDYSRYVWLYPLKLKSDTLSAFNHFLTMVKTQFGSMIKAIQSDNGGEYVKVHRLCNQLGIQSRYSCPHTSAQNGRAERKHRHVVETGLTLLAQASMP

Query:  LAYWWDAFMAAARLINGLPTTVLKGKSPMELMFLKKLDFTALKTFGCSCYPCLRPYQNHKFYFHTDQCVNLGLSASHKGYRCMNKAGRVFVSRHVKFDEE
        L YWW+AF  A  LIN LP+ V + +SP  LM  K+ D+  LKTFGC+CYPCL+PY  HK  +HT +CV LG S SHKGY+C+N  GR+F+SRHV F+E+
Subjt:  LAYWWDAFMAAARLINGLPTTVLKGKSPMELMFLKKLDFTALKTFGCSCYPCLRPYQNHKFYFHTDQCVNLGLSASHKGYRCMNKAGRVFVSRHVKFDEE

Query:  TFPFAAGFGTVDSSMSGSNTTLAPHILQWFPQPNIP--QSGIFSPPVNQPPLTCVQPSPSPAPLQQPTGQNNEPCSQTSPSPPPSQQPAVQNTSPSILPF
         FPF  GF    S +    TT+        P  + P   +G      + P L    P+ +     Q    + E   QT      +  P+  NT+      
Subjt:  TFPFAAGFGTVDSSMSGSNTTLAPHILQWFPQPNIP--QSGIFSPPVNQPPLTCVQPSPSPAPLQQPTGQNNEPCSQTSPSPPPSQQPAVQNTSPSILPF

Query:  PNQETSVSSPDSNTSQTSPASEPSPETILNSNPCPQSTHPMVTRGKAGIFKPK-AWLSRQQVDWSLTEPTRVQDALATPQWKAAMDTEFSALIKNQTWSL
          Q+ SV     NT+                     ++H + TR K+GI KPK  ++   +      EP   ++AL+ P WK AM  EF AL+ N+TW L
Subjt:  PNQETSVSSPDSNTSQTSPASEPSPETILNSNPCPQSTHPMVTRGKAGIFKPK-AWLSRQQVDWSLTEPTRVQDALATPQWKAAMDTEFSALIKNQTWSL

Query:  VPHAPSFNVVGNKWIFRIKRNADGSIQRYKARLVAKGFHQYPGVDFFETFSPVVKASTIRIVLSLAVTRGWELRQLDFNNAFLNGTLNEVVYMKQPPGYV
        VP+    N+V +KW+F+ K   DGS++R KARLVAKGF Q  G+D+ ETFSPV+KAST+RI+LS+AV   WE+RQLD NNAFLNG L E V+M QP G+V
Subjt:  VPHAPSFNVVGNKWIFRIKRNADGSIQRYKARLVAKGFHQYPGVDFFETFSPVVKASTIRIVLSLAVTRGWELRQLDFNNAFLNGTLNEVVYMKQPPGYV

Query:  DPNRPNHVCKLKKAIYGLKQAPRAWNTTLKAVLLSWGFHNSRSDNSLFIFRTENVCLLLLVYVDDVIVTGNNSKMINRLIVELDNRFALKDLGRLNYFLG
        D  +PNH+CKL KAIYGLKQAPRAW  +LK  LL+WGF N++SD+SLF+ + ++    LL+YVDD+IVTG+N K +   I +L++ F+LKDLG L+YFLG
Subjt:  DPNRPNHVCKLKKAIYGLKQAPRAWNTTLKAVLLSWGFHNSRSDNSLFIFRTENVCLLLLVYVDDVIVTGNNSKMINRLIVELDNRFALKDLGRLNYFLG

Query:  IQVTYIPSGLLLTQAKYIDDLLTKLDLLHLKPAPSPCVIGKKMSIHDGKPLEDPFIYRSTIGALQYLTTTRPDIAYIINQLSQFLQTPTDIHWQAVKRVL
        I+V    SG+ L Q+KYI DLL K  + +  P P+P + G++ ++ +G+ L+DP ++R  IG LQYLT T PDIA+ +N+LSQ++ +P+  HWQ +KR+L
Subjt:  IQVTYIPSGLLLTQAKYIDDLLTKLDLLHLKPAPSPCVIGKKMSIHDGKPLEDPFIYRSTIGALQYLTTTRPDIAYIINQLSQFLQTPTDIHWQAVKRVL

Query:  RYLTGTKHLGLLFQPGSNLSVSAFSDADWASNIDDRKSVAAYCVFLGNNLVSWSSKKQSVVARSSTESEYRALSLASAEIIWLQQLLKELGCH-SSKPIL
        RYL GT +  L  +P ++L ++ FSDADWA++IDDRKS++  CVFLG  L+SWSS+KQ VV+RSSTESEYRAL+  +AEI W++ LL EL      KPIL
Subjt:  RYLTGTKHLGLLFQPGSNLSVSAFSDADWASNIDDRKSVAAYCVFLGNNLVSWSSKKQSVVARSSTESEYRALSLASAEIIWLQQLLKELGCH-SSKPIL

Query:  WCDNISAGALAANPVFHARTKHIEVDVHFVRDQILWGALEVRYVPSHDQLADCLTKPLTHTQFLYLRSKLGLVDTP
        WCDN+SA ALA+NPV HAR+KHIE+DVH++RDQ+L   + V YVP+ DQ+ADCLTKPL+HT+F  LR KLG++ +P
Subjt:  WCDNISAGALAANPVFHARTKHIEVDVHFVRDQILWGALEVRYVPSHDQLADCLTKPLTHTQFLYLRSKLGLVDTP

GAU51268.1 hypothetical protein TSUD_412550 [Trifolium subterraneum]0.0e+0044.64Show/hide
Query:  SPPLNQLLNQITTIKLDRGNYLLWKNLAMPILRSYKLEGHLLGTKSCPPEFIRQDGEPVEVTSGAAIGAPSSQTDGSGASTSEARLSMNPQYEAWVTVDQ
        SP  N  L  I ++KLDR NY LWK+L + ++R  KL+G++LGT  CP +F+                           ++++    +NP +  W+  DQ
Subjt:  SPPLNQLLNQITTIKLDRGNYLLWKNLAMPILRSYKLEGHLLGTKSCPPEFIRQDGEPVEVTSGAAIGAPSSQTDGSGASTSEARLSMNPQYEAWVTVDQ

Query:  LLLGWLYNSMTPEVATQVMGIENAKDLWSAIQELFGVQSRAEEDFLRQTFQQTRKGNSKMSDYLRLMKTHADNLGLAGSPVSNRNLVSQVLLGLDEEYNA
         LLGWL NSM  ++ATQ++  E +K LW   Q L G  +++   +L+  F  TRKG  KM +YL  MK  +D L LAGSP+SN +L+ Q L GLD EYN 
Subjt:  LLLGWLYNSMTPEVATQVMGIENAKDLWSAIQELFGVQSRAEEDFLRQTFQQTRKGNSKMSDYLRLMKTHADNLGLAGSPVSNRNLVSQVLLGLDEEYNA

Query:  IVAMIQGRASVTWAELQAELLVFEKRLELQNSVKNTTTFSQNALANMASSKGVSSPKQTNQITSNGNGNRPWYNNYNQRGSGNRGRGRGRGYNNYNNRQI
        +V  +  + +++W ++QA+LL FE RL+  N+    T  +    AN    +G       N+  S GN  R   N    RG    GRG+GR  N       
Subjt:  IVAMIQGRASVTWAELQAELLVFEKRLELQNSVKNTTTFSQNALANMASSKGVSSPKQTNQITSNGNGNRPWYNNYNQRGSGNRGRGRGRGYNNYNNRQI

Query:  CQVCGKVGHSALVCYNRFNKEFSPIQNRGNGNGNGNHNQNRGQNQQSNAFMATQPTATPETLADPNWYADSGASNHVTSNYDNLSNPTDYEGNECVTIGN
        CQVC   GH A+ C  RF++ ++        +  G+H          +AF+     A+P    D  WY DSGA+NHVT   D      ++ G   + +GN
Subjt:  CQVCGKVGHSALVCYNRFNKEFSPIQNRGNGNGNGNHNQNRGQNQQSNAFMATQPTATPETLADPNWYADSGASNHVTSNYDNLSNPTDYEGNECVTIGN

Query:  GDKLPITCIGSSRLTDGNHVLQLEHVLCVPDIAKNLVSMSKLAQDNNVFIEFHGNFCLVKDKTTGRVVLKGALKDGLYQLQGVNLRNLSFSASSSSMRQE
        G+KL I   GS++L +    L L  VL VP I KNL+S+SKL  DNN+ +EF  N C VKDK TG+ +LKG LKDGLYQL                    
Subjt:  GDKLPITCIGSSRLTDGNHVLQLEHVLCVPDIAKNLVSMSKLAQDNNVFIEFHGNFCLVKDKTTGRVVLKGALKDGLYQLQGVNLRNLSFSASSSSMRQE

Query:  NKIEKSYNEGAVFVVSNVVPCANMAVSKKIWHRRLGHPSEKVLNSIVKDCKLSVKVNEPLQFCESCQFGKSHALKFPLSDSRASKRFDLIHTDIWGPAPV
                       SN  PC  M+V K+ WHR+LGHP+ KVL+ ++KDC + +  ++   FCE+CQFGK H L F  S S   +   LIH+D+WGPAP+
Subjt:  NKIEKSYNEGAVFVVSNVVPCANMAVSKKIWHRRLGHPSEKVLNSIVKDCKLSVKVNEPLQFCESCQFGKSHALKFPLSDSRASKRFDLIHTDIWGPAPV

Query:  LSGDGYRYYVLFLDDYSRYVWLYPLKLKSDTLSAFNHFLTMVKTQFGSMIKAIQSDNGGEYVKVHRLCNQLGIQSRYSCPHTSAQNGRAERKHRHVVETG
        LS  G++YYV F+DD+SR+ W++PLK KSDT+ AF  F  + + QF   IK IQ D GGEY  V ++  + GIQ R SCP+TS QNGRAERKHRHV E G
Subjt:  LSGDGYRYYVLFLDDYSRYVWLYPLKLKSDTLSAFNHFLTMVKTQFGSMIKAIQSDNGGEYVKVHRLCNQLGIQSRYSCPHTSAQNGRAERKHRHVVETG

Query:  LTLLAQASMPLAYWWDAFMAAARLINGLPTTVLKGKSPMELMFLKKLDFTALKTFGCSCYPCLRPYQNHKFYFHTDQCVNLGLSASHKGYRCMNKAGRVF
        LTLLAQA MPL YWW+AF  A  LIN LP++V   +SP  LMF ++ D+ ALK FGC+CYPCL+PY  HK  FHT +CV +G S SHKGY+C+N  GR+F
Subjt:  LTLLAQASMPLAYWWDAFMAAARLINGLPTTVLKGKSPMELMFLKKLDFTALKTFGCSCYPCLRPYQNHKFYFHTDQCVNLGLSASHKGYRCMNKAGRVF

Query:  VSRHVKFDEETFPFAAGFGTVDSSMSGSNTTLAPHILQWFPQPNIPQSGIFSPPVNQPPLT--CVQPSPSPAPLQQ----PTGQNNEPCSQTSPSPPPSQ
        VSRHV F+E  FPF  GF    + +     TL  +            S I  P  +    T   ++P  +    Q      +  NNE   Q   S     
Subjt:  VSRHVKFDEETFPFAAGFGTVDSSMSGSNTTLAPHILQWFPQPNIPQSGIFSPPVNQPPLT--CVQPSPSPAPLQQ----PTGQNNEPCSQTSPSPPPSQ

Query:  QPAVQNTSPSILPFPNQETSVSSPDSNTSQTSPASEPSPETILNSNPCPQSTHPMVTRGKAGIFKPK-AWLSRQQVDWSLTEPTRVQDALATPQWKAAMD
             NT+ S       + SV S D N S  +   +   +   NSN     TH M TR K GI KPK  ++   + D    EP  V++AL  P WK AMD
Subjt:  QPAVQNTSPSILPFPNQETSVSSPDSNTSQTSPASEPSPETILNSNPCPQSTHPMVTRGKAGIFKPK-AWLSRQQVDWSLTEPTRVQDALATPQWKAAMD

Query:  TEFSALIKNQTWSLVPHAPSFNVVGNKWIFRIKRNADGSIQRYKARLVAKGFHQYPGVDFFETFSPVVKASTIRIVLSLAVTRGWELRQLDFNNAFLNGT
         E+ AL+ N TW+LVP+    N++ +KWIF+ K  +DGSI+R KARLVAKGF Q  G+DF ETFSPVVK+ST+RI+L++AV   WE+RQLD NNAFLNG 
Subjt:  TEFSALIKNQTWSLVPHAPSFNVVGNKWIFRIKRNADGSIQRYKARLVAKGFHQYPGVDFFETFSPVVKASTIRIVLSLAVTRGWELRQLDFNNAFLNGT

Query:  LNEVVYMKQPPGYVDPNRPNHVCKLKKAIYGLKQAPRAWNTTLKAVLLSWGFHNSRSDNSLFIFRTENVCLLLLVYVDDVIVTGNNSKMINRLIVELDNR
        L E V+M QP GY+D  +PNH+CKL KAIYGLKQAPRAW  +L++ L++WGF N+++D SLF  +  +    LL+YVDD+IVTG+N K +     +L+  
Subjt:  LNEVVYMKQPPGYVDPNRPNHVCKLKKAIYGLKQAPRAWNTTLKAVLLSWGFHNSRSDNSLFIFRTENVCLLLLVYVDDVIVTGNNSKMINRLIVELDNR

Query:  FALKDLGRLNYFLGIQVTYIPSGLLLTQAKYIDDLLTKLDLLHLKPAPSPCVIGKKMSIHDGKPLEDPFIYRSTIGALQYLTTTRPDIAYIINQLSQFLQ
        ++LKDLG L+YFLG++V    SG+ L Q KYI D+L K ++ +    P+P V G++  I +G+ + +P +YR  IGALQYLT TRPDIA+ +N+LSQ++ 
Subjt:  FALKDLGRLNYFLGIQVTYIPSGLLLTQAKYIDDLLTKLDLLHLKPAPSPCVIGKKMSIHDGKPLEDPFIYRSTIGALQYLTTTRPDIAYIINQLSQFLQ

Query:  TPTDIHWQAVKRVLRYLTGTKHLGLLFQPGSNLSVSAFSDADWASNIDDRKSVAAYCVFLGNNLVSWSSKKQSVVARSSTESEYRAL-------------
        TPT  HWQ +KR+LRYL GTK+  L  +P +NL ++ F DADWA++ DDRKS    CVFLG  LVSW+S+KQ VV+RSSTESEYR+L             
Subjt:  TPTDIHWQAVKRVLRYLTGTKHLGLLFQPGSNLSVSAFSDADWASNIDDRKSVAAYCVFLGNNLVSWSSKKQSVVARSSTESEYRAL-------------

Query:  SLASAEIIWLQQ------LLKELGCH-SSKPILWCDNISAGALAANPVFHARTKHIEVDVHFVRDQILWGALEVRYVPSHDQLADCLTKPLTHTQFLYLR
        +L S+E   L        LL+EL      KP+LWCDN+SA ALA+NPV HAR+KHIE+D+H++RDQ+L   + + YVP+ DQ+ADCLTKPL HT+F  +R
Subjt:  SLASAEIIWLQQ------LLKELGCH-SSKPILWCDNISAGALAANPVFHARTKHIEVDVHFVRDQILWGALEVRYVPSHDQLADCLTKPLTHTQFLYLR

Query:  SKLGLVDTPS
         KLG+  +PS
Subjt:  SKLGLVDTPS

RVW18104.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]0.0e+0042.46Show/hide
Query:  MANASSMSSTSVTNVGNTTFTSPPLNQLLNQITTIKLDRGNYLLWKNLAMPILRSYKLEGHLLGTKSCPPEFIRQDGEPVEVTSGAAIGAPSSQTDGSGA
        M++ +S SS+S  +  +++  S P  Q+LN    +KLDR NY+LW++    ++ +   E  + GT  CP + +R    P E+                  
Subjt:  MANASSMSSTSVTNVGNTTFTSPPLNQLLNQITTIKLDRGNYLLWKNLAMPILRSYKLEGHLLGTKSCPPEFIRQDGEPVEVTSGAAIGAPSSQTDGSGA

Query:  STSEARLSMNPQYEAWVTVDQLLLGWLYNSMTPEVATQVMGIENAKDLWSAIQELFGVQSRAEEDFLRQTFQQTRKGNSKMSDYLRLMKTHADNLGLAGS
                 NP + AW   D+ +L W+Y+S+TP +  Q++G  ++   W+A++++F   SRA    LR  FQ T+KG+  M DY+  +K  AD+L   G 
Subjt:  STSEARLSMNPQYEAWVTVDQLLLGWLYNSMTPEVATQVMGIENAKDLWSAIQELFGVQSRAEEDFLRQTFQQTRKGNSKMSDYLRLMKTHADNLGLAGS

Query:  PVSNRNLVSQVLLGLDEEYNAIVAMIQGRA-SVTWAELQAELLVFEKRLELQNSVKNTTTFSQNALANMASSKGVSSPKQTNQITSNGNGNRPWYNNYNQ
         VS ++ +  +L GL  +YNA+V  I  R   ++   + + LL FE+RLE Q S++     S    AN ASS   S+ +   +  + G G      N N 
Subjt:  PVSNRNLVSQVLLGLDEEYNAIVAMIQGRA-SVTWAELQAELLVFEKRLELQNSVKNTTTFSQNALANMASSKGVSSPKQTNQITSNGNGNRPWYNNYNQ

Query:  RGSGNRGR--GRGRGYNNYNNRQICQVCGKVGHSALVCYNRFNKEFSPIQNRGNGNGNGNHNQNRGQNQQSNAFMATQPTATPETLADPNWYADSGASNH
        RG G  GR    GR  ++ + R  CQ+CGK GH+  VCY+RF+  F   QN   G  N  +         SN+  A    A+   LAD NWY DSGAS+H
Subjt:  RGSGNRGR--GRGRGYNNYNNRQICQVCGKVGHSALVCYNRFNKEFSPIQNRGNGNGNGNHNQNRGQNQQSNAFMATQPTATPETLADPNWYADSGASNH

Query:  VTSNYDNLSNPTDYEGNECVTIGNGDKLPITCIGSSRLTDGNHVLQLEHVLCVPDIAKNLVSMSKLAQDNNVFIEFHGNFCLVKDKTTGRVVLKGALKDG
        +T N  NL+N T Y G + VTIGNG  L I+  G +RL    H  QL+ V  VP I+ NL+S++K   DNN  IEFH N   VKD  T RV+ +G L++G
Subjt:  VTSNYDNLSNPTDYEGNECVTIGNGDKLPITCIGSSRLTDGNHVLQLEHVLCVPDIAKNLVSMSKLAQDNNVFIEFHGNFCLVKDKTTGRVVLKGALKDG

Query:  LYQLQGVNLRNLSFSASSSSMRQENKIEKSYNEGAVFVVSNVVPCANMAVSKKIWHRRLGHPSEKVLNSIVKDCKLSVKVNEPLQFCESCQFGKSHALKF
        LY+   ++ +  ++   ++                     +   C+ +   +++WH RLGH +  ++  I+ +C +S         C SCQ  KSH L  
Subjt:  LYQLQGVNLRNLSFSASSSSMRQENKIEKSYNEGAVFVVSNVVPCANMAVSKKIWHRRLGHPSEKVLNSIVKDCKLSVKVNEPLQFCESCQFGKSHALKF

Query:  PLSDSRASKRFDLIHTDIWGPAPVLSGDGYRYYVLFLDDYSRYVWLYPLKLKSDTLSAFNHFLTMVKTQFGSMIKAIQSDNGGEYVKVHRLCNQLGIQSR
         LS   ASK  +L++TDIWGPA V S  G +Y++LF+DDYSRY WLY L+ K D    F  F   V+ QF + IK +QSDNGGE+        + GI  R
Subjt:  PLSDSRASKRFDLIHTDIWGPAPVLSGDGYRYYVLFLDDYSRYVWLYPLKLKSDTLSAFNHFLTMVKTQFGSMIKAIQSDNGGEYVKVHRLCNQLGIQSR

Query:  YSCPHTSAQNGRAERKHRHVVETGLTLLAQASMPLAYWWDAFMAAARLINGLPTTVLKGKSPMELMFLKKLDFTALKTFGCSCYPCLRPYQNHKFYFHTD
        +SCP+ S+QNGR ERKHRHVVETGL LLA A +PL +W  AF  A  LIN +P+ VL+  SP   +F +  D+  L+ FGC CYP +RPY NHK  + + 
Subjt:  YSCPHTSAQNGRAERKHRHVVETGLTLLAQASMPLAYWWDAFMAAARLINGLPTTVLKGKSPMELMFLKKLDFTALKTFGCSCYPCLRPYQNHKFYFHTD

Query:  QCVNLGLSASHKGYRCM-NKAGRVFVSRHVKFDEETFPFAAGFGTVDSSMSGSNTTLAPHILQWFPQPNIPQSGIFSPPVNQPPLTCVQPSPSPAPLQQP
        +CV LG S  HKGY C+ N  GRV+VS HV FDE  FPFA    +   S   S+ ++ P I+       +   G  +  +  P LT     P+P P   P
Subjt:  QCVNLGLSASHKGYRCM-NKAGRVFVSRHVKFDEETFPFAAGFGTVDSSMSGSNTTLAPHILQWFPQPNIPQSGIFSPPVNQPPLTCVQPSPSPAPLQQP

Query:  TGQN-NEPCSQTSPSPPPSQQPAVQNTSPSILPFPNQETSVSSPDSNTSQTSPASEPSPETILNSNPCPQSTHPMVTRGKAGIFKPKAWLSRQQVDWSLT
        T ++  EP  +   + P  QQ  V                                          P P+ T    TR  +GI K K   +     + ++
Subjt:  TGQN-NEPCSQTSPSPPPSQQPAVQNTSPSILPFPNQETSVSSPDSNTSQTSPASEPSPETILNSNPCPQSTHPMVTRGKAGIFKPKAWLSRQQVDWSLT

Query:  EPTRVQDALATPQWKAAMDTEFSALIKNQTWSLVPHAPSFNVVGNKWIFRIKRNADGSIQRYKARLVAKGFHQYPGVDFFETFSPVVKASTIRIVLSLAV
        EPT ++ A+  P W  AM TE +AL KNQTW LV      N++G KW++++K   DGS+ RYKARLVA+GF+Q  G+D+FETFSPVVKA+TIRIVL++A+
Subjt:  EPTRVQDALATPQWKAAMDTEFSALIKNQTWSLVPHAPSFNVVGNKWIFRIKRNADGSIQRYKARLVAKGFHQYPGVDFFETFSPVVKASTIRIVLSLAV

Query:  TRGWELRQLDFNNAFLNGTLNEVVYMKQPPGYVDPNRPNHVCKLKKAIYGLKQAPRAWNTTLKAVLLSWGFHNSRSDNSLFIFRTENVCLLLLVYVDDVI
        +  WELRQLD  NAFLNG L E VYM QPPG++ PN PN VCKLKKA+YGLKQ+PRAW T L + LLSWGF++SR+D+S+F+    +  L++LVYVDD+I
Subjt:  TRGWELRQLDFNNAFLNGTLNEVVYMKQPPGYVDPNRPNHVCKLKKAIYGLKQAPRAWNTTLKAVLLSWGFHNSRSDNSLFIFRTENVCLLLLVYVDDVI

Query:  VTGNNSKMINRLIVELDNRFALKDLGRLNYFLGIQVTYIPSGLLLTQAKYIDDLLTKLDLLHLKPAPSPCVIGKKMSIHDGKPLEDPFIYRSTIGALQYL
        VTG++  +I +LI +L + FAL+DLG+L+YFLGI+VTY    + L+Q KYI DLL +  +L  K   +P  +G  +S  DG  ++D  +YRS +GALQY 
Subjt:  VTGNNSKMINRLIVELDNRFALKDLGRLNYFLGIQVTYIPSGLLLTQAKYIDDLLTKLDLLHLKPAPSPCVIGKKMSIHDGKPLEDPFIYRSTIGALQYL

Query:  TTTRPDIAYIINQLSQFLQTPTDIHWQAVKRVLRYLTGTKHLGLLFQPGSNLSVSAFSDADWASNIDDRKSVAAYCVFLGNNLVSWSSKKQSVVARSSTE
        T TRPDIA+ +N+  QF+  PT  HW +VKR+LRYL GT   GL  QP ++ ++ A++DADW +  DDR+S + Y V+LGNNLVSW++ KQ VV+RSS E
Subjt:  TTTRPDIAYIINQLSQFLQTPTDIHWQAVKRVLRYLTGTKHLGLLFQPGSNLSVSAFSDADWASNIDDRKSVAAYCVFLGNNLVSWSSKKQSVVARSSTE

Query:  SEYRALSLASAEIIWLQQLLKELGCHS--SKPILWCDNISAGALAANPVFHARTKHIEVDVHFVRDQILWGALEVRYVPSHDQLADCLTKPLTHTQFLYL
        SEYR L++A+AEIIW Q LL EL C S  S P L+ DNISA  +A NPVFHARTKHIE+D+HF+RDQ+L   L+++Y+PS DQ AD LTK LT ++FL L
Subjt:  SEYRALSLASAEIIWLQQLLKELGCHS--SKPILWCDNISAGALAANPVFHARTKHIEVDVHFVRDQILWGALEVRYVPSHDQLADCLTKPLTHTQFLYL

Query:  RSKLGLVDTPSRLRGDIKE
        RS L LV  P  LRG I +
Subjt:  RSKLGLVDTPSRLRGDIKE

RVW85836.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]0.0e+0042.85Show/hide
Query:  MANASSMSSTSVTNVG---NTTFTSPPLNQLLNQITTIKLDRGNYLLWKNLAMPILRSYKLEGHLLGTKSCPPEFIRQDGEPVEVTSGAAIGAPSSQTDG
        MA+A + SS+S  ++G   NTT  S P  Q+LN    +KLDR NY+LWK+    ++ +   E  + G+  CP +         E++SG            
Subjt:  MANASSMSSTSVTNVG---NTTFTSPPLNQLLNQITTIKLDRGNYLLWKNLAMPILRSYKLEGHLLGTKSCPPEFIRQDGEPVEVTSGAAIGAPSSQTDG

Query:  SGASTSEARLSMNPQYEAWVTVDQLLLGWLYNSMTPEVATQVMGIENAKDLWSAIQELFGVQSRAEEDFLRQTFQQTRKGNSKMSDYLRLMKTHADNLGL
                   +NP + AW   D+ +L WLY+S+TP +  Q++G  ++   W+A+++ F   SRA    LR   Q T+KG+  M DY+  +K  A++L  
Subjt:  SGASTSEARLSMNPQYEAWVTVDQLLLGWLYNSMTPEVATQVMGIENAKDLWSAIQELFGVQSRAEEDFLRQTFQQTRKGNSKMSDYLRLMKTHADNLGL

Query:  AGSPVSNRNLVSQVLLGLDEEYNAIVAMIQGR-ASVTWAELQAELLVFEKRLELQNSVKNTTTFSQNALANMASSKGVSSPKQTNQITSNGNGNRPWYNN
         G PVS ++ V  +L GL  +YNA+V  I  R   ++   + + LL FE RLE Q+S++  +  S N  ++  S  G    ++ N     G  + P  +N
Subjt:  AGSPVSNRNLVSQVLLGLDEEYNAIVAMIQGR-ASVTWAELQAELLVFEKRLELQNSVKNTTTFSQNALANMASSKGVSSPKQTNQITSNGNGNRPWYNN

Query:  YNQRGSGNRGR--GRGRGYNNYNNRQICQVCGKVGHSALVCYNRFNKEFSPIQNRGNGNGN-GNHNQNRGQNQQSNAFMATQPTATPETLADPNWYADSG
        Y  RG G  GR    GR  +N + +  CQ+CGK GH+  +CY+RF+  +   Q+      N GN N        SN             LAD  WY DSG
Subjt:  YNQRGSGNRGR--GRGRGYNNYNNRQICQVCGKVGHSALVCYNRFNKEFSPIQNRGNGNGN-GNHNQNRGQNQQSNAFMATQPTATPETLADPNWYADSG

Query:  ASNHVTSNYDNLSNPTDYEGNECVTIGNGDKLPITCIGSSRLTDGNHVLQLEHVLCVPDIAKNLVSMSKLAQDNNVFIEFHGNFCLVKDKTTGRVVLKGA
        AS+H+T +  NL++ + Y G + VTIGNG  L I+  GS RL   +H   L+ V  VP I+ NL+S++K   DNN  IEF  N   VKD  T +V+ +G 
Subjt:  ASNHVTSNYDNLSNPTDYEGNECVTIGNGDKLPITCIGSSRLTDGNHVLQLEHVLCVPDIAKNLVSMSKLAQDNNVFIEFHGNFCLVKDKTTGRVVLKGA

Query:  LKDGLYQLQGVNLRNLSFSASSSSMRQENKIEKSYNEGAVFVVSNVVPCANMAVSKKIWHRRLGHPSEKVLNSIVKDCKLSVKVNEPL---QFCESCQFG
        L++GLY+   +N + ++F  ++ S                    N   C N      +WH RLGH S  ++  I++ C +S + N+       C SCQ  
Subjt:  LKDGLYQLQGVNLRNLSFSASSSSMRQENKIEKSYNEGAVFVVSNVVPCANMAVSKKIWHRRLGHPSEKVLNSIVKDCKLSVKVNEPL---QFCESCQFG

Query:  KSHALKFPLSDSRASKRFDLIHTDIWGPAPVLSGDGYRYYVLFLDDYSRYVWLYPLKLKSDTLSAFNHFLTMVKTQFGSMIKAIQSDNGGEYVKVHRLCN
        KSH L   LS S ASK  +L+HTD+WGPAPV S  G RY++LFLDDYSRY W YPL+ K   L  F  F   V+ QF + IK +QSDNGGE+        
Subjt:  KSHALKFPLSDSRASKRFDLIHTDIWGPAPVLSGDGYRYYVLFLDDYSRYVWLYPLKLKSDTLSAFNHFLTMVKTQFGSMIKAIQSDNGGEYVKVHRLCN

Query:  QLGIQSRYSCPHTSAQNGRAERKHRHVVETGLTLLAQASMPLAYWWDAFMAAARLINGLPTTVLKGKSPMELMFLKKLDFTALKTFGCSCYPCLRPYQNH
        Q GI  R+SCP+ SAQNGR ERKHRHVVETGL LLA AS+P+ +W  AF  A  LIN +P+ VL+  SP   +F K  D+ +L+ FGC CYP +RPY +H
Subjt:  QLGIQSRYSCPHTSAQNGRAERKHRHVVETGLTLLAQASMPLAYWWDAFMAAARLINGLPTTVLKGKSPMELMFLKKLDFTALKTFGCSCYPCLRPYQNH

Query:  KFYFHTDQCVNLGLSASHKGYRCMN-KAGRVFVSRHVKFDEETFPFAAGFGTVDSSMSGSNTTLAPHILQWFPQPNIPQSGIFSPPVNQPPLTCVQPSPS
        K  + + Q + LG S  +KG+ C++   GRV+++ HV FDE  FP A     +      S  TL P I+  FP P                         
Subjt:  KFYFHTDQCVNLGLSASHKGYRCMN-KAGRVFVSRHVKFDEETFPFAAGFGTVDSSMSGSNTTLAPHILQWFPQPNIPQSGIFSPPVNQPPLTCVQPSPS

Query:  PAPLQQPTGQNNEPCSQTSPSPPPSQQPAVQNTSPSILPFPNQETSVSSPDSNTSQTSPASEPSPETILNSNPCPQSTHP-MVTRGKAGIFKPKAWLSRQ
                      CS  SP+   S  P++   S           SVSSP       +P S   PE I    P   S  P M TR   GI + KA     
Subjt:  PAPLQQPTGQNNEPCSQTSPSPPPSQQPAVQNTSPSILPFPNQETSVSSPDSNTSQTSPASEPSPETILNSNPCPQSTHP-MVTRGKAGIFKPKAWLSRQ

Query:  QVDWSLTEPTRVQDALATPQWKAAMDTEFSALIKNQTWSLVPHAPSFNVVGNKWIFRIKRNADGSIQRYKARLVAKGFHQYPGVDFFETFSPVVKASTIR
         V   ++EP  ++ AL  P W  AMD E +AL +NQTW LV   P  N++G KW++++K   DGSI+RYKARLVAKG++Q  G+D+FETFSPVVKA+TIR
Subjt:  QVDWSLTEPTRVQDALATPQWKAAMDTEFSALIKNQTWSLVPHAPSFNVVGNKWIFRIKRNADGSIQRYKARLVAKGFHQYPGVDFFETFSPVVKASTIR

Query:  IVLSLAVTRGWELRQLDFNNAFLNGTLNEVVYMKQPPGYVDPNRPNHVCKLKKAIYGLKQAPRAWNTTLKAVLLSWGFHNSRSDNSLFIFRTENVCLLLL
        I+L++A++  WE+RQLD +NAFLNG L E VYM QPPGY+D   P  VC+LKKA+YGLKQAPRAW   L + L+ WGF NSR+D+S+F++  E+  L++L
Subjt:  IVLSLAVTRGWELRQLDFNNAFLNGTLNEVVYMKQPPGYVDPNRPNHVCKLKKAIYGLKQAPRAWNTTLKAVLLSWGFHNSRSDNSLFIFRTENVCLLLL

Query:  VYVDDVIVTGNNSKMINRLIVELDNRFALKDLGRLNYFLGIQVTYIPSGLLLTQAKYIDDLLTKLDLLHLKPAPSPCVIGKKMSIHDGKPLEDPFIYRST
        VYVDD+I+TG +S  I+ LI +L++ FAL+DLG+L+YFLGI+V+Y    + L+Q KY+ DLL +  +   KPA +P  +GK +S  DG P+++   YRS 
Subjt:  VYVDDVIVTGNNSKMINRLIVELDNRFALKDLGRLNYFLGIQVTYIPSGLLLTQAKYIDDLLTKLDLLHLKPAPSPCVIGKKMSIHDGKPLEDPFIYRST

Query:  IGALQYLTTTRPDIAYIINQLSQFLQTPTDIHWQAVKRVLRYLTGTKHLGLLFQPGSNLSVSAFSDADWASNIDDRKSVAAYCVFLGNNLVSWSSKKQSV
        +GALQYLT TRPDIA+ +N+  QF+Q PT  HW +VKR+LRYL GT   GLL  P +NL++  FSDADW +  DDR+S + Y V+LG NLVSWSS KQ V
Subjt:  IGALQYLTTTRPDIAYIINQLSQFLQTPTDIHWQAVKRVLRYLTGTKHLGLLFQPGSNLSVSAFSDADWASNIDDRKSVAAYCVFLGNNLVSWSSKKQSV

Query:  VARSSTESEYRALSLASAEIIWLQQLLKELGCH-SSKPILWCDNISAGALAANPVFHARTKHIEVDVHFVRDQILWGALEVRYVPSHDQLADCLTKPLTH
        V+RSS ESEYRAL+LA+AEIIW+Q LL+EL     + P+LW DNISA  +A NPVFHARTKHIE+D+HF+RDQ++ G +++ +VP+ DQ AD LTK LT 
Subjt:  VARSSTESEYRALSLASAEIIWLQQLLKELGCH-SSKPILWCDNISAGALAANPVFHARTKHIEVDVHFVRDQILWGALEVRYVPSHDQLADCLTKPLTH

Query:  TQFLYLRSKLGLVDTPSRLRGDIK
        ++FL L+S+L +   P  LRGD K
Subjt:  TQFLYLRSKLGLVDTPSRLRGDIK

TrEMBL top hitse value%identityAlignment
A0A2Z6MBG6 Integrase catalytic domain-containing protein0.0e+0046.07Show/hide
Query:  TIKLDRGNYLLWKNLAMPILRSYKLEGHLLGTKSCPPEFIRQDGEPVEVTSGAAIGAPSSQTDGSGASTSEARLSMNPQYEAWVTVDQLLLGWLYNSMTP
        ++KLDR NY LWK+L +P++R  KL+G++LGT+ CP EFI                           ++S++  + N  +  W   DQ LLGW+ NSMT 
Subjt:  TIKLDRGNYLLWKNLAMPILRSYKLEGHLLGTKSCPPEFIRQDGEPVEVTSGAAIGAPSSQTDGSGASTSEARLSMNPQYEAWVTVDQLLLGWLYNSMTP

Query:  EVATQVMGIENAKDLWSAIQELFGVQSRAEEDFLRQTFQQTRKGNSKMSDYLRLMKTHADNLGLAGSPVSNRNLVSQVLLGLDEEYNAIVAMIQGRASVT
        E+ATQ++  E +K LW   Q L G  +R++  +L+  F   RKG  KM DYL  MK   D L LAG+PVS  +L+ Q L GLD EYN +V  +  + +++
Subjt:  EVATQVMGIENAKDLWSAIQELFGVQSRAEEDFLRQTFQQTRKGNSKMSDYLRLMKTHADNLGLAGSPVSNRNLVSQVLLGLDEEYNAIVAMIQGRASVT

Query:  WAELQAELLVFEKRLELQNSVKNTTTFSQNALANMASSKGVSSPKQTNQITSNGNGNRPWYNNYNQRGSGNRG--RGRGRGYNNYNNRQICQVCGKVGHS
        W +LQA+LL FE R+E  N++ N T    NA AN+A                N + +R   +N N RGS +RG   GRGRG +  N    CQVCG   H 
Subjt:  WAELQAELLVFEKRLELQNSVKNTTTFSQNALANMASSKGVSSPKQTNQITSNGNGNRPWYNNYNQRGSGNRG--RGRGRGYNNYNNRQICQVCGKVGHS

Query:  ALVCYNRFNKEFSPIQNRGNGNGNGNHNQNRGQNQQSNAFMATQPTATPETLADPNWYADSGASNHVTSNYDNLSNPTDYEGNECVTIGNGDKLPITCIG
        A+ C++RF+K +S            NH+    +    NAF+A+Q      ++ D +WY DSGASNHVT   +   + T++ G   + +GNG+KL I   G
Subjt:  ALVCYNRFNKEFSPIQNRGNGNGNGNHNQNRGQNQQSNAFMATQPTATPETLADPNWYADSGASNHVTSNYDNLSNPTDYEGNECVTIGNGDKLPITCIG

Query:  SSRLTDGNHVLQLEHVLCVPDIAKNLVSMSKLAQDNNVFIEFHGNFCLVKDKTTGRVVLKGALKDGLYQLQGVNLRNLSFSASSSSMRQENKIEKSYNEG
        SS+L      L L  +L VP+I KNL+S+SKLA DNN+ +EF  N C VKDK TG+V+LKG LKDGLYQL G   RN                       
Subjt:  SSRLTDGNHVLQLEHVLCVPDIAKNLVSMSKLAQDNNVFIEFHGNFCLVKDKTTGRVVLKGALKDGLYQLQGVNLRNLSFSASSSSMRQENKIEKSYNEG

Query:  AVFVVSNVVPCANMAVSKKIWHRRLGHPSEKVLNSIVKDCKLSVKVNEPLQFCESCQFGKSHALKFPLSDSRASKRFDLIHTDIWGPAPVLSGDGYRYYV
                 P A ++V K+ WHRRLGHP+ KVL+ +++ CK+ V  ++   FCE+CQ+GK H L F  S S A +  +L+HTD+WGPAP+++  G++YYV
Subjt:  AVFVVSNVVPCANMAVSKKIWHRRLGHPSEKVLNSIVKDCKLSVKVNEPLQFCESCQFGKSHALKFPLSDSRASKRFDLIHTDIWGPAPVLSGDGYRYYV

Query:  LFLDDYSRYVWLYPLKLKSDTLSAFNHFLTMVKTQFGSMIKAIQSDNGGEYVKVHRLCNQLGIQSRYSCPHTSAQNGRAERKHRHVVETGLTLLAQASMP
         F+DD+SR+ W+YPLK KS+T+ AF  F  + + QF   IK IQ D GGEY  V +L  + GIQ R SCP+TS QNGRAERKHRH+ E GLTLLAQA MP
Subjt:  LFLDDYSRYVWLYPLKLKSDTLSAFNHFLTMVKTQFGSMIKAIQSDNGGEYVKVHRLCNQLGIQSRYSCPHTSAQNGRAERKHRHVVETGLTLLAQASMP

Query:  LAYWWDAFMAAARLINGLPTTVLKGKSPMELMFLKKLDFTALKTFGCSCYPCLRPYQNHKFYFHTDQCVNLGLSASHKGYRCMNKAGRVFVSRHVKFDEE
        L YWW+AF  A  LIN LP+ V + +SP  LM  K+ D+  LKTFGC+CYPCL+PY  HK  +HT +CV LG S SHKGY+C+N  GR+F+SRHV F+E+
Subjt:  LAYWWDAFMAAARLINGLPTTVLKGKSPMELMFLKKLDFTALKTFGCSCYPCLRPYQNHKFYFHTDQCVNLGLSASHKGYRCMNKAGRVFVSRHVKFDEE

Query:  TFPFAAGFGTVDSSMSGSNTTLAPHILQWFPQPNIP--QSGIFSPPVNQPPLTCVQPSPSPAPLQQPTGQNNEPCSQTSPSPPPSQQPAVQNTSPSILPF
         FPF  GF    S +    TT+        P  + P   +G      + P L    P+ +     Q    + E   QT      +  P+  NT+      
Subjt:  TFPFAAGFGTVDSSMSGSNTTLAPHILQWFPQPNIP--QSGIFSPPVNQPPLTCVQPSPSPAPLQQPTGQNNEPCSQTSPSPPPSQQPAVQNTSPSILPF

Query:  PNQETSVSSPDSNTSQTSPASEPSPETILNSNPCPQSTHPMVTRGKAGIFKPK-AWLSRQQVDWSLTEPTRVQDALATPQWKAAMDTEFSALIKNQTWSL
          Q+ SV     NT+                     ++H + TR K+GI KPK  ++   +      EP   ++AL+ P WK AM  EF AL+ N+TW L
Subjt:  PNQETSVSSPDSNTSQTSPASEPSPETILNSNPCPQSTHPMVTRGKAGIFKPK-AWLSRQQVDWSLTEPTRVQDALATPQWKAAMDTEFSALIKNQTWSL

Query:  VPHAPSFNVVGNKWIFRIKRNADGSIQRYKARLVAKGFHQYPGVDFFETFSPVVKASTIRIVLSLAVTRGWELRQLDFNNAFLNGTLNEVVYMKQPPGYV
        VP+    N+V +KW+F+ K   DGS++R KARLVAKGF Q  G+D+ ETFSPV+KAST+RI+LS+AV   WE+RQLD NNAFLNG L E V+M QP G+V
Subjt:  VPHAPSFNVVGNKWIFRIKRNADGSIQRYKARLVAKGFHQYPGVDFFETFSPVVKASTIRIVLSLAVTRGWELRQLDFNNAFLNGTLNEVVYMKQPPGYV

Query:  DPNRPNHVCKLKKAIYGLKQAPRAWNTTLKAVLLSWGFHNSRSDNSLFIFRTENVCLLLLVYVDDVIVTGNNSKMINRLIVELDNRFALKDLGRLNYFLG
        D  +PNH+CKL KAIYGLKQAPRAW  +LK  LL+WGF N++SD+SLF+ + ++    LL+YVDD+IVTG+N K +   I +L++ F+LKDLG L+YFLG
Subjt:  DPNRPNHVCKLKKAIYGLKQAPRAWNTTLKAVLLSWGFHNSRSDNSLFIFRTENVCLLLLVYVDDVIVTGNNSKMINRLIVELDNRFALKDLGRLNYFLG

Query:  IQVTYIPSGLLLTQAKYIDDLLTKLDLLHLKPAPSPCVIGKKMSIHDGKPLEDPFIYRSTIGALQYLTTTRPDIAYIINQLSQFLQTPTDIHWQAVKRVL
        I+V    SG+ L Q+KYI DLL K  + +  P P+P + G++ ++ +G+ L+DP ++R  IG LQYLT T PDIA+ +N+LSQ++ +P+  HWQ +KR+L
Subjt:  IQVTYIPSGLLLTQAKYIDDLLTKLDLLHLKPAPSPCVIGKKMSIHDGKPLEDPFIYRSTIGALQYLTTTRPDIAYIINQLSQFLQTPTDIHWQAVKRVL

Query:  RYLTGTKHLGLLFQPGSNLSVSAFSDADWASNIDDRKSVAAYCVFLGNNLVSWSSKKQSVVARSSTESEYRALSLASAEIIWLQQLLKELGCH-SSKPIL
        RYL GT +  L  +P ++L ++ FSDADWA++IDDRKS++  CVFLG  L+SWSS+KQ VV+RSSTESEYRAL+  +AEI W++ LL EL      KPIL
Subjt:  RYLTGTKHLGLLFQPGSNLSVSAFSDADWASNIDDRKSVAAYCVFLGNNLVSWSSKKQSVVARSSTESEYRALSLASAEIIWLQQLLKELGCH-SSKPIL

Query:  WCDNISAGALAANPVFHARTKHIEVDVHFVRDQILWGALEVRYVPSHDQLADCLTKPLTHTQFLYLRSKLGLVDTP
        WCDN+SA ALA+NPV HAR+KHIE+DVH++RDQ+L   + V YVP+ DQ+ADCLTKPL+HT+F  LR KLG++ +P
Subjt:  WCDNISAGALAANPVFHARTKHIEVDVHFVRDQILWGALEVRYVPSHDQLADCLTKPLTHTQFLYLRSKLGLVDTP

A0A2Z6P4D5 Integrase catalytic domain-containing protein0.0e+0044.64Show/hide
Query:  SPPLNQLLNQITTIKLDRGNYLLWKNLAMPILRSYKLEGHLLGTKSCPPEFIRQDGEPVEVTSGAAIGAPSSQTDGSGASTSEARLSMNPQYEAWVTVDQ
        SP  N  L  I ++KLDR NY LWK+L + ++R  KL+G++LGT  CP +F+                           ++++    +NP +  W+  DQ
Subjt:  SPPLNQLLNQITTIKLDRGNYLLWKNLAMPILRSYKLEGHLLGTKSCPPEFIRQDGEPVEVTSGAAIGAPSSQTDGSGASTSEARLSMNPQYEAWVTVDQ

Query:  LLLGWLYNSMTPEVATQVMGIENAKDLWSAIQELFGVQSRAEEDFLRQTFQQTRKGNSKMSDYLRLMKTHADNLGLAGSPVSNRNLVSQVLLGLDEEYNA
         LLGWL NSM  ++ATQ++  E +K LW   Q L G  +++   +L+  F  TRKG  KM +YL  MK  +D L LAGSP+SN +L+ Q L GLD EYN 
Subjt:  LLLGWLYNSMTPEVATQVMGIENAKDLWSAIQELFGVQSRAEEDFLRQTFQQTRKGNSKMSDYLRLMKTHADNLGLAGSPVSNRNLVSQVLLGLDEEYNA

Query:  IVAMIQGRASVTWAELQAELLVFEKRLELQNSVKNTTTFSQNALANMASSKGVSSPKQTNQITSNGNGNRPWYNNYNQRGSGNRGRGRGRGYNNYNNRQI
        +V  +  + +++W ++QA+LL FE RL+  N+    T  +    AN    +G       N+  S GN  R   N    RG    GRG+GR  N       
Subjt:  IVAMIQGRASVTWAELQAELLVFEKRLELQNSVKNTTTFSQNALANMASSKGVSSPKQTNQITSNGNGNRPWYNNYNQRGSGNRGRGRGRGYNNYNNRQI

Query:  CQVCGKVGHSALVCYNRFNKEFSPIQNRGNGNGNGNHNQNRGQNQQSNAFMATQPTATPETLADPNWYADSGASNHVTSNYDNLSNPTDYEGNECVTIGN
        CQVC   GH A+ C  RF++ ++        +  G+H          +AF+     A+P    D  WY DSGA+NHVT   D      ++ G   + +GN
Subjt:  CQVCGKVGHSALVCYNRFNKEFSPIQNRGNGNGNGNHNQNRGQNQQSNAFMATQPTATPETLADPNWYADSGASNHVTSNYDNLSNPTDYEGNECVTIGN

Query:  GDKLPITCIGSSRLTDGNHVLQLEHVLCVPDIAKNLVSMSKLAQDNNVFIEFHGNFCLVKDKTTGRVVLKGALKDGLYQLQGVNLRNLSFSASSSSMRQE
        G+KL I   GS++L +    L L  VL VP I KNL+S+SKL  DNN+ +EF  N C VKDK TG+ +LKG LKDGLYQL                    
Subjt:  GDKLPITCIGSSRLTDGNHVLQLEHVLCVPDIAKNLVSMSKLAQDNNVFIEFHGNFCLVKDKTTGRVVLKGALKDGLYQLQGVNLRNLSFSASSSSMRQE

Query:  NKIEKSYNEGAVFVVSNVVPCANMAVSKKIWHRRLGHPSEKVLNSIVKDCKLSVKVNEPLQFCESCQFGKSHALKFPLSDSRASKRFDLIHTDIWGPAPV
                       SN  PC  M+V K+ WHR+LGHP+ KVL+ ++KDC + +  ++   FCE+CQFGK H L F  S S   +   LIH+D+WGPAP+
Subjt:  NKIEKSYNEGAVFVVSNVVPCANMAVSKKIWHRRLGHPSEKVLNSIVKDCKLSVKVNEPLQFCESCQFGKSHALKFPLSDSRASKRFDLIHTDIWGPAPV

Query:  LSGDGYRYYVLFLDDYSRYVWLYPLKLKSDTLSAFNHFLTMVKTQFGSMIKAIQSDNGGEYVKVHRLCNQLGIQSRYSCPHTSAQNGRAERKHRHVVETG
        LS  G++YYV F+DD+SR+ W++PLK KSDT+ AF  F  + + QF   IK IQ D GGEY  V ++  + GIQ R SCP+TS QNGRAERKHRHV E G
Subjt:  LSGDGYRYYVLFLDDYSRYVWLYPLKLKSDTLSAFNHFLTMVKTQFGSMIKAIQSDNGGEYVKVHRLCNQLGIQSRYSCPHTSAQNGRAERKHRHVVETG

Query:  LTLLAQASMPLAYWWDAFMAAARLINGLPTTVLKGKSPMELMFLKKLDFTALKTFGCSCYPCLRPYQNHKFYFHTDQCVNLGLSASHKGYRCMNKAGRVF
        LTLLAQA MPL YWW+AF  A  LIN LP++V   +SP  LMF ++ D+ ALK FGC+CYPCL+PY  HK  FHT +CV +G S SHKGY+C+N  GR+F
Subjt:  LTLLAQASMPLAYWWDAFMAAARLINGLPTTVLKGKSPMELMFLKKLDFTALKTFGCSCYPCLRPYQNHKFYFHTDQCVNLGLSASHKGYRCMNKAGRVF

Query:  VSRHVKFDEETFPFAAGFGTVDSSMSGSNTTLAPHILQWFPQPNIPQSGIFSPPVNQPPLT--CVQPSPSPAPLQQ----PTGQNNEPCSQTSPSPPPSQ
        VSRHV F+E  FPF  GF    + +     TL  +            S I  P  +    T   ++P  +    Q      +  NNE   Q   S     
Subjt:  VSRHVKFDEETFPFAAGFGTVDSSMSGSNTTLAPHILQWFPQPNIPQSGIFSPPVNQPPLT--CVQPSPSPAPLQQ----PTGQNNEPCSQTSPSPPPSQ

Query:  QPAVQNTSPSILPFPNQETSVSSPDSNTSQTSPASEPSPETILNSNPCPQSTHPMVTRGKAGIFKPK-AWLSRQQVDWSLTEPTRVQDALATPQWKAAMD
             NT+ S       + SV S D N S  +   +   +   NSN     TH M TR K GI KPK  ++   + D    EP  V++AL  P WK AMD
Subjt:  QPAVQNTSPSILPFPNQETSVSSPDSNTSQTSPASEPSPETILNSNPCPQSTHPMVTRGKAGIFKPK-AWLSRQQVDWSLTEPTRVQDALATPQWKAAMD

Query:  TEFSALIKNQTWSLVPHAPSFNVVGNKWIFRIKRNADGSIQRYKARLVAKGFHQYPGVDFFETFSPVVKASTIRIVLSLAVTRGWELRQLDFNNAFLNGT
         E+ AL+ N TW+LVP+    N++ +KWIF+ K  +DGSI+R KARLVAKGF Q  G+DF ETFSPVVK+ST+RI+L++AV   WE+RQLD NNAFLNG 
Subjt:  TEFSALIKNQTWSLVPHAPSFNVVGNKWIFRIKRNADGSIQRYKARLVAKGFHQYPGVDFFETFSPVVKASTIRIVLSLAVTRGWELRQLDFNNAFLNGT

Query:  LNEVVYMKQPPGYVDPNRPNHVCKLKKAIYGLKQAPRAWNTTLKAVLLSWGFHNSRSDNSLFIFRTENVCLLLLVYVDDVIVTGNNSKMINRLIVELDNR
        L E V+M QP GY+D  +PNH+CKL KAIYGLKQAPRAW  +L++ L++WGF N+++D SLF  +  +    LL+YVDD+IVTG+N K +     +L+  
Subjt:  LNEVVYMKQPPGYVDPNRPNHVCKLKKAIYGLKQAPRAWNTTLKAVLLSWGFHNSRSDNSLFIFRTENVCLLLLVYVDDVIVTGNNSKMINRLIVELDNR

Query:  FALKDLGRLNYFLGIQVTYIPSGLLLTQAKYIDDLLTKLDLLHLKPAPSPCVIGKKMSIHDGKPLEDPFIYRSTIGALQYLTTTRPDIAYIINQLSQFLQ
        ++LKDLG L+YFLG++V    SG+ L Q KYI D+L K ++ +    P+P V G++  I +G+ + +P +YR  IGALQYLT TRPDIA+ +N+LSQ++ 
Subjt:  FALKDLGRLNYFLGIQVTYIPSGLLLTQAKYIDDLLTKLDLLHLKPAPSPCVIGKKMSIHDGKPLEDPFIYRSTIGALQYLTTTRPDIAYIINQLSQFLQ

Query:  TPTDIHWQAVKRVLRYLTGTKHLGLLFQPGSNLSVSAFSDADWASNIDDRKSVAAYCVFLGNNLVSWSSKKQSVVARSSTESEYRAL-------------
        TPT  HWQ +KR+LRYL GTK+  L  +P +NL ++ F DADWA++ DDRKS    CVFLG  LVSW+S+KQ VV+RSSTESEYR+L             
Subjt:  TPTDIHWQAVKRVLRYLTGTKHLGLLFQPGSNLSVSAFSDADWASNIDDRKSVAAYCVFLGNNLVSWSSKKQSVVARSSTESEYRAL-------------

Query:  SLASAEIIWLQQ------LLKELGCH-SSKPILWCDNISAGALAANPVFHARTKHIEVDVHFVRDQILWGALEVRYVPSHDQLADCLTKPLTHTQFLYLR
        +L S+E   L        LL+EL      KP+LWCDN+SA ALA+NPV HAR+KHIE+D+H++RDQ+L   + + YVP+ DQ+ADCLTKPL HT+F  +R
Subjt:  SLASAEIIWLQQ------LLKELGCH-SSKPILWCDNISAGALAANPVFHARTKHIEVDVHFVRDQILWGALEVRYVPSHDQLADCLTKPLTHTQFLYLR

Query:  SKLGLVDTPS
         KLG+  +PS
Subjt:  SKLGLVDTPS

A0A438HN11 Retrovirus-related Pol polyprotein from transposon TNT 1-940.0e+0042.85Show/hide
Query:  MANASSMSSTSVTNVG---NTTFTSPPLNQLLNQITTIKLDRGNYLLWKNLAMPILRSYKLEGHLLGTKSCPPEFIRQDGEPVEVTSGAAIGAPSSQTDG
        MA+A + SS+S  ++G   NTT  S P  Q+LN    +KLDR NY+LWK+    ++ +   E  + G+  CP +         E++SG            
Subjt:  MANASSMSSTSVTNVG---NTTFTSPPLNQLLNQITTIKLDRGNYLLWKNLAMPILRSYKLEGHLLGTKSCPPEFIRQDGEPVEVTSGAAIGAPSSQTDG

Query:  SGASTSEARLSMNPQYEAWVTVDQLLLGWLYNSMTPEVATQVMGIENAKDLWSAIQELFGVQSRAEEDFLRQTFQQTRKGNSKMSDYLRLMKTHADNLGL
                   +NP + AW   D+ +L WLY+S+TP +  Q++G  ++   W+A+++ F   SRA    LR   Q T+KG+  M DY+  +K  A++L  
Subjt:  SGASTSEARLSMNPQYEAWVTVDQLLLGWLYNSMTPEVATQVMGIENAKDLWSAIQELFGVQSRAEEDFLRQTFQQTRKGNSKMSDYLRLMKTHADNLGL

Query:  AGSPVSNRNLVSQVLLGLDEEYNAIVAMIQGR-ASVTWAELQAELLVFEKRLELQNSVKNTTTFSQNALANMASSKGVSSPKQTNQITSNGNGNRPWYNN
         G PVS ++ V  +L GL  +YNA+V  I  R   ++   + + LL FE RLE Q+S++  +  S N  ++  S  G    ++ N     G  + P  +N
Subjt:  AGSPVSNRNLVSQVLLGLDEEYNAIVAMIQGR-ASVTWAELQAELLVFEKRLELQNSVKNTTTFSQNALANMASSKGVSSPKQTNQITSNGNGNRPWYNN

Query:  YNQRGSGNRGR--GRGRGYNNYNNRQICQVCGKVGHSALVCYNRFNKEFSPIQNRGNGNGN-GNHNQNRGQNQQSNAFMATQPTATPETLADPNWYADSG
        Y  RG G  GR    GR  +N + +  CQ+CGK GH+  +CY+RF+  +   Q+      N GN N        SN             LAD  WY DSG
Subjt:  YNQRGSGNRGR--GRGRGYNNYNNRQICQVCGKVGHSALVCYNRFNKEFSPIQNRGNGNGN-GNHNQNRGQNQQSNAFMATQPTATPETLADPNWYADSG

Query:  ASNHVTSNYDNLSNPTDYEGNECVTIGNGDKLPITCIGSSRLTDGNHVLQLEHVLCVPDIAKNLVSMSKLAQDNNVFIEFHGNFCLVKDKTTGRVVLKGA
        AS+H+T +  NL++ + Y G + VTIGNG  L I+  GS RL   +H   L+ V  VP I+ NL+S++K   DNN  IEF  N   VKD  T +V+ +G 
Subjt:  ASNHVTSNYDNLSNPTDYEGNECVTIGNGDKLPITCIGSSRLTDGNHVLQLEHVLCVPDIAKNLVSMSKLAQDNNVFIEFHGNFCLVKDKTTGRVVLKGA

Query:  LKDGLYQLQGVNLRNLSFSASSSSMRQENKIEKSYNEGAVFVVSNVVPCANMAVSKKIWHRRLGHPSEKVLNSIVKDCKLSVKVNEPL---QFCESCQFG
        L++GLY+   +N + ++F  ++ S                    N   C N      +WH RLGH S  ++  I++ C +S + N+       C SCQ  
Subjt:  LKDGLYQLQGVNLRNLSFSASSSSMRQENKIEKSYNEGAVFVVSNVVPCANMAVSKKIWHRRLGHPSEKVLNSIVKDCKLSVKVNEPL---QFCESCQFG

Query:  KSHALKFPLSDSRASKRFDLIHTDIWGPAPVLSGDGYRYYVLFLDDYSRYVWLYPLKLKSDTLSAFNHFLTMVKTQFGSMIKAIQSDNGGEYVKVHRLCN
        KSH L   LS S ASK  +L+HTD+WGPAPV S  G RY++LFLDDYSRY W YPL+ K   L  F  F   V+ QF + IK +QSDNGGE+        
Subjt:  KSHALKFPLSDSRASKRFDLIHTDIWGPAPVLSGDGYRYYVLFLDDYSRYVWLYPLKLKSDTLSAFNHFLTMVKTQFGSMIKAIQSDNGGEYVKVHRLCN

Query:  QLGIQSRYSCPHTSAQNGRAERKHRHVVETGLTLLAQASMPLAYWWDAFMAAARLINGLPTTVLKGKSPMELMFLKKLDFTALKTFGCSCYPCLRPYQNH
        Q GI  R+SCP+ SAQNGR ERKHRHVVETGL LLA AS+P+ +W  AF  A  LIN +P+ VL+  SP   +F K  D+ +L+ FGC CYP +RPY +H
Subjt:  QLGIQSRYSCPHTSAQNGRAERKHRHVVETGLTLLAQASMPLAYWWDAFMAAARLINGLPTTVLKGKSPMELMFLKKLDFTALKTFGCSCYPCLRPYQNH

Query:  KFYFHTDQCVNLGLSASHKGYRCMN-KAGRVFVSRHVKFDEETFPFAAGFGTVDSSMSGSNTTLAPHILQWFPQPNIPQSGIFSPPVNQPPLTCVQPSPS
        K  + + Q + LG S  +KG+ C++   GRV+++ HV FDE  FP A     +      S  TL P I+  FP P                         
Subjt:  KFYFHTDQCVNLGLSASHKGYRCMN-KAGRVFVSRHVKFDEETFPFAAGFGTVDSSMSGSNTTLAPHILQWFPQPNIPQSGIFSPPVNQPPLTCVQPSPS

Query:  PAPLQQPTGQNNEPCSQTSPSPPPSQQPAVQNTSPSILPFPNQETSVSSPDSNTSQTSPASEPSPETILNSNPCPQSTHP-MVTRGKAGIFKPKAWLSRQ
                      CS  SP+   S  P++   S           SVSSP       +P S   PE I    P   S  P M TR   GI + KA     
Subjt:  PAPLQQPTGQNNEPCSQTSPSPPPSQQPAVQNTSPSILPFPNQETSVSSPDSNTSQTSPASEPSPETILNSNPCPQSTHP-MVTRGKAGIFKPKAWLSRQ

Query:  QVDWSLTEPTRVQDALATPQWKAAMDTEFSALIKNQTWSLVPHAPSFNVVGNKWIFRIKRNADGSIQRYKARLVAKGFHQYPGVDFFETFSPVVKASTIR
         V   ++EP  ++ AL  P W  AMD E +AL +NQTW LV   P  N++G KW++++K   DGSI+RYKARLVAKG++Q  G+D+FETFSPVVKA+TIR
Subjt:  QVDWSLTEPTRVQDALATPQWKAAMDTEFSALIKNQTWSLVPHAPSFNVVGNKWIFRIKRNADGSIQRYKARLVAKGFHQYPGVDFFETFSPVVKASTIR

Query:  IVLSLAVTRGWELRQLDFNNAFLNGTLNEVVYMKQPPGYVDPNRPNHVCKLKKAIYGLKQAPRAWNTTLKAVLLSWGFHNSRSDNSLFIFRTENVCLLLL
        I+L++A++  WE+RQLD +NAFLNG L E VYM QPPGY+D   P  VC+LKKA+YGLKQAPRAW   L + L+ WGF NSR+D+S+F++  E+  L++L
Subjt:  IVLSLAVTRGWELRQLDFNNAFLNGTLNEVVYMKQPPGYVDPNRPNHVCKLKKAIYGLKQAPRAWNTTLKAVLLSWGFHNSRSDNSLFIFRTENVCLLLL

Query:  VYVDDVIVTGNNSKMINRLIVELDNRFALKDLGRLNYFLGIQVTYIPSGLLLTQAKYIDDLLTKLDLLHLKPAPSPCVIGKKMSIHDGKPLEDPFIYRST
        VYVDD+I+TG +S  I+ LI +L++ FAL+DLG+L+YFLGI+V+Y    + L+Q KY+ DLL +  +   KPA +P  +GK +S  DG P+++   YRS 
Subjt:  VYVDDVIVTGNNSKMINRLIVELDNRFALKDLGRLNYFLGIQVTYIPSGLLLTQAKYIDDLLTKLDLLHLKPAPSPCVIGKKMSIHDGKPLEDPFIYRST

Query:  IGALQYLTTTRPDIAYIINQLSQFLQTPTDIHWQAVKRVLRYLTGTKHLGLLFQPGSNLSVSAFSDADWASNIDDRKSVAAYCVFLGNNLVSWSSKKQSV
        +GALQYLT TRPDIA+ +N+  QF+Q PT  HW +VKR+LRYL GT   GLL  P +NL++  FSDADW +  DDR+S + Y V+LG NLVSWSS KQ V
Subjt:  IGALQYLTTTRPDIAYIINQLSQFLQTPTDIHWQAVKRVLRYLTGTKHLGLLFQPGSNLSVSAFSDADWASNIDDRKSVAAYCVFLGNNLVSWSSKKQSV

Query:  VARSSTESEYRALSLASAEIIWLQQLLKELGCH-SSKPILWCDNISAGALAANPVFHARTKHIEVDVHFVRDQILWGALEVRYVPSHDQLADCLTKPLTH
        V+RSS ESEYRAL+LA+AEIIW+Q LL+EL     + P+LW DNISA  +A NPVFHARTKHIE+D+HF+RDQ++ G +++ +VP+ DQ AD LTK LT 
Subjt:  VARSSTESEYRALSLASAEIIWLQQLLKELGCH-SSKPILWCDNISAGALAANPVFHARTKHIEVDVHFVRDQILWGALEVRYVPSHDQLADCLTKPLTH

Query:  TQFLYLRSKLGLVDTPSRLRGDIK
        ++FL L+S+L +   P  LRGD K
Subjt:  TQFLYLRSKLGLVDTPSRLRGDIK

A0A803PM38 Uncharacterized protein0.0e+0042.98Show/hide
Query:  PPLNQLLNQITTIKLDRGNYLLWKNLAMPILRSYKLEGHLLGTKSCPPEFIRQDGEPVEVTSGAAIGAPSSQTDGSGASTSEARLSMNPQYEAWVTVDQL
        P     LNQ   +KLDR N+ LW+ +   I+R ++L+G+L GT   P EF+                  S+  DGS +S  +    +NP +E W+  DQL
Subjt:  PPLNQLLNQITTIKLDRGNYLLWKNLAMPILRSYKLEGHLLGTKSCPPEFIRQDGEPVEVTSGAAIGAPSSQTDGSGASTSEARLSMNPQYEAWVTVDQL

Query:  LLGWLYNSMTPEVATQVMGIENAKDLWSAIQELFGVQSRAEEDFLRQTFQQTRKGNSKMSDYLRLMKTHADNLGLAGSPVSNRNLVSQVLLGLDEEYNAI
        LLGWLY SMT  +A +VMG +++  LW+A++ELFG  S+A+ D  R   Q  RKG   M+DYLR  +  AD L LAG P     LVS VL GLD EY  +
Subjt:  LLGWLYNSMTPEVATQVMGIENAKDLWSAIQELFGVQSRAEEDFLRQTFQQTRKGNSKMSDYLRLMKTHADNLGLAGSPVSNRNLVSQVLLGLDEEYNAI

Query:  VAMIQGRASVTWAELQAELLVFEKRLELQNSVKNTTTFSQNALANMASSKGVSSPKQTNQITSNGNGNRPWYNNYNQRGSGNRGRGRGRGYNNYNNRQIC
        V +I+ R S TW +LQ  LL  + ++E  +S   ++  +     N ++S     P       ++ N NR  ++  N RGS NR RGRG        R  C
Subjt:  VAMIQGRASVTWAELQAELLVFEKRLELQNSVKNTTTFSQNALANMASSKGVSSPKQTNQITSNGNGNRPWYNNYNQRGSGNRGRGRGRGYNNYNNRQIC

Query:  QVCGKVGHSALVCYNRFNKEFSPIQNRGNGNGNGNHNQNRGQNQQSNAFMATQPTATPETLADPNWYADSGASNHVTSNYDNLSNPTDYEGNECVTIGNG
        QVCGK GHSA  CYNR                                                      GASNH+TS  + ++   +Y G E VT+ NG
Subjt:  QVCGKVGHSALVCYNRFNKEFSPIQNRGNGNGNGNHNQNRGQNQQSNAFMATQPTATPETLADPNWYADSGASNHVTSNYDNLSNPTDYEGNECVTIGNG

Query:  DKLPITCIGSSRL-TDGNHVLQLEHVLCVPDIAKNLVSMSKLAQDNNVFIEFHGNFCLVKDKTTGRVVLKGALKDGLYQLQGVNLRNLSFSASSSSMRQE
        ++LPI  IG   L T     L L+ +L VP I KNL+S+SKL  DNNV +EF  + C VKDK TG+VVLKG LKDGLYQ           S +S S  + 
Subjt:  DKLPITCIGSSRL-TDGNHVLQLEHVLCVPDIAKNLVSMSKLAQDNNVFIEFHGNFCLVKDKTTGRVVLKGALKDGLYQLQGVNLRNLSFSASSSSMRQE

Query:  NKIEKSYNEGAVFVV-SNVV-PCANMAVS--KKIWHRRLGHPSEKVLNSIVKDCKLSVK-VNEPLQFCESCQFGKSHALKFPLSDSRASKRFDLIHTDIW
             S++   V  V SNV  P AN  +   K  WHRRLGHPS +VL++++   K++VK +N  L FC++CQ GKSH+L F ++  RA+   +L+HTDIW
Subjt:  NKIEKSYNEGAVFVV-SNVV-PCANMAVS--KKIWHRRLGHPSEKVLNSIVKDCKLSVK-VNEPLQFCESCQFGKSHALKFPLSDSRASKRFDLIHTDIW

Query:  GPAPVLSGDGYRYYVLFLDDYSRYVWLYPLKLKSDTLSAFNHFLTMVKTQFGSMIKAIQSDNGGEYVKVHRLCNQLGIQSRYSCPHTSAQNGRAERKHRH
        GP+P++S   +RYY+ F+DD+SRY W+YPLK KS+ L+AF  F  +V+ QF S +K +Q+D GGEY    R  +  GI  ++ CPHTS QNGRAERKHRH
Subjt:  GPAPVLSGDGYRYYVLFLDDYSRYVWLYPLKLKSDTLSAFNHFLTMVKTQFGSMIKAIQSDNGGEYVKVHRLCNQLGIQSRYSCPHTSAQNGRAERKHRH

Query:  VVETGLTLLAQASMPLAYWWDAFMAAARLINGLPTTVLKGKSPMELMFLKKLDFTALKTFGCSCYPCLRPYQNHKFYFHTDQCVNLGLSASHKGYRCMNK
        +VE GLTLLAQA +P  YWWDAF  A  LIN LPT VLK K+P E++F ++ D+  LK FG SC+PCLR YQNHKF FH+ +CVNLG S  HKGY+C++ 
Subjt:  VVETGLTLLAQASMPLAYWWDAFMAAARLINGLPTTVLKGKSPMELMFLKKLDFTALKTFGCSCYPCLRPYQNHKFYFHTDQCVNLGLSASHKGYRCMNK

Query:  AGRVFVSRHVKFDEETFPFAAGFGTVDSSMSGSNTTLAPHILQWFPQPNIPQSGIFSPPVNQPPLTCVQPSPSPAPLQQPTGQNNEPCSQTSPSPPPSQQ
         GR+++SR V F+E+ FPF +GF   +   +  +  +       F          FS  +                    T + +     TS   P    
Subjt:  AGRVFVSRHVKFDEETFPFAAGFGTVDSSMSGSNTTLAPHILQWFPQPNIPQSGIFSPPVNQPPLTCVQPSPSPAPLQQPTGQNNEPCSQTSPSPPPSQQ

Query:  PAVQNTSPSILPFPNQETSVS---SPDSNTSQTSPASEPSPETILNSN-PCPQSTHPMVTRGKAGIFKPKAWLSRQQVDWSLTEPTRVQDALATPQWKAA
            +T   I  F N +          ++T+    A++P   +  + N     STHPM+TR KAGIFKPK +L++ +   + +EP  +++AL    W  A
Subjt:  PAVQNTSPSILPFPNQETSVS---SPDSNTSQTSPASEPSPETILNSN-PCPQSTHPMVTRGKAGIFKPKAWLSRQQVDWSLTEPTRVQDALATPQWKAA

Query:  MDTEFSALIKNQTWSLVPHAPSFNVVGNKWIFRIKRNADGSIQRYKARLVAKGFHQYPGVDFFETFSPVVKASTIRIVLSLAVTRGWELRQLDFNNAFLN
        M +E  AL +N TW LVP  P  +++ NKW+++ KRNADGS QR KARLVAKGF Q PGVDF ETFSPV+KAST+RIVLS+AVT+ WE+RQLD NNAFLN
Subjt:  MDTEFSALIKNQTWSLVPHAPSFNVVGNKWIFRIKRNADGSIQRYKARLVAKGFHQYPGVDFFETFSPVVKASTIRIVLSLAVTRGWELRQLDFNNAFLN

Query:  GTLNEVVYMKQPPGYVDPNRPNHVCKLKKAIYGLKQAPRAWNTTLKAVLLSWGFHNSRSDNSLFIFRTENVCLLLLVYVDDVIVTGNNSKMINRLIVELD
        G + E +YMKQP G+ D N+PNHVCKL K+IYGL+QAPRAW   LKA L SW F NS++D+SLF  +T +  +L+L+YVDD+I+TGNNS ++   I +L+
Subjt:  GTLNEVVYMKQPPGYVDPNRPNHVCKLKKAIYGLKQAPRAWNTTLKAVLLSWGFHNSRSDNSLFIFRTENVCLLLLVYVDDVIVTGNNSKMINRLIVELD

Query:  NRFALKDLGRLNYFLGIQVTYIPSGLLLTQAKYIDDLLTKLDLLHLKPAPSPCVIGKKMSIHDGKPLEDPFIYRSTIGALQYLTTTRPDIAYIINQLSQF
         +FALKDLG+L+YFLGI+V    +G+ L+Q KYI++LL K+++++LK  P+P   GK +SI DG  L +P  YR                          
Subjt:  NRFALKDLGRLNYFLGIQVTYIPSGLLLTQAKYIDDLLTKLDLLHLKPAPSPCVIGKKMSIHDGKPLEDPFIYRSTIGALQYLTTTRPDIAYIINQLSQF

Query:  LQTPTDIHWQAVKRVLRYLTGTKHLGLLFQPGSNLSVSAFSDADWASNIDDRKSVAAYCVFLGNNLVSWSSKKQSVVARSSTESEYRALSLASAEIIWLQ
                                                         +DR+SVA  CV+LG+ L+SWSS+KQ VV+RSSTESEYRAL+  +AE+ W+Q
Subjt:  LQTPTDIHWQAVKRVLRYLTGTKHLGLLFQPGSNLSVSAFSDADWASNIDDRKSVAAYCVFLGNNLVSWSSKKQSVVARSSTESEYRALSLASAEIIWLQ

Query:  QLLKELGCH-SSKPILWCDNISAGALAANPVFHARTKHIEVDVHFVRDQILWGALEVRYVPSHDQLADCLTKPLTHTQFLYLRSKLGLVDTPSRLRGDIK
         LLKEL     + PI+WCDN+ A ALA+NPV+HARTKHIE+D+HFVRD+I+   LEVRY+PS +Q+ADCLTK LTH    +L SKLG+V  P  LRG+++
Subjt:  QLLKELGCH-SSKPILWCDNISAGALAANPVFHARTKHIEVDVHFVRDQILWGALEVRYVPSHDQLADCLTKPLTHTQFLYLRSKLGLVDTPSRLRGDIK

Query:  EPSHSVSSASPSKKHNP
           +  +  + S    P
Subjt:  EPSHSVSSASPSKKHNP

A5BFR8 Integrase catalytic domain-containing protein0.0e+0041.44Show/hide
Query:  MANASSMSSTSVTNVG---NTTFTSPPLNQLLNQITTIKLDRGNYLLWKNLAMPILRSYKLEGHLLGTKSCPPEFIRQDGEPVEVTSGAAIGAPSSQTDG
        MA+  + SS+S  ++G   ++T  S P  Q+LN    +KLDR NY+LW++    ++ +   E  + GT  CP +         +++ G            
Subjt:  MANASSMSSTSVTNVG---NTTFTSPPLNQLLNQITTIKLDRGNYLLWKNLAMPILRSYKLEGHLLGTKSCPPEFIRQDGEPVEVTSGAAIGAPSSQTDG

Query:  SGASTSEARLSMNPQYEAWVTVDQLLLGWLYNSMTPEVATQVMGIENAKDLWSAIQELFGVQSRAEEDFLRQTFQQTRKGNSKMSDYLRLMKTHADNLGL
                   MNP + AW   D+ +L W+Y+S+TP +  Q++G   +   W+A++ +F   SRA    LR   Q T+KG+  M DY+  +K  ADNL  
Subjt:  SGASTSEARLSMNPQYEAWVTVDQLLLGWLYNSMTPEVATQVMGIENAKDLWSAIQELFGVQSRAEEDFLRQTFQQTRKGNSKMSDYLRLMKTHADNLGL

Query:  AGSPVSNRNLVSQVLLGLDEEYNAIVAMIQGR-ASVTWAELQAELLVFEKRLELQNSVKNTTTFSQNALANMASSKGVSSPKQTNQITSNGNGNRPWYNN
         G PVS ++ V  +L GL  +YNA+V  I  R   ++   + + LL FE RLE Q+S++  +       AN ASS       +       G G  P  NN
Subjt:  AGSPVSNRNLVSQVLLGLDEEYNAIVAMIQGR-ASVTWAELQAELLVFEKRLELQNSVKNTTTFSQNALANMASSKGVSSPKQTNQITSNGNGNRPWYNN

Query:  YNQRGSGNRGRG--RGRGYNNYNNRQICQVCGKVGHSALVCYNRFNKEFSPIQNRGNGNGNGNHNQNRGQNQQSNAFMATQPTATPETLADPNWYADSGA
        Y  RG G  GR    GR  ++ + +  CQ+CGK GH+A +CY+RF+  F        G    +H+ N G NQ +   M    +  P   AD +WY DSGA
Subjt:  YNQRGSGNRGRG--RGRGYNNYNNRQICQVCGKVGHSALVCYNRFNKEFSPIQNRGNGNGNGNHNQNRGQNQQSNAFMATQPTATPETLADPNWYADSGA

Query:  SNHVTSNYDNLSNPTDYEGNECVTIGNGDKLPITCIGSSRLTDGNHVLQLEHVLCVPDIAKNLVSMSKLAQDNNVFIEFHGNFCLVKDKTTGRVVLKGAL
        S+H+T N  NL++ + Y G + VTIGNG  L I+ IGS +L    H  +L+ V  VP I+ NL+S++K   +NN  IEFH N   VKD  T  V+ +G L
Subjt:  SNHVTSNYDNLSNPTDYEGNECVTIGNGDKLPITCIGSSRLTDGNHVLQLEHVLCVPDIAKNLVSMSKLAQDNNVFIEFHGNFCLVKDKTTGRVVLKGAL

Query:  KDGLYQLQGV-------NLRNLSFSASSSSMRQENKIEKSYNEGAVFVVSNVVPCANMAVSKKIWHRRLGHPSEKVLNSIVKDCKLSVKVNEPLQFCESC
        ++GLY+           ++ N S   S  S   ENK E                         +WH RLGH S  +++ ++  C ++    +    C  C
Subjt:  KDGLYQLQGV-------NLRNLSFSASSSSMRQENKIEKSYNEGAVFVVSNVVPCANMAVSKKIWHRRLGHPSEKVLNSIVKDCKLSVKVNEPLQFCESC

Query:  QFGKSHALKFPLSDSRASKRFDLIHTDIWGPAPVLSGDGYRYYVLFLDDYSRYVWLYPLKLKSDTLSAFNHFLTMVKTQFGSMIKAIQSDNGGEYVKVHR
        Q  KSH L   LS+  ASK  +L++TDIWGPA + S  G RY++LF+DDYSRY W Y L+ K   L  F  F   ++ QF + IK +QSDNGGE+     
Subjt:  QFGKSHALKFPLSDSRASKRFDLIHTDIWGPAPVLSGDGYRYYVLFLDDYSRYVWLYPLKLKSDTLSAFNHFLTMVKTQFGSMIKAIQSDNGGEYVKVHR

Query:  LCNQLGIQSRYSCPHTSAQNGRAERKHRHVVETGLTLLAQASMPLAYWWDAFMAAARLINGLPTTVLKGKSPMELMFLKKLDFTALKTFGCSCYPCLRPY
            +GI  R+SCP+ S QNGR ERKHRHVVETGL LL+ AS+P+ YW  AF     LIN +P+ VL+  SP   +F +  D+ + + FGC CYP +RPY
Subjt:  LCNQLGIQSRYSCPHTSAQNGRAERKHRHVVETGLTLLAQASMPLAYWWDAFMAAARLINGLPTTVLKGKSPMELMFLKKLDFTALKTFGCSCYPCLRPY

Query:  QNHKFYFHTDQCVNLGLSASHKGYRCMNKA-GRVFVSRHVKFDEETFPFAAGFGTVDSSMSGSNTTLAPHILQWFPQPNIPQSGIFSPPVNQPPLTCVQP
          HK  + + QC+ LG S +HKG+ C++ A GRV+++ HV FDE TFP A        S S SN T A               G     +  P   C+ P
Subjt:  QNHKFYFHTDQCVNLGLSASHKGYRCMNKA-GRVFVSRHVKFDEETFPFAAGFGTVDSSMSGSNTTLAPHILQWFPQPNIPQSGIFSPPVNQPPLTCVQP

Query:  SPSPAPLQQPTGQNNEPCSQTSPSPPPSQQPAVQNTSPSILPFPNQETSVSSPDSNTSQTSPASEPSPETILNSNPCPQST---HPMVTRGKAGIFKPKA
            + +   +  ++   +  SP P  S  P                        +TS +SPA + SP+++    P PQ T     M TR   GI K K 
Subjt:  SPSPAPLQQPTGQNNEPCSQTSPSPPPSQQPAVQNTSPSILPFPNQETSVSSPDSNTSQTSPASEPSPETILNSNPCPQST---HPMVTRGKAGIFKPKA

Query:  WLSRQQVDWSLTEPTRVQDALATPQWKAAMDTEFSALIKNQTWSLVPHAPSFNVVGNKWIFRIKRNADGSIQRYKARLVAKGFHQYPGVDFFETFSPVVK
         L    +   ++EP+ ++ A   P W  AM+ E +AL +N TW LV   P+ NV+G KW++++K   DGSI+RYKARLVAKG++Q  G+D+FETFSPVVK
Subjt:  WLSRQQVDWSLTEPTRVQDALATPQWKAAMDTEFSALIKNQTWSLVPHAPSFNVVGNKWIFRIKRNADGSIQRYKARLVAKGFHQYPGVDFFETFSPVVK

Query:  ASTIRIVLSLAVTRGWELRQLDFNNAFLNGTLNEVVYMKQPPGYVDPNRPNHVCKLKKAIYGLKQAPRAWNTTLKAVLLSWGFHNSRSDNSLFIFRTENV
        A+TIRI+L++A++  WE+RQLD +NAFLNG L E VYM QPPGY DP  PN VC+LKKA+YGLKQAPRAW   L + LL WGF  SR+D+S+F+   +  
Subjt:  ASTIRIVLSLAVTRGWELRQLDFNNAFLNGTLNEVVYMKQPPGYVDPNRPNHVCKLKKAIYGLKQAPRAWNTTLKAVLLSWGFHNSRSDNSLFIFRTENV

Query:  CLLLLVYVDDVIVTGNNSKMINRLIVELDNRFALKDLGRLNYFLGIQVTYIPSGLLLTQAKYIDDLLTKLDLLHLKPAPSPCVIGKKMSIHDGKPLEDPF
         L++LVYVDD++VTG++S  I+ LI +LD+ FAL+DLG+L++FLGI+V+Y    + L+Q KYI DLL + +L   KPA +P  +GK +S  DG P+ D  
Subjt:  CLLLLVYVDDVIVTGNNSKMINRLIVELDNRFALKDLGRLNYFLGIQVTYIPSGLLLTQAKYIDDLLTKLDLLHLKPAPSPCVIGKKMSIHDGKPLEDPF

Query:  IYRSTIGALQYLTTTRPDIAYIINQLSQFLQTPTDIHWQAVKRVLRYLTGTKHLGLLFQPGSNLSVSAFSDADWASNIDDRKSVAAYCVFLGNNLVSWSS
         YRS +GALQY+T TRPDIA+ +N+  QF+Q PT  HW +VKR+LRYL GT   GLLF P SNL++  F+DADW +++DDR+S + Y V+LG NLVSWSS
Subjt:  IYRSTIGALQYLTTTRPDIAYIINQLSQFLQTPTDIHWQAVKRVLRYLTGTKHLGLLFQPGSNLSVSAFSDADWASNIDDRKSVAAYCVFLGNNLVSWSS

Query:  KKQSVVARSSTESEYRALSLASAEIIWLQQLLKELGCH-SSKPILWCDNISAGALAANPVFHARTKHIEVDVHFVRDQILWGALEVRYVPSHDQLADCLT
         KQ VV+RSS ESEYR L  A+AEI+W+Q LL+EL     + P+LW DNISA  +A NPVFHARTKHIE+D+HF+RDQ++ G +++++VP+ +Q  D LT
Subjt:  KKQSVVARSSTESEYRALSLASAEIIWLQQLLKELGCH-SSKPILWCDNISAGALAANPVFHARTKHIEVDVHFVRDQILWGALEVRYVPSHDQLADCLT

Query:  KPLTHTQFLYLRSKLGLVDTPSRLRGDIKEPSHSVSSASPSKKHNPEEEQAKGS
        K LT ++FL L+S+L +   P  LRGD K  +              EE +  GS
Subjt:  KPLTHTQFLYLRSKLGLVDTPSRLRGDIKEPSHSVSSASPSKKHNPEEEQAKGS

SwissProt top hitse value%identityAlignment
P04146 Copia protein7.3e-11527.91Show/hide
Query:  AKDLWSAIQELFGVQSRAEEDFLRQTFQQTRKGNSKMS--DYLRLMKTHADNLGLAGSPVSNRNLVSQVLLGLDEEYNAIVAMIQ--GRASVTWAELQAE
        A+ +   +  ++  +S A +  LR+    + K +S+MS   +  +       L  AG+ +   + +S +L+ L   Y+ I+  I+     ++T A ++  
Subjt:  AKDLWSAIQELFGVQSRAEEDFLRQTFQQTRKGNSKMS--DYLRLMKTHADNLGLAGSPVSNRNLVSQVLLGLDEEYNAIVAMIQ--GRASVTWAELQAE

Query:  LLVFEKRLELQNSVKNTTTFSQNALANMASSKGVSSPKQTNQITSNGNGNRPWYNNYNQRGSGNRGRGRGRGYNNYNNRQICQVCGKVGHSALVCYNRFN
        LL  ++ ++++N   +T                  S K  N I  N N     Y N   +    + +   +G + Y  +  C  CG+ GH    C++   
Subjt:  LLVFEKRLELQNSVKNTTTFSQNALANMASSKGVSSPKQTNQITSNGNGNRPWYNNYNQRGSGNRGRGRGRGYNNYNNRQICQVCGKVGHSALVCYNRFN

Query:  KEFSPIQNRGNGNGNGNHNQNRGQNQQSNAFMATQPTATPETLADPNWYADSGASNHVTSNYDNLSNPTDYEGNECVTIG-NGDKLPITCIGSSRLTDGN
          +  I N  N     N  Q +       AFM  +   T   + +  +  DSGAS+H+ ++    ++  +      + +   G+ +  T  G  RL + +
Subjt:  KEFSPIQNRGNGNGNGNHNQNRGQNQQSNAFMATQPTATPETLADPNWYADSGASNHVTSNYDNLSNPTDYEGNECVTIG-NGDKLPITCIGSSRLTDGN

Query:  HVLQLEHVLCVPDIAKNLVSMSKLAQDNNVFIEFHGNFCLVKDKTTGRVVLKGALKDGLYQLQGVNLRNLSFSASSSSMRQENKIEKSYNEGAVFVVSNV
        H + LE VL   + A NL+S+ +L Q+  + IEF  +   +     G +V+K +   G+      N+  ++F A S + + +N                 
Subjt:  HVLQLEHVLCVPDIAKNLVSMSKLAQDNNVFIEFHGNFCLVKDKTTGRVVLKGALKDGLYQLQGVNLRNLSFSASSSSMRQENKIEKSYNEGAVFVVSNV

Query:  VPCANMAVSKKIWHRRLGHPSEKVL-----NSIVKDCKLSVKVNEPLQFCESCQFGKSHALKF-PLSDSRASKR-FDLIHTDIWGPAPVLSGDGYRYYVL
                  ++WH R GH S+  L      ++  D  L   +    + CE C  GK   L F  L D    KR   ++H+D+ GP   ++ D   Y+V+
Subjt:  VPCANMAVSKKIWHRRLGHPSEKVL-----NSIVKDCKLSVKVNEPLQFCESCQFGKSHALKF-PLSDSRASKR-FDLIHTDIWGPAPVLSGDGYRYYVL

Query:  FLDDYSRYVWLYPLKLKSDTLSAFNHFLTMVKTQFGSMIKAIQSDNGGEYV--KVHRLCNQLGIQSRYSCPHTSAQNGRAERKHRHVVETGLTLLAQASM
        F+D ++ Y   Y +K KSD  S F  F+   +  F   +  +  DNG EY+  ++ + C + GI    + PHT   NG +ER  R + E   T+++ A +
Subjt:  FLDDYSRYVWLYPLKLKSDTLSAFNHFLTMVKTQFGSMIKAIQSDNGGEYV--KVHRLCNQLGIQSRYSCPHTSAQNGRAERKHRHVVETGLTLLAQASM

Query:  PLAYWWDAFMAAARLINGLPTTVL--KGKSPMELMFLKKLDFTALKTFGCSCYPCLRPYQNHKFYFHTDQCVNLGLSASHKGYRCMNKAGRVF-VSRHVK
          ++W +A + A  LIN +P+  L    K+P E+   KK     L+ FG + Y  ++  Q  KF   + + + +G   +  G++  +     F V+R V 
Subjt:  PLAYWWDAFMAAARLINGLPTTVL--KGKSPMELMFLKKLDFTALKTFGCSCYPCLRPYQNHKFYFHTDQCVNLGLSASHKGYRCMNKAGRVF-VSRHVK

Query:  FDEETF--PFAAGFGTV---DSSMSGSNT--TLAPHILQW-FPQPNIPQSGIFSPPVNQPPLTCVQPSPSPAPLQQPTGQNNEPCSQTS-PSPPPSQQPA
         DE       A  F TV   DS  S +      +  I+Q  FP  +     I     ++       P+ S   +Q      ++ C               
Subjt:  FDEETF--PFAAGFGTV---DSSMSGSNT--TLAPHILQW-FPQPNIPQSGIFSPPVNQPPLTCVQPSPSPAPLQQPTGQNNEPCSQTS-PSPPPSQQPA

Query:  VQNTSPSILPFPNQETSVSSPDSNTSQTSPASEPSPETILNSNPCPQSTHPMVTRGKAGIFKPKAWLSRQQVDWSLTEPT---------------RVQDA
          N S       +   S  S + N S+ S  +E   E  ++ NP       ++ R ++   K K  +S  + D SL +                  +Q  
Subjt:  VQNTSPSILPFPNQETSVSSPDSNTSQTSPASEPSPETILNSNPCPQSTHPMVTRGKAGIFKPKAWLSRQQVDWSLTEPT---------------RVQDA

Query:  LATPQWKAAMDTEFSALIKNQTWSLVPHAPSFNVVGNKWIFRIKRNADGSIQRYKARLVAKGFHQYPGVDFFETFSPVVKASTIRIVLSLAVTRGWELRQ
             W+ A++TE +A   N TW++     + N+V ++W+F +K N  G+  RYKARLVA+GF Q   +D+ ETF+PV + S+ R +LSL +    ++ Q
Subjt:  LATPQWKAAMDTEFSALIKNQTWSLVPHAPSFNVVGNKWIFRIKRNADGSIQRYKARLVAKGFHQYPGVDFFETFSPVVKASTIRIVLSLAVTRGWELRQ

Query:  LDFNNAFLNGTLNEVVYMKQPPGYVDPNRPNHVCKLKKAIYGLKQAPRAWNTTLKAVLLSWGFHNSRSDNSLFIFRTENV--CLLLLVYVDDVIVTGNNS
        +D   AFLNGTL E +YM+ P G +  N  N VCKL KAIYGLKQA R W    +  L    F NS  D  ++I    N+   + +L+YVDDV++   + 
Subjt:  LDFNNAFLNGTLNEVVYMKQPPGYVDPNRPNHVCKLKKAIYGLKQAPRAWNTTLKAVLLSWGFHNSRSDNSLFIFRTENV--CLLLLVYVDDVIVTGNNS

Query:  KMINRLIVELDNRFALKDLGRLNYFLGIQVTYIPSGLLLTQAKYIDDLLTKLDL--LHLKPAPSPCVIGKKMSIHDGKPLEDPFIYRSTIGALQY-LTTT
          +N     L  +F + DL  + +F+GI++      + L+Q+ Y+  +L+K ++   +    P P  I  ++ ++  +    P   RS IG L Y +  T
Subjt:  KMINRLIVELDNRFALKDLGRLNYFLGIQVTYIPSGLLLTQAKYIDDLLTKLDL--LHLKPAPSPCVIGKKMSIHDGKPLEDPFIYRSTIGALQY-LTTT

Query:  RPDIAYIINQLSQFLQTPTDIHWQAVKRVLRYLTGTKHLGLLFQPGSNLS----VSAFSDADWASNIDDRKSVAAYCVFLGN-NLVSWSSKKQSVVARSS
        RPD+   +N LS++        WQ +KRVLRYL GT  + L+F+   NL+    +  + D+DWA +  DRKS   Y   + + NL+ W++K+Q+ VA SS
Subjt:  RPDIAYIINQLSQFLQTPTDIHWQAVKRVLRYLTGTKHLGLLFQPGSNLS----VSAFSDADWASNIDDRKSVAAYCVFLGN-NLVSWSSKKQSVVARSS

Query:  TESEYRALSLASAEIIWLQQLLKELGCHSSKPI-LWCDNISAGALAANPVFHARTKHIEVDVHFVRDQILWGALEVRYVPSHDQLADCLTKPLTHTQFLY
        TE+EY AL  A  E +WL+ LL  +      PI ++ DN    ++A NP  H R KHI++  HF R+Q+    + + Y+P+ +QLAD  TKPL   +F+ 
Subjt:  TESEYRALSLASAEIIWLQQLLKELGCHSSKPI-LWCDNISAGALAANPVFHARTKHIEVDVHFVRDQILWGALEVRYVPSHDQLADCLTKPLTHTQFLY

Query:  LRSKLGLV
        LR KLGL+
Subjt:  LRSKLGLV

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-945.0e-14027.97Show/hide
Query:  EAWVTVDQLLLGWLYNSMTPEVATQVMGIENAKDLWSAIQELFGVQSRAEEDFL-RQTFQQTRKGNSKMSDYLRLMKTHADNLGLAGSPVSNRNLVSQVL
        E W  +D+     +   ++ +V   ++  + A+ +W+ ++ L+  ++   + +L +Q +       +    +L +       L   G  +   +    +L
Subjt:  EAWVTVDQLLLGWLYNSMTPEVATQVMGIENAKDLWSAIQELFGVQSRAEEDFL-RQTFQQTRKGNSKMSDYLRLMKTHADNLGLAGSPVSNRNLVSQVL

Query:  LGLDEEY-NAIVAMIQGRASVTWAELQAELLVFEKRLELQNSVKNTTTFSQNALANMASSKGVSSPKQTNQITSNGNGNRPWYNNYNQRGSGNRGRGRGR
          L   Y N    ++ G+ ++   ++ + LL+ EK   ++   +N                      Q   + + G G     ++ N   SG RG+ + R
Subjt:  LGLDEEY-NAIVAMIQGRASVTWAELQAELLVFEKRLELQNSVKNTTTFSQNALANMASSKGVSSPKQTNQITSNGNGNRPWYNNYNQRGSGNRGRGRGR

Query:  GYNNYNNRQICQVCGKVGHSALVCYNRFNKEFSPIQNRGNGNGNGNHNQNRGQNQQSN---AFMATQPTATPETLADPNWYADSGASNHVTSNYDNLSNP
          +   N   C  C + GH    C N       P + +G  +G  N +      Q ++    F+  +      +  +  W  D+ AS+H T   D     
Subjt:  GYNNYNNRQICQVCGKVGHSALVCYNRFNKEFSPIQNRGNGNGNGNHNQNRGQNQQSN---AFMATQPTATPETLADPNWYADSGASNHVTSNYDNLSNP

Query:  TDYEGNE--CVTIGNGDKLPITCIGSSRL-TDGNHVLQLEHVLCVPDIAKNLVSMSKLAQDNNVFIEFHGNFCLVKDKTT--GRVVLKGALKDGLYQLQG
          Y   +   V +GN     I  IG   + T+    L L+ V  VPD+  NL+S   L +D      +   F   K + T    V+ KG  +  LY+   
Subjt:  TDYEGNE--CVTIGNGDKLPITCIGSSRL-TDGNHVLQLEHVLCVPDIAKNLVSMSKLAQDNNVFIEFHGNFCLVKDKTT--GRVVLKGALKDGLYQLQG

Query:  VNLRNLSFSASSSSMRQENKIEKSYNEGAVFVVSNVVPCANMAVSKKIWHRRLGHPSEKVLNSIVKDCKLSVKVNEPLQFCESCQFGKSHALKFPLSDSR
                                       +    +  A   +S  +WH+R+GH SEK L  + K   +S      ++ C+ C FGK H + F  S  R
Subjt:  VNLRNLSFSASSSSMRQENKIEKSYNEGAVFVVSNVVPCANMAVSKKIWHRRLGHPSEKVLNSIVKDCKLSVKVNEPLQFCESCQFGKSHALKFPLSDSR

Query:  ASKRFDLIHTDIWGPAPVLSGDGYRYYVLFLDDYSRYVWLYPLKLKSDTLSAFNHFLTMVKTQFGSMIKAIQSDNGGEYV--KVHRLCNQLGIQSRYSCP
             DL+++D+ GP  + S  G +Y+V F+DD SR +W+Y LK K      F  F  +V+ + G  +K ++SDNGGEY   +    C+  GI+   + P
Subjt:  ASKRFDLIHTDIWGPAPVLSGDGYRYYVLFLDDYSRYVWLYPLKLKSDTLSAFNHFLTMVKTQFGSMIKAIQSDNGGEYV--KVHRLCNQLGIQSRYSCP

Query:  HTSAQNGRAERKHRHVVETGLTLLAQASMPLAYWWDAFMAAARLINGLPTTVLKGKSPMELMFLKKLDFTALKTFGCSCYPCLRPYQNHKFYFHTDQCVN
         T   NG AER +R +VE   ++L  A +P ++W +A   A  LIN  P+  L  + P  +   K++ ++ LK FGC  +  +   Q  K    +  C+ 
Subjt:  HTSAQNGRAERKHRHVVETGLTLLAQASMPLAYWWDAFMAAARLINGLPTTVLKGKSPMELMFLKKLDFTALKTFGCSCYPCLRPYQNHKFYFHTDQCVN

Query:  LGLSASHKGYRCMNKA-GRVFVSRHVKFDEETFPFAAGFGTVDSSMSGSNTTLAPHILQWFPQPNIPQSGIFSPPVNQPPLTCVQPSPSPAPLQQPTGQN
        +G      GYR  +    +V  SR V F E     AA     D S    N  +          PN               +T    S +P   +  T + 
Subjt:  LGLSASHKGYRCMNKA-GRVFVSRHVKFDEETFPFAAGFGTVDSSMSGSNTTLAPHILQWFPQPNIPQSGIFSPPVNQPPLTCVQPSPSPAPLQQPTGQN

Query:  NEPCSQTSPSPPPSQQPAVQNTSPSILPFPNQETSVSSPDSNTSQTSPASEPSPETILNSNPCPQSTHPMVTRGKAGIFKPKAWLSRQQVDWSLTEPTRV
        +E   Q        +Q                   V  P     Q  P    S    + S   P + + +++  +                    EP  +
Subjt:  NEPCSQTSPSPPPSQQPAVQNTSPSILPFPNQETSVSSPDSNTSQTSPASEPSPETILNSNPCPQSTHPMVTRGKAGIFKPKAWLSRQQVDWSLTEPTRV

Query:  QDALATP---QWKAAMDTEFSALIKNQTWSLVPHAPSFNVVGNKWIFRIKRNADGSIQRYKARLVAKGFHQYPGVDFFETFSPVVKASTIRIVLSLAVTR
        ++ L+ P   Q   AM  E  +L KN T+ LV        +  KW+F++K++ D  + RYKARLV KGF Q  G+DF E FSPVVK ++IR +LSLA + 
Subjt:  QDALATP---QWKAAMDTEFSALIKNQTWSLVPHAPSFNVVGNKWIFRIKRNADGSIQRYKARLVAKGFHQYPGVDFFETFSPVVKASTIRIVLSLAVTR

Query:  GWELRQLDFNNAFLNGTLNEVVYMKQPPGYVDPNRPNHVCKLKKAIYGLKQAPRAWNTTLKAVLLSWGFHNSRSDNSLFIFR-TENVCLLLLVYVDDVIV
          E+ QLD   AFL+G L E +YM+QP G+    + + VCKL K++YGLKQAPR W     + + S  +  + SD  ++  R +EN  ++LL+YVDD+++
Subjt:  GWELRQLDFNNAFLNGTLNEVVYMKQPPGYVDPNRPNHVCKLKKAIYGLKQAPRAWNTTLKAVLLSWGFHNSRSDNSLFIFR-TENVCLLLLVYVDDVIV

Query:  TGNNSKMINRLIVELDNRFALKDLGRLNYFLGIQVTYIPSG--LLLTQAKYIDDLLTKLDLLHLKPAPSPCV----IGKKM--SIHDGKPLEDPFIYRST
         G +  +I +L  +L   F +KDLG     LG+++    +   L L+Q KYI+ +L + ++ + KP  +P      + KKM  +  + K       Y S 
Subjt:  TGNNSKMINRLIVELDNRFALKDLGRLNYFLGIQVTYIPSG--LLLTQAKYIDDLLTKLDLLHLKPAPSPCV----IGKKM--SIHDGKPLEDPFIYRST

Query:  IGALQY-LTTTRPDIAYIINQLSQFLQTPTDIHWQAVKRVLRYLTGTKHLGLLFQPGSNLSVSAFSDADWASNIDDRKSVAAYCVFLGNNLVSWSSKKQS
        +G+L Y +  TRPDIA+ +  +S+FL+ P   HW+AVK +LRYL GT    L F  GS+  +  ++DAD A +ID+RKS   Y        +SW SK Q 
Subjt:  IGALQY-LTTTRPDIAYIINQLSQFLQTPTDIHWQAVKRVLRYLTGTKHLGLLFQPGSNLSVSAFSDADWASNIDDRKSVAAYCVFLGNNLVSWSSKKQS

Query:  VVARSSTESEYRALSLASAEIIWLQQLLKELGCHSSKPILWCDNISAGALAANPVFHARTKHIEVDVHFVRDQILWGALEVRYVPSHDQLADCLTKPLTH
         VA S+TE+EY A +    E+IWL++ L+ELG H  + +++CD+ SA  L+ N ++HARTKHI+V  H++R+ +   +L+V  + +++  AD LTK +  
Subjt:  VVARSSTESEYRALSLASAEIIWLQQLLKELGCHSSKPILWCDNISAGALAANPVFHARTKHIEVDVHFVRDQILWGALEVRYVPSHDQLADCLTKPLTH

Query:  TQFLYLRSKLGL
         +F   +  +G+
Subjt:  TQFLYLRSKLGL

P92519 Uncharacterized mitochondrial protein AtMg008102.7e-5346.46Show/hide
Query:  LLLLVYVDDVIVTGNNSKMINRLIVELDNRFALKDLGRLNYFLGIQVTYIPSGLLLTQAKYIDDLLTKLDLLHLKPAPSPCVIGKKMSIHDGKPLEDPFI
        + LL+YVDD+++TG+++ ++N LI +L + F++KDLG ++YFLGIQ+   PSGL L+Q KY + +L    +L  KP  +P  +    S+   K   DP  
Subjt:  LLLLVYVDDVIVTGNNSKMINRLIVELDNRFALKDLGRLNYFLGIQVTYIPSGLLLTQAKYIDDLLTKLDLLHLKPAPSPCVIGKKMSIHDGKPLEDPFI

Query:  YRSTIGALQYLTTTRPDIAYIINQLSQFLQTPTDIHWQAVKRVLRYLTGTKHLGLLFQPGSNLSVSAFSDADWASNIDDRKSVAAYCVFLGNNLVSWSSK
        +RS +GALQYLT TRPDI+Y +N + Q +  PT   +  +KRVLRY+ GT   GL     S L+V AF D+DWA     R+S   +C FLG N++SWS+K
Subjt:  YRSTIGALQYLTTTRPDIAYIINQLSQFLQTPTDIHWQAVKRVLRYLTGTKHLGLLFQPGSNLSVSAFSDADWASNIDDRKSVAAYCVFLGNNLVSWSSK

Query:  KQSVVARSSTESEYRALSLASAEIIW
        +Q  V+RSSTE+EYRAL+L +AE+ W
Subjt:  KQSVVARSSTESEYRALSLASAEIIW

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE14.2e-28038.16Show/hide
Query:  LNQITTIKLDRGNYLLWKNLAMPILRSYKLEGHLLGTKSCPPEFIRQDGEPVEVTSGAAIGAPSSQTDGSGASTSEARLSMNPQYEAWVTVDQLLLGWLY
        +N     KL   NYL+W      +   Y+L G L G+ + PP  I  D  P                             +NP Y  W   D+L+   + 
Subjt:  LNQITTIKLDRGNYLLWKNLAMPILRSYKLEGHLLGTKSCPPEFIRQDGEPVEVTSGAAIGAPSSQTDGSGASTSEARLSMNPQYEAWVTVDQLLLGWLY

Query:  NSMTPEVATQVMGIENAKDLWSAIQELFGVQSRAEEDFLRQTFQQTRKGNSKMSDYLRLMKTHADNLGLAGSPVSNRNLVSQVLLGLDEEYNAIVAMIQG
         +++  V   V     A  +W  +++++   S      LR   +Q  KG   + DY++ + T  D L L G P+ +   V +VL  L EEY  ++  I  
Subjt:  NSMTPEVATQVMGIENAKDLWSAIQELFGVQSRAEEDFLRQTFQQTRKGNSKMSDYLRLMKTHADNLGLAGSPVSNRNLVSQVLLGLDEEYNAIVAMIQG

Query:  R-ASVTWAELQAELLVFEKRLELQNSVKNTTTFSQNALANMASSKGVSSPKQTNQITSNGNGNRPWYNNYNQRGSGNRGRGRGRGYNNY--NNRQI----
        +    T  E+   LL  E ++ L  S       + NA+          S + T    +N NGNR   N Y+ R + N  +   +   N+  NN Q     
Subjt:  R-ASVTWAELQAELLVFEKRLELQNSVKNTTTFSQNALANMASSKGVSSPKQTNQITSNGNGNRPWYNNYNQRGSGNRGRGRGRGYNNY--NNRQI----

Query:  --CQVCGKVGHSALVCYNRFNKEFSPIQNRGNGNGNGNHNQNRGQNQQSNAFMATQPTAT---PETLADPNWYADSGASNHVTSNYDNLSNPTDYEGNEC
          CQ+CG  GHSA  C ++     S + ++                Q  + F   QP A        +  NW  DSGA++H+TS+++NLS    Y G + 
Subjt:  --CQVCGKVGHSALVCYNRFNKEFSPIQNRGNGNGNGNHNQNRGQNQQSNAFMATQPTAT---PETLADPNWYADSGASNHVTSNYDNLSNPTDYEGNEC

Query:  VTIGNGDKLPITCIGSSRLTDGNHVLQLEHVLCVPDIAKNLVSMSKLAQDNNVFIEFHGNFCLVKDKTTGRVVLKGALKDGLYQLQGVNLRNLSFSASSS
        V + +G  +PI+  GS+ L+  +  L L ++L VP+I KNL+S+ +L   N V +EF      VKD  TG  +L+G  KD LY+    + + +S  AS S
Subjt:  VTIGNGDKLPITCIGSSRLTDGNHVLQLEHVLCVPDIAKNLVSMSKLAQDNNVFIEFHGNFCLVKDKTTGRVVLKGALKDGLYQLQGVNLRNLSFSASSS

Query:  SMRQENKIEKSYNEGAVFVVSNVVPCANMAVSKKIWHRRLGHPSEKVLNSIVKDCKLSVKVNEPLQF--CESCQFGKSHALKFPLSDSRASKRFDLIHTD
        S    +                             WH RLGHP+  +LNS++ +  LSV +N   +F  C  C   KS+ + F  S   +++  + I++D
Subjt:  SMRQENKIEKSYNEGAVFVVSNVVPCANMAVSKKIWHRRLGHPSEKVLNSIVKDCKLSVKVNEPLQF--CESCQFGKSHALKFPLSDSRASKRFDLIHTD

Query:  IWGPAPVLSGDGYRYYVLFLDDYSRYVWLYPLKLKSDTLSAFNHFLTMVKTQFGSMIKAIQSDNGGEYVKVHRLCNQLGIQSRYSCPHTSAQNGRAERKH
        +W  +P+LS D YRYYV+F+D ++RY WLYPLK KS     F  F  +++ +F + I    SDNGGE+V +    +Q GI    S PHT   NG +ERKH
Subjt:  IWGPAPVLSGDGYRYYVLFLDDYSRYVWLYPLKLKSDTLSAFNHFLTMVKTQFGSMIKAIQSDNGGEYVKVHRLCNQLGIQSRYSCPHTSAQNGRAERKH

Query:  RHVVETGLTLLAQASMPLAYWWDAFMAAARLINGLPTTVLKGKSPMELMFLKKLDFTALKTFGCSCYPCLRPYQNHKFYFHTDQCVNLGLSASHKGYRCM
        RH+VETGLTLL+ AS+P  YW  AF  A  LIN LPT +L+ +SP + +F    ++  L+ FGC+CYP LRPY  HK    + QCV LG S +   Y C+
Subjt:  RHVVETGLTLLAQASMPLAYWWDAFMAAARLINGLPTTVLKGKSPMELMFLKKLDFTALKTFGCSCYPCLRPYQNHKFYFHTDQCVNLGLSASHKGYRCM

Query:  N-KAGRVFVSRHVKFDEETFPFA---AGFGTVDSSMSGSNTTLAPHILQWFPQPNIPQSGIFSPPVNQPPLTCVQPSPSPAPLQ----------------
        + +  R+++SRHV+FDE  FPF+   A    V      S+   +PH       P +P     +P  + P      PS   AP +                
Subjt:  N-KAGRVFVSRHVKFDEETFPFA---AGFGTVDSSMSGSNTTLAPHILQWFPQPNIPQSGIFSPPVNQPPLTCVQPSPSPAPLQ----------------

Query:  -----QPTG-QNNEPCSQTSPSPPPSQQPAVQNT--------SPS----ILPFPNQETSVSSPDSNTSQTSPASEPSPETIL------------NSNPCP
             +PT  + N P   T P+   +Q  + QNT        SPS     L  P Q +S SSP   TS +S ++ P+P +IL            N+N  P
Subjt:  -----QPTG-QNNEPCSQTSPSPPPSQQPAVQNT--------SPS----ILPFPNQETSVSSPDSNTSQTSPASEPSPETIL------------NSNPCP

Query:  QSTHPMVTRGKAGIFKPKAWLSRQQVDWSLTEPTRVQDALATPQWKAAMDTEFSALIKNQTWSLVPHAPS-FNVVGNKWIFRIKRNADGSIQRYKARLVA
         +TH M TR KAGI KP    S      + +EP     AL   +W+ AM +E +A I N TW LVP  PS   +VG +WIF  K N+DGS+ RYKARLVA
Subjt:  QSTHPMVTRGKAGIFKPKAWLSRQQVDWSLTEPTRVQDALATPQWKAAMDTEFSALIKNQTWSLVPHAPS-FNVVGNKWIFRIKRNADGSIQRYKARLVA

Query:  KGFHQYPGVDFFETFSPVVKASTIRIVLSLAVTRGWELRQLDFNNAFLNGTLNEVVYMKQPPGYVDPNRPNHVCKLKKAIYGLKQAPRAWNTTLKAVLLS
        KG++Q PG+D+ ETFSPV+K+++IRIVL +AV R W +RQLD NNAFL GTL + VYM QPPG++D +RPN+VCKL+KA+YGLKQAPRAW   L+  LL+
Subjt:  KGFHQYPGVDFFETFSPVVKASTIRIVLSLAVTRGWELRQLDFNNAFLNGTLNEVVYMKQPPGYVDPNRPNHVCKLKKAIYGLKQAPRAWNTTLKAVLLS

Query:  WGFHNSRSDNSLFIFRTENVCLLLLVYVDDVIVTGNNSKMINRLIVELDNRFALKDLGRLNYFLGIQVTYIPSGLLLTQAKYIDDLLTKLDLLHLKPAPS
         GF NS SD SLF+ +     + +LVYVDD+++TGN+  +++  +  L  RF++KD   L+YFLGI+   +P+GL L+Q +YI DLL + +++  KP  +
Subjt:  WGFHNSRSDNSLFIFRTENVCLLLLVYVDDVIVTGNNSKMINRLIVELDNRFALKDLGRLNYFLGIQVTYIPSGLLLTQAKYIDDLLTKLDLLHLKPAPS

Query:  PCVIGKKMSIHDGKPLEDPFIYRSTIGALQYLTTTRPDIAYIINQLSQFLQTPTDIHWQAVKRVLRYLTGTKHLGLLFQPGSNLSVSAFSDADWASNIDD
        P     K+S++ G  L DP  YR  +G+LQYL  TRPDI+Y +N+LSQF+  PT+ H QA+KR+LRYL GT + G+  + G+ LS+ A+SDADWA + DD
Subjt:  PCVIGKKMSIHDGKPLEDPFIYRSTIGALQYLTTTRPDIAYIINQLSQFLQTPTDIHWQAVKRVLRYLTGTKHLGLLFQPGSNLSVSAFSDADWASNIDD

Query:  RKSVAAYCVFLGNNLVSWSSKKQSVVARSSTESEYRALSLASAEIIWLQQLLKELGCHSSK-PILWCDNISAGALAANPVFHARTKHIEVDVHFVRDQIL
          S   Y V+LG++ +SWSSKKQ  V RSSTE+EYR+++  S+E+ W+  LL ELG   ++ P+++CDN+ A  L ANPVFH+R KHI +D HF+R+Q+ 
Subjt:  RKSVAAYCVFLGNNLVSWSSKKQSVVARSSTESEYRALSLASAEIIWLQQLLKELGCHSSK-PILWCDNISAGALAANPVFHARTKHIEVDVHFVRDQIL

Query:  WGALEVRYVPSHDQLADCLTKPLTHTQFLYLRSKLGLVDTP
         GAL V +V +HDQLAD LTKPL+ T F    SK+G+   P
Subjt:  WGALEVRYVPSHDQLADCLTKPLTHTQFLYLRSKLGLVDTP

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.5e-27237.64Show/hide
Query:  LNQITTIKLDRGNYLLWKNLAMPILRSYKLEGHLLGTKSCPPEFIRQDGEPVEVTSGAAIGAPSSQTDGSGASTSEARLSMNPQYEAWVTVDQLLLGWLY
        +N     KL   NYL+W      +   Y+L G L G+   PP  I  D  P                             +NP Y  W   D+L+   + 
Subjt:  LNQITTIKLDRGNYLLWKNLAMPILRSYKLEGHLLGTKSCPPEFIRQDGEPVEVTSGAAIGAPSSQTDGSGASTSEARLSMNPQYEAWVTVDQLLLGWLY

Query:  NSMTPEVATQVMGIENAKDLWSAIQELFGVQSRAEEDFLRQTFQQTRKGNSKMSDYLRLMKTHADNLGLAGSPVSNRNLVSQVLLGLDEEYNAIVAMIQG
         +++  V   V     A  +W  +++++   S      LR                     T  D L L G P+ +   V +VL  L ++Y  ++  I  
Subjt:  NSMTPEVATQVMGIENAKDLWSAIQELFGVQSRAEEDFLRQTFQQTRKGNSKMSDYLRLMKTHADNLGLAGSPVSNRNLVSQVLLGLDEEYNAIVAMIQG

Query:  R-ASVTWAELQAELLVFEKRLELQNSVKNTTTFSQNALANMASSKGVSSPKQTNQITSNGNGNRPWYNNYNQRGSGNRGRGRGRGYNNYNNRQI--CQVC
        +    +  E+   L+  E +L   NS +          AN+ + +  +    TN+  +N   NR + NN N+  S        R  N      +  CQ+C
Subjt:  R-ASVTWAELQAELLVFEKRLELQNSVKNTTTFSQNALANMASSKGVSSPKQTNQITSNGNGNRPWYNNYNQRGSGNRGRGRGRGYNNYNNRQI--CQVC

Query:  GKVGHSALVCYNRFNKEFSPIQNRGNGNGNGNHNQNRGQNQQSNAFMATQPTATPETLAD---PNWYADSGASNHVTSNYDNLSNPTDYEGNECVTIGNG
           GHSA  C      +    Q+  N            Q Q ++ F   QP A     +     NW  DSGA++H+TS+++NLS    Y G + V I +G
Subjt:  GKVGHSALVCYNRFNKEFSPIQNRGNGNGNGNHNQNRGQNQQSNAFMATQPTATPETLAD---PNWYADSGASNHVTSNYDNLSNPTDYEGNECVTIGNG

Query:  DKLPITCIGSSRLTDGNHVLQLEHVLCVPDIAKNLVSMSKLAQDNNVFIEFHGNFCLVKDKTTGRVVLKGALKDGLYQLQGVNLRNLSFSASSSSMRQEN
          +PIT  GS+ L   +  L L  VL VP+I KNL+S+ +L   N V +EF      VKD  TG  +L+G  KD LY+    + + +S  AS        
Subjt:  DKLPITCIGSSRLTDGNHVLQLEHVLCVPDIAKNLVSMSKLAQDNNVFIEFHGNFCLVKDKTTGRVVLKGALKDGLYQLQGVNLRNLSFSASSSSMRQEN

Query:  KIEKSYNEGAVFVVSNVVPCANMAVSKKIWHRRLGHPSEKVLNSIVKDCKLSV-KVNEPLQFCESCQFGKSHALKFPLSDSRASKRFDLIHTDIWGPAPV
                          PC+    S   WH RLGHPS  +LNS++ +  L V   +  L  C  C   KSH + F  S   +SK  + I++D+W  +P+
Subjt:  KIEKSYNEGAVFVVSNVVPCANMAVSKKIWHRRLGHPSEKVLNSIVKDCKLSV-KVNEPLQFCESCQFGKSHALKFPLSDSRASKRFDLIHTDIWGPAPV

Query:  LSGDGYRYYVLFLDDYSRYVWLYPLKLKSDTLSAFNHFLTMVKTQFGSMIKAIQSDNGGEYVKVHRLCNQLGIQSRYSCPHTSAQNGRAERKHRHVVETG
        LS D YRYYV+F+D ++RY WLYPLK KS     F  F ++V+ +F + I  + SDNGGE+V +    +Q GI    S PHT   NG +ERKHRH+VE G
Subjt:  LSGDGYRYYVLFLDDYSRYVWLYPLKLKSDTLSAFNHFLTMVKTQFGSMIKAIQSDNGGEYVKVHRLCNQLGIQSRYSCPHTSAQNGRAERKHRHVVETG

Query:  LTLLAQASMPLAYWWDAFMAAARLINGLPTTVLKGKSPMELMFLKKLDFTALKTFGCSCYPCLRPYQNHKFYFHTDQCVNLGLSASHKGYRCMN-KAGRV
        LTLL+ AS+P  YW  AF  A  LIN LPT +L+ +SP + +F +  ++  LK FGC+CYP LRPY  HK    + QC  +G S +   Y C++   GR+
Subjt:  LTLLAQASMPLAYWWDAFMAAARLINGLPTTVLKGKSPMELMFLKKLDFTALKTFGCSCYPCLRPYQNHKFYFHTDQCVNLGLSASHKGYRCMN-KAGRV

Query:  FVSRHVKFDEETFPFA-AGFGTVDSSMSGSNTT---------------------LAPHILQWFPQP---------------NIPQSGIFSPPVNQPPLTC
        + SRHV+FDE  FPF+   FG   S    S++                      L PH L   P+P               N+P S I SP  ++P    
Subjt:  FVSRHVKFDEETFPFA-AGFGTVDSSMSGSNTT---------------------LAPHILQWFPQP---------------NIPQSGIFSPPVNQPPLTC

Query:  VQ-PSPSPAPLQQPTGQNNEPC----SQTSPSPPPSQQPAVQNTSPSILP-FPNQETSVSSPDSNTSQTS-----PASEPSPETILNSNPCPQSTHPMVT
           P P+  P Q     +N P     +  SPSP    Q +    SP   P  P   TS+S P+S +S ++     P   P+P  I  +   P +TH M T
Subjt:  VQ-PSPSPAPLQQPTGQNNEPC----SQTSPSPPPSQQPAVQNTSPSILP-FPNQETSVSSPDSNTSQTS-----PASEPSPETILNSNPCPQSTHPMVT

Query:  RGKAGIFKPKAWLSRQQVDWSLTEPTRVQDALATPQWKAAMDTEFSALIKNQTWSLV-PHAPSFNVVGNKWIFRIKRNADGSIQRYKARLVAKGFHQYPG
        R K GI KP    S      + +EP     A+   +W+ AM +E +A I N TW LV P  PS  +VG +WIF  K N+DGS+ RYKARLVAKG++Q PG
Subjt:  RGKAGIFKPKAWLSRQQVDWSLTEPTRVQDALATPQWKAAMDTEFSALIKNQTWSLV-PHAPSFNVVGNKWIFRIKRNADGSIQRYKARLVAKGFHQYPG

Query:  VDFFETFSPVVKASTIRIVLSLAVTRGWELRQLDFNNAFLNGTLNEVVYMKQPPGYVDPNRPNHVCKLKKAIYGLKQAPRAWNTTLKAVLLSWGFHNSRS
        +D+ ETFSPV+K+++IRIVL +AV R W +RQLD NNAFL GTL + VYM QPPG+VD +RP++VC+L+KAIYGLKQAPRAW   L+  LL+ GF NS S
Subjt:  VDFFETFSPVVKASTIRIVLSLAVTRGWELRQLDFNNAFLNGTLNEVVYMKQPPGYVDPNRPNHVCKLKKAIYGLKQAPRAWNTTLKAVLLSWGFHNSRS

Query:  DNSLFIFRTENVCLLLLVYVDDVIVTGNNSKMINRLIVELDNRFALKDLGRLNYFLGIQVTYIPSGLLLTQAKYIDDLLTKLDLLHLKPAPSPCVIGKKM
        D SLF+ +     + +LVYVDD+++TGN++ ++   +  L  RF++K+   L+YFLGI+   +P GL L+Q +Y  DLL + ++L  KP  +P     K+
Subjt:  DNSLFIFRTENVCLLLLVYVDDVIVTGNNSKMINRLIVELDNRFALKDLGRLNYFLGIQVTYIPSGLLLTQAKYIDDLLTKLDLLHLKPAPSPCVIGKKM

Query:  SIHDGKPLEDPFIYRSTIGALQYLTTTRPDIAYIINQLSQFLQTPTDIHWQAVKRVLRYLTGTKHLGLLFQPGSNLSVSAFSDADWASNIDDRKSVAAYC
        ++H G  L DP  YR  +G+LQYL  TRPD++Y +N+LSQ++  PTD HW A+KRVLRYL GT   G+  + G+ LS+ A+SDADWA + DD  S   Y 
Subjt:  SIHDGKPLEDPFIYRSTIGALQYLTTTRPDIAYIINQLSQFLQTPTDIHWQAVKRVLRYLTGTKHLGLLFQPGSNLSVSAFSDADWASNIDDRKSVAAYC

Query:  VFLGNNLVSWSSKKQSVVARSSTESEYRALSLASAEIIWLQQLLKELGCH-SSKPILWCDNISAGALAANPVFHARTKHIEVDVHFVRDQILWGALEVRY
        V+LG++ +SWSSKKQ  V RSSTE+EYR+++  S+E+ W+  LL ELG   S  P+++CDN+ A  L ANPVFH+R KHI +D HF+R+Q+  GAL V +
Subjt:  VFLGNNLVSWSSKKQSVVARSSTESEYRALSLASAEIIWLQQLLKELGCH-SSKPILWCDNISAGALAANPVFHARTKHIEVDVHFVRDQILWGALEVRY

Query:  VPSHDQLADCLTKPLTHTQFLYLRSKLGLVDTPSRLRGDIK
        V +HDQLAD LTKPL+   F     K+G++  P    G ++
Subjt:  VPSHDQLADCLTKPLTHTQFLYLRSKLGLVDTPSRLRGDIK

Arabidopsis top hitse value%identityAlignment
AT1G34070.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)4.9e-0527.45Show/hide
Query:  WVTVDQLLLGWLYNSMTP-EVATQVMGIENAKDLWSAIQELFGVQSRAEEDFLRQTFQQTRKGNSKMSDYLRLMKTHADNLGLAGSPVSNRNLVSQVLLG
        W   D ++   LY ++TP +     +    ++D+W  I+  F     A    L    +    G+ +++DY R MK  AD+L     PV++RNLV  VL G
Subjt:  WVTVDQLLLGWLYNSMTP-EVATQVMGIENAKDLWSAIQELFGVQSRAEEDFLRQTFQQTRKGNSKMSDYLRLMKTHADNLGLAGSPVSNRNLVSQVLLG

Query:  LDEEYNAIVAMIQGRASVTWAELQAELLVFEKRLELQNSVKNTTTFSQNALANMASSKGVSSPKQTNQITSNGNGNRPWYNNYNQRGSGNR-GRGRGRGY
        L+ +++ I+ +I+ R      +  A  ++ E+   L+ ++K   T   ++ ++   +    +P  TN   S GN        Y  RG GN   RGRG  +
Subjt:  LDEEYNAIVAMIQGRASVTWAELQAELLVFEKRLELQNSVKNTTTFSQNALANMASSKGVSSPKQTNQITSNGNGNRPWYNNYNQRGSGNR-GRGRGRGY

Query:  NNYN
        + YN
Subjt:  NNYN

AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 88.0e-11743.36Show/hide
Query:  EPTRVQDALATPQWKAAMDTEFSALIKNQTWSLVPHAPSFNVVGNKWIFRIKRNADGSIQRYKARLVAKGFHQYPGVDFFETFSPVVKASTIRIVLSLAV
        EP+   +A     W  AMD E  A+    TW +    P+   +G KW+++IK N+DG+I+RYKARLVAKG+ Q  G+DF ETFSPV K ++++++L+++ 
Subjt:  EPTRVQDALATPQWKAAMDTEFSALIKNQTWSLVPHAPSFNVVGNKWIFRIKRNADGSIQRYKARLVAKGFHQYPGVDFFETFSPVVKASTIRIVLSLAV

Query:  TRGWELRQLDFNNAFLNGTLNEVVYMKQPPGYV----DPNRPNHVCKLKKAIYGLKQAPRAWNTTLKAVLLSWGFHNSRSDNSLFIFRTENVCLLLLVYV
           + L QLD +NAFLNG L+E +YMK PPGY     D   PN VC LKK+IYGLKQA R W       L+ +GF  S SD++ F+  T  + L +LVYV
Subjt:  TRGWELRQLDFNNAFLNGTLNEVVYMKQPPGYV----DPNRPNHVCKLKKAIYGLKQAPRAWNTTLKAVLLSWGFHNSRSDNSLFIFRTENVCLLLLVYV

Query:  DDVIVTGNNSKMINRLIVELDNRFALKDLGRLNYFLGIQVTYIPSGLLLTQAKYIDDLLTKLDLLHLKPAPSPCVIGKKMSIHDGKPLEDPFIYRSTIGA
        DD+I+  NN   ++ L  +L + F L+DLG L YFLG+++    +G+ + Q KY  DLL +  LL  KP+  P       S H G    D   YR  IG 
Subjt:  DDVIVTGNNSKMINRLIVELDNRFALKDLGRLNYFLGIQVTYIPSGLLLTQAKYIDDLLTKLDLLHLKPAPSPCVIGKKMSIHDGKPLEDPFIYRSTIGA

Query:  LQYLTTTRPDIAYIINQLSQFLQTPTDIHWQAVKRVLRYLTGTKHLGLLFQPGSNLSVSAFSDADWASNIDDRKSVAAYCVFLGNNLVSWSSKKQSVVAR
        L YL  TR DI++ +N+LSQF + P   H QAV ++L Y+ GT   GL +   + + +  FSDA + S  D R+S   YC+FLG +L+SW SKKQ VV++
Subjt:  LQYLTTTRPDIAYIINQLSQFLQTPTDIHWQAVKRVLRYLTGTKHLGLLFQPGSNLSVSAFSDADWASNIDDRKSVAAYCVFLGNNLVSWSSKKQSVVAR

Query:  SSTESEYRALSLASAEIIWLQQLLKELGCHSSKP-ILWCDNISAGALAANPVFHARTKHIEVDVHFVRDQILWGALEVRYVPSHDQLADCLTK---PLTH
        SS E+EYRALS A+ E++WL Q  +EL    SKP +L+CDN +A  +A N VFH RTKHIE D H VR++ ++ A       ++D+  D  T+   P+  
Subjt:  SSTESEYRALSLASAEIIWLQQLLKELGCHSSKP-ILWCDNISAGALAANPVFHARTKHIEVDVHFVRDQILWGALEVRYVPSHDQLADCLTK---PLTH

Query:  TQFLYLRSKLGL
           +Y+ S  GL
Subjt:  TQFLYLRSKLGL

ATMG00240.1 Gag-Pol-related retrotransposon family protein2.8e-1346.15Show/hide
Query:  YLTTTRPDIAYIINQLSQFLQTPTDIHWQAVKRVLRYLTGTKHLGLLFQPGSNLSVSAFSDADWASNIDDRKSVAAYC
        YLT TRPD+ + +N+LSQF         QAV +VL Y+ GT   GL +   S+L + AF+D+DWAS  D R+SV  +C
Subjt:  YLTTTRPDIAYIINQLSQFLQTPTDIHWQAVKRVLRYLTGTKHLGLLFQPGSNLSVSAFSDADWASNIDDRKSVAAYC

ATMG00810.1 DNA/RNA polymerases superfamily protein1.9e-5446.46Show/hide
Query:  LLLLVYVDDVIVTGNNSKMINRLIVELDNRFALKDLGRLNYFLGIQVTYIPSGLLLTQAKYIDDLLTKLDLLHLKPAPSPCVIGKKMSIHDGKPLEDPFI
        + LL+YVDD+++TG+++ ++N LI +L + F++KDLG ++YFLGIQ+   PSGL L+Q KY + +L    +L  KP  +P  +    S+   K   DP  
Subjt:  LLLLVYVDDVIVTGNNSKMINRLIVELDNRFALKDLGRLNYFLGIQVTYIPSGLLLTQAKYIDDLLTKLDLLHLKPAPSPCVIGKKMSIHDGKPLEDPFI

Query:  YRSTIGALQYLTTTRPDIAYIINQLSQFLQTPTDIHWQAVKRVLRYLTGTKHLGLLFQPGSNLSVSAFSDADWASNIDDRKSVAAYCVFLGNNLVSWSSK
        +RS +GALQYLT TRPDI+Y +N + Q +  PT   +  +KRVLRY+ GT   GL     S L+V AF D+DWA     R+S   +C FLG N++SWS+K
Subjt:  YRSTIGALQYLTTTRPDIAYIINQLSQFLQTPTDIHWQAVKRVLRYLTGTKHLGLLFQPGSNLSVSAFSDADWASNIDDRKSVAAYCVFLGNNLVSWSSK

Query:  KQSVVARSSTESEYRALSLASAEIIW
        +Q  V+RSSTE+EYRAL+L +AE+ W
Subjt:  KQSVVARSSTESEYRALSLASAEIIW

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)2.5e-2548Show/hide
Query:  MVTRGKAGIFKPKAWLSRQQVDWSLTEPTRVQDALATPQWKAAMDTEFSALIKNQTWSLVPHAPSFNVVGNKWIFRIKRNADGSIQRYKARLVAKGFHQY
        M+TR KAGI K     S         EP  V  AL  P W  AM  E  AL +N+TW LVP   + N++G KW+F+ K ++DG++ R KARLVAKGFHQ 
Subjt:  MVTRGKAGIFKPKAWLSRQQVDWSLTEPTRVQDALATPQWKAAMDTEFSALIKNQTWSLVPHAPSFNVVGNKWIFRIKRNADGSIQRYKARLVAKGFHQY

Query:  PGVDFFETFSPVVKASTIRIVLSLA
         G+ F ET+SPVV+ +TIR +L++A
Subjt:  PGVDFFETFSPVVKASTIRIVLSLA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCAACGCCTCATCAATGTCGTCTACATCAGTGACCAACGTAGGCAATACAACATTCACCAGTCCACCGCTCAATCAATTATTGAATCAGATTACCACTATCAAGCT
GGATCGTGGAAATTACCTCCTCTGGAAGAATCTGGCAATGCCCATCCTTCGCAGCTATAAACTCGAAGGTCATCTACTGGGAACAAAATCATGTCCCCCAGAATTTATTC
GACAAGATGGTGAACCAGTCGAAGTTACTTCTGGAGCAGCTATCGGAGCACCCAGCTCTCAAACTGATGGAAGTGGTGCTTCCACATCTGAAGCAAGACTATCGATGAAT
CCTCAATATGAAGCCTGGGTTACGGTTGATCAACTCCTTCTCGGATGGTTATACAACTCCATGACACCAGAGGTTGCGACTCAGGTAATGGGAATAGAAAATGCGAAAGA
TCTCTGGAGTGCTATTCAGGAACTTTTTGGAGTACAGTCAAGAGCTGAAGAGGATTTCCTTCGCCAAACCTTCCAACAGACCAGGAAAGGTAACTCGAAAATGTCTGATT
ATCTTCGTTTAATGAAAACCCATGCTGATAACCTGGGATTGGCTGGGAGTCCTGTATCGAATAGAAATTTGGTCTCTCAAGTTTTGTTAGGTCTGGATGAAGAGTACAAT
GCTATTGTTGCAATGATACAAGGTCGAGCAAGCGTGACCTGGGCGGAACTACAAGCTGAGCTTCTGGTCTTTGAGAAGCGGTTAGAGTTACAAAACTCAGTAAAAAATAC
AACTACCTTCAGTCAAAATGCTTTAGCCAACATGGCTTCCAGCAAAGGAGTAAGTTCTCCAAAGCAAACTAATCAAATCACTAGCAATGGAAATGGAAATCGACCATGGT
ACAATAACTACAATCAGAGAGGCAGTGGTAATCGTGGTCGAGGCAGAGGGCGAGGTTACAACAATTACAACAATAGGCAGATTTGTCAAGTATGCGGAAAGGTAGGTCAC
TCAGCCCTTGTATGTTATAATAGGTTTAATAAGGAATTTTCTCCTATTCAGAACAGGGGAAATGGAAATGGAAATGGAAATCATAATCAGAACAGGGGACAGAATCAACA
ATCCAATGCGTTCATGGCCACTCAACCAACTGCCACCCCTGAGACATTAGCGGATCCCAATTGGTATGCGGACAGTGGAGCTTCAAATCATGTGACAAGCAACTATGACA
ACCTCTCCAACCCCACTGACTATGAAGGTAATGAGTGTGTGACCATAGGCAATGGGGATAAATTACCTATAACCTGCATAGGATCATCAAGATTGACTGATGGAAACCAT
GTTTTACAATTAGAACATGTTTTATGTGTACCTGACATAGCTAAAAACCTAGTGAGCATGTCTAAGTTGGCACAAGATAATAATGTGTTCATTGAGTTTCATGGTAACTT
TTGCCTTGTTAAGGACAAGACTACGGGTCGTGTGGTGCTGAAAGGAGCTCTTAAAGATGGTCTTTATCAATTACAAGGAGTCAACTTGAGGAACCTCTCATTTTCTGCTA
GTTCAAGTTCAATGCGGCAAGAGAATAAAATTGAGAAAAGTTACAATGAAGGAGCTGTTTTTGTTGTGTCCAATGTAGTTCCTTGTGCCAATATGGCTGTGTCTAAGAAA
ATATGGCATAGACGTCTTGGTCACCCCTCTGAAAAAGTGTTGAACTCTATAGTGAAGGATTGTAAACTCTCAGTTAAAGTTAATGAACCTCTTCAGTTTTGTGAATCTTG
CCAGTTTGGAAAGTCACATGCTCTTAAATTTCCTCTATCTGATTCTAGGGCCTCGAAACGATTTGATCTTATTCACACTGATATCTGGGGTCCAGCTCCTGTATTGTCTG
GTGATGGTTATCGTTATTATGTCTTATTTTTGGATGATTATAGTCGGTATGTTTGGTTATATCCACTAAAATTGAAAAGTGATACACTGTCAGCTTTTAATCACTTTCTC
ACAATGGTTAAGACTCAGTTTGGTAGCATGATCAAGGCAATTCAGTCTGATAATGGTGGAGAGTATGTGAAGGTTCACAGGTTGTGTAATCAGTTGGGCATTCAGTCTCG
ATACTCATGTCCACATACATCGGCACAAAATGGGAGAGCAGAGAGAAAGCACCGCCATGTGGTAGAAACCGGACTCACTCTACTCGCACAAGCGTCTATGCCTTTGGCTT
ACTGGTGGGATGCCTTTATGGCAGCTGCAAGGTTGATTAATGGTTTACCAACCACTGTTCTGAAAGGTAAGTCTCCAATGGAGCTCATGTTTTTAAAGAAACTTGACTTC
ACTGCTTTAAAAACATTTGGTTGTTCCTGTTATCCATGCTTGAGGCCATATCAAAACCATAAGTTTTATTTTCATACTGATCAGTGTGTGAATCTTGGCTTGAGTGCTTC
TCATAAAGGGTACAGGTGTATGAACAAGGCTGGGAGAGTTTTTGTCTCCAGACATGTGAAATTTGATGAAGAAACTTTTCCATTTGCGGCTGGATTTGGGACTGTCGATT
CCTCAATGTCAGGGTCCAATACAACATTAGCCCCACATATCCTACAGTGGTTTCCCCAACCAAATATTCCTCAATCTGGTATCTTTTCACCACCTGTAAATCAGCCTCCC
CTGACATGTGTTCAACCATCTCCGTCTCCTGCTCCTTTACAACAACCTACAGGCCAAAATAACGAACCTTGCTCACAAACCAGTCCATCTCCTCCTCCATCACAACAACC
TGCAGTCCAAAATACGTCTCCCTCAATACTGCCATTTCCTAACCAAGAAACCTCAGTATCATCACCAGATTCTAACACATCTCAAACCTCCCCTGCTTCTGAGCCATCAC
CAGAGACCATCCTAAATTCGAATCCTTGTCCTCAATCCACTCATCCCATGGTTACTCGTGGGAAAGCTGGAATTTTCAAGCCCAAAGCCTGGTTATCTCGACAACAAGTT
GATTGGTCTTTGACTGAGCCCACTCGTGTGCAAGATGCGTTAGCTACCCCTCAGTGGAAAGCTGCAATGGATACTGAATTCTCAGCCTTGATCAAGAATCAAACCTGGTC
CCTAGTTCCGCATGCTCCCTCCTTCAACGTAGTCGGCAACAAATGGATATTCCGAATAAAACGGAATGCAGATGGCTCTATTCAAAGGTACAAGGCCAGACTTGTAGCTA
AGGGCTTTCACCAATATCCTGGGGTTGATTTCTTTGAGACATTCAGTCCGGTGGTAAAAGCCTCCACCATTAGAATTGTTTTGAGCTTGGCAGTAACAAGGGGCTGGGAA
CTTCGTCAGTTGGACTTCAATAATGCTTTTTTGAACGGTACTCTTAATGAGGTTGTGTACATGAAGCAGCCTCCTGGCTATGTGGATCCCAACCGTCCTAATCACGTGTG
CAAACTAAAAAAGGCCATTTATGGCCTTAAACAGGCGCCAAGAGCATGGAACACAACCCTCAAAGCAGTTCTCTTGTCATGGGGATTTCATAACTCGAGGTCAGATAATT
CTCTTTTCATCTTTCGCACTGAGAATGTATGCTTGTTGCTGTTGGTATATGTCGATGATGTAATCGTGACTGGTAATAACTCAAAAATGATTAATCGACTGATTGTTGAG
CTGGATAACCGATTTGCACTCAAAGATCTGGGGCGACTGAATTATTTTCTGGGGATTCAAGTAACATATATACCCTCCGGGTTACTATTGACTCAGGCCAAATATATAGA
TGATCTTCTGACTAAGCTGGACCTGTTGCATCTTAAACCAGCACCATCTCCTTGTGTTATTGGTAAGAAGATGTCTATTCATGATGGTAAACCTTTGGAGGATCCATTCA
TTTACAGAAGTACAATTGGGGCTCTCCAATATCTTACCACTACACGTCCTGACATCGCTTATATAATCAACCAACTGAGTCAATTCCTTCAAACACCAACTGATATACAC
TGGCAAGCTGTAAAGAGAGTTCTTCGTTATCTCACCGGCACCAAACACCTAGGTCTGTTGTTTCAACCAGGTTCAAACCTTTCTGTTTCAGCATTCTCGGATGCTGATTG
GGCCTCCAATATTGATGATCGTAAGTCAGTTGCCGCCTACTGTGTGTTCCTTGGAAATAACTTGGTGTCATGGTCATCAAAGAAGCAATCAGTTGTTGCACGCTCAAGTA
CAGAGTCAGAATATCGAGCATTATCTCTTGCTTCAGCAGAAATCATCTGGCTTCAACAACTTCTCAAGGAGCTTGGCTGTCACTCCTCAAAACCAATCCTCTGGTGCGAC
AATATAAGTGCAGGAGCGCTAGCAGCTAATCCTGTGTTTCATGCCCGAACGAAACATATAGAAGTTGACGTCCACTTTGTTCGAGATCAAATACTTTGGGGGGCTTTAGA
AGTTCGCTATGTGCCATCTCATGACCAGCTCGCAGATTGTCTTACGAAACCACTCACTCACACGCAGTTTCTATATCTTAGATCCAAACTCGGGCTTGTTGACACTCCCT
CTCGTTTGAGGGGGGATATTAAGGAACCGAGTCACAGTGTCAGCTCAGCATCACCATCCAAGAAACATAACCCAGAGGAAGAACAGGCCAAAGGGTCGGGCCAAGGCCGA
AGGGATCAAGTTTTTGGCCCGGCCCCTGGGCCTAGGTCAAGCTCTTCCGCCCCCGTTTGGTCCCCGATGCTCTTGGACGCCTCGTTTCCACCTGGTTCAGCCCTGGATCA
CCTCCGAACACCTAGAAACCCTAGAGCAGGAACATCTGACTTAAGCATCGGAGGCGGTGTGGCAAGCACCACACCGATGTGCAGGTTTCTCTGTCTTGAATTATGGCCAC
GTCTTCCTCCCTCTCAAACAAATTTACCGTTGGTGACACGTGAAAGTCAGACATTTGCTCCAATATACTCCCTTGTTAGAATGATAGACATAAGCATACTAAAAACAGGA
CAACGAAAAGAGTTTCTATATACGGAGTCTGATCTGAGGGCTGACGAGCTTGATCTCAGATGGAGCAAGGTTGAATCTTCAACTTACTCTCTAACTTCTGAAGACTCTGG
ACCTTGGGAGAGTGCTCTGGACTCTAGCTTCTCTGACTCTGTCTTGAATTCGAGCAATACCCTCCCTTCGTGCCTTTTTGTTGTGCTATCAGAATTGGAATTAACAAATT
CATCTGACGATCTCTTGGACACAACATCAACCTTTGCCATCATTTGGAAGCAATCGTTTCCAAATTCACTTGCAGTGATTTCACTCCTGCTAGACCAGTCCCCTATTTCA
TTGGGTGATCTGAGTAACCTGCAGGGGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCCAACGCCTCATCAATGTCGTCTACATCAGTGACCAACGTAGGCAATACAACATTCACCAGTCCACCGCTCAATCAATTATTGAATCAGATTACCACTATCAAGCT
GGATCGTGGAAATTACCTCCTCTGGAAGAATCTGGCAATGCCCATCCTTCGCAGCTATAAACTCGAAGGTCATCTACTGGGAACAAAATCATGTCCCCCAGAATTTATTC
GACAAGATGGTGAACCAGTCGAAGTTACTTCTGGAGCAGCTATCGGAGCACCCAGCTCTCAAACTGATGGAAGTGGTGCTTCCACATCTGAAGCAAGACTATCGATGAAT
CCTCAATATGAAGCCTGGGTTACGGTTGATCAACTCCTTCTCGGATGGTTATACAACTCCATGACACCAGAGGTTGCGACTCAGGTAATGGGAATAGAAAATGCGAAAGA
TCTCTGGAGTGCTATTCAGGAACTTTTTGGAGTACAGTCAAGAGCTGAAGAGGATTTCCTTCGCCAAACCTTCCAACAGACCAGGAAAGGTAACTCGAAAATGTCTGATT
ATCTTCGTTTAATGAAAACCCATGCTGATAACCTGGGATTGGCTGGGAGTCCTGTATCGAATAGAAATTTGGTCTCTCAAGTTTTGTTAGGTCTGGATGAAGAGTACAAT
GCTATTGTTGCAATGATACAAGGTCGAGCAAGCGTGACCTGGGCGGAACTACAAGCTGAGCTTCTGGTCTTTGAGAAGCGGTTAGAGTTACAAAACTCAGTAAAAAATAC
AACTACCTTCAGTCAAAATGCTTTAGCCAACATGGCTTCCAGCAAAGGAGTAAGTTCTCCAAAGCAAACTAATCAAATCACTAGCAATGGAAATGGAAATCGACCATGGT
ACAATAACTACAATCAGAGAGGCAGTGGTAATCGTGGTCGAGGCAGAGGGCGAGGTTACAACAATTACAACAATAGGCAGATTTGTCAAGTATGCGGAAAGGTAGGTCAC
TCAGCCCTTGTATGTTATAATAGGTTTAATAAGGAATTTTCTCCTATTCAGAACAGGGGAAATGGAAATGGAAATGGAAATCATAATCAGAACAGGGGACAGAATCAACA
ATCCAATGCGTTCATGGCCACTCAACCAACTGCCACCCCTGAGACATTAGCGGATCCCAATTGGTATGCGGACAGTGGAGCTTCAAATCATGTGACAAGCAACTATGACA
ACCTCTCCAACCCCACTGACTATGAAGGTAATGAGTGTGTGACCATAGGCAATGGGGATAAATTACCTATAACCTGCATAGGATCATCAAGATTGACTGATGGAAACCAT
GTTTTACAATTAGAACATGTTTTATGTGTACCTGACATAGCTAAAAACCTAGTGAGCATGTCTAAGTTGGCACAAGATAATAATGTGTTCATTGAGTTTCATGGTAACTT
TTGCCTTGTTAAGGACAAGACTACGGGTCGTGTGGTGCTGAAAGGAGCTCTTAAAGATGGTCTTTATCAATTACAAGGAGTCAACTTGAGGAACCTCTCATTTTCTGCTA
GTTCAAGTTCAATGCGGCAAGAGAATAAAATTGAGAAAAGTTACAATGAAGGAGCTGTTTTTGTTGTGTCCAATGTAGTTCCTTGTGCCAATATGGCTGTGTCTAAGAAA
ATATGGCATAGACGTCTTGGTCACCCCTCTGAAAAAGTGTTGAACTCTATAGTGAAGGATTGTAAACTCTCAGTTAAAGTTAATGAACCTCTTCAGTTTTGTGAATCTTG
CCAGTTTGGAAAGTCACATGCTCTTAAATTTCCTCTATCTGATTCTAGGGCCTCGAAACGATTTGATCTTATTCACACTGATATCTGGGGTCCAGCTCCTGTATTGTCTG
GTGATGGTTATCGTTATTATGTCTTATTTTTGGATGATTATAGTCGGTATGTTTGGTTATATCCACTAAAATTGAAAAGTGATACACTGTCAGCTTTTAATCACTTTCTC
ACAATGGTTAAGACTCAGTTTGGTAGCATGATCAAGGCAATTCAGTCTGATAATGGTGGAGAGTATGTGAAGGTTCACAGGTTGTGTAATCAGTTGGGCATTCAGTCTCG
ATACTCATGTCCACATACATCGGCACAAAATGGGAGAGCAGAGAGAAAGCACCGCCATGTGGTAGAAACCGGACTCACTCTACTCGCACAAGCGTCTATGCCTTTGGCTT
ACTGGTGGGATGCCTTTATGGCAGCTGCAAGGTTGATTAATGGTTTACCAACCACTGTTCTGAAAGGTAAGTCTCCAATGGAGCTCATGTTTTTAAAGAAACTTGACTTC
ACTGCTTTAAAAACATTTGGTTGTTCCTGTTATCCATGCTTGAGGCCATATCAAAACCATAAGTTTTATTTTCATACTGATCAGTGTGTGAATCTTGGCTTGAGTGCTTC
TCATAAAGGGTACAGGTGTATGAACAAGGCTGGGAGAGTTTTTGTCTCCAGACATGTGAAATTTGATGAAGAAACTTTTCCATTTGCGGCTGGATTTGGGACTGTCGATT
CCTCAATGTCAGGGTCCAATACAACATTAGCCCCACATATCCTACAGTGGTTTCCCCAACCAAATATTCCTCAATCTGGTATCTTTTCACCACCTGTAAATCAGCCTCCC
CTGACATGTGTTCAACCATCTCCGTCTCCTGCTCCTTTACAACAACCTACAGGCCAAAATAACGAACCTTGCTCACAAACCAGTCCATCTCCTCCTCCATCACAACAACC
TGCAGTCCAAAATACGTCTCCCTCAATACTGCCATTTCCTAACCAAGAAACCTCAGTATCATCACCAGATTCTAACACATCTCAAACCTCCCCTGCTTCTGAGCCATCAC
CAGAGACCATCCTAAATTCGAATCCTTGTCCTCAATCCACTCATCCCATGGTTACTCGTGGGAAAGCTGGAATTTTCAAGCCCAAAGCCTGGTTATCTCGACAACAAGTT
GATTGGTCTTTGACTGAGCCCACTCGTGTGCAAGATGCGTTAGCTACCCCTCAGTGGAAAGCTGCAATGGATACTGAATTCTCAGCCTTGATCAAGAATCAAACCTGGTC
CCTAGTTCCGCATGCTCCCTCCTTCAACGTAGTCGGCAACAAATGGATATTCCGAATAAAACGGAATGCAGATGGCTCTATTCAAAGGTACAAGGCCAGACTTGTAGCTA
AGGGCTTTCACCAATATCCTGGGGTTGATTTCTTTGAGACATTCAGTCCGGTGGTAAAAGCCTCCACCATTAGAATTGTTTTGAGCTTGGCAGTAACAAGGGGCTGGGAA
CTTCGTCAGTTGGACTTCAATAATGCTTTTTTGAACGGTACTCTTAATGAGGTTGTGTACATGAAGCAGCCTCCTGGCTATGTGGATCCCAACCGTCCTAATCACGTGTG
CAAACTAAAAAAGGCCATTTATGGCCTTAAACAGGCGCCAAGAGCATGGAACACAACCCTCAAAGCAGTTCTCTTGTCATGGGGATTTCATAACTCGAGGTCAGATAATT
CTCTTTTCATCTTTCGCACTGAGAATGTATGCTTGTTGCTGTTGGTATATGTCGATGATGTAATCGTGACTGGTAATAACTCAAAAATGATTAATCGACTGATTGTTGAG
CTGGATAACCGATTTGCACTCAAAGATCTGGGGCGACTGAATTATTTTCTGGGGATTCAAGTAACATATATACCCTCCGGGTTACTATTGACTCAGGCCAAATATATAGA
TGATCTTCTGACTAAGCTGGACCTGTTGCATCTTAAACCAGCACCATCTCCTTGTGTTATTGGTAAGAAGATGTCTATTCATGATGGTAAACCTTTGGAGGATCCATTCA
TTTACAGAAGTACAATTGGGGCTCTCCAATATCTTACCACTACACGTCCTGACATCGCTTATATAATCAACCAACTGAGTCAATTCCTTCAAACACCAACTGATATACAC
TGGCAAGCTGTAAAGAGAGTTCTTCGTTATCTCACCGGCACCAAACACCTAGGTCTGTTGTTTCAACCAGGTTCAAACCTTTCTGTTTCAGCATTCTCGGATGCTGATTG
GGCCTCCAATATTGATGATCGTAAGTCAGTTGCCGCCTACTGTGTGTTCCTTGGAAATAACTTGGTGTCATGGTCATCAAAGAAGCAATCAGTTGTTGCACGCTCAAGTA
CAGAGTCAGAATATCGAGCATTATCTCTTGCTTCAGCAGAAATCATCTGGCTTCAACAACTTCTCAAGGAGCTTGGCTGTCACTCCTCAAAACCAATCCTCTGGTGCGAC
AATATAAGTGCAGGAGCGCTAGCAGCTAATCCTGTGTTTCATGCCCGAACGAAACATATAGAAGTTGACGTCCACTTTGTTCGAGATCAAATACTTTGGGGGGCTTTAGA
AGTTCGCTATGTGCCATCTCATGACCAGCTCGCAGATTGTCTTACGAAACCACTCACTCACACGCAGTTTCTATATCTTAGATCCAAACTCGGGCTTGTTGACACTCCCT
CTCGTTTGAGGGGGGATATTAAGGAACCGAGTCACAGTGTCAGCTCAGCATCACCATCCAAGAAACATAACCCAGAGGAAGAACAGGCCAAAGGGTCGGGCCAAGGCCGA
AGGGATCAAGTTTTTGGCCCGGCCCCTGGGCCTAGGTCAAGCTCTTCCGCCCCCGTTTGGTCCCCGATGCTCTTGGACGCCTCGTTTCCACCTGGTTCAGCCCTGGATCA
CCTCCGAACACCTAGAAACCCTAGAGCAGGAACATCTGACTTAAGCATCGGAGGCGGTGTGGCAAGCACCACACCGATGTGCAGGTTTCTCTGTCTTGAATTATGGCCAC
GTCTTCCTCCCTCTCAAACAAATTTACCGTTGGTGACACGTGAAAGTCAGACATTTGCTCCAATATACTCCCTTGTTAGAATGATAGACATAAGCATACTAAAAACAGGA
CAACGAAAAGAGTTTCTATATACGGAGTCTGATCTGAGGGCTGACGAGCTTGATCTCAGATGGAGCAAGGTTGAATCTTCAACTTACTCTCTAACTTCTGAAGACTCTGG
ACCTTGGGAGAGTGCTCTGGACTCTAGCTTCTCTGACTCTGTCTTGAATTCGAGCAATACCCTCCCTTCGTGCCTTTTTGTTGTGCTATCAGAATTGGAATTAACAAATT
CATCTGACGATCTCTTGGACACAACATCAACCTTTGCCATCATTTGGAAGCAATCGTTTCCAAATTCACTTGCAGTGATTTCACTCCTGCTAGACCAGTCCCCTATTTCA
TTGGGTGATCTGAGTAACCTGCAGGGGTAG
Protein sequenceShow/hide protein sequence
MANASSMSSTSVTNVGNTTFTSPPLNQLLNQITTIKLDRGNYLLWKNLAMPILRSYKLEGHLLGTKSCPPEFIRQDGEPVEVTSGAAIGAPSSQTDGSGASTSEARLSMN
PQYEAWVTVDQLLLGWLYNSMTPEVATQVMGIENAKDLWSAIQELFGVQSRAEEDFLRQTFQQTRKGNSKMSDYLRLMKTHADNLGLAGSPVSNRNLVSQVLLGLDEEYN
AIVAMIQGRASVTWAELQAELLVFEKRLELQNSVKNTTTFSQNALANMASSKGVSSPKQTNQITSNGNGNRPWYNNYNQRGSGNRGRGRGRGYNNYNNRQICQVCGKVGH
SALVCYNRFNKEFSPIQNRGNGNGNGNHNQNRGQNQQSNAFMATQPTATPETLADPNWYADSGASNHVTSNYDNLSNPTDYEGNECVTIGNGDKLPITCIGSSRLTDGNH
VLQLEHVLCVPDIAKNLVSMSKLAQDNNVFIEFHGNFCLVKDKTTGRVVLKGALKDGLYQLQGVNLRNLSFSASSSSMRQENKIEKSYNEGAVFVVSNVVPCANMAVSKK
IWHRRLGHPSEKVLNSIVKDCKLSVKVNEPLQFCESCQFGKSHALKFPLSDSRASKRFDLIHTDIWGPAPVLSGDGYRYYVLFLDDYSRYVWLYPLKLKSDTLSAFNHFL
TMVKTQFGSMIKAIQSDNGGEYVKVHRLCNQLGIQSRYSCPHTSAQNGRAERKHRHVVETGLTLLAQASMPLAYWWDAFMAAARLINGLPTTVLKGKSPMELMFLKKLDF
TALKTFGCSCYPCLRPYQNHKFYFHTDQCVNLGLSASHKGYRCMNKAGRVFVSRHVKFDEETFPFAAGFGTVDSSMSGSNTTLAPHILQWFPQPNIPQSGIFSPPVNQPP
LTCVQPSPSPAPLQQPTGQNNEPCSQTSPSPPPSQQPAVQNTSPSILPFPNQETSVSSPDSNTSQTSPASEPSPETILNSNPCPQSTHPMVTRGKAGIFKPKAWLSRQQV
DWSLTEPTRVQDALATPQWKAAMDTEFSALIKNQTWSLVPHAPSFNVVGNKWIFRIKRNADGSIQRYKARLVAKGFHQYPGVDFFETFSPVVKASTIRIVLSLAVTRGWE
LRQLDFNNAFLNGTLNEVVYMKQPPGYVDPNRPNHVCKLKKAIYGLKQAPRAWNTTLKAVLLSWGFHNSRSDNSLFIFRTENVCLLLLVYVDDVIVTGNNSKMINRLIVE
LDNRFALKDLGRLNYFLGIQVTYIPSGLLLTQAKYIDDLLTKLDLLHLKPAPSPCVIGKKMSIHDGKPLEDPFIYRSTIGALQYLTTTRPDIAYIINQLSQFLQTPTDIH
WQAVKRVLRYLTGTKHLGLLFQPGSNLSVSAFSDADWASNIDDRKSVAAYCVFLGNNLVSWSSKKQSVVARSSTESEYRALSLASAEIIWLQQLLKELGCHSSKPILWCD
NISAGALAANPVFHARTKHIEVDVHFVRDQILWGALEVRYVPSHDQLADCLTKPLTHTQFLYLRSKLGLVDTPSRLRGDIKEPSHSVSSASPSKKHNPEEEQAKGSGQGR
RDQVFGPAPGPRSSSSAPVWSPMLLDASFPPGSALDHLRTPRNPRAGTSDLSIGGGVASTTPMCRFLCLELWPRLPPSQTNLPLVTRESQTFAPIYSLVRMIDISILKTG
QRKEFLYTESDLRADELDLRWSKVESSTYSLTSEDSGPWESALDSSFSDSVLNSSNTLPSCLFVVLSELELTNSSDDLLDTTSTFAIIWKQSFPNSLAVISLLLDQSPIS
LGDLSNLQG