; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0005866 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0005866
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr6:32540165..32544525
RNA-Seq ExpressionLag0005866
SyntenyLag0005866
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR025724 - GAG-pre-integrase domain
IPR029472 - Retrotransposon Copia-like, N-terminal
IPR036397 - Ribonuclease H superfamily
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GAU19483.1 hypothetical protein TSUD_77270 [Trifolium subterraneum]0.0e+0046.36Show/hide
Query:  NPFNSILAVKLDEKNYILWKSIVIAILRGQKLEGFILGTTNPPPAIQQEATTSTATTNPERE--KWTTTDQTLLGWLFSTMSPSVASEVINLQTSREVWK
        N   S ++VKLD  NY LWKS+V+ ++RG KL+G++LGT   P   ++  T+S ++ N      +W   DQ LLGW+ ++M+  +A+++++ +TS+++W 
Subjt:  NPFNSILAVKLDEKNYILWKSIVIAILRGQKLEGFILGTTNPPPAIQQEATTSTATTNPERE--KWTTTDQTLLGWLFSTMSPSVASEVINLQTSREVWK

Query:  ALEELYGAANKSRVNQLRTGLQTTRKNNMRMGEYLAQMKEFSENLALAGEPISNSYLISSVLAGLDSEFLPIMCQINSQGPMSWQEFHATLVTFENTLLR
          + L GA  +S++  L++   + RK  M+M +YL +MK   + L LAG P+S S LI   L GLDSE+ P++ +++ Q  +SW +  A L+TFE+ + +
Subjt:  ALEELYGAANKSRVNQLRTGLQTTRKNNMRMGEYLAQMKEFSENLALAGEPISNSYLISSVLAGLDSEFLPIMCQINSQGPMSWQEFHATLVTFENTLLR

Query:  L-KVINLDGDASANFVVANQ--------NNNIQSSNS-----SQERNSEETIPLKTVSEDMDEIEGGVMGHSALVCYYRYDPPTVGQQHNSPRPA-----
        L  + NL  +A+AN  VAN+        NNN + SNS      + R      P +           G+  H A+ C++R+D       H++         
Subjt:  L-KVINLDGDASANFVVANQ--------NNNIQSSNS-----SQERNSEETIPLKTVSEDMDEIEGGVMGHSALVCYYRYDPPTVGQQHNSPRPA-----

Query:  ANIATAEVIADQSWYADSGATNHTTAELANLNIQSEYGGKTSLVVGNGSKLKIAHVGESLIQNNCLSKPIALKKILHVPSIQKNLISISKLTADNNLFVE
        A +A+   + D  WY DSGA+NH T +       +E+ GK SLVVGNG KL I   G S +      K + L  IL+VP+I KNL+S+SKL ADNN+ VE
Subjt:  ANIATAEVIADQSWYADSGATNHTTAELANLNIQSEYGGKTSLVVGNGSKLKIAHVGESLIQNNCLSKPIALKKILHVPSIQKNLISISKLTADNNLFVE

Query:  FHSHMCFVKDKQTGKVVLQGVLKDGLYQLNLNSNPPKSRPQLLFSQASQPHVLSASVKPSFPSQSLSFSLWHKRLGHPSNKILHSVVKSCNLQCPINEKP
        F  + CFVKDK TGKV+L+G+LKDGLYQL+     P +                 SVK S          WH+RLGHP+NK+L  V++SC ++ P ++  
Subjt:  FHSHMCFVKDKQTGKVVLQGVLKDGLYQLNLNSNPPKSRPQLLFSQASQPHVLSASVKPSFPSQSLSFSLWHKRLGHPSNKILHSVVKSCNLQCPINEKP

Query:  PFCDACVYGKAHLLPFHNAPNITTAPLELVYSDLWGPSPVCSLNGYRFYISFLDDFTRLTYIFPLKAKSEATSTFLHFKKLIENKFDRKIKTLQTDWGGE
         FC+AC YGK HLLPF ++ +    PLELV++D+WGP+P+ + +G+++Y+ F+DDF+R T+I+PLK KSE    F+ FK L EN+F+++IK +Q D GGE
Subjt:  PFCDACVYGKAHLLPFHNAPNITTAPLELVYSDLWGPSPVCSLNGYRFYISFLDDFTRLTYIFPLKAKSEATSTFLHFKKLIENKFDRKIKTLQTDWGGE

Query:  YRPLTSHLKALGIAFRHPCPHTSEQNGRVERKHRHIVEMGLTLLAQASLPLKFWWEAFSSATYLINRLPTPTLHGLSPFEKAFGRKPDYKFLKVFGCSCF
        Y+P+       GI FR  CP+TS+QNGR ERKHRHI E GLTLLAQA +PL +WWEAFS+A YLINRLP+      SP+     ++PDYK LK FGC+C+
Subjt:  YRPLTSHLKALGIAFRHPCPHTSEQNGRVERKHRHIVEMGLTLLAQASLPLKFWWEAFSSATYLINRLPTPTLHGLSPFEKAFGRKPDYKFLKVFGCSCF

Query:  PCLRPYQKHKFDFHSIKCTFLGYDESYKGYICLSPQGRRYISRHVLFNEVEFPY------ISQPTTQPISSPNNSMSYCLVPPPTNHFNPVPTISTSNVQ
        PCL+PY +HK  +H+ +C FLGY  S+KGY CL+  GR +ISRHV+FNE  FP+         P    I+ P+ S   C             T       
Subjt:  PCLRPYQKHKFDFHSIKCTFLGYDESYKGYICLSPQGRRYISRHVLFNEVEFPY------ISQPTTQPISSPNNSMSYCLVPPPTNHFNPVPTISTSNVQ

Query:  AAPPPSLVDTPPASNIVVSPSSAAVSPPTHMLRSVSSSPIVPILTDVVTSPSPHLQPPSIPTHISTHPMQTRAKCGIFKPKI-FQAQISPPTDLCEPTSV
        A+ P    + P  +N   +  S  V+  T    +  S         +  +    +   S  T+ S H + TR+K GI KPK+ +        D  EP + 
Subjt:  AAPPPSLVDTPPASNIVVSPSSAAVSPPTHMLRSVSSSPIVPILTDVVTSPSPHLQPPSIPTHISTHPMQTRAKCGIFKPKI-FQAQISPPTDLCEPTSV

Query:  KQALLSHEWKSAMDSEYQALLGNGTWSLVPPSPHYNVVGNKWVFRLKFHPDGSIQRYKARLVAKGFHQTPAIDYHETFSPVIKPPTVKLILTLAVSKNWI
        K+AL    WK AM  E++AL+ N TW LVP     N+V +KWVF+ K+ PDGS++R KARLVAKGF QT  IDY ETFSPVIK  TV++IL++AV  NW 
Subjt:  KQALLSHEWKSAMDSEYQALLGNGTWSLVPPSPHYNVVGNKWVFRLKFHPDGSIQRYKARLVAKGFHQTPAIDYHETFSPVIKPPTVKLILTLAVSKNWI

Query:  VRQLDINNAFLNGHLAEAVYMVQPEGYVDPTKPTYVCKLHKALYGLKQAPRAWYDRLKQCLHRWKFQNSKADSSLFYYFANGHQIFLLVYVDDILITGTS
        VRQLDINNAFLNGHL E V+M QPEG+VD TKP ++CKL KA+YGLKQAPRAW+D LK  L  W FQN+K+DSSLF      H  FLL+YVDDI++TG++
Subjt:  VRQLDINNAFLNGHLAEAVYMVQPEGYVDPTKPTYVCKLHKALYGLKQAPRAWYDRLKQCLHRWKFQNSKADSSLFYYFANGHQIFLLVYVDDILITGTS

Query:  PTLIDDFITKLGNLFALKDLGHLHYFLGIEVTRNSSGMVLTQRRYIHDLLHRVGLEHIKPCPTPAVVGSHLSATQGTPLASPTQYRSTIGALQYLSHTRP
           +  FI +L + F+LKDLGHLHYFLGIEV R++SGM L Q +YI DLL +  +++  PCPTP + G   +  +G  L  PT +R  IG LQYL+HT P
Subjt:  PTLIDDFITKLGNLFALKDLGHLHYFLGIEVTRNSSGMVLTQRRYIHDLLHRVGLEHIKPCPTPAVVGSHLSATQGTPLASPTQYRSTIGALQYLSHTRP

Query:  DIAFIVNKLSQFLQAPTTDHWQAVKRVLRYLQGTVDYGLHISPCSNLTLTGYSDADWACCPDDRRSIGAYCIFYGTSLVSWSSKKQTVVARSSAESEYRA
        DIAF VNKLSQ++ +P+ DHWQ +KR+LRYLQGT++Y LHI P ++L +TG+SDADWA   DDR+S+   C+F G +L+SWSS+KQ VV+RSS ESEYRA
Subjt:  DIAFIVNKLSQFLQAPTTDHWQAVKRVLRYLQGTVDYGLHISPCSNLTLTGYSDADWACCPDDRRSIGAYCIFYGTSLVSWSSKKQTVVARSSAESEYRA

Query:  MAQATAEITWLHNLLHELCLLPPAPTVLWSDNMSAISLASNPVFHSRTKHIELDVHFIRDKVLGKTLEVRYVPSHDQLADFLTKPLSSAVFQVFRSKLGV
        +A   AEI W+ +LL EL L  P   +LW DN+SA +LASNPV H+R+KHIE+DVH+IRD+VL   + V YVP+ DQ+AD LTKPLS   F   R KLGV
Subjt:  MAQATAEITWLHNLLHELCLLPPAPTVLWSDNMSAISLASNPVFHSRTKHIELDVHFIRDKVLGKTLEVRYVPSHDQLADFLTKPLSSAVFQVFRSKLGV

GAU51268.1 hypothetical protein TSUD_412550 [Trifolium subterraneum]0.0e+0044.9Show/hide
Query:  NPFNSILAVKLDEKNYILWKSIVIAILRGQKLEGFILGTTNPPPAIQQEATTSTATTNPEREKWTTTDQTLLGWLFSTMSPSVASEVINLQTSREVWKAL
        N   SI++VKLD  NY LWKS+V++++RG KL+G+ILGTT  P      A  S    NP+   W   DQ LLGWL ++M+  +A+++++ +TS+++W   
Subjt:  NPFNSILAVKLDEKNYILWKSIVIAILRGQKLEGFILGTTNPPPAIQQEATTSTATTNPEREKWTTTDQTLLGWLFSTMSPSVASEVINLQTSREVWKAL

Query:  EELYGAANKSRVNQLRTGLQTTRKNNMRMGEYLAQMKEFSENLALAGEPISNSYLISSVLAGLDSEFLPIMCQINSQGPMSWQEFHATLVTFENTLLRLK
        + L GA  KSR+  L++    TRK  M+M EYL +MK  S+ L LAG PISNS L+   L GLD+E+ P++ +++ Q  +SW +  A L+ FE+ L +  
Subjt:  EELYGAANKSRVNQLRTGLQTTRKNNMRMGEYLAQMKEFSENLALAGEPISNSYLISSVLAGLDSEFLPIMCQINSQGPMSWQEFHATLVTFENTLLRLK

Query:  VIN-LDGDASANFV-----VANQNNNIQSSNSSQERNSEETIPLKTVSEDMDEIEGGVMGHSALVCYYRYDPPTVGQQHNSPRP-----AANIATAEVIA
          + L  +ASANF        N+ N+  +   S  R          +S    ++  G  GH A+ C YR+D P  G+ +++        +A IA+     
Subjt:  VIN-LDGDASANFV-----VANQNNNIQSSNSSQERNSEETIPLKTVSEDMDEIEGGVMGHSALVCYYRYDPPTVGQQHNSPRP-----AANIATAEVIA

Query:  DQSWYADSGATNHTTAELANLNIQSEYGGKTSLVVGNGSKLKIAHVGESLIQNNCLSKPIALKKILHVPSIQKNLISISKLTADNNLFVEFHSHMCFVKD
        D  WY DSGA NH T +       +E+ GK SL+VGNG KLKI   G + + N      + L  +L+VP I KNL+S+SKLTADNN+ VEF ++ C VKD
Subjt:  DQSWYADSGATNHTTAELANLNIQSEYGGKTSLVVGNGSKLKIAHVGESLIQNNCLSKPIALKKILHVPSIQKNLISISKLTADNNLFVEFHSHMCFVKD

Query:  KQTGKVVLQGVLKDGLYQLNLNSNPPKSRPQLLFSQASQPHVLSASVKPSFPSQSLSFSLWHKRLGHPSNKILHSVVKSCNLQCPINEKPPFCDACVYGK
        K TG+ +L+G LKDGLYQL+ N  P                 +  SVK S          WH++LGHP+NK+L  V+K CN++   +++  FC+AC +GK
Subjt:  KQTGKVVLQGVLKDGLYQLNLNSNPPKSRPQLLFSQASQPHVLSASVKPSFPSQSLSFSLWHKRLGHPSNKILHSVVKSCNLQCPINEKPPFCDACVYGK

Query:  AHLLPFHNAPNITTAPLELVYSDLWGPSPVCSLNGYRFYISFLDDFTRLTYIFPLKAKSEATSTFLHFKKLIENKFDRKIKTLQTDWGGEYRPLTSHLKA
         HLLPF  + +    PL L++SD+WGP+P+ S +G+++Y+ F+DDF+R T+IFPLK KS+    F+ FK L EN+F++KIK +Q D GGEY+ +      
Subjt:  AHLLPFHNAPNITTAPLELVYSDLWGPSPVCSLNGYRFYISFLDDFTRLTYIFPLKAKSEATSTFLHFKKLIENKFDRKIKTLQTDWGGEYRPLTSHLKA

Query:  LGIAFRHPCPHTSEQNGRVERKHRHIVEMGLTLLAQASLPLKFWWEAFSSATYLINRLPTPTLHGLSPFEKAFGRKPDYKFLKVFGCSCFPCLRPYQKHK
         GI FR  CP+TS+QNGR ERKHRH+ E+GLTLLAQA +PL++WWEAFS+A YLINRLP+      SP+   F R+PDY  LK FGC+C+PCL+PY +HK
Subjt:  LGIAFRHPCPHTSEQNGRVERKHRHIVEMGLTLLAQASLPLKFWWEAFSSATYLINRLPTPTLHGLSPFEKAFGRKPDYKFLKVFGCSCFPCLRPYQKHK

Query:  FDFHSIKCTFLGYDESYKGYICLSPQGRRYISRHVLFNEVEFPYIS--QPTTQPISSPNNSMSYCLVPPPTNHFNPVPTISTSNVQAAPPPSLVDTPPAS
          FH+ +C F+GY  S+KGY C++  GR ++SRHV+FNE  FP+      T  P+ +  ++ S  L         P  +   +   A  P +   +   +
Subjt:  FDFHSIKCTFLGYDESYKGYICLSPQGRRYISRHVLFNEVEFPYIS--QPTTQPISSPNNSMSYCLVPPPTNHFNPVPTISTSNVQAAPPPSLVDTPPAS

Query:  NIV-----------VSPSSAAVSPPTHMLRSVSSSPIVPILTDVVTSPSPHLQPPSIPTHISTHPMQTRAKCGIFKPKI-FQAQISPPTDLCEPTSVKQA
        + +           V  S   V+      + + +   V       ++ +  +Q  +   + +TH M+TR+K GI KPKI +       ++  EP SVK+A
Subjt:  NIV-----------VSPSSAAVSPPTHMLRSVSSSPIVPILTDVVTSPSPHLQPPSIPTHISTHPMQTRAKCGIFKPKI-FQAQISPPTDLCEPTSVKQA

Query:  LLSHEWKSAMDSEYQALLGNGTWSLVPPSPHYNVVGNKWVFRLKFHPDGSIQRYKARLVAKGFHQTPAIDYHETFSPVIKPPTVKLILTLAVSKNWIVRQ
        L    WK AMD EY+AL+ N TW+LVP     N++ +KW+F+ K+  DGSI+R KARLVAKGF QT  +D+ ETFSPV+K  TV++ILT+AV  NW VRQ
Subjt:  LLSHEWKSAMDSEYQALLGNGTWSLVPPSPHYNVVGNKWVFRLKFHPDGSIQRYKARLVAKGFHQTPAIDYHETFSPVIKPPTVKLILTLAVSKNWIVRQ

Query:  LDINNAFLNGHLAEAVYMVQPEGYVDPTKPTYVCKLHKALYGLKQAPRAWYDRLKQCLHRWKFQNSKADSSLFYYFANGHQIFLLVYVDDILITGTSPTL
        LDINNAFLNG L E V+M QPEGY+D  KP ++CKL KA+YGLKQAPRAWYD L+  L  W FQN+K D+SLF+     H  FLL+YVDDI++TG++   
Subjt:  LDINNAFLNGHLAEAVYMVQPEGYVDPTKPTYVCKLHKALYGLKQAPRAWYDRLKQCLHRWKFQNSKADSSLFYYFANGHQIFLLVYVDDILITGTSPTL

Query:  IDDFITKLGNLFALKDLGHLHYFLGIEVTRNSSGMVLTQRRYIHDLLHRVGLEHIKPCPTPAVVGSHLSATQGTPLASPTQYRSTIGALQYLSHTRPDIA
        ++ F  +L   ++LKDLG LHYFLG+EV R+ SGM L Q +YI D+L +  +E+   CPTP V G    A +G  +++PT YR  IGALQYL++TRPDIA
Subjt:  IDDFITKLGNLFALKDLGHLHYFLGIEVTRNSSGMVLTQRRYIHDLLHRVGLEHIKPCPTPAVVGSHLSATQGTPLASPTQYRSTIGALQYLSHTRPDIA

Query:  FIVNKLSQFLQAPTTDHWQAVKRVLRYLQGTVDYGLHISPCSNLTLTGYSDADWACCPDDRRSIGAYCIFYGTSLVSWSSKKQTVVARSSAESEYRAMAQ
        F VNKLSQ++  PT +HWQ +KR+LRYLQGT ++ LHI P +NL + G+ DADWA   DDR+S G  C+F G +LVSW+S+KQ VV+RSS ESEYR++A 
Subjt:  FIVNKLSQFLQAPTTDHWQAVKRVLRYLQGTVDYGLHISPCSNLTLTGYSDADWACCPDDRRSIGAYCIFYGTSLVSWSSKKQTVVARSSAESEYRAMAQ

Query:  ATAEI-------------------TWLHNLLHELCLLPPAPTVLWSDNMSAISLASNPVFHSRTKHIELDVHFIRDKVLGKTLEVRYVPSHDQLADFLTK
          AE+                   +    LL EL L      VLW DN+SA +LASNPV H+R+KHIE+D+H+IRD+VL   + + YVP+ DQ+AD LTK
Subjt:  ATAEI-------------------TWLHNLLHELCLLPPAPTVLWSDNMSAISLASNPVFHSRTKHIELDVHFIRDKVLGKTLEVRYVPSHDQLADFLTK

Query:  PLSSAVFQVFRSKLGV-FSPS
        PL    F + R KLGV  SPS
Subjt:  PLSSAVFQVFRSKLGV-FSPS

RVW64314.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]0.0e+0043.17Show/hide
Query:  SVTSVPAIQTG----NNGSPSLGNPFNSILAVKLDEKNYILWKSIVIAILRGQKLEGFILGTTNPPPAIQQEATTSTATTNPEREKWTTTDQTLLGWLFS
        S T    +Q G    + G+       N  L +KLD  NYILW++ +  ++     E  I G    PP       TS+  TNP+   W   D+ +L W++S
Subjt:  SVTSVPAIQTG----NNGSPSLGNPFNSILAVKLDEKNYILWKSIVIAILRGQKLEGFILGTTNPPPAIQQEATTSTATTNPEREKWTTTDQTLLGWLFS

Query:  TMSPSVASEVINLQTSREVWKALEELYGAANKSRVNQLRTGLQTTRKNNMRMGEYLAQMKEFSENLALAGEPISNSYLISSVLAGLDSEFLPIMCQINS-
        +++P +  +++  Q+S   W ALE ++ A++++RV QLR   QTTRK ++ M EY+ ++K  ++NLA  GEP+++   I  +L GL +++  I+  + + 
Subjt:  TMSPSVASEVINLQTSREVWKALEELYGAANKSRVNQLRTGLQTTRKNNMRMGEYLAQMKEFSENLALAGEPISNSYLISSVLAGLDSEFLPIMCQINS-

Query:  QGPMSWQEFHATLVTFENTLLRLKVINLDGDASANFVVANQNNNIQSSNSSQERNSEETIPLKT------VSEDMDEIE-GGVMGHSALVCYYRYD----
        +  MS    H+ L+T E  L     +  D   SAN       +     +S Q R S       T       S+   + +  G  GH+ + CY+R+D    
Subjt:  QGPMSWQEFHATLVTFENTLLRLKVINLDGDASANFVVANQNNNIQSSNSSQERNSEETIPLKT------VSEDMDEIE-GGVMGHSALVCYYRYD----

Query:  ---PPTVGQQHNSP----RPAANIATAEVIADQSWYADSGATNHTTAELANLNIQSEYGGKTSLVVGNGSKLKIAHVGESLIQNNCLSKPIALKKILHVP
           P     Q N P    +  A +A+   I+D++W+ D+GAT+H +  +  L+    Y G   ++VGNG  L+I H G +   ++  SK   L+++LHVP
Subjt:  ---PPTVGQQHNSP----RPAANIATAEVIADQSWYADSGATNHTTAELANLNIQSEYGGKTSLVVGNGSKLKIAHVGESLIQNNCLSKPIALKKILHVP

Query:  SIQKNLISISKLTADNNLFVEFHSHMCFVKDKQTGKVVLQGVLKDGLYQLNLNSNPPKSRPQLLFSQASQPHVLSASVKPSFPSQSLSFSLWHKRLGHPS
         I  NLIS+S+  ADNN F EFH    FVKD+ T K++LQG L+ GLY+      P +      F  +    V S+  + S  S + + +LWH RLGHP+
Subjt:  SIQKNLISISKLTADNNLFVEFHSHMCFVKDKQTGKVVLQGVLKDGLYQLNLNSNPPKSRPQLLFSQASQPHVLSASVKPSFPSQSLSFSLWHKRLGHPS

Query:  NKILHSVVKSCNL--QCPINEKPPFCDACVYGKAHLLPFHNAPNITTAPLELVYSDLWGPSPVCSLNGYRFYISFLDDFTRLTYIFPLKAKSEATSTFLH
        N IL  ++ SCN+  QC    K   C AC + K+H LPF+ + +  + PL L+++DLWGP+ + S  G R++I F+DDF+R ++I+PL +K +A S F+ 
Subjt:  NKILHSVVKSCNL--QCPINEKPPFCDACVYGKAHLLPFHNAPNITTAPLELVYSDLWGPSPVCSLNGYRFYISFLDDFTRLTYIFPLKAKSEATSTFLH

Query:  FKKLIENKFDRKIKTLQTDWGGEYRPLTSHLKALGIAFRHPCPHTSEQNGRVERKHRHIVEMGLTLLAQASLPLKFWWEAFSSATYLINRLPTPTLHGLS
        FK L+EN+F+ +I+ L++D GGE++  +S+L   GI  +  CP+T EQNGR ERK RHI+E GL LLA ASLP KFW  AF +A +LINRLPT  L+  S
Subjt:  FKKLIENKFDRKIKTLQTDWGGEYRPLTSHLKALGIAFRHPCPHTSEQNGRVERKHRHIVEMGLTLLAQASLPLKFWWEAFSSATYLINRLPTPTLHGLS

Query:  PFEKAFGRKPDYKFLKVFGCSCFPCLRPYQKHKFDFHSIKCTFLGYDESYKGYICLSP-QGRRYISRHVLFNEVEFPYISQPTTQPISSPNNSMSYCLVP
        PF+  FG+ P+Y   K+FGC C+P +RPY K+K  + S +C FLGY  ++KGY+CL+P  GR Y++RHV+F+E  FP+         S+P+ S S   +P
Subjt:  PFEKAFGRKPDYKFLKVFGCSCFPCLRPYQKHKFDFHSIKCTFLGYDESYKGYICLSP-QGRRYISRHVLFNEVEFPYISQPTTQPISSPNNSMSYCLVP

Query:  PPTNHFNPVPTISTSNVQAAPPPSLVDTPPASNIVVSPSSAAVSPPTHMLRSVSSSPIVPILTDVVTSPSPHLQPPSIPTHISTHPMQTRAKCGIFKPKI
         P   F P  +   S++++   PS   +PP +N+   PSS           ++S   ++ +    +++  PH      PT  + HPM TRAK GI K K+
Subjt:  PPTNHFNPVPTISTSNVQAAPPPSLVDTPPASNIVVSPSSAAVSPPTHMLRSVSSSPIVPILTDVVTSPSPHLQPPSIPTHISTHPMQTRAKCGIFKPKI

Query:  FQAQISPPTDLCEPTSVKQALLSHEWKSAMDSEYQALLGNGTWSLVPPSPHYNVVGNKWVFRLKFHPDGSIQRYKARLVAKGFHQTPAIDYHETFSPVIK
        + +     + + EPT+  QA+    W  AM+ E+ AL  N TW LVPP  + N++G KWV++LK+ PDG++ RYKARLVA+GF QT  +DY ETFSPV+K
Subjt:  FQAQISPPTDLCEPTSVKQALLSHEWKSAMDSEYQALLGNGTWSLVPPSPHYNVVGNKWVFRLKFHPDGSIQRYKARLVAKGFHQTPAIDYHETFSPVIK

Query:  PPTVKLILTLAVSKNWIVRQLDINNAFLNGHLAEAVYMVQPEGYVDPTKPTYVCKLHKALYGLKQAPRAWYDRLKQCLHRWKFQNSKADSSLFYYFANGH
          T+++IL +A+S NW V QLD+ NAFL+G L E V+M QP G+++   P++VCKL+KALYGLKQAPRAWY++L   L  W FQ S+ADSS+F + +   
Subjt:  PPTVKLILTLAVSKNWIVRQLDINNAFLNGHLAEAVYMVQPEGYVDPTKPTYVCKLHKALYGLKQAPRAWYDRLKQCLHRWKFQNSKADSSLFYYFANGH

Query:  QIFLLVYVDDILITGTSPTLIDDFITKLGNLFALKDLGHLHYFLGIEVTRNSSGMVLTQRRYIHDLLHRVGLEHIKPCPTPAVVGSHLSATQGTPLASPT
         + LL+YVDDIL+TG+S   +  FIT+L + FAL+DLG+++YFLGIEV R+ +   L+Q +Y  DLL R  +   KP  TP ++G  LS   G P +  T
Subjt:  QIFLLVYVDDILITGTSPTLIDDFITKLGNLFALKDLGHLHYFLGIEVTRNSSGMVLTQRRYIHDLLHRVGLEHIKPCPTPAVVGSHLSATQGTPLASPT

Query:  QYRSTIGALQYLSHTRPDIAFIVNKLSQFLQAPTTDHWQAVKRVLRYLQGTVDYGLHISPCSNLTLTGYSDADWACCPDDRRSIGAYCIFYGTSLVSWSS
         YRST+GALQYL+ TRPDI+F VNK  QF+  PTT HW AVKR+LRYL+GT+ YG+ +   ++L + GY+DADWA CPDDRRS G Y IF G +LVSWSS
Subjt:  QYRSTIGALQYLSHTRPDIAFIVNKLSQFLQAPTTDHWQAVKRVLRYLQGTVDYGLHISPCSNLTLTGYSDADWACCPDDRRSIGAYCIFYGTSLVSWSS

Query:  KKQTVVARSSAESEYRAMAQATAEITWLHNLLHELCLLPPAPTVLWSDNMSAISLASNPVFHSRTKHIELDVHFIRDKVLGKTLEVRYVPSHDQLADFLT
         KQ VV+RSSAESEYRA+A AT+E+ W+  +L ELCL   +P +LW DN SA  LA+NPVFH+RTKHIE+D+HFIRD VL K L ++Y+PS +Q+AD  T
Subjt:  KKQTVVARSSAESEYRAMAQATAEITWLHNLLHELCLLPPAPTVLWSDNMSAISLASNPVFHSRTKHIELDVHFIRDKVLGKTLEVRYVPSHDQLADFLT

Query:  KPLSSAVFQVFRSKLGVF-SPSRLRGD
        K +SS+ F  FR+KL V  SP  LRGD
Subjt:  KPLSSAVFQVFRSKLGVF-SPSRLRGD

RVW85836.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]0.0e+0043.07Show/hide
Query:  SQGSSMNDSVTSVPAIQTGNNGSPSLGNPFNSILAVKLDEKNYILWKSIVIAILRGQKLEGFILGTTNPPPAIQQEATTSTATTNPEREKWTTTDQTLLG
        S GS  N ++ S PA Q             N  L VKLD  NYILWKS +  ++     E FI G++  P     +   S+   NP    W   D+T+L 
Subjt:  SQGSSMNDSVTSVPAIQTGNNGSPSLGNPFNSILAVKLDEKNYILWKSIVIAILRGQKLEGFILGTTNPPPAIQQEATTSTATTNPEREKWTTTDQTLLG

Query:  WLFSTMSPSVASEVINLQTSREVWKALEELYGAANKSRVNQLRTGLQTTRKNNMRMGEYLAQMKEFSENLALAGEPISNSYLISSVLAGLDSEFLPIMCQ
        WL+S+++P++ +++I   +S   W ALE+ + +++++R+ QLR  LQ+T+K ++ M +Y+ ++K  + +LA  GEP+S    + ++L GL S++  ++  
Subjt:  WLFSTMSPSVASEVINLQTSREVWKALEELYGAANKSRVNQLRTGLQTTRKNNMRMGEYLAQMKEFSENLALAGEPISNSYLISSVLAGLDSEFLPIMCQ

Query:  IN-SQGPMSWQEFHATLVTFENTLLRLKVINLDGDASANFVVA--------------NQNNNIQSSNSSQE------RNSEETIPLKTVSEDMDEIEGGV
        IN     +S +  H+ L+ FE+ L +   I      SAN+  +               QN+   +SN +        R  +        SE       G 
Subjt:  IN-SQGPMSWQEFHATLVTFENTLLRLKVINLDGDASANFVVA--------------NQNNNIQSSNSSQE------RNSEETIPLKTVSEDMDEIEGGV

Query:  MGHSALVCYYRYDPPTVGQQHNSPRPA---------ANIATAEVIADQSWYADSGATNHTTAELANLNIQSEYGGKTSLVVGNGSKLKIAHVGESLIQNN
         GH+  +CY+R+D      Q ++  P+         A +A++  +AD +WY DSGA++H T  ++NL   S Y G   + +GNG  L I++ G   + +N
Subjt:  MGHSALVCYYRYDPPTVGQQHNSPRPA---------ANIATAEVIADQSWYADSGATNHTTAELANLNIQSEYGGKTSLVVGNGSKLKIAHVGESLIQNN

Query:  CLSKPIALKKILHVPSIQKNLISISKLTADNNLFVEFHSHMCFVKDKQTGKVVLQGVLKDGLYQLNLNSNPPKSRPQLLFSQASQPHVLSASVKPSFPSQ
          S    LKK+ HVP I  NLIS++K  +DNN  +EF S+  FVKD  T KV+ QG L++GLY+      P  +  ++ F  A+  H  ++S+  +    
Subjt:  CLSKPIALKKILHVPSIQKNLISISKLTADNNLFVEFHSHMCFVKDKQTGKVVLQGVLKDGLYQLNLNSNPPKSRPQLLFSQASQPHVLSASVKPSFPSQ

Query:  SLSFSLWHKRLGHPSNKILHSVVKSCNLQCPINEK---PPFCDACVYGKAHLLPFHNAPNITTAPLELVYSDLWGPSPVCSLNGYRFYISFLDDFTRLTY
            +LWH RLGH S  I+  +++SCN+    N+       C +C   K+H LP H + +  + PLELV++DLWGP+PV S +G R++I FLDD++R T+
Subjt:  SLSFSLWHKRLGHPSNKILHSVVKSCNLQCPINEK---PPFCDACVYGKAHLLPFHNAPNITTAPLELVYSDLWGPSPVCSLNGYRFYISFLDDFTRLTY

Query:  IFPLKAKSEATSTFLHFKKLIENKFDRKIKTLQTDWGGEYRPLTSHLKALGIAFRHPCPHTSEQNGRVERKHRHIVEMGLTLLAQASLPLKFWWEAFSSA
         +PL+ K +A   F  FK  +EN+FD KIK LQ+D GGE+R   + L+  GI  R  CP+ S QNGRVERKHRH+VE GL LLA ASLP++FW  AF +A
Subjt:  IFPLKAKSEATSTFLHFKKLIENKFDRKIKTLQTDWGGEYRPLTSHLKALGIAFRHPCPHTSEQNGRVERKHRHIVEMGLTLLAQASLPLKFWWEAFSSA

Query:  TYLINRLPTPTLHGLSPFEKAFGRKPDYKFLKVFGCSCFPCLRPYQKHKFDFHSIKCTFLGYDESYKGYICLS-PQGRRYISRHVLFNEVEFPYISQPTT
        T+LINR+P+  L   SP+   F + PDYK L+VFGC C+P +RPY  HK  + S++  FLGY    KG++CL    GR YI+ HV+F+E +FP      T
Subjt:  TYLINRLPTPTLHGLSPFEKAFGRKPDYKFLKVFGCSCFPCLRPYQKHKFDFHSIKCTFLGYDESYKGYICLS-PQGRRYISRHVLFNEVEFPYISQPTT

Query:  QPISSPNNSMSYCLVPPPTNHFNPVPTISTSNVQAAPPPSLVDTPPASNIVVSPSSAAVSPPTHMLRSVSSSPIVPILTDVVTSPSPHLQPPSIPTHIST
         P+S   ++ +  L P     F P PT  +     +P  SL  +P      +S +S +VS PT        +P+   L + +    P       P+    
Subjt:  QPISSPNNSMSYCLVPPPTNHFNPVPTISTSNVQAAPPPSLVDTPPASNIVVSPSSAAVSPPTHMLRSVSSSPIVPILTDVVTSPSPHLQPPSIPTHIST

Query:  HPMQTRAKCGIFKPK-IFQAQISPPTDLCEPTSVKQALLSHEWKSAMDSEYQALLGNGTWSLVPPSPHYNVVGNKWVFRLKFHPDGSIQRYKARLVAKGF
          M TR   GI + K IF         + EP ++KQAL    W  AMD E  AL  N TW LV   P  N++G KWV++LK  PDGSI+RYKARLVAKG+
Subjt:  HPMQTRAKCGIFKPK-IFQAQISPPTDLCEPTSVKQALLSHEWKSAMDSEYQALLGNGTWSLVPPSPHYNVVGNKWVFRLKFHPDGSIQRYKARLVAKGF

Query:  HQTPAIDYHETFSPVIKPPTVKLILTLAVSKNWIVRQLDINNAFLNGHLAEAVYMVQPEGYVDPTKPTYVCKLHKALYGLKQAPRAWYDRLKQCLHRWKF
        +QT  +DY ETFSPV+K  T+++ILT+A+S  W +RQLD++NAFLNG L E VYM QP GY+D T PT VC+L KALYGLKQAPRAW+ RL   L +W F
Subjt:  HQTPAIDYHETFSPVIKPPTVKLILTLAVSKNWIVRQLDINNAFLNGHLAEAVYMVQPEGYVDPTKPTYVCKLHKALYGLKQAPRAWYDRLKQCLHRWKF

Query:  QNSKADSSLFYYFANGHQIFLLVYVDDILITGTSPTLIDDFITKLGNLFALKDLGHLHYFLGIEVTRNSSGMVLTQRRYIHDLLHRVGLEHIKPCPTPAV
         NS+ DSS+F YF     + +LVYVDDI+ITG S T I   I KL ++FAL+DLG L YFLGIEV+ +   M L+Q +Y+ DLLHR G+   KP  TP  
Subjt:  QNSKADSSLFYYFANGHQIFLLVYVDDILITGTSPTLIDDFITKLGNLFALKDLGHLHYFLGIEVTRNSSGMVLTQRRYIHDLLHRVGLEHIKPCPTPAV

Query:  VGSHLSATQGTPLASPTQYRSTIGALQYLSHTRPDIAFIVNKLSQFLQAPTTDHWQAVKRVLRYLQGTVDYGLHISPCSNLTLTGYSDADWACCPDDRRS
        VG +LS   G P+   TQYRS +GALQYL+ TRPDIAF VNK  QF+Q PT+ HW +VKR+LRYL+GT+  GL +SP +NLT+ G+SDADW   PDDRRS
Subjt:  VGSHLSATQGTPLASPTQYRSTIGALQYLSHTRPDIAFIVNKLSQFLQAPTTDHWQAVKRVLRYLQGTVDYGLHISPCSNLTLTGYSDADWACCPDDRRS

Query:  IGAYCIFYGTSLVSWSSKKQTVVARSSAESEYRAMAQATAEITWLHNLLHELCLLPPAPTVLWSDNMSAISLASNPVFHSRTKHIELDVHFIRDKVLGKT
           Y ++ G +LVSWSS KQ VV+RSSAESEYRA+A ATAEI W+  LL ELC+  PA  +LW DN+SA  +A NPVFH+RTKHIE+D+HFIRD+V+   
Subjt:  IGAYCIFYGTSLVSWSSKKQTVVARSSAESEYRAMAQATAEITWLHNLLHELCLLPPAPTVLWSDNMSAISLASNPVFHSRTKHIELDVHFIRDKVLGKT

Query:  LEVRYVPSHDQLADFLTKPLSSAVFQVFRSKLGVF-SPSRLRGDVKDCNET
        +++ +VP+ DQ AD LTK L+S+ F   +S+L +   P  LRGD K  + T
Subjt:  LEVRYVPSHDQLADFLTKPLSSAVFQVFRSKLGVF-SPSRLRGDVKDCNET

RVX06084.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]0.0e+0042.96Show/hide
Query:  SVTSVPAIQTG----NNGSPSLGNPFNSILAVKLDEKNYILWKSIVIAILRGQKLEGFILGTTNPPPAIQQEATTSTATTNPEREKWTTTDQTLLGWLFS
        S T    +Q G    + G+       N  L +KLD  NYILW++ +  ++     E  I G    PP       TS+  TNP+   W   D+ +L W++S
Subjt:  SVTSVPAIQTG----NNGSPSLGNPFNSILAVKLDEKNYILWKSIVIAILRGQKLEGFILGTTNPPPAIQQEATTSTATTNPEREKWTTTDQTLLGWLFS

Query:  TMSPSVASEVINLQTSREVWKALEELYGAANKSRVNQLRTGLQTTRKNNMRMGEYLAQMKEFSENLALAGEPISNSYLISSVLAGLDSEFLPIMCQINS-
        +++P +  +++  Q+S   W ALE ++ A++++RV QLR   QTTRK ++ M EY+ ++K  ++NLA  GEP+++   I  +L GL +++  I+  + + 
Subjt:  TMSPSVASEVINLQTSREVWKALEELYGAANKSRVNQLRTGLQTTRKNNMRMGEYLAQMKEFSENLALAGEPISNSYLISSVLAGLDSEFLPIMCQINS-

Query:  QGPMSWQEFHATLVTFENTLLRLKVINLDGDASANFVVANQNNNIQSSNSSQERNSEETIPLKT------VSEDMDEIE-GGVMGHSALVCYYRYD----
        +  MS    H+ L+T E  L     +  D   SAN       +     +S Q R S       T       S+   + +  G  GH+ + CY+R+D    
Subjt:  QGPMSWQEFHATLVTFENTLLRLKVINLDGDASANFVVANQNNNIQSSNSSQERNSEETIPLKT------VSEDMDEIE-GGVMGHSALVCYYRYD----

Query:  ---PPTVGQQHNSP----RPAANIATAEVIADQSWYADSGATNHTTAELANLNIQSEYGGKTSLVVGNGSKLKIAHVGESLIQNNCLSKPIALKKILHVP
           P     Q N P    +  A +A+   I+D++W+ D+GAT+H +  +  L+    Y G   ++VGNG  L+I H G +   ++  SK   L+++LHVP
Subjt:  ---PPTVGQQHNSP----RPAANIATAEVIADQSWYADSGATNHTTAELANLNIQSEYGGKTSLVVGNGSKLKIAHVGESLIQNNCLSKPIALKKILHVP

Query:  SIQKNLISISKLTADNNLFVEFHSHMCFVKDKQTGKVVLQGVLKDGLYQLNLNSNPPKSRPQLLFSQASQPHVLSASVKPSFPSQSLSFSLWHKRLGHPS
         I  NLIS+S+  ADNN F EFH    FVKD+ T K++LQG L+ GLY+      P +      F  +    V S+  + S  S + + +LWH RLGHP+
Subjt:  SIQKNLISISKLTADNNLFVEFHSHMCFVKDKQTGKVVLQGVLKDGLYQLNLNSNPPKSRPQLLFSQASQPHVLSASVKPSFPSQSLSFSLWHKRLGHPS

Query:  NKILHSVVKSCNL--QCPINEKPPFCDACVYGKAHLLPFHNAPNITTAPLELVYSDLWGPSPVCSLNGYRFYISFLDDFTRLTYIFPLKAKSEATSTFLH
        + IL  ++ SCN+  QC    K   C AC + K+H LPF+ + +  + PL L+++DLWGP+ + S  G R++I F+DDF+R ++I+PL +K +A S F+ 
Subjt:  NKILHSVVKSCNL--QCPINEKPPFCDACVYGKAHLLPFHNAPNITTAPLELVYSDLWGPSPVCSLNGYRFYISFLDDFTRLTYIFPLKAKSEATSTFLH

Query:  FKKLIENKFDRKIKTLQTDWGGEYRPLTSHLKALGIAFRHPCPHTSEQNGRVERKHRHIVEMGLTLLAQASLPLKFWWEAFSSATYLINRLPTPTLHGLS
        FK L+EN+F+ +I+ L++D GGE++  +S+L   GI  +  CP+T EQNGR ERK RHI+E GL LLA ASLP KFW  AF +  +LINRLPT  L+  S
Subjt:  FKKLIENKFDRKIKTLQTDWGGEYRPLTSHLKALGIAFRHPCPHTSEQNGRVERKHRHIVEMGLTLLAQASLPLKFWWEAFSSATYLINRLPTPTLHGLS

Query:  PFEKAFGRKPDYKFLKVFGCSCFPCLRPYQKHKFDFHSIKCTFLGYDESYKGYICLSP-QGRRYISRHVLFNEVEFPYISQPTTQPISSPNNSMSYCLVP
        PF+  FG+ P+Y   K+FGC C+P +RPY K+K  + S +C FLGY  ++KGY+CL+P  GR Y++RHV+F+E  FP+         S+P+ S S   +P
Subjt:  PFEKAFGRKPDYKFLKVFGCSCFPCLRPYQKHKFDFHSIKCTFLGYDESYKGYICLSP-QGRRYISRHVLFNEVEFPYISQPTTQPISSPNNSMSYCLVP

Query:  PPTNHFNPVPTISTSNVQAAPPPSLVDTPPASNIVVSPSSAAVSPPTHMLRSVSSSPIVPILTDVVTSPSPHLQPPSIPTHISTHPMQTRAKCGIFKPKI
         P   F P  +   S++++   PS   +PP +N+   PSS           ++S   ++ +    +++  PH      PT  + HPM TRAK GI K K+
Subjt:  PPTNHFNPVPTISTSNVQAAPPPSLVDTPPASNIVVSPSSAAVSPPTHMLRSVSSSPIVPILTDVVTSPSPHLQPPSIPTHISTHPMQTRAKCGIFKPKI

Query:  FQAQISPPTDLCEPTSVKQALLSHEWKSAMDSEYQALLGNGTWSLVPPSPHYNVVGNKWVFRLKFHPDGSIQRYKARLVAKGFHQTPAIDYHETFSPVIK
        + +     + + EPT+  QA+    W  AM+ E+ AL  N TW LVPP  + N++G KWV++LK+ PDG++ RYKARLVA+GF QT  +DY ETFSPV+K
Subjt:  FQAQISPPTDLCEPTSVKQALLSHEWKSAMDSEYQALLGNGTWSLVPPSPHYNVVGNKWVFRLKFHPDGSIQRYKARLVAKGFHQTPAIDYHETFSPVIK

Query:  PPTVKLILTLAVSKNWIVRQLDINNAFLNGHLAEAVYMVQPEGYVDPTKPTYVCKLHKALYGLKQAPRAWYDRLKQCLHRWKFQNSKADSSLFYYFANGH
          T+++IL +A+S NW V QLD+ NAFL+G L E V+M QP G+++   P++VCKL+KALYGLKQAPRAWY++L   L  W FQ S+ADSS+F + +   
Subjt:  PPTVKLILTLAVSKNWIVRQLDINNAFLNGHLAEAVYMVQPEGYVDPTKPTYVCKLHKALYGLKQAPRAWYDRLKQCLHRWKFQNSKADSSLFYYFANGH

Query:  QIFLLVYVDDILITGTSPTLIDDFITKLGNLFALKDLGHLHYFLGIEVTRNSSGMVLTQRRYIHDLLHRVGLEHIKPCPTPAVVGSHLSATQGTPLASPT
         + LL+YVDDIL+TG+S   +  FIT+L   FAL+DLG+++YFLGIEV  + +   L+Q +Y  DLL R  +   KP  TP ++G  LS   G P +  T
Subjt:  QIFLLVYVDDILITGTSPTLIDDFITKLGNLFALKDLGHLHYFLGIEVTRNSSGMVLTQRRYIHDLLHRVGLEHIKPCPTPAVVGSHLSATQGTPLASPT

Query:  QYRSTIGALQYLSHTRPDIAFIVNKLSQFLQAPTTDHWQAVKRVLRYLQGTVDYGLHISPCSNLTLTGYSDADWACCPDDRRSIGAYCIFYGTSLVSWSS
         YRST+GALQYL+ TRPDI+F VNK  QF+  PTT HW AVKR+LRYL+GT+ YG+ +   ++L + GY+DADWA CPDDRRS G Y IF G +LVSWSS
Subjt:  QYRSTIGALQYLSHTRPDIAFIVNKLSQFLQAPTTDHWQAVKRVLRYLQGTVDYGLHISPCSNLTLTGYSDADWACCPDDRRSIGAYCIFYGTSLVSWSS

Query:  KKQTVVARSSAESEYRAMAQATAEITWLHNLLHELCLLPPAPTVLWSDNMSAISLASNPVFHSRTKHIELDVHFIRDKVLGKTLEVRYVPSHDQLADFLT
         KQ VV+RSSAESEYRA+A AT+E+ W+  +L ELCL   +P +LW DN SA  LA+NPVFH+RTKHIE+D+HFIRD VL K L ++Y+PS +Q+AD  T
Subjt:  KKQTVVARSSAESEYRAMAQATAEITWLHNLLHELCLLPPAPTVLWSDNMSAISLASNPVFHSRTKHIELDVHFIRDKVLGKTLEVRYVPSHDQLADFLT

Query:  KPLSSAVFQVFRSKLGVF-SPSRLRGD
        K +SS+ F  FR+KL V  SP  LRGD
Subjt:  KPLSSAVFQVFRSKLGVF-SPSRLRGD

TrEMBL top hitse value%identityAlignment
A0A2N9IMQ9 Integrase catalytic domain-containing protein0.0e+0043.96Show/hide
Query:  ILAVKLDEKNYILWKSIVIAILRGQKLEGFILGTTNPPPAIQQEATTSTATT--NPEREKWTTTDQTLLGWLFSTMSPSVASEVINLQTSREVWKALEEL
        ++ +KL  +NY+LWK+ V+  LRGQ L  F+ G++  P  I   ++   +TT  NPE  +W   DQ +L  L S++S  V + V+   TSR++W  LE +
Subjt:  ILAVKLDEKNYILWKSIVIAILRGQKLEGFILGTTNPPPAIQQEATTSTATT--NPEREKWTTTDQTLLGWLFSTMSPSVASEVINLQTSREVWKALEEL

Query:  YGAANKSRVNQLRTGLQTTRKNNMRMGEYLAQMKEFSENLALAGEPISNSYLISSVLAGLDSEFLPIMCQINSQ-GPMSWQEFHATLVTFENTLLRLKVI
        + A +++R+ Q+   L T RK +  + ++       ++ LA   +P+    L+S +LAGL  E+   +  +  +  P++    +  L+T E  L + +  
Subjt:  YGAANKSRVNQLRTGLQTTRKNNMRMGEYLAQMKEFSENLALAGEPISNSYLISSVLAGLDSEFLPIMCQINSQ-GPMSWQEFHATLVTFENTLLRLKVI

Query:  NLDGDASANFV----VANQNNNIQSSNSSQERNSEETIPLKTVSEDMDEIEGGV------------MGHSALVCYYRYDPPTVGQQHNSPRPAANIATAE
             ASANFV     +      ++ +SS     + T P    +                       GH AL CY+R+D     ++  S    A  +T +
Subjt:  NLDGDASANFV----VANQNNNIQSSNSSQERNSEETIPLKTVSEDMDEIEGGV------------MGHSALVCYYRYDPPTVGQQHNSPRPAANIATAE

Query:  VIADQSWYADSGATNHTTAELANLNIQS-EYGGKTSLVVGNGSKLKIAHVGESLIQNNCLSKPIALKKILHVPSIQKNLISISKLTADNNLFVEFHSHMC
           D +WY D+GATNH T++LANLN+ S EY G   + VGNG  L +AH G S +     S    L  +LHVP I KNLIS+ K T+D + F+EFH    
Subjt:  VIADQSWYADSGATNHTTAELANLNIQS-EYGGKTSLVVGNGSKLKIAHVGESLIQNNCLSKPIALKKILHVPSIQKNLISISKLTADNNLFVEFHSHMC

Query:  FVKDKQTGKVVLQGVLKDGLYQLNLNSNPPKSRPQLLFSQASQPHVLSASVKPSFPSQSLSFSLWHKRLGHPSNKILHSVVKSCNLQCPI---NEKPPFC
         VKD+ T K++ +G  K GLY    +S    + P  L  + +                  S   WH RLGHP+ K++  ++   +L  P+   N     C
Subjt:  FVKDKQTGKVVLQGVLKDGLYQLNLNSNPPKSRPQLLFSQASQPHVLSASVKPSFPSQSLSFSLWHKRLGHPSNKILHSVVKSCNLQCPI---NEKPPFC

Query:  DACVYGKAHLLPFHNAPNITTAPLELVYSDLWGPSPVCSLNGYRFYISFLDDFTRLTYIFPLKAKSEATSTFLHFKKLIENKFDRKIKTLQTDWGGEYRP
         AC+  K+  L F  +P     PLEL+Y+D+WGPSP+ S NG+++Y+SFLD ++R  ++FP+  K+E  S F+ F+K +E  FD KIK +Q+DWGGE+R 
Subjt:  DACVYGKAHLLPFHNAPNITTAPLELVYSDLWGPSPVCSLNGYRFYISFLDDFTRLTYIFPLKAKSEATSTFLHFKKLIENKFDRKIKTLQTDWGGEYRP

Query:  LTSHLKALGIAFRHPCPHTSEQNGRVERKHRHIVEMGLTLLAQASLPLKFWWEAFSSATYLINRLPTPTLHGLSPFEKAFGRKPDYKFLKVFGCSCFPCL
        L     +LGI  R  CPHT +QNG +ERKHRHIVE GL LL+ A +PL++W +AFS+A YLINRLPTP L   +P+E  F  KP+Y FLKVFGC+C+P L
Subjt:  LTSHLKALGIAFRHPCPHTSEQNGRVERKHRHIVEMGLTLLAQASLPLKFWWEAFSSATYLINRLPTPTLHGLSPFEKAFGRKPDYKFLKVFGCSCFPCL

Query:  RPYQKHKFDFHSIKCTFLGYDESYKGYICL-SPQGRRYISRHVLFNEVEFPYISQP--TTQPISSPNNSMSYCLVPPPTNHFNPVPTISTSNVQAAPPPS
        RPY KHK    S++C FLGY   +KGY CL  P GR YISR V+F E  FP  + P   T P  S ++ +   L           PTIS       PPP 
Subjt:  RPYQKHKFDFHSIKCTFLGYDESYKGYICL-SPQGRRYISRHVLFNEVEFPYISQP--TTQPISSPNNSMSYCLVPPPTNHFNPVPTISTSNVQAAPPPS

Query:  LVDTPPASNIVVSPSSAAVSPPTHMLRSVSSSPIVPILTDVVTSPSPHLQPPSIP--TH---ISTHPMQTRAKCGIFKPKIFQ-------------AQIS
        ++ +P          S+ +SP   ++ S  +S          T PS     PS+P  TH   +S+HPM TR+K  I KPK F              A+  
Subjt:  LVDTPPASNIVVSPSSAAVSPPTHMLRSVSSSPIVPILTDVVTSPSPHLQPPSIP--TH---ISTHPMQTRAKCGIFKPKIFQ-------------AQIS

Query:  PPTDLCEPTSVKQALLSHEWKSAMDSEYQALLGNGTWSLVPPSPHYNVVGNKWVFRLKFHPDGSIQRYKARLVAKGFHQTPAIDYHETFSPVIKPPTVKL
        P   L EPT    A+   +W+ AM++E+ ALL N TW+LVP +   N+VGNKWVFR+K   DGS++RYKARLVAKGFHQ P IDY ETFSPV+KP TV+ 
Subjt:  PPTDLCEPTSVKQALLSHEWKSAMDSEYQALLGNGTWSLVPPSPHYNVVGNKWVFRLKFHPDGSIQRYKARLVAKGFHQTPAIDYHETFSPVIKPPTVKL

Query:  ILTLAVSKNWIVRQLDINNAFLNGHLAEAVYMVQPEGYVDPTKPTYVCKLHKALYGLKQAPRAWYDRLKQCLHRWKFQNSKADSSLFYYFANGHQIFLLV
        +L+LA+SKNW VRQLD+ NAFL+G L+E VYM QP G+  P  P +VCKLHKALYGLKQAPRAW+ RL   L  + F  S++DSSLF Y    + ++ L+
Subjt:  ILTLAVSKNWIVRQLDINNAFLNGHLAEAVYMVQPEGYVDPTKPTYVCKLHKALYGLKQAPRAWYDRLKQCLHRWKFQNSKADSSLFYYFANGHQIFLLV

Query:  YVDDILITGTSPTLIDDFITKLGNLFALKDLGHLHYFLGIEVTRNSSGMVLTQRRYIHDLLHRVGLEHIKPCPTPAVVGSHLSATQGTPLASPTQYRSTI
        YVDDI+IT +  + I   + +LG+ FA+KDLG L+YFLGIEV   + G++L+Q++YI D+L R  +   KP  +P    +HLS  +G P   PT YRST+
Subjt:  YVDDILITGTSPTLIDDFITKLGNLFALKDLGHLHYFLGIEVTRNSSGMVLTQRRYIHDLLHRVGLEHIKPCPTPAVVGSHLSATQGTPLASPTQYRSTI

Query:  GALQYLSHTRPDIAFIVNKLSQFLQAPTTDHWQAVKRVLRYLQGTVDYGLHISPCSNLTLTGYSDADWACCPDDRRSIGAYCIFYGTSLVSWSSKKQTVV
        GALQYLS TRPDIAF VNKLSQF+  PTT HWQ+VKR+LRYL+ T+ +GLHI P S   L G++DADWA   DDRRS G YCIF G++LVSWS KKQ  V
Subjt:  GALQYLSHTRPDIAFIVNKLSQFLQAPTTDHWQAVKRVLRYLQGTVDYGLHISPCSNLTLTGYSDADWACCPDDRRSIGAYCIFYGTSLVSWSSKKQTVV

Query:  ARSSAESEYRAMAQATAEITWLHNLLHELCLLPPAPTVLWSDNMSAISLASNPVFHSRTKHIELDVHFIRDKVLGKTLEVRYVPSHDQLADFLTKPLSSA
        ARSS E+EY+A+A A AEITW   LL EL +   +P +LW DN+ A  L+SNPVFH+RTKH+E+D HF+RD V  +T+++R++ S DQLAD  TKPLS+A
Subjt:  ARSSAESEYRAMAQATAEITWLHNLLHELCLLPPAPTVLWSDNMSAISLASNPVFHSRTKHIELDVHFIRDKVLGKTLEVRYVPSHDQLADFLTKPLSSA

Query:  VFQVFRSKLGVFS-PSRLRGDVKD
         F + R+KL V   P  LRG VKD
Subjt:  VFQVFRSKLGVFS-PSRLRGDVKD

A0A2Z6MBG6 Integrase catalytic domain-containing protein0.0e+0046.36Show/hide
Query:  NPFNSILAVKLDEKNYILWKSIVIAILRGQKLEGFILGTTNPPPAIQQEATTSTATTNPERE--KWTTTDQTLLGWLFSTMSPSVASEVINLQTSREVWK
        N   S ++VKLD  NY LWKS+V+ ++RG KL+G++LGT   P   ++  T+S ++ N      +W   DQ LLGW+ ++M+  +A+++++ +TS+++W 
Subjt:  NPFNSILAVKLDEKNYILWKSIVIAILRGQKLEGFILGTTNPPPAIQQEATTSTATTNPERE--KWTTTDQTLLGWLFSTMSPSVASEVINLQTSREVWK

Query:  ALEELYGAANKSRVNQLRTGLQTTRKNNMRMGEYLAQMKEFSENLALAGEPISNSYLISSVLAGLDSEFLPIMCQINSQGPMSWQEFHATLVTFENTLLR
          + L GA  +S++  L++   + RK  M+M +YL +MK   + L LAG P+S S LI   L GLDSE+ P++ +++ Q  +SW +  A L+TFE+ + +
Subjt:  ALEELYGAANKSRVNQLRTGLQTTRKNNMRMGEYLAQMKEFSENLALAGEPISNSYLISSVLAGLDSEFLPIMCQINSQGPMSWQEFHATLVTFENTLLR

Query:  L-KVINLDGDASANFVVANQ--------NNNIQSSNS-----SQERNSEETIPLKTVSEDMDEIEGGVMGHSALVCYYRYDPPTVGQQHNSPRPA-----
        L  + NL  +A+AN  VAN+        NNN + SNS      + R      P +           G+  H A+ C++R+D       H++         
Subjt:  L-KVINLDGDASANFVVANQ--------NNNIQSSNS-----SQERNSEETIPLKTVSEDMDEIEGGVMGHSALVCYYRYDPPTVGQQHNSPRPA-----

Query:  ANIATAEVIADQSWYADSGATNHTTAELANLNIQSEYGGKTSLVVGNGSKLKIAHVGESLIQNNCLSKPIALKKILHVPSIQKNLISISKLTADNNLFVE
        A +A+   + D  WY DSGA+NH T +       +E+ GK SLVVGNG KL I   G S +      K + L  IL+VP+I KNL+S+SKL ADNN+ VE
Subjt:  ANIATAEVIADQSWYADSGATNHTTAELANLNIQSEYGGKTSLVVGNGSKLKIAHVGESLIQNNCLSKPIALKKILHVPSIQKNLISISKLTADNNLFVE

Query:  FHSHMCFVKDKQTGKVVLQGVLKDGLYQLNLNSNPPKSRPQLLFSQASQPHVLSASVKPSFPSQSLSFSLWHKRLGHPSNKILHSVVKSCNLQCPINEKP
        F  + CFVKDK TGKV+L+G+LKDGLYQL+     P +                 SVK S          WH+RLGHP+NK+L  V++SC ++ P ++  
Subjt:  FHSHMCFVKDKQTGKVVLQGVLKDGLYQLNLNSNPPKSRPQLLFSQASQPHVLSASVKPSFPSQSLSFSLWHKRLGHPSNKILHSVVKSCNLQCPINEKP

Query:  PFCDACVYGKAHLLPFHNAPNITTAPLELVYSDLWGPSPVCSLNGYRFYISFLDDFTRLTYIFPLKAKSEATSTFLHFKKLIENKFDRKIKTLQTDWGGE
         FC+AC YGK HLLPF ++ +    PLELV++D+WGP+P+ + +G+++Y+ F+DDF+R T+I+PLK KSE    F+ FK L EN+F+++IK +Q D GGE
Subjt:  PFCDACVYGKAHLLPFHNAPNITTAPLELVYSDLWGPSPVCSLNGYRFYISFLDDFTRLTYIFPLKAKSEATSTFLHFKKLIENKFDRKIKTLQTDWGGE

Query:  YRPLTSHLKALGIAFRHPCPHTSEQNGRVERKHRHIVEMGLTLLAQASLPLKFWWEAFSSATYLINRLPTPTLHGLSPFEKAFGRKPDYKFLKVFGCSCF
        Y+P+       GI FR  CP+TS+QNGR ERKHRHI E GLTLLAQA +PL +WWEAFS+A YLINRLP+      SP+     ++PDYK LK FGC+C+
Subjt:  YRPLTSHLKALGIAFRHPCPHTSEQNGRVERKHRHIVEMGLTLLAQASLPLKFWWEAFSSATYLINRLPTPTLHGLSPFEKAFGRKPDYKFLKVFGCSCF

Query:  PCLRPYQKHKFDFHSIKCTFLGYDESYKGYICLSPQGRRYISRHVLFNEVEFPY------ISQPTTQPISSPNNSMSYCLVPPPTNHFNPVPTISTSNVQ
        PCL+PY +HK  +H+ +C FLGY  S+KGY CL+  GR +ISRHV+FNE  FP+         P    I+ P+ S   C             T       
Subjt:  PCLRPYQKHKFDFHSIKCTFLGYDESYKGYICLSPQGRRYISRHVLFNEVEFPY------ISQPTTQPISSPNNSMSYCLVPPPTNHFNPVPTISTSNVQ

Query:  AAPPPSLVDTPPASNIVVSPSSAAVSPPTHMLRSVSSSPIVPILTDVVTSPSPHLQPPSIPTHISTHPMQTRAKCGIFKPKI-FQAQISPPTDLCEPTSV
        A+ P    + P  +N   +  S  V+  T    +  S         +  +    +   S  T+ S H + TR+K GI KPK+ +        D  EP + 
Subjt:  AAPPPSLVDTPPASNIVVSPSSAAVSPPTHMLRSVSSSPIVPILTDVVTSPSPHLQPPSIPTHISTHPMQTRAKCGIFKPKI-FQAQISPPTDLCEPTSV

Query:  KQALLSHEWKSAMDSEYQALLGNGTWSLVPPSPHYNVVGNKWVFRLKFHPDGSIQRYKARLVAKGFHQTPAIDYHETFSPVIKPPTVKLILTLAVSKNWI
        K+AL    WK AM  E++AL+ N TW LVP     N+V +KWVF+ K+ PDGS++R KARLVAKGF QT  IDY ETFSPVIK  TV++IL++AV  NW 
Subjt:  KQALLSHEWKSAMDSEYQALLGNGTWSLVPPSPHYNVVGNKWVFRLKFHPDGSIQRYKARLVAKGFHQTPAIDYHETFSPVIKPPTVKLILTLAVSKNWI

Query:  VRQLDINNAFLNGHLAEAVYMVQPEGYVDPTKPTYVCKLHKALYGLKQAPRAWYDRLKQCLHRWKFQNSKADSSLFYYFANGHQIFLLVYVDDILITGTS
        VRQLDINNAFLNGHL E V+M QPEG+VD TKP ++CKL KA+YGLKQAPRAW+D LK  L  W FQN+K+DSSLF      H  FLL+YVDDI++TG++
Subjt:  VRQLDINNAFLNGHLAEAVYMVQPEGYVDPTKPTYVCKLHKALYGLKQAPRAWYDRLKQCLHRWKFQNSKADSSLFYYFANGHQIFLLVYVDDILITGTS

Query:  PTLIDDFITKLGNLFALKDLGHLHYFLGIEVTRNSSGMVLTQRRYIHDLLHRVGLEHIKPCPTPAVVGSHLSATQGTPLASPTQYRSTIGALQYLSHTRP
           +  FI +L + F+LKDLGHLHYFLGIEV R++SGM L Q +YI DLL +  +++  PCPTP + G   +  +G  L  PT +R  IG LQYL+HT P
Subjt:  PTLIDDFITKLGNLFALKDLGHLHYFLGIEVTRNSSGMVLTQRRYIHDLLHRVGLEHIKPCPTPAVVGSHLSATQGTPLASPTQYRSTIGALQYLSHTRP

Query:  DIAFIVNKLSQFLQAPTTDHWQAVKRVLRYLQGTVDYGLHISPCSNLTLTGYSDADWACCPDDRRSIGAYCIFYGTSLVSWSSKKQTVVARSSAESEYRA
        DIAF VNKLSQ++ +P+ DHWQ +KR+LRYLQGT++Y LHI P ++L +TG+SDADWA   DDR+S+   C+F G +L+SWSS+KQ VV+RSS ESEYRA
Subjt:  DIAFIVNKLSQFLQAPTTDHWQAVKRVLRYLQGTVDYGLHISPCSNLTLTGYSDADWACCPDDRRSIGAYCIFYGTSLVSWSSKKQTVVARSSAESEYRA

Query:  MAQATAEITWLHNLLHELCLLPPAPTVLWSDNMSAISLASNPVFHSRTKHIELDVHFIRDKVLGKTLEVRYVPSHDQLADFLTKPLSSAVFQVFRSKLGV
        +A   AEI W+ +LL EL L  P   +LW DN+SA +LASNPV H+R+KHIE+DVH+IRD+VL   + V YVP+ DQ+AD LTKPLS   F   R KLGV
Subjt:  MAQATAEITWLHNLLHELCLLPPAPTVLWSDNMSAISLASNPVFHSRTKHIELDVHFIRDKVLGKTLEVRYVPSHDQLADFLTKPLSSAVFQVFRSKLGV

A0A2Z6P4D5 Integrase catalytic domain-containing protein0.0e+0044.9Show/hide
Query:  NPFNSILAVKLDEKNYILWKSIVIAILRGQKLEGFILGTTNPPPAIQQEATTSTATTNPEREKWTTTDQTLLGWLFSTMSPSVASEVINLQTSREVWKAL
        N   SI++VKLD  NY LWKS+V++++RG KL+G+ILGTT  P      A  S    NP+   W   DQ LLGWL ++M+  +A+++++ +TS+++W   
Subjt:  NPFNSILAVKLDEKNYILWKSIVIAILRGQKLEGFILGTTNPPPAIQQEATTSTATTNPEREKWTTTDQTLLGWLFSTMSPSVASEVINLQTSREVWKAL

Query:  EELYGAANKSRVNQLRTGLQTTRKNNMRMGEYLAQMKEFSENLALAGEPISNSYLISSVLAGLDSEFLPIMCQINSQGPMSWQEFHATLVTFENTLLRLK
        + L GA  KSR+  L++    TRK  M+M EYL +MK  S+ L LAG PISNS L+   L GLD+E+ P++ +++ Q  +SW +  A L+ FE+ L +  
Subjt:  EELYGAANKSRVNQLRTGLQTTRKNNMRMGEYLAQMKEFSENLALAGEPISNSYLISSVLAGLDSEFLPIMCQINSQGPMSWQEFHATLVTFENTLLRLK

Query:  VIN-LDGDASANFV-----VANQNNNIQSSNSSQERNSEETIPLKTVSEDMDEIEGGVMGHSALVCYYRYDPPTVGQQHNSPRP-----AANIATAEVIA
          + L  +ASANF        N+ N+  +   S  R          +S    ++  G  GH A+ C YR+D P  G+ +++        +A IA+     
Subjt:  VIN-LDGDASANFV-----VANQNNNIQSSNSSQERNSEETIPLKTVSEDMDEIEGGVMGHSALVCYYRYDPPTVGQQHNSPRP-----AANIATAEVIA

Query:  DQSWYADSGATNHTTAELANLNIQSEYGGKTSLVVGNGSKLKIAHVGESLIQNNCLSKPIALKKILHVPSIQKNLISISKLTADNNLFVEFHSHMCFVKD
        D  WY DSGA NH T +       +E+ GK SL+VGNG KLKI   G + + N      + L  +L+VP I KNL+S+SKLTADNN+ VEF ++ C VKD
Subjt:  DQSWYADSGATNHTTAELANLNIQSEYGGKTSLVVGNGSKLKIAHVGESLIQNNCLSKPIALKKILHVPSIQKNLISISKLTADNNLFVEFHSHMCFVKD

Query:  KQTGKVVLQGVLKDGLYQLNLNSNPPKSRPQLLFSQASQPHVLSASVKPSFPSQSLSFSLWHKRLGHPSNKILHSVVKSCNLQCPINEKPPFCDACVYGK
        K TG+ +L+G LKDGLYQL+ N  P                 +  SVK S          WH++LGHP+NK+L  V+K CN++   +++  FC+AC +GK
Subjt:  KQTGKVVLQGVLKDGLYQLNLNSNPPKSRPQLLFSQASQPHVLSASVKPSFPSQSLSFSLWHKRLGHPSNKILHSVVKSCNLQCPINEKPPFCDACVYGK

Query:  AHLLPFHNAPNITTAPLELVYSDLWGPSPVCSLNGYRFYISFLDDFTRLTYIFPLKAKSEATSTFLHFKKLIENKFDRKIKTLQTDWGGEYRPLTSHLKA
         HLLPF  + +    PL L++SD+WGP+P+ S +G+++Y+ F+DDF+R T+IFPLK KS+    F+ FK L EN+F++KIK +Q D GGEY+ +      
Subjt:  AHLLPFHNAPNITTAPLELVYSDLWGPSPVCSLNGYRFYISFLDDFTRLTYIFPLKAKSEATSTFLHFKKLIENKFDRKIKTLQTDWGGEYRPLTSHLKA

Query:  LGIAFRHPCPHTSEQNGRVERKHRHIVEMGLTLLAQASLPLKFWWEAFSSATYLINRLPTPTLHGLSPFEKAFGRKPDYKFLKVFGCSCFPCLRPYQKHK
         GI FR  CP+TS+QNGR ERKHRH+ E+GLTLLAQA +PL++WWEAFS+A YLINRLP+      SP+   F R+PDY  LK FGC+C+PCL+PY +HK
Subjt:  LGIAFRHPCPHTSEQNGRVERKHRHIVEMGLTLLAQASLPLKFWWEAFSSATYLINRLPTPTLHGLSPFEKAFGRKPDYKFLKVFGCSCFPCLRPYQKHK

Query:  FDFHSIKCTFLGYDESYKGYICLSPQGRRYISRHVLFNEVEFPYIS--QPTTQPISSPNNSMSYCLVPPPTNHFNPVPTISTSNVQAAPPPSLVDTPPAS
          FH+ +C F+GY  S+KGY C++  GR ++SRHV+FNE  FP+      T  P+ +  ++ S  L         P  +   +   A  P +   +   +
Subjt:  FDFHSIKCTFLGYDESYKGYICLSPQGRRYISRHVLFNEVEFPYIS--QPTTQPISSPNNSMSYCLVPPPTNHFNPVPTISTSNVQAAPPPSLVDTPPAS

Query:  NIV-----------VSPSSAAVSPPTHMLRSVSSSPIVPILTDVVTSPSPHLQPPSIPTHISTHPMQTRAKCGIFKPKI-FQAQISPPTDLCEPTSVKQA
        + +           V  S   V+      + + +   V       ++ +  +Q  +   + +TH M+TR+K GI KPKI +       ++  EP SVK+A
Subjt:  NIV-----------VSPSSAAVSPPTHMLRSVSSSPIVPILTDVVTSPSPHLQPPSIPTHISTHPMQTRAKCGIFKPKI-FQAQISPPTDLCEPTSVKQA

Query:  LLSHEWKSAMDSEYQALLGNGTWSLVPPSPHYNVVGNKWVFRLKFHPDGSIQRYKARLVAKGFHQTPAIDYHETFSPVIKPPTVKLILTLAVSKNWIVRQ
        L    WK AMD EY+AL+ N TW+LVP     N++ +KW+F+ K+  DGSI+R KARLVAKGF QT  +D+ ETFSPV+K  TV++ILT+AV  NW VRQ
Subjt:  LLSHEWKSAMDSEYQALLGNGTWSLVPPSPHYNVVGNKWVFRLKFHPDGSIQRYKARLVAKGFHQTPAIDYHETFSPVIKPPTVKLILTLAVSKNWIVRQ

Query:  LDINNAFLNGHLAEAVYMVQPEGYVDPTKPTYVCKLHKALYGLKQAPRAWYDRLKQCLHRWKFQNSKADSSLFYYFANGHQIFLLVYVDDILITGTSPTL
        LDINNAFLNG L E V+M QPEGY+D  KP ++CKL KA+YGLKQAPRAWYD L+  L  W FQN+K D+SLF+     H  FLL+YVDDI++TG++   
Subjt:  LDINNAFLNGHLAEAVYMVQPEGYVDPTKPTYVCKLHKALYGLKQAPRAWYDRLKQCLHRWKFQNSKADSSLFYYFANGHQIFLLVYVDDILITGTSPTL

Query:  IDDFITKLGNLFALKDLGHLHYFLGIEVTRNSSGMVLTQRRYIHDLLHRVGLEHIKPCPTPAVVGSHLSATQGTPLASPTQYRSTIGALQYLSHTRPDIA
        ++ F  +L   ++LKDLG LHYFLG+EV R+ SGM L Q +YI D+L +  +E+   CPTP V G    A +G  +++PT YR  IGALQYL++TRPDIA
Subjt:  IDDFITKLGNLFALKDLGHLHYFLGIEVTRNSSGMVLTQRRYIHDLLHRVGLEHIKPCPTPAVVGSHLSATQGTPLASPTQYRSTIGALQYLSHTRPDIA

Query:  FIVNKLSQFLQAPTTDHWQAVKRVLRYLQGTVDYGLHISPCSNLTLTGYSDADWACCPDDRRSIGAYCIFYGTSLVSWSSKKQTVVARSSAESEYRAMAQ
        F VNKLSQ++  PT +HWQ +KR+LRYLQGT ++ LHI P +NL + G+ DADWA   DDR+S G  C+F G +LVSW+S+KQ VV+RSS ESEYR++A 
Subjt:  FIVNKLSQFLQAPTTDHWQAVKRVLRYLQGTVDYGLHISPCSNLTLTGYSDADWACCPDDRRSIGAYCIFYGTSLVSWSSKKQTVVARSSAESEYRAMAQ

Query:  ATAEI-------------------TWLHNLLHELCLLPPAPTVLWSDNMSAISLASNPVFHSRTKHIELDVHFIRDKVLGKTLEVRYVPSHDQLADFLTK
          AE+                   +    LL EL L      VLW DN+SA +LASNPV H+R+KHIE+D+H+IRD+VL   + + YVP+ DQ+AD LTK
Subjt:  ATAEI-------------------TWLHNLLHELCLLPPAPTVLWSDNMSAISLASNPVFHSRTKHIELDVHFIRDKVLGKTLEVRYVPSHDQLADFLTK

Query:  PLSSAVFQVFRSKLGV-FSPS
        PL    F + R KLGV  SPS
Subjt:  PLSSAVFQVFRSKLGV-FSPS

A0A438FWJ3 Retrovirus-related Pol polyprotein from transposon TNT 1-940.0e+0043.17Show/hide
Query:  SVTSVPAIQTG----NNGSPSLGNPFNSILAVKLDEKNYILWKSIVIAILRGQKLEGFILGTTNPPPAIQQEATTSTATTNPEREKWTTTDQTLLGWLFS
        S T    +Q G    + G+       N  L +KLD  NYILW++ +  ++     E  I G    PP       TS+  TNP+   W   D+ +L W++S
Subjt:  SVTSVPAIQTG----NNGSPSLGNPFNSILAVKLDEKNYILWKSIVIAILRGQKLEGFILGTTNPPPAIQQEATTSTATTNPEREKWTTTDQTLLGWLFS

Query:  TMSPSVASEVINLQTSREVWKALEELYGAANKSRVNQLRTGLQTTRKNNMRMGEYLAQMKEFSENLALAGEPISNSYLISSVLAGLDSEFLPIMCQINS-
        +++P +  +++  Q+S   W ALE ++ A++++RV QLR   QTTRK ++ M EY+ ++K  ++NLA  GEP+++   I  +L GL +++  I+  + + 
Subjt:  TMSPSVASEVINLQTSREVWKALEELYGAANKSRVNQLRTGLQTTRKNNMRMGEYLAQMKEFSENLALAGEPISNSYLISSVLAGLDSEFLPIMCQINS-

Query:  QGPMSWQEFHATLVTFENTLLRLKVINLDGDASANFVVANQNNNIQSSNSSQERNSEETIPLKT------VSEDMDEIE-GGVMGHSALVCYYRYD----
        +  MS    H+ L+T E  L     +  D   SAN       +     +S Q R S       T       S+   + +  G  GH+ + CY+R+D    
Subjt:  QGPMSWQEFHATLVTFENTLLRLKVINLDGDASANFVVANQNNNIQSSNSSQERNSEETIPLKT------VSEDMDEIE-GGVMGHSALVCYYRYD----

Query:  ---PPTVGQQHNSP----RPAANIATAEVIADQSWYADSGATNHTTAELANLNIQSEYGGKTSLVVGNGSKLKIAHVGESLIQNNCLSKPIALKKILHVP
           P     Q N P    +  A +A+   I+D++W+ D+GAT+H +  +  L+    Y G   ++VGNG  L+I H G +   ++  SK   L+++LHVP
Subjt:  ---PPTVGQQHNSP----RPAANIATAEVIADQSWYADSGATNHTTAELANLNIQSEYGGKTSLVVGNGSKLKIAHVGESLIQNNCLSKPIALKKILHVP

Query:  SIQKNLISISKLTADNNLFVEFHSHMCFVKDKQTGKVVLQGVLKDGLYQLNLNSNPPKSRPQLLFSQASQPHVLSASVKPSFPSQSLSFSLWHKRLGHPS
         I  NLIS+S+  ADNN F EFH    FVKD+ T K++LQG L+ GLY+      P +      F  +    V S+  + S  S + + +LWH RLGHP+
Subjt:  SIQKNLISISKLTADNNLFVEFHSHMCFVKDKQTGKVVLQGVLKDGLYQLNLNSNPPKSRPQLLFSQASQPHVLSASVKPSFPSQSLSFSLWHKRLGHPS

Query:  NKILHSVVKSCNL--QCPINEKPPFCDACVYGKAHLLPFHNAPNITTAPLELVYSDLWGPSPVCSLNGYRFYISFLDDFTRLTYIFPLKAKSEATSTFLH
        N IL  ++ SCN+  QC    K   C AC + K+H LPF+ + +  + PL L+++DLWGP+ + S  G R++I F+DDF+R ++I+PL +K +A S F+ 
Subjt:  NKILHSVVKSCNL--QCPINEKPPFCDACVYGKAHLLPFHNAPNITTAPLELVYSDLWGPSPVCSLNGYRFYISFLDDFTRLTYIFPLKAKSEATSTFLH

Query:  FKKLIENKFDRKIKTLQTDWGGEYRPLTSHLKALGIAFRHPCPHTSEQNGRVERKHRHIVEMGLTLLAQASLPLKFWWEAFSSATYLINRLPTPTLHGLS
        FK L+EN+F+ +I+ L++D GGE++  +S+L   GI  +  CP+T EQNGR ERK RHI+E GL LLA ASLP KFW  AF +A +LINRLPT  L+  S
Subjt:  FKKLIENKFDRKIKTLQTDWGGEYRPLTSHLKALGIAFRHPCPHTSEQNGRVERKHRHIVEMGLTLLAQASLPLKFWWEAFSSATYLINRLPTPTLHGLS

Query:  PFEKAFGRKPDYKFLKVFGCSCFPCLRPYQKHKFDFHSIKCTFLGYDESYKGYICLSP-QGRRYISRHVLFNEVEFPYISQPTTQPISSPNNSMSYCLVP
        PF+  FG+ P+Y   K+FGC C+P +RPY K+K  + S +C FLGY  ++KGY+CL+P  GR Y++RHV+F+E  FP+         S+P+ S S   +P
Subjt:  PFEKAFGRKPDYKFLKVFGCSCFPCLRPYQKHKFDFHSIKCTFLGYDESYKGYICLSP-QGRRYISRHVLFNEVEFPYISQPTTQPISSPNNSMSYCLVP

Query:  PPTNHFNPVPTISTSNVQAAPPPSLVDTPPASNIVVSPSSAAVSPPTHMLRSVSSSPIVPILTDVVTSPSPHLQPPSIPTHISTHPMQTRAKCGIFKPKI
         P   F P  +   S++++   PS   +PP +N+   PSS           ++S   ++ +    +++  PH      PT  + HPM TRAK GI K K+
Subjt:  PPTNHFNPVPTISTSNVQAAPPPSLVDTPPASNIVVSPSSAAVSPPTHMLRSVSSSPIVPILTDVVTSPSPHLQPPSIPTHISTHPMQTRAKCGIFKPKI

Query:  FQAQISPPTDLCEPTSVKQALLSHEWKSAMDSEYQALLGNGTWSLVPPSPHYNVVGNKWVFRLKFHPDGSIQRYKARLVAKGFHQTPAIDYHETFSPVIK
        + +     + + EPT+  QA+    W  AM+ E+ AL  N TW LVPP  + N++G KWV++LK+ PDG++ RYKARLVA+GF QT  +DY ETFSPV+K
Subjt:  FQAQISPPTDLCEPTSVKQALLSHEWKSAMDSEYQALLGNGTWSLVPPSPHYNVVGNKWVFRLKFHPDGSIQRYKARLVAKGFHQTPAIDYHETFSPVIK

Query:  PPTVKLILTLAVSKNWIVRQLDINNAFLNGHLAEAVYMVQPEGYVDPTKPTYVCKLHKALYGLKQAPRAWYDRLKQCLHRWKFQNSKADSSLFYYFANGH
          T+++IL +A+S NW V QLD+ NAFL+G L E V+M QP G+++   P++VCKL+KALYGLKQAPRAWY++L   L  W FQ S+ADSS+F + +   
Subjt:  PPTVKLILTLAVSKNWIVRQLDINNAFLNGHLAEAVYMVQPEGYVDPTKPTYVCKLHKALYGLKQAPRAWYDRLKQCLHRWKFQNSKADSSLFYYFANGH

Query:  QIFLLVYVDDILITGTSPTLIDDFITKLGNLFALKDLGHLHYFLGIEVTRNSSGMVLTQRRYIHDLLHRVGLEHIKPCPTPAVVGSHLSATQGTPLASPT
         + LL+YVDDIL+TG+S   +  FIT+L + FAL+DLG+++YFLGIEV R+ +   L+Q +Y  DLL R  +   KP  TP ++G  LS   G P +  T
Subjt:  QIFLLVYVDDILITGTSPTLIDDFITKLGNLFALKDLGHLHYFLGIEVTRNSSGMVLTQRRYIHDLLHRVGLEHIKPCPTPAVVGSHLSATQGTPLASPT

Query:  QYRSTIGALQYLSHTRPDIAFIVNKLSQFLQAPTTDHWQAVKRVLRYLQGTVDYGLHISPCSNLTLTGYSDADWACCPDDRRSIGAYCIFYGTSLVSWSS
         YRST+GALQYL+ TRPDI+F VNK  QF+  PTT HW AVKR+LRYL+GT+ YG+ +   ++L + GY+DADWA CPDDRRS G Y IF G +LVSWSS
Subjt:  QYRSTIGALQYLSHTRPDIAFIVNKLSQFLQAPTTDHWQAVKRVLRYLQGTVDYGLHISPCSNLTLTGYSDADWACCPDDRRSIGAYCIFYGTSLVSWSS

Query:  KKQTVVARSSAESEYRAMAQATAEITWLHNLLHELCLLPPAPTVLWSDNMSAISLASNPVFHSRTKHIELDVHFIRDKVLGKTLEVRYVPSHDQLADFLT
         KQ VV+RSSAESEYRA+A AT+E+ W+  +L ELCL   +P +LW DN SA  LA+NPVFH+RTKHIE+D+HFIRD VL K L ++Y+PS +Q+AD  T
Subjt:  KKQTVVARSSAESEYRAMAQATAEITWLHNLLHELCLLPPAPTVLWSDNMSAISLASNPVFHSRTKHIELDVHFIRDKVLGKTLEVRYVPSHDQLADFLT

Query:  KPLSSAVFQVFRSKLGVF-SPSRLRGD
        K +SS+ F  FR+KL V  SP  LRGD
Subjt:  KPLSSAVFQVFRSKLGVF-SPSRLRGD

A0A803PM38 Uncharacterized protein0.0e+0043.78Show/hide
Query:  PSLGNPFNSILAVKLDEKNYILWKSIVIAILRGQKLEGFILGTTNPPPAIQQE-----ATTSTATTNPEREKWTTTDQTLLGWLFSTMSPSVASEVINLQ
        P  G+  N   A+KLD  N+ LW+++V AI+RG +L+G++ GT   P           + +S    NP  E+W   DQ LLGWL+ +M+  +A EV+   
Subjt:  PSLGNPFNSILAVKLDEKNYILWKSIVIAILRGQKLEGFILGTTNPPPAIQQE-----ATTSTATTNPEREKWTTTDQTLLGWLFSTMSPSVASEVINLQ

Query:  TSREVWKALEELYGAANKSRVNQLRTGLQTTRKNNMRMGEYLAQMKEFSENLALAGEPISNSYLISSVLAGLDSEFLPIMCQINSQGPMSWQEFHATLVT
        +S  +W ALEEL+GA +K+++++ RT +QT RK  + M +YL Q +++++ LALAGEP   + L+S+VL+GLD E+LP++  I ++G  +WQ+    L++
Subjt:  TSREVWKALEELYGAANKSRVNQLRTGLQTTRKNNMRMGEYLAQMKEFSENLALAGEPISNSYLISSVLAGLDSEFLPIMCQINSQGPMSWQEFHATLVT

Query:  FENTLLRL-------KVINLDGDASANFV---------VANQNNNIQ--------SSNSSQERNSEETIPLKTVSEDMDEIEGGVMGHSALVCYYRYDPP
         ++ + RL       K+  +  + SA+             N NNN +        S+N S+ R    + P  T          G  GHSA  CY R    
Subjt:  FENTLLRL-------KVINLDGDASANFV---------VANQNNNIQ--------SSNSSQERNSEETIPLKTVSEDMDEIEGGVMGHSALVCYYRYDPP

Query:  TVGQQHNSPRPAANIATAEVIADQSWYADSGATNHTTAELANLNIQSEYGGKTSLVVGNGSKLKIAHVGESLIQNNCLSKPIALKKILHVPSIQKNLISI
                                      GA+NH T+E+  +N++ EY GK  + V NG++L I H+G   +Q    S P+ LK+ILHVPSI KNL+SI
Subjt:  TVGQQHNSPRPAANIATAEVIADQSWYADSGATNHTTAELANLNIQSEYGGKTSLVVGNGSKLKIAHVGESLIQNNCLSKPIALKKILHVPSIQKNLISI

Query:  SKLTADNNLFVEFHSHMCFVKDKQTGKVVLQGVLKDGLYQL-------NLNSNPPKSRPQLLFSQASQPHVLSASVKPSFPSQSLSF-SLWHKRLGHPSN
        SKLT+DNN+ VEF S +CFVKDK+TG+VVL+G LKDGLYQ        +++SN   S P   FS      V S   KP       S    WH+RLGHPS 
Subjt:  SKLTADNNLFVEFHSHMCFVKDKQTGKVVLQGVLKDGLYQL-------NLNSNPPKSRPQLLFSQASQPHVLSASVKPSFPSQSLSF-SLWHKRLGHPSN

Query:  KILHSVVKSCNLQCPINEKPPFCDACVYGKAHLLPFHNAPNITTAPLELVYSDLWGPSPVCSLNGYRFYISFLDDFTRLTYIFPLKAKSEATSTFLHFKK
        ++L +V+   N++  IN    FCDAC  GK+H LPF   P   TAPLELV++D+WGPSP+ S   +R+YI F+DDF+R T+I+PLKAKSEA + F+ FK 
Subjt:  KILHSVVKSCNLQCPINEKPPFCDACVYGKAHLLPFHNAPNITTAPLELVYSDLWGPSPVCSLNGYRFYISFLDDFTRLTYIFPLKAKSEATSTFLHFKK

Query:  LIENKFDRKIKTLQTDWGGEYRPLTSHLKALGIAFRHPCPHTSEQNGRVERKHRHIVEMGLTLLAQASLPLKFWWEAFSSATYLINRLPTPTLHGLSPFE
        L+EN+F+ ++K +QTDWGGEY+         GI F+HPCPHTS QNGR ERKHRHIVEMGLTLLAQA +P K+WW+AF +A YLINRLPTP L   +PFE
Subjt:  LIENKFDRKIKTLQTDWGGEYRPLTSHLKALGIAFRHPCPHTSEQNGRVERKHRHIVEMGLTLLAQASLPLKFWWEAFSSATYLINRLPTPTLHGLSPFE

Query:  KAFGRKPDYKFLKVFGCSCFPCLRPYQKHKFDFHSIKCTFLGYDESYKGYICLSPQGRRYISRHVLFNEVEFPYISQPTTQPISSPNNSMSYCLVPPPTN
          F ++PDYKFLKVFG SCFPCLR YQ HKF FHS KC  LGY + +KGY CLS  GR YISR V+FNE EFP+ S       + P   +S  +     +
Subjt:  KAFGRKPDYKFLKVFGCSCFPCLRPYQKHKFDFHSIKCTFLGYDESYKGYICLSPQGRRYISRHVLFNEVEFPYISQPTTQPISSPNNSMSYCLVPPPTN

Query:  HFNPVPTISTSNVQAAPPPSLVD-----TPPASNIVVSPSSAAVSPPTHML---------------RSVSSSPIVPILTDVVTSPSPHLQPPSIPTHIST
         F    + S ++  ++   +  D     TP  S +V   S+   +   H++               +   ++ +      + TS S H    ++   +ST
Subjt:  HFNPVPTISTSNVQAAPPPSLVD-----TPPASNIVVSPSSAAVSPPTHML---------------RSVSSSPIVPILTDVVTSPSPHLQPPSIPTHIST

Query:  HPMQTRAKCGIFKPKIFQAQISPPTDLCEPTSVKQALLSHEWKSAMDSEYQALLGNGTWSLVPPSPHYNVVGNKWVFRLKFHPDGSIQRYKARLVAKGFH
        HPM TRAK GIFKPK +  Q     +  EP S+++AL    W +AM SE  AL  NGTW LVP  PH +++ NKWV++ K + DGS QR KARLVAKGF 
Subjt:  HPMQTRAKCGIFKPKIFQAQISPPTDLCEPTSVKQALLSHEWKSAMDSEYQALLGNGTWSLVPPSPHYNVVGNKWVFRLKFHPDGSIQRYKARLVAKGFH

Query:  QTPAIDYHETFSPVIKPPTVKLILTLAVSKNWIVRQLDINNAFLNGHLAEAVYMVQPEGYVDPTKPTYVCKLHKALYGLKQAPRAWYDRLKQCLHRWKFQ
        Q P +D+ ETFSPVIK  TV+++L++AV+K W VRQLDINNAFLNGH+ E +YM QP G+ D  KP +VCKL K++YGL+QAPRAW+D+LK  L  WKF+
Subjt:  QTPAIDYHETFSPVIKPPTVKLILTLAVSKNWIVRQLDINNAFLNGHLAEAVYMVQPEGYVDPTKPTYVCKLHKALYGLKQAPRAWYDRLKQCLHRWKFQ

Query:  NSKADSSLFYYFANGHQIFLLVYVDDILITGTSPTLIDDFITKLGNLFALKDLGHLHYFLGIEVTRNSSGMVLTQRRYIHDLLHRVGLEHIKPCPTPAVV
        NSKADSSLF+   + + I +L+YVDDI+ITG +  ++  FI KL   FALKDLG LHYFLGIEV R+++GM L+Q +YI +LL ++ + ++K CPTP   
Subjt:  NSKADSSLFYYFANGHQIFLLVYVDDILITGTSPTLIDDFITKLGNLFALKDLGHLHYFLGIEVTRNSSGMVLTQRRYIHDLLHRVGLEHIKPCPTPAVV

Query:  GSHLSATQGTPLASPTQYRSTIGALQYLSHTRPDIAFIVNKLSQFLQAPTTDHWQAVKRVLRYLQGTVDYGLHISPCSNLTLTGYSDADWACCPDDRRSI
        G  LS   G  L +PT+YR                                                                           +DRRS+
Subjt:  GSHLSATQGTPLASPTQYRSTIGALQYLSHTRPDIAFIVNKLSQFLQAPTTDHWQAVKRVLRYLQGTVDYGLHISPCSNLTLTGYSDADWACCPDDRRSI

Query:  GAYCIFYGTSLVSWSSKKQTVVARSSAESEYRAMAQATAEITWLHNLLHELCLLPPAPTVLWSDNMSAISLASNPVFHSRTKHIELDVHFIRDKVLGKTL
           C++ G +L+SWSS+KQ VV+RSS ESEYRA+AQ  AE+TW+ +LL EL    PA  ++W DNM A +LASNPV+H+RTKHIE+D+HF+RDK++ K L
Subjt:  GAYCIFYGTSLVSWSSKKQTVVARSSAESEYRAMAQATAEITWLHNLLHELCLLPPAPTVLWSDNMSAISLASNPVFHSRTKHIELDVHFIRDKVLGKTL

Query:  EVRYVPSHDQLADFLTKPLSSAVFQVFRSKLGVFS-PSRLRGDVKD
        EVRY+PS +Q+AD LTK L+        SKLGV   P  LRG+V++
Subjt:  EVRYVPSHDQLADFLTKPLSSAVFQVFRSKLGVFS-PSRLRGDVKD

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.8e-11627.61Show/hide
Query:  DEKNYILWKSIVIAILRGQKLEGFILGTTNPPPAIQQEATTSTATTNPEREKWTTTDQTLLGWLFSTMSPSVASEVINLQTSREVWKALEELYGAANKSR
        D + Y +WK  + A+L  Q +   + G                   N   + W   ++     +   +S S  +   +  T+R++ + L+ +Y   + + 
Subjt:  DEKNYILWKSIVIAILRGQKLEGFILGTTNPPPAIQQEATTSTATTNPEREKWTTTDQTLLGWLFSTMSPSVASEVINLQTSREVWKALEELYGAANKSR

Query:  VNQLRTGLQTTR-KNNMRMGEYLAQMKEFSENLALAGEPISNSYLISSVLAGLDSEFLPIMCQINSQGPMSWQEFHATLVTFENTLL--RLKVINLDGDA
           LR  L + +  + M +  +     E    L  AG  I     IS +L  L S +  I+  I      +  E + TL   +N LL   +K+ N   D 
Subjt:  VNQLRTGLQTTR-KNNMRMGEYLAQMKEFSENLALAGEPISNSYLISSVLAGLDSEFLPIMCQINSQGPMSWQEFHATLVTFENTLL--RLKVINLDGDA

Query:  SANFV--VANQNNNIQSSNSSQERNSEETIPLKTVSEDMDEIEG-GVMGHSALVCYY-----------RYDPPTVGQQHNSPRPAANIATAEVIADQSWY
        S   +  + + NNN   +N  + R ++     K  S+   +    G  GH    C++                     H        +    V+ +  + 
Subjt:  SANFV--VANQNNNIQSSNSSQERNSEETIPLKTVSEDMDEIEG-GVMGHSALVCYY-----------RYDPPTVGQQHNSPRPAANIATAEVIADQSWY

Query:  ADSGATNHTTAELANLNIQSEYGGKTSLVVGNGSKLKIAHVGESL------IQNNCLSKPIALKKILHVPSIQKNLISISKLTADNNLFVEFHSHMCFVK
         DSGA++H       +N +S Y    S+ V    K+ +A  GE +      I        I L+ +L       NL+S+ +L  +  + +EF        
Subjt:  ADSGATNHTTAELANLNIQSEYGGKTSLVVGNGSKLKIAHVGESL------IQNNCLSKPIALKKILHVPSIQKNLISISKLTADNNLFVEFHSHMCFVK

Query:  DKQTGKVVLQGVLKDGLYQLNLNSNPPKSRPQLLFSQASQPHVLSASVKPSFPSQSLSFSLWHKRLGHPSN-KIL----------HSVVKSCNLQCPINE
        DK    +   G++      +  NS    + P + F    Q + ++A  K        +F LWH+R GH S+ K+L           S++ +  L C I  
Subjt:  DKQTGKVVLQGVLKDGLYQLNLNSNPPKSRPQLLFSQASQPHVLSASVKPSFPSQSLSFSLWHKRLGHPSN-KIL----------HSVVKSCNLQCPINE

Query:  KPPFCDACVYGKAHLLPFHNAPNIT--TAPLELVYSDLWGPSPVCSLNGYRFYISFLDDFTRLTYIFPLKAKSEATSTFLHFKKLIENKFDRKIKTLQTD
            C+ C+ GK   LPF    + T    PL +V+SD+ GP    +L+   +++ F+D FT     + +K KS+  S F  F    E  F+ K+  L  D
Subjt:  KPPFCDACVYGKAHLLPFHNAPNIT--TAPLELVYSDLWGPSPVCSLNGYRFYISFLDDFTRLTYIFPLKAKSEATSTFLHFKKLIENKFDRKIKTLQTD

Query:  WGGEYRPLTSHLKAL----GIAFRHPCPHTSEQNGRVERKHRHIVEMGLTLLAQASLPLKFWWEAFSSATYLINRLPTPTL--HGLSPFEKAFGRKPDYK
         G EY  L++ ++      GI++    PHT + NG  ER  R I E   T+++ A L   FW EA  +ATYLINR+P+  L     +P+E    +KP  K
Subjt:  WGGEYRPLTSHLKAL----GIAFRHPCPHTSEQNGRVERKHRHIVEMGLTLLAQASLPLKFWWEAFSSATYLINRLPTPTL--HGLSPFEKAFGRKPDYK

Query:  FLKVFGCSCFPCLRPYQKHKFDFHSIKCTFLGYDESYKGYICLSPQGRRYI--------------SRHVLFNEV-----------EFPYISQPTTQ---P
         L+VFG + +  ++  Q  KFD  S K  F+GY+ +  G+        ++I              SR V F  V            FP  S+   Q   P
Subjt:  FLKVFGCSCFPCLRPYQKHKFDFHSIKCTFLGYDESYKGYICLSPQGRRYI--------------SRHVLFNEV-----------EFPYISQPTTQ---P

Query:  ISSPNNSMSYCLVPPPTNHFNPVPTISTSNVQAAPPPS---------LVDTPPASNIVVSPS-----------SAAVSPPTHMLRSVSSSPIVPILTDVV
          S        L     +     P  S   +Q   P           L D+  ++   ++ S           S     P     S ++  +  I  D  
Subjt:  ISSPNNSMSYCLVPPPTNHFNPVPTISTSNVQAAPPPS---------LVDTPPASNIVVSPS-----------SAAVSPPTHMLRSVSSSPIVPILTDVV

Query:  TSPSPHLQPPSIPTHISTHPMQTRAKCGIFKPKIFQAQISPPTDLCEPTSVKQALLSHE---WKSAMDSEYQALLGNGTWSLVPPSPHYNVVGNKWVFRL
        T              + T P  +  +      K+     +   D+  P S  +     +   W+ A+++E  A   N TW++     + N+V ++WVF +
Subjt:  TSPSPHLQPPSIPTHISTHPMQTRAKCGIFKPKIFQAQISPPTDLCEPTSVKQALLSHE---WKSAMDSEYQALLGNGTWSLVPPSPHYNVVGNKWVFRL

Query:  KFHPDGSIQRYKARLVAKGFHQTPAIDYHETFSPVIKPPTVKLILTLAVSKNWIVRQLDINNAFLNGHLAEAVYMVQPEGYVDPTKPTYVCKLHKALYGL
        K++  G+  RYKARLVA+GF Q   IDY ETF+PV +  + + IL+L +  N  V Q+D+  AFLNG L E +YM  P+G         VCKL+KA+YGL
Subjt:  KFHPDGSIQRYKARLVAKGFHQTPAIDYHETFSPVIKPPTVKLILTLAVSKNWIVRQLDINNAFLNGHLAEAVYMVQPEGYVDPTKPTYVCKLHKALYGL

Query:  KQAPRAWYDRLKQCLHRWKFQNSKADSSLFYYFANGH---QIFLLVYVDDILITGTSPTLIDDFITKLGNLFALKDLGHLHYFLGIEVTRNSSGMVLTQR
        KQA R W++  +Q L   +F NS  D  + Y    G+    I++L+YVDD++I     T +++F   L   F + DL  + +F+GI +      + L+Q 
Subjt:  KQAPRAWYDRLKQCLHRWKFQNSKADSSLFYYFANGH---QIFLLVYVDDILITGTSPTLIDDFITKLGNLFALKDLGHLHYFLGIEVTRNSSGMVLTQR

Query:  RYIHDLLHRVGLEHIKPCPTPAVVGSHLSATQGTPLASPTQYRSTIGALQYLS-HTRPDIAFIVNKLSQFLQAPTTDHWQAVKRVLRYLQGTVDYGLHIS
         Y+  +L +  +E+     TP     +          + T  RS IG L Y+   TRPD+   VN LS++     ++ WQ +KRVLRYL+GT+D  L   
Subjt:  RYIHDLLHRVGLEHIKPCPTPAVVGSHLSATQGTPLASPTQYRSTIGALQYLS-HTRPDIAFIVNKLSQFLQAPTTDHWQAVKRVLRYLQGTVDYGLHIS

Query:  PCSNLT----LTGYSDADWACCPDDRRSIGAYCI-FYGTSLVSWSSKKQTVVARSSAESEYRAMAQATAEITWLHNLLHELCLLPPAPTVLWSDNMSAIS
           NL     + GY D+DWA    DR+S   Y    +  +L+ W++K+Q  VA SS E+EY A+ +A  E  WL  LL  + +    P  ++ DN   IS
Subjt:  PCSNLT----LTGYSDADWACCPDDRRSIGAYCI-FYGTSLVSWSSKKQTVVARSSAESEYRAMAQATAEITWLHNLLHELCLLPPAPTVLWSDNMSAIS

Query:  LASNPVFHSRTKHIELDVHFIRDKVLGKTLEVRYVPSHDQLADFLTKPLSSAVFQVFRSKLGV
        +A+NP  H R KHI++  HF R++V    + + Y+P+ +QLAD  TKPL +A F   R KLG+
Subjt:  LASNPVFHSRTKHIELDVHFIRDKVLGKTLEVRYVPSHDQLADFLTKPLSSAVFQVFRSKLGV

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-947.4e-14230.09Show/hide
Query:  EKWTTTDQTLLGWLFSTMSPSVASEVINLQTSREVWKALEELYGAANKSRVNQLRTGLQTTRKNNMRMGEYLAQMKEFS---ENLALAGEPISNSYLISS
        E W   D+     +   +S  V + +I+  T+R +W  LE LY   +K+  N+L    Q    +      +L+ +  F+     LA  G  I        
Subjt:  EKWTTTDQTLLGWLFSTMSPSVASEVINLQTSREVWKALEELYGAANKSRVNQLRTGLQTTRKNNMRMGEYLAQMKEFS---ENLALAGEPISNSYLISS

Query:  VLAGLDSEFLPIMCQINSQGPMSWQEFHATLVTFENTLLRLKVINLDGDASANFVVANQNNNIQSSNSSQERNSEETIPLKTVSEDMDEIEGGVMGHSAL
        +L  L S +  +   I   G  + +    T     N  +R K  N      A        +  +SSN+     +      ++ S   +       GH   
Subjt:  VLAGLDSEFLPIMCQINSQGPMSWQEFHATLVTFENTLLRLKVINLDGDASANFVVANQNNNIQSSNSSQERNSEETIPLKTVSEDMDEIEGGVMGHSAL

Query:  VC-YYRYDPPTVGQQHNSPRPAANIATAEVIA---------------DQSWYADSGATNHTTAELANLNIQSEYGGKTSLVVGNGSKLKIAHVGESLIQN
         C   R        Q N    AA +   + +                +  W  D+ A++H T  + +L  +   G   ++ +GN S  KIA +G+  I+ 
Subjt:  VC-YYRYDPPTVGQQHNSPRPAANIATAEVIA---------------DQSWYADSGATNHTTAELANLNIQSEYGGKTSLVVGNGSKLKIAHVGESLIQN

Query:  NCLSKPIALKKILHVPSIQKNLISISKLTADNNLFVEFHSHMCFVKDKQT--GKVVLQGVLKDGLYQLNLNSNPPKSRPQLLFSQASQPHVLSASVKPSF
        N +   + LK + HVP ++ NLIS   L  D      + S+    K + T    V+ +GV +  LY+ N              ++  Q  + +A      
Subjt:  NCLSKPIALKKILHVPSIQKNLISISKLTADNNLFVEFHSHMCFVKDKQT--GKVVLQGVLKDGLYQLNLNSNPPKSRPQLLFSQASQPHVLSASVKPSF

Query:  PSQSLSFSLWHKRLGHPSNKILHSVVKSCNLQCPINEKPPFCDACVYGKAHLLPFHNAPNITTAPLELVYSDLWGPSPVCSLNGYRFYISFLDDFTRLTY
            +S  LWHKR+GH S K L  + K   +          CD C++GK H + F  +       L+LVYSD+ GP  + S+ G +++++F+DD +R  +
Subjt:  PSQSLSFSLWHKRLGHPSNKILHSVVKSCNLQCPINEKPPFCDACVYGKAHLLPFHNAPNITTAPLELVYSDLWGPSPVCSLNGYRFYISFLDDFTRLTY

Query:  IFPLKAKSEATSTFLHFKKLIENKFDRKIKTLQTDWGGEY--RPLTSHLKALGIAFRHPCPHTSEQNGRVERKHRHIVEMGLTLLAQASLPLKFWWEAFS
        ++ LK K +    F  F  L+E +  RK+K L++D GGEY  R    +  + GI      P T + NG  ER +R IVE   ++L  A LP  FW EA  
Subjt:  IFPLKAKSEATSTFLHFKKLIENKFDRKIKTLQTDWGGEY--RPLTSHLKALGIAFRHPCPHTSEQNGRVERKHRHIVEMGLTLLAQASLPLKFWWEAFS

Query:  SATYLINRLPTPTLHGLSPFEKAFGRKPDYKFLKVFGCSCFPCLRPYQKHKFDFHSIKCTFLGYDESYKGYICLSPQGRRYI-SRHVLFNEVEFPYISQP
        +A YLINR P+  L    P      ++  Y  LKVFGC  F  +   Q+ K D  SI C F+GY +   GY    P  ++ I SR V+F E E     + 
Subjt:  SATYLINRLPTPTLHGLSPFEKAFGRKPDYKFLKVFGCSCFPCLRPYQKHKFDFHSIKCTFLGYDESYKGYICLSPQGRRYI-SRHVLFNEVEFPYISQP

Query:  TTQPISSPNNSMSYCLVPPPTNHFNPVPTISTS---NVQAAPPPSLVDTPPASNIVVSPSSAAVSPPTHMLRSVSSSPIVPILTDVVTSPSPHLQPPSIP
                 N +    V  P+   NP    ST+   + Q   P  +++        +      V  PT                        H       
Subjt:  TTQPISSPNNSMSYCLVPPPTNHFNPVPTISTS---NVQAAPPPSLVDTPPASNIVVSPSSAAVSPPTHMLRSVSSSPIVPILTDVVTSPSPHLQPPSIP

Query:  THISTHPMQTRAKCGIFKPKIFQAQISPPTDLCEPTSVKQALLSHEWKS----AMDSEYQALLGNGTWSLVPPSPHYNVVGNKWVFRLKFHPDGSIQRYK
              P++   +  +   +    +    +D  EP S+K+ +LSH  K+    AM  E ++L  NGT+ LV        +  KWVF+LK   D  + RYK
Subjt:  THISTHPMQTRAKCGIFKPKIFQAQISPPTDLCEPTSVKQALLSHEWKS----AMDSEYQALLGNGTWSLVPPSPHYNVVGNKWVFRLKFHPDGSIQRYK

Query:  ARLVAKGFHQTPAIDYHETFSPVIKPPTVKLILTLAVSKNWIVRQLDINNAFLNGHLAEAVYMVQPEGYVDPTKPTYVCKLHKALYGLKQAPRAWYDRLK
        ARLV KGF Q   ID+ E FSPV+K  +++ IL+LA S +  V QLD+  AFL+G L E +YM QPEG+    K   VCKL+K+LYGLKQAPR WY +  
Subjt:  ARLVAKGFHQTPAIDYHETFSPVIKPPTVKLILTLAVSKNWIVRQLDINNAFLNGHLAEAVYMVQPEGYVDPTKPTYVCKLHKALYGLKQAPRAWYDRLK

Query:  QCLHRWKFQNSKADSSLFY-YFANGHQIFLLVYVDDILITGTSPTLIDDFITKLGNLFALKDLGHLHYFLGIEVT--RNSSGMVLTQRRYIHDLLHRVGL
          +    +  + +D  +++  F+  + I LL+YVDD+LI G    LI      L   F +KDLG     LG+++   R S  + L+Q +YI  +L R  +
Subjt:  QCLHRWKFQNSKADSSLFY-YFANGHQIFLLVYVDDILITGTSPTLIDDFITKLGNLFALKDLGHLHYFLGIEVT--RNSSGMVLTQRRYIHDLLHRVGL

Query:  EHIKPCPTPAVVGSHL--------SATQGTPLASPTQYRSTIGALQY-LSHTRPDIAFIVNKLSQFLQAPTTDHWQAVKRVLRYLQGTVDYGLHISPCSN
        ++ KP  TP  +  HL        +  +     +   Y S +G+L Y +  TRPDIA  V  +S+FL+ P  +HW+AVK +LRYL+GT    L     S+
Subjt:  EHIKPCPTPAVVGSHL--------SATQGTPLASPTQYRSTIGALQY-LSHTRPDIAFIVNKLSQFLQAPTTDHWQAVKRVLRYLQGTVDYGLHISPCSN

Query:  LTLTGYSDADWACCPDDRRSIGAYCIFYGTSLVSWSSKKQTVVARSSAESEYRAMAQATAEITWLHNLLHELCLLPPAPTVLWSDNMSAISLASNPVFHS
          L GY+DAD A   D+R+S   Y   +    +SW SK Q  VA S+ E+EY A  +   E+ WL   L EL  L     V++ D+ SAI L+ N ++H+
Subjt:  LTLTGYSDADWACCPDDRRSIGAYCIFYGTSLVSWSSKKQTVVARSSAESEYRAMAQATAEITWLHNLLHELCLLPPAPTVLWSDNMSAISLASNPVFHS

Query:  RTKHIELDVHFIRDKVLGKTLEVRYVPSHDQLADFLTKPLSSAVFQVFRSKLGVFS
        RTKHI++  H+IR+ V  ++L+V  + +++  AD LTK +    F++ +  +G+ S
Subjt:  RTKHIELDVHFIRDKVLGKTLEVRYVPSHDQLADFLTKPLSSAVFQVFRSKLGVFS

P92519 Uncharacterized mitochondrial protein AtMg008109.3e-5244.25Show/hide
Query:  IFLLVYVDDILITGTSPTLIDDFITKLGNLFALKDLGHLHYFLGIEVTRNSSGMVLTQRRYIHDLLHRVGLEHIKPCPTPAVVGSHLSATQGTPLASPTQ
        ++LL+YVDDIL+TG+S TL++  I +L + F++KDLG +HYFLGI++  + SG+ L+Q +Y   +L+  G+   KP  TP  +  + S+        P+ 
Subjt:  IFLLVYVDDILITGTSPTLIDDFITKLGNLFALKDLGHLHYFLGIEVTRNSSGMVLTQRRYIHDLLHRVGLEHIKPCPTPAVVGSHLSATQGTPLASPTQ

Query:  YRSTIGALQYLSHTRPDIAFIVNKLSQFLQAPTTDHWQAVKRVLRYLQGTVDYGLHISPCSNLTLTGYSDADWACCPDDRRSIGAYCIFYGTSLVSWSSK
        +RS +GALQYL+ TRPDI++ VN + Q +  PT   +  +KRVLRY++GT+ +GL+I   S L +  + D+DWA C   RRS   +C F G +++SWS+K
Subjt:  YRSTIGALQYLSHTRPDIAFIVNKLSQFLQAPTTDHWQAVKRVLRYLQGTVDYGLHISPCSNLTLTGYSDADWACCPDDRRSIGAYCIFYGTSLVSWSSK

Query:  KQTVVARSSAESEYRAMAQATAEITW
        +Q  V+RSS E+EYRA+A   AE+TW
Subjt:  KQTVVARSSAESEYRAMAQATAEITW

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE15.6e-28340.42Show/hide
Query:  KLDEKNYILWKSIVIAILRGQKLEGFILGTTNPPPAIQQEATTSTATTNPEREKWTTTDQTLLGWLFSTMSPSVASEVINLQTSREVWKALEELYGAANK
        KL   NY++W   V A+  G +L GF+ G+T  PPA     T +    NP+  +W   D+ +   +   +S SV   V    T+ ++W+ L ++Y   + 
Subjt:  KLDEKNYILWKSIVIAILRGQKLEGFILGTTNPPPAIQQEATTSTATTNPEREKWTTTDQTLLGWLFSTMSPSVASEVINLQTSREVWKALEELYGAANK

Query:  SRVNQLRTGLQTTRKNNMRMGEYLAQMKEFSENLALAGEPISNSYLISSVLAGLDSEFLPIMCQINSQG-PMSWQEFHATLVTFENTLLRLK---VINLD
          V QLRT L+   K    + +Y+  +    + LAL G+P+ +   +  VL  L  E+ P++ QI ++  P +  E H  L+  E+ +L +    VI + 
Subjt:  SRVNQLRTGLQTTRKNNMRMGEYLAQMKEFSENLALAGEPISNSYLISSVLAGLDSEFLPIMCQINSQG-PMSWQEFHATLVTFENTLLRLK---VINLD

Query:  GDASA--NFVVANQNNNIQSSNSSQER-NSEETIPLKTVSEDMDEIEG------------GVMGHSALVC--YYRYDPPTVGQQHNSP----RPAANIAT
         +A +  N    N NNN   +N    R N+  + P +  S +                  GV GHSA  C     +      QQ  SP    +P AN+A 
Subjt:  GDASA--NFVVANQNNNIQSSNSSQER-NSEETIPLKTVSEDMDEIEG------------GVMGHSALVC--YYRYDPPTVGQQHNSP----RPAANIAT

Query:  AEVIADQSWYADSGATNHTTAELANLNIQSEYGGKTSLVVGNGSKLKIAHVGESLIQNNCLSKPIALKKILHVPSIQKNLISISKLTADNNLFVEFHSHM
            +  +W  DSGAT+H T++  NL++   Y G   ++V +GS + I+H G + +     S+P+ L  IL+VP+I KNLIS+ +L   N + VEF    
Subjt:  AEVIADQSWYADSGATNHTTAELANLNIQSEYGGKTSLVVGNGSKLKIAHVGESLIQNNCLSKPIALKKILHVPSIQKNLISISKLTADNNLFVEFHSHM

Query:  CFVKDKQTGKVVLQGVLKDGLYQLNLNSNPPKSRPQLLFSQASQPHVLSASVKPSFPSQSLSFSLWHKRLGHPSNKILHSVVKSCNLQCPINEKPPF--C
          VKD  TG  +LQG  KD LY+  + S              SQP  L AS     PS   + S WH RLGHP+  IL+SV+ + +L   +N    F  C
Subjt:  CFVKDKQTGKVVLQGVLKDGLYQLNLNSNPPKSRPQLLFSQASQPHVLSASVKPSFPSQSLSFSLWHKRLGHPSNKILHSVVKSCNLQCPINEKPPF--C

Query:  DACVYGKAHLLPFHNAPNITTAPLELVYSDLWGPSPVCSLNGYRFYISFLDDFTRLTYIFPLKAKSEATSTFLHFKKLIENKFDRKIKTLQTDWGGEYRP
          C+  K++ +PF  +   +T PLE +YSD+W  SP+ S + YR+Y+ F+D FTR T+++PLK KS+   TF+ FK L+EN+F  +I T  +D GGE+  
Subjt:  DACVYGKAHLLPFHNAPNITTAPLELVYSDLWGPSPVCSLNGYRFYISFLDDFTRLTYIFPLKAKSEATSTFLHFKKLIENKFDRKIKTLQTDWGGEYRP

Query:  LTSHLKALGIAFRHPCPHTSEQNGRVERKHRHIVEMGLTLLAQASLPLKFWWEAFSSATYLINRLPTPTLHGLSPFEKAFGRKPDYKFLKVFGCSCFPCL
        L  +    GI+     PHT E NG  ERKHRHIVE GLTLL+ AS+P  +W  AF+ A YLINRLPTP L   SPF+K FG  P+Y  L+VFGC+C+P L
Subjt:  LTSHLKALGIAFRHPCPHTSEQNGRVERKHRHIVEMGLTLLAQASLPLKFWWEAFSSATYLINRLPTPTLHGLSPFEKAFGRKPDYKFLKVFGCSCFPCL

Query:  RPYQKHKFDFHSIKCTFLGYDESYKGYICLSPQ-GRRYISRHVLFNEVEFPYISQ-PTTQPISSPNNSMSYCLVP----PPTNHFNPVPTISTSNVQAAP
        RPY +HK D  S +C FLGY  +   Y+CL  Q  R YISRHV F+E  FP+ +   T  P+       S    P    P      P P+ S  +  AA 
Subjt:  RPYQKHKFDFHSIKCTFLGYDESYKGYICLSPQ-GRRYISRHVLFNEVEFPYISQ-PTTQPISSPNNSMSYCLVP----PPTNHFNPVPTISTSNVQAAP

Query:  PPSLVDTP------PASNIVVSPSSAAVSPP----------------------THMLRSVS---------------------SSPIVPILTDVVTSPSPH
        PPS    P       +SN+  S SS+  S P                      TH  ++ S                     SS   P  T   +S S  
Subjt:  PPSLVDTP------PASNIVVSPSSAAVSPP----------------------THMLRSVS---------------------SSPIVPILTDVVTSPSPH

Query:  LQPPSIPTH----------------ISTHPMQTRAKCGIFKPKIFQAQISPPTDLCEPTSVKQALLSHEWKSAMDSEYQALLGNGTWSLVPPSP-HYNVV
          PPSI  H                ++TH M TRAK GI KP    +         EP +  QAL    W++AM SE  A +GN TW LVPP P H  +V
Subjt:  LQPPSIPTH----------------ISTHPMQTRAKCGIFKPKIFQAQISPPTDLCEPTSVKQALLSHEWKSAMDSEYQALLGNGTWSLVPPSP-HYNVV

Query:  GNKWVFRLKFHPDGSIQRYKARLVAKGFHQTPAIDYHETFSPVIKPPTVKLILTLAVSKNWIVRQLDINNAFLNGHLAEAVYMVQPEGYVDPTKPTYVCK
        G +W+F  K++ DGS+ RYKARLVAKG++Q P +DY ETFSPVIK  +++++L +AV ++W +RQLD+NNAFL G L + VYM QP G++D  +P YVCK
Subjt:  GNKWVFRLKFHPDGSIQRYKARLVAKGFHQTPAIDYHETFSPVIKPPTVKLILTLAVSKNWIVRQLDINNAFLNGHLAEAVYMVQPEGYVDPTKPTYVCK

Query:  LHKALYGLKQAPRAWYDRLKQCLHRWKFQNSKADSSLFYYFANGHQIFLLVYVDDILITGTSPTLIDDFITKLGNLFALKDLGHLHYFLGIEVTRNSSGM
        L KALYGLKQAPRAWY  L+  L    F NS +D+SLF        +++LVYVDDILITG  PTL+ + +  L   F++KD   LHYFLGIE  R  +G+
Subjt:  LHKALYGLKQAPRAWYDRLKQCLHRWKFQNSKADSSLFYYFANGHQIFLLVYVDDILITGTSPTLIDDFITKLGNLFALKDLGHLHYFLGIEVTRNSSGM

Query:  VLTQRRYIHDLLHRVGLEHIKPCPTPAVVGSHLSATQGTPLASPTQYRSTIGALQYLSHTRPDIAFIVNKLSQFLQAPTTDHWQAVKRVLRYLQGTVDYG
         L+QRRYI DLL R  +   KP  TP      LS   GT L  PT+YR  +G+LQYL+ TRPDI++ VN+LSQF+  PT +H QA+KR+LRYL GT ++G
Subjt:  VLTQRRYIHDLLHRVGLEHIKPCPTPAVVGSHLSATQGTPLASPTQYRSTIGALQYLSHTRPDIAFIVNKLSQFLQAPTTDHWQAVKRVLRYLQGTVDYG

Query:  LHISPCSNLTLTGYSDADWACCPDDRRSIGAYCIFYGTSLVSWSSKKQTVVARSSAESEYRAMAQATAEITWLHNLLHELCLLPPAPTVLWSDNMSAISL
        + +   + L+L  YSDADWA   DD  S   Y ++ G   +SWSSKKQ  V RSS E+EYR++A  ++E+ W+ +LL EL +    P V++ DN+ A  L
Subjt:  LHISPCSNLTLTGYSDADWACCPDDRRSIGAYCIFYGTSLVSWSSKKQTVVARSSAESEYRAMAQATAEITWLHNLLHELCLLPPAPTVLWSDNMSAISL

Query:  ASNPVFHSRTKHIELDVHFIRDKVLGKTLEVRYVPSHDQLADFLTKPLSSAVFQVFRSKLGV
         +NPVFHSR KHI +D HFIR++V    L V +V +HDQLAD LTKPLS   FQ F SK+GV
Subjt:  ASNPVFHSRTKHIELDVHFIRDKVLGKTLEVRYVPSHDQLADFLTKPLSSAVFQVFRSKLGV

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.5e-27539.5Show/hide
Query:  KLDEKNYILWKSIVIAILRGQKLEGFILGTTNPPPAIQQEATTSTATTNPEREKWTTTDQTLLGWLFSTMSPSVASEVINLQTSREVWKALEELYGAANK
        KL   NY++W   V A+  G +L GF+ G+T  PPA     T +    NP+  +W   D+ +   +   +S SV   V    T+ ++W+ L ++Y   + 
Subjt:  KLDEKNYILWKSIVIAILRGQKLEGFILGTTNPPPAIQQEATTSTATTNPEREKWTTTDQTLLGWLFSTMSPSVASEVINLQTSREVWKALEELYGAANK

Query:  SRVNQLRTGLQTTRKNNMRMGEYLAQMKEFSENLALAGEPISNSYLISSVLAGLDSEFLPIMCQINSQG-PMSWQEFHATLVTFENTLLRLKVINLDGDA
          V QLR               ++ +     + LAL G+P+ +   +  VL  L  ++ P++ QI ++  P S  E H  L+  E+ LL L    +    
Subjt:  SRVNQLRTGLQTTRKNNMRMGEYLAQMKEFSENLALAGEPISNSYLISSVLAGLDSEFLPIMCQINSQG-PMSWQEFHATLVTFENTLLRLKVINLDGDA

Query:  SANFVV----------------ANQNNNIQSSNSSQERNSEETIPLKTVSEDMDEIE-GGVMGHSALVC--YYRYDPPTVGQQHNSP----RPAANIATA
        +AN V                  N NNN   SNS Q  +S      +     +   +   V GHSA  C   +++   T  QQ  SP    +P AN+A  
Subjt:  SANFVV----------------ANQNNNIQSSNSSQERNSEETIPLKTVSEDMDEIE-GGVMGHSALVC--YYRYDPPTVGQQHNSP----RPAANIATA

Query:  EVIADQSWYADSGATNHTTAELANLNIQSEYGGKTSLVVGNGSKLKIAHVGESLIQNNCLSKPIALKKILHVPSIQKNLISISKLTADNNLFVEFHSHMC
              +W  DSGAT+H T++  NL+    Y G   +++ +GS + I H G + +  +  S+ + L K+L+VP+I KNLIS+ +L   N + VEF     
Subjt:  EVIADQSWYADSGATNHTTAELANLNIQSEYGGKTSLVVGNGSKLKIAHVGESLIQNNCLSKPIALKKILHVPSIQKNLISISKLTADNNLFVEFHSHMC

Query:  FVKDKQTGKVVLQGVLKDGLYQLNLNSNPPKSRPQLLFSQASQPHVLSASVKPSFPSQSLSFSLWHKRLGHPSNKILHSVVKSCNLQCPI---NEKPPFC
         VKD  TG  +LQG  KD LY+  + S              SQ   + AS     P    + S WH RLGHPS  IL+SV+ + +L  P+   + K   C
Subjt:  FVKDKQTGKVVLQGVLKDGLYQLNLNSNPPKSRPQLLFSQASQPHVLSASVKPSFPSQSLSFSLWHKRLGHPSNKILHSVVKSCNLQCPI---NEKPPFC

Query:  DACVYGKAHLLPFHNAPNITTAPLELVYSDLWGPSPVCSLNGYRFYISFLDDFTRLTYIFPLKAKSEATSTFLHFKKLIENKFDRKIKTLQTDWGGEYRP
          C   K+H +PF N+   ++ PLE +YSD+W  SP+ S++ YR+Y+ F+D FTR T+++PLK KS+   TF+ FK L+EN+F  +I TL +D GGE+  
Subjt:  DACVYGKAHLLPFHNAPNITTAPLELVYSDLWGPSPVCSLNGYRFYISFLDDFTRLTYIFPLKAKSEATSTFLHFKKLIENKFDRKIKTLQTDWGGEYRP

Query:  LTSHLKALGIAFRHPCPHTSEQNGRVERKHRHIVEMGLTLLAQASLPLKFWWEAFSSATYLINRLPTPTLHGLSPFEKAFGRKPDYKFLKVFGCSCFPCL
        L  +L   GI+     PHT E NG  ERKHRHIVEMGLTLL+ AS+P  +W  AFS A YLINRLPTP L   SPF+K FG+ P+Y+ LKVFGC+C+P L
Subjt:  LTSHLKALGIAFRHPCPHTSEQNGRVERKHRHIVEMGLTLLAQASLPLKFWWEAFSSATYLINRLPTPTLHGLSPFEKAFGRKPDYKFLKVFGCSCFPCL

Query:  RPYQKHKFDFHSIKCTFLGYDESYKGYICLS-PQGRRYISRHVLFNEVEFPY------ISQPTTQPISSPNNSMSYCLVPPPTNHFNPVPTISTSNVQAA
        RPY +HK +  S +C F+GY  +   Y+CL  P GR Y SRHV F+E  FP+      +S    Q   S  N  S+  + P T    P P     ++  +
Subjt:  RPYQKHKFDFHSIKCTFLGYDESYKGYICLS-PQGRRYISRHVLFNEVEFPY------ISQPTTQPISSPNNSMSYCLVPPPTNHFNPVPTISTSNVQAA

Query:  PPPSLVDTP------PASNI----VVSPSSAAVSPPTH-----------MLRSVSSSPIV----------------------PILTDVVTSPSPHLQPPS
        P P    +P       +SN+    + SPSS+  + P+H              S S+SPI+                      PI +  + +PS  +  P+
Subjt:  PPPSLVDTP------PASNI----VVSPSSAAVSPPTH-----------MLRSVSSSPIV----------------------PILTDVVTSPSPHLQPPS

Query:  IPTH-------------------------ISTHPMQTRAKCGIFKPKIFQAQISPPTDLCEPTSVKQALLSHEWKSAMDSEYQALLGNGTWSLV-PPSPH
         P+                          ++TH M TRAK GI KP    +  +      EP +  QA+    W+ AM SE  A +GN TW LV PP P 
Subjt:  IPTH-------------------------ISTHPMQTRAKCGIFKPKIFQAQISPPTDLCEPTSVKQALLSHEWKSAMDSEYQALLGNGTWSLV-PPSPH

Query:  YNVVGNKWVFRLKFHPDGSIQRYKARLVAKGFHQTPAIDYHETFSPVIKPPTVKLILTLAVSKNWIVRQLDINNAFLNGHLAEAVYMVQPEGYVDPTKPT
          +VG +W+F  KF+ DGS+ RYKARLVAKG++Q P +DY ETFSPVIK  +++++L +AV ++W +RQLD+NNAFL G L + VYM QP G+VD  +P 
Subjt:  YNVVGNKWVFRLKFHPDGSIQRYKARLVAKGFHQTPAIDYHETFSPVIKPPTVKLILTLAVSKNWIVRQLDINNAFLNGHLAEAVYMVQPEGYVDPTKPT

Query:  YVCKLHKALYGLKQAPRAWYDRLKQCLHRWKFQNSKADSSLFYYFANGHQIFLLVYVDDILITGTSPTLIDDFITKLGNLFALKDLGHLHYFLGIEVTRN
        YVC+L KA+YGLKQAPRAWY  L+  L    F NS +D+SLF        I++LVYVDDILITG    L+   +  L   F++K+   LHYFLGIE  R 
Subjt:  YVCKLHKALYGLKQAPRAWYDRLKQCLHRWKFQNSKADSSLFYYFANGHQIFLLVYVDDILITGTSPTLIDDFITKLGNLFALKDLGHLHYFLGIEVTRN

Query:  SSGMVLTQRRYIHDLLHRVGLEHIKPCPTPAVVGSHLSATQGTPLASPTQYRSTIGALQYLSHTRPDIAFIVNKLSQFLQAPTTDHWQAVKRVLRYLQGT
          G+ L+QRRY  DLL R  +   KP  TP      L+   GT L  PT+YR  +G+LQYL+ TRPD+++ VN+LSQ++  PT DHW A+KRVLRYL GT
Subjt:  SSGMVLTQRRYIHDLLHRVGLEHIKPCPTPAVVGSHLSATQGTPLASPTQYRSTIGALQYLSHTRPDIAFIVNKLSQFLQAPTTDHWQAVKRVLRYLQGT

Query:  VDYGLHISPCSNLTLTGYSDADWACCPDDRRSIGAYCIFYGTSLVSWSSKKQTVVARSSAESEYRAMAQATAEITWLHNLLHELCLLPPAPTVLWSDNMS
         D+G+ +   + L+L  YSDADWA   DD  S   Y ++ G   +SWSSKKQ  V RSS E+EYR++A  ++E+ W+ +LL EL +    P V++ DN+ 
Subjt:  VDYGLHISPCSNLTLTGYSDADWACCPDDRRSIGAYCIFYGTSLVSWSSKKQTVVARSSAESEYRAMAQATAEITWLHNLLHELCLLPPAPTVLWSDNMS

Query:  AISLASNPVFHSRTKHIELDVHFIRDKVLGKTLEVRYVPSHDQLADFLTKPLSSAVFQVFRSKLGV
        A  L +NPVFHSR KHI LD HFIR++V    L V +V +HDQLAD LTKPLS   FQ F  K+GV
Subjt:  AISLASNPVFHSRTKHIELDVHFIRDKVLGKTLEVRYVPSHDQLADFLTKPLSSAVFQVFRSKLGV

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 85.3e-11143.37Show/hide
Query:  EPTSVKQALLSHEWKSAMDSEYQALLGNGTWSLVPPSPHYNVVGNKWVFRLKFHPDGSIQRYKARLVAKGFHQTPAIDYHETFSPVIKPPTVKLILTLAV
        EP++  +A     W  AMD E  A+    TW +    P+   +G KWV+++K++ DG+I+RYKARLVAKG+ Q   ID+ ETFSPV K  +VKLIL ++ 
Subjt:  EPTSVKQALLSHEWKSAMDSEYQALLGNGTWSLVPPSPHYNVVGNKWVFRLKFHPDGSIQRYKARLVAKGFHQTPAIDYHETFSPVIKPPTVKLILTLAV

Query:  SKNWIVRQLDINNAFLNGHLAEAVYMVQPEGYV----DPTKPTYVCKLHKALYGLKQAPRAWYDRLKQCLHRWKFQNSKADSSLFYYFANGHQIFLLVYV
          N+ + QLDI+NAFLNG L E +YM  P GY     D   P  VC L K++YGLKQA R W+ +    L  + F  S +D + F        + +LVYV
Subjt:  SKNWIVRQLDINNAFLNGHLAEAVYMVQPEGYV----DPTKPTYVCKLHKALYGLKQAPRAWYDRLKQCLHRWKFQNSKADSSLFYYFANGHQIFLLVYV

Query:  DDILITGTSPTLIDDFITKLGNLFALKDLGHLHYFLGIEVTRNSSGMVLTQRRYIHDLLHRVGLEHIKPCPTPAVVGSHLSATQGTPLASPTQYRSTIGA
        DDI+I   +   +D+  ++L + F L+DLG L YFLG+E+ R+++G+ + QR+Y  DLL   GL   KP   P       SA  G        YR  IG 
Subjt:  DDILITGTSPTLIDDFITKLGNLFALKDLGHLHYFLGIEVTRNSSGMVLTQRRYIHDLLHRVGLEHIKPCPTPAVVGSHLSATQGTPLASPTQYRSTIGA

Query:  LQYLSHTRPDIAFIVNKLSQFLQAPTTDHWQAVKRVLRYLQGTVDYGLHISPCSNLTLTGYSDADWACCPDDRRSIGAYCIFYGTSLVSWSSKKQTVVAR
        L YL  TR DI+F VNKLSQF +AP   H QAV ++L Y++GTV  GL  S  + + L  +SDA +  C D RRS   YC+F GTSL+SW SKKQ VV++
Subjt:  LQYLSHTRPDIAFIVNKLSQFLQAPTTDHWQAVKRVLRYLQGTVDYGLHISPCSNLTLTGYSDADWACCPDDRRSIGAYCIFYGTSLVSWSSKKQTVVAR

Query:  SSAESEYRAMAQATAEITWLHNLLHELCLLPPAPTVLWSDNMSAISLASNPVFHSRTKHIELDVHFIRDK-VLGKTLEVRY--VPSHDQLADFLTKPLSS
        SSAE+EYRA++ AT E+ WL     EL L    PT+L+ DN +AI +A+N VFH RTKHIE D H +R++ V   TL   +      D   ++L+  L  
Subjt:  SSAESEYRAMAQATAEITWLHNLLHELCLLPPAPTVLWSDNMSAISLASNPVFHSRTKHIELDVHFIRDK-VLGKTLEVRY--VPSHDQLADFLTKPLSS

Query:  AVFQV
         +  +
Subjt:  AVFQV

AT5G48050.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)2.4e-1023.96Show/hide
Query:  LAVKLDEKNYILWKSIVIAILRGQKLEGFILGTTNPPPAIQQEATTSTATTNPEREKWTTTDQTLLGWLFSTMSPSVASEVINLQ-TSREVWKALEELYG
        + + L++ NY +W+ +   +     + G I G++ P P  +              ++W   D  +  W++ T++ S+   +I +  T+R++W +LE L+ 
Subjt:  LAVKLDEKNYILWKSIVIAILRGQKLEGFILGTTNPPPAIQQEATTSTATTNPEREKWTTTDQTLLGWLFSTMSPSVASEVINLQ-TSREVWKALEELYG

Query:  AANKSRVNQLRTGLQTTRKNNMRMGEYLAQMKEFSENLALAGEPISNSYLISSVLAGLDSEFLPIMCQINSQGPM-SWQEFHATLVTFENTL
           ++R  Q    L+TT  +++ + EY  ++K  S+ L     PIS+  L+  +L GL  ++  I+  I  + P  S+ E  + L+  E+ L
Subjt:  AANKSRVNQLRTGLQTTRKNNMRMGEYLAQMKEFSENLALAGEPISNSYLISSVLAGLDSEFLPIMCQINSQGPM-SWQEFHATLVTFENTL

ATMG00240.1 Gag-Pol-related retrotransposon family protein2.6e-1752.56Show/hide
Query:  YLSHTRPDIAFIVNKLSQFLQAPTTDHWQAVKRVLRYLQGTVDYGLHISPCSNLTLTGYSDADWACCPDDRRSIGAYC
        YL+ TRPD+ F VN+LSQF  A  T   QAV +VL Y++GTV  GL  S  S+L L  ++D+DWA CPD RRS+  +C
Subjt:  YLSHTRPDIAFIVNKLSQFLQAPTTDHWQAVKRVLRYLQGTVDYGLHISPCSNLTLTGYSDADWACCPDDRRSIGAYC

ATMG00810.1 DNA/RNA polymerases superfamily protein6.6e-5344.25Show/hide
Query:  IFLLVYVDDILITGTSPTLIDDFITKLGNLFALKDLGHLHYFLGIEVTRNSSGMVLTQRRYIHDLLHRVGLEHIKPCPTPAVVGSHLSATQGTPLASPTQ
        ++LL+YVDDIL+TG+S TL++  I +L + F++KDLG +HYFLGI++  + SG+ L+Q +Y   +L+  G+   KP  TP  +  + S+        P+ 
Subjt:  IFLLVYVDDILITGTSPTLIDDFITKLGNLFALKDLGHLHYFLGIEVTRNSSGMVLTQRRYIHDLLHRVGLEHIKPCPTPAVVGSHLSATQGTPLASPTQ

Query:  YRSTIGALQYLSHTRPDIAFIVNKLSQFLQAPTTDHWQAVKRVLRYLQGTVDYGLHISPCSNLTLTGYSDADWACCPDDRRSIGAYCIFYGTSLVSWSSK
        +RS +GALQYL+ TRPDI++ VN + Q +  PT   +  +KRVLRY++GT+ +GL+I   S L +  + D+DWA C   RRS   +C F G +++SWS+K
Subjt:  YRSTIGALQYLSHTRPDIAFIVNKLSQFLQAPTTDHWQAVKRVLRYLQGTVDYGLHISPCSNLTLTGYSDADWACCPDDRRSIGAYCIFYGTSLVSWSSK

Query:  KQTVVARSSAESEYRAMAQATAEITW
        +Q  V+RSS E+EYRA+A   AE+TW
Subjt:  KQTVVARSSAESEYRAMAQATAEITW

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)8.4e-2444.6Show/hide
Query:  MQTRAKCGIFKPKIFQAQISPPTDLCEPTSVKQALLSHEWKSAMDSEYQALLGNGTWSLVPPSPHYNVVGNKWVFRLKFHPDGSIQRYKARLVAKGFHQT
        M TR+K GI K     +     T   EP SV  AL    W  AM  E  AL  N TW LVPP  + N++G KWVF+ K H DG++ R KARLVAKGFHQ 
Subjt:  MQTRAKCGIFKPKIFQAQISPPTDLCEPTSVKQALLSHEWKSAMDSEYQALLGNGTWSLVPPSPHYNVVGNKWVFRLKFHPDGSIQRYKARLVAKGFHQT

Query:  PAIDYHETFSPVIKPPTVKLILTLA------VSKNWIVR
          I + ET+SPV++  T++ IL +A       S NW+ +
Subjt:  PAIDYHETFSPVIKPPTVKLILTLA------VSKNWIVR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGTCCCAAGGATCCTCGATGAATGATAGTGTGACAAGTGTTCCTGCCATTCAAACTGGAAACAATGGTTCTCCATCATTGGGCAATCCGTTCAACTCAATACTTGC
AGTCAAGCTTGACGAAAAGAACTATATTTTGTGGAAATCAATTGTCATTGCCATACTTCGAGGACAGAAGCTTGAAGGTTTTATCCTTGGCACTACAAACCCTCCCCCAG
CAATCCAACAAGAAGCAACCACATCTACTGCAACAACAAATCCAGAGAGGGAAAAGTGGACAACAACTGACCAAACCTTACTTGGATGGCTTTTCAGCACCATGAGTCCT
AGTGTTGCGTCTGAGGTAATCAATCTACAAACTTCAAGAGAAGTATGGAAGGCTCTAGAAGAGTTATATGGAGCTGCAAATAAATCAAGAGTCAATCAATTGAGAACCGG
TCTTCAAACCACCAGAAAAAACAACATGAGGATGGGAGAGTATTTAGCCCAAATGAAGGAGTTCTCTGAAAATCTTGCACTGGCAGGTGAACCCATTTCAAATAGTTATC
TAATATCAAGTGTTCTTGCAGGTTTAGACAGTGAATTCTTGCCTATCATGTGTCAAATCAACTCTCAAGGACCCATGTCTTGGCAAGAATTTCATGCTACCTTAGTCACC
TTTGAAAACACATTGTTGAGGCTGAAAGTTATCAATCTGGATGGTGATGCCTCAGCCAATTTTGTTGTTGCGAATCAAAACAATAATATTCAATCATCCAACTCATCTCA
AGAGAGAAACTCAGAAGAAACAATACCTCTCAAAACAGTTTCAGAGGACATGGACGAAATAGAGGGAGGGGTGATGGGACATTCTGCACTTGTCTGCTACTATAGGTATG
ATCCTCCAACTGTAGGACAACAACACAATAGCCCACGTCCAGCTGCAAATATTGCCACTGCAGAAGTCATTGCAGACCAAAGCTGGTATGCTGACAGTGGCGCCACAAAT
CACACTACTGCTGAGCTTGCGAATCTGAACATTCAATCGGAATATGGTGGTAAGACTAGTCTTGTTGTTGGAAATGGCTCTAAGTTAAAAATAGCTCATGTGGGAGAATC
CTTGATACAAAATAATTGTCTTTCCAAACCAATTGCCTTAAAAAAGATTCTTCATGTGCCTTCTATTCAAAAGAACTTGATTAGCATCTCTAAACTTACTGCTGATAATA
ATCTCTTTGTTGAATTTCACTCACATATGTGTTTTGTGAAGGACAAGCAAACCGGGAAGGTGGTTCTTCAAGGGGTGCTTAAGGATGGATTGTATCAACTCAATCTCAAC
TCAAATCCTCCAAAATCTAGACCACAGTTGCTGTTTTCCCAAGCTTCTCAACCTCATGTTTTATCTGCTTCAGTCAAGCCCTCTTTTCCAAGTCAGTCCTTGTCCTTTAG
TTTGTGGCATAAGCGTTTAGGCCACCCTTCCAATAAAATCTTACATTCTGTTGTCAAGTCTTGTAACCTTCAATGTCCAATAAATGAGAAGCCACCATTTTGTGATGCTT
GTGTTTATGGAAAAGCTCATCTTCTGCCCTTTCATAATGCTCCCAATATCACCACTGCTCCTTTGGAGCTTGTTTACAGTGACCTATGGGGTCCTTCCCCTGTGTGTTCT
TTGAACGGCTACAGATTTTATATTAGCTTCCTTGATGACTTTACAAGACTAACCTACATATTCCCTCTCAAAGCTAAATCTGAGGCTACCTCCACATTCCTTCATTTCAA
GAAACTTATTGAGAACAAATTTGACAGAAAAATAAAAACTCTCCAAACTGATTGGGGTGGTGAATATCGCCCTCTTACATCCCATCTCAAAGCATTAGGAATTGCTTTTC
GCCACCCTTGCCCTCATACTAGTGAACAAAATGGTAGAGTTGAGCGCAAGCACCGCCACATCGTTGAAATGGGTCTTACGCTTCTTGCCCAAGCCTCTTTACCTCTCAAA
TTTTGGTGGGAAGCTTTCTCCTCGGCCACATATCTAATTAACCGCCTCCCCACTCCCACTCTCCATGGCCTATCTCCTTTTGAAAAAGCCTTTGGTCGTAAACCAGACTA
TAAGTTTCTCAAAGTCTTTGGGTGTTCATGTTTTCCTTGCTTGCGTCCCTATCAAAAACACAAGTTTGACTTCCACTCCATCAAATGCACATTTCTTGGCTATGATGAGT
CTTATAAAGGCTACATTTGCCTTAGTCCTCAAGGTCGTCGCTACATTTCCCGTCATGTACTCTTTAATGAAGTAGAATTTCCTTACATCTCTCAGCCAACGACCCAACCC
ATTTCTTCACCTAATAACTCCATGTCCTATTGTCTTGTTCCACCTCCTACCAATCACTTTAACCCAGTACCTACTATTTCCACATCTAATGTCCAAGCTGCACCACCACC
ATCCTTGGTTGATACACCTCCTGCCAGCAATATTGTGGTTTCACCTTCTTCGGCTGCTGTGTCACCACCAACGCATATGCTAAGGTCAGTTTCTTCTTCCCCTATTGTAC
CCATTTTAACTGATGTTGTCACCTCTCCTTCCCCTCATCTGCAACCACCATCTATTCCTACTCACATCTCTACACACCCCATGCAAACCCGAGCCAAATGTGGCATTTTT
AAACCAAAAATATTTCAAGCCCAAATCAGCCCTCCCACTGACCTTTGTGAGCCCACATCAGTAAAGCAAGCTCTTCTCTCTCATGAGTGGAAATCAGCAATGGATAGTGA
ATATCAGGCCCTCCTTGGCAATGGTACTTGGTCCCTTGTTCCTCCATCTCCTCATTATAACGTTGTTGGCAACAAGTGGGTGTTTCGTCTTAAGTTTCATCCTGATGGTA
GCATCCAACGCTACAAAGCTAGATTGGTTGCCAAGGGATTTCATCAAACACCCGCTATTGACTATCATGAGACCTTCAGCCCTGTTATAAAGCCTCCTACAGTCAAACTC
ATCCTCACATTAGCAGTCTCAAAGAATTGGATTGTTCGTCAACTTGACATTAACAACGCCTTCCTTAATGGACATCTTGCTGAGGCTGTATATATGGTCCAACCAGAAGG
CTATGTTGACCCAACCAAACCTACCTATGTATGCAAGCTCCATAAGGCACTATATGGTCTCAAACAAGCCCCACGAGCTTGGTATGATCGGTTGAAACAATGTTTACACC
GCTGGAAATTTCAGAACTCCAAGGCTGACAGTTCTCTTTTTTATTACTTTGCTAACGGTCACCAAATATTTCTTCTTGTCTATGTTGATGACATATTAATTACAGGAACA
AGCCCTACTTTGATTGACGACTTCATCACAAAGTTGGGCAACCTTTTCGCTTTAAAGGATCTTGGTCATCTTCATTACTTCCTTGGCATCGAGGTTACTCGTAACTCTAG
TGGTATGGTCCTTACACAACGACGCTATATTCATGATCTCTTACACAGAGTTGGACTTGAGCACATCAAACCTTGCCCCACACCAGCTGTTGTGGGAAGCCATCTCTCAG
CCACGCAAGGCACTCCACTTGCCAGTCCCACACAATATCGCAGTACTATCGGTGCTCTTCAATATCTCTCTCATACACGACCTGATATTGCCTTTATTGTCAATAAACTC
AGTCAGTTTCTTCAAGCCCCTACAACTGACCATTGGCAAGCTGTGAAACGGGTACTTCGCTATCTCCAAGGGACTGTTGATTACGGCCTACATATCTCCCCTTGCTCTAA
CCTCACTCTTACTGGATACTCAGATGCTGATTGGGCTTGTTGTCCAGATGATCGACGGTCCATTGGTGCTTATTGCATCTTTTATGGCACTTCCCTGGTTTCATGGTCCT
CTAAGAAGCAAACTGTTGTTGCTCGCTCCAGTGCTGAAAGTGAATATCGTGCCATGGCTCAAGCTACAGCTGAAATTACATGGCTCCACAATCTTCTTCATGAACTTTGC
TTGCTGCCGCCAGCTCCTACTGTACTTTGGAGTGACAATATGAGTGCTATTTCCTTAGCCTCCAACCCAGTTTTTCACTCACGCACAAAACATATTGAACTTGACGTACA
CTTCATTCGTGACAAGGTTCTGGGAAAAACGCTTGAGGTTCGTTATGTGCCATCTCATGATCAGCTAGCCGACTTCCTCACAAAGCCACTGTCTTCTGCTGTCTTTCAAG
TTTTTCGCTCCAAACTCGGTGTATTTTCACCCTCTCGTTTGAGGGGGGATGTCAAGGATTGCAATGAAACATTACCCATCCATCTGTCACAACCATCAGCAACCACATCT
ACAGAAGATTCATAA
mRNA sequenceShow/hide mRNA sequence
ATGGCGTCCCAAGGATCCTCGATGAATGATAGTGTGACAAGTGTTCCTGCCATTCAAACTGGAAACAATGGTTCTCCATCATTGGGCAATCCGTTCAACTCAATACTTGC
AGTCAAGCTTGACGAAAAGAACTATATTTTGTGGAAATCAATTGTCATTGCCATACTTCGAGGACAGAAGCTTGAAGGTTTTATCCTTGGCACTACAAACCCTCCCCCAG
CAATCCAACAAGAAGCAACCACATCTACTGCAACAACAAATCCAGAGAGGGAAAAGTGGACAACAACTGACCAAACCTTACTTGGATGGCTTTTCAGCACCATGAGTCCT
AGTGTTGCGTCTGAGGTAATCAATCTACAAACTTCAAGAGAAGTATGGAAGGCTCTAGAAGAGTTATATGGAGCTGCAAATAAATCAAGAGTCAATCAATTGAGAACCGG
TCTTCAAACCACCAGAAAAAACAACATGAGGATGGGAGAGTATTTAGCCCAAATGAAGGAGTTCTCTGAAAATCTTGCACTGGCAGGTGAACCCATTTCAAATAGTTATC
TAATATCAAGTGTTCTTGCAGGTTTAGACAGTGAATTCTTGCCTATCATGTGTCAAATCAACTCTCAAGGACCCATGTCTTGGCAAGAATTTCATGCTACCTTAGTCACC
TTTGAAAACACATTGTTGAGGCTGAAAGTTATCAATCTGGATGGTGATGCCTCAGCCAATTTTGTTGTTGCGAATCAAAACAATAATATTCAATCATCCAACTCATCTCA
AGAGAGAAACTCAGAAGAAACAATACCTCTCAAAACAGTTTCAGAGGACATGGACGAAATAGAGGGAGGGGTGATGGGACATTCTGCACTTGTCTGCTACTATAGGTATG
ATCCTCCAACTGTAGGACAACAACACAATAGCCCACGTCCAGCTGCAAATATTGCCACTGCAGAAGTCATTGCAGACCAAAGCTGGTATGCTGACAGTGGCGCCACAAAT
CACACTACTGCTGAGCTTGCGAATCTGAACATTCAATCGGAATATGGTGGTAAGACTAGTCTTGTTGTTGGAAATGGCTCTAAGTTAAAAATAGCTCATGTGGGAGAATC
CTTGATACAAAATAATTGTCTTTCCAAACCAATTGCCTTAAAAAAGATTCTTCATGTGCCTTCTATTCAAAAGAACTTGATTAGCATCTCTAAACTTACTGCTGATAATA
ATCTCTTTGTTGAATTTCACTCACATATGTGTTTTGTGAAGGACAAGCAAACCGGGAAGGTGGTTCTTCAAGGGGTGCTTAAGGATGGATTGTATCAACTCAATCTCAAC
TCAAATCCTCCAAAATCTAGACCACAGTTGCTGTTTTCCCAAGCTTCTCAACCTCATGTTTTATCTGCTTCAGTCAAGCCCTCTTTTCCAAGTCAGTCCTTGTCCTTTAG
TTTGTGGCATAAGCGTTTAGGCCACCCTTCCAATAAAATCTTACATTCTGTTGTCAAGTCTTGTAACCTTCAATGTCCAATAAATGAGAAGCCACCATTTTGTGATGCTT
GTGTTTATGGAAAAGCTCATCTTCTGCCCTTTCATAATGCTCCCAATATCACCACTGCTCCTTTGGAGCTTGTTTACAGTGACCTATGGGGTCCTTCCCCTGTGTGTTCT
TTGAACGGCTACAGATTTTATATTAGCTTCCTTGATGACTTTACAAGACTAACCTACATATTCCCTCTCAAAGCTAAATCTGAGGCTACCTCCACATTCCTTCATTTCAA
GAAACTTATTGAGAACAAATTTGACAGAAAAATAAAAACTCTCCAAACTGATTGGGGTGGTGAATATCGCCCTCTTACATCCCATCTCAAAGCATTAGGAATTGCTTTTC
GCCACCCTTGCCCTCATACTAGTGAACAAAATGGTAGAGTTGAGCGCAAGCACCGCCACATCGTTGAAATGGGTCTTACGCTTCTTGCCCAAGCCTCTTTACCTCTCAAA
TTTTGGTGGGAAGCTTTCTCCTCGGCCACATATCTAATTAACCGCCTCCCCACTCCCACTCTCCATGGCCTATCTCCTTTTGAAAAAGCCTTTGGTCGTAAACCAGACTA
TAAGTTTCTCAAAGTCTTTGGGTGTTCATGTTTTCCTTGCTTGCGTCCCTATCAAAAACACAAGTTTGACTTCCACTCCATCAAATGCACATTTCTTGGCTATGATGAGT
CTTATAAAGGCTACATTTGCCTTAGTCCTCAAGGTCGTCGCTACATTTCCCGTCATGTACTCTTTAATGAAGTAGAATTTCCTTACATCTCTCAGCCAACGACCCAACCC
ATTTCTTCACCTAATAACTCCATGTCCTATTGTCTTGTTCCACCTCCTACCAATCACTTTAACCCAGTACCTACTATTTCCACATCTAATGTCCAAGCTGCACCACCACC
ATCCTTGGTTGATACACCTCCTGCCAGCAATATTGTGGTTTCACCTTCTTCGGCTGCTGTGTCACCACCAACGCATATGCTAAGGTCAGTTTCTTCTTCCCCTATTGTAC
CCATTTTAACTGATGTTGTCACCTCTCCTTCCCCTCATCTGCAACCACCATCTATTCCTACTCACATCTCTACACACCCCATGCAAACCCGAGCCAAATGTGGCATTTTT
AAACCAAAAATATTTCAAGCCCAAATCAGCCCTCCCACTGACCTTTGTGAGCCCACATCAGTAAAGCAAGCTCTTCTCTCTCATGAGTGGAAATCAGCAATGGATAGTGA
ATATCAGGCCCTCCTTGGCAATGGTACTTGGTCCCTTGTTCCTCCATCTCCTCATTATAACGTTGTTGGCAACAAGTGGGTGTTTCGTCTTAAGTTTCATCCTGATGGTA
GCATCCAACGCTACAAAGCTAGATTGGTTGCCAAGGGATTTCATCAAACACCCGCTATTGACTATCATGAGACCTTCAGCCCTGTTATAAAGCCTCCTACAGTCAAACTC
ATCCTCACATTAGCAGTCTCAAAGAATTGGATTGTTCGTCAACTTGACATTAACAACGCCTTCCTTAATGGACATCTTGCTGAGGCTGTATATATGGTCCAACCAGAAGG
CTATGTTGACCCAACCAAACCTACCTATGTATGCAAGCTCCATAAGGCACTATATGGTCTCAAACAAGCCCCACGAGCTTGGTATGATCGGTTGAAACAATGTTTACACC
GCTGGAAATTTCAGAACTCCAAGGCTGACAGTTCTCTTTTTTATTACTTTGCTAACGGTCACCAAATATTTCTTCTTGTCTATGTTGATGACATATTAATTACAGGAACA
AGCCCTACTTTGATTGACGACTTCATCACAAAGTTGGGCAACCTTTTCGCTTTAAAGGATCTTGGTCATCTTCATTACTTCCTTGGCATCGAGGTTACTCGTAACTCTAG
TGGTATGGTCCTTACACAACGACGCTATATTCATGATCTCTTACACAGAGTTGGACTTGAGCACATCAAACCTTGCCCCACACCAGCTGTTGTGGGAAGCCATCTCTCAG
CCACGCAAGGCACTCCACTTGCCAGTCCCACACAATATCGCAGTACTATCGGTGCTCTTCAATATCTCTCTCATACACGACCTGATATTGCCTTTATTGTCAATAAACTC
AGTCAGTTTCTTCAAGCCCCTACAACTGACCATTGGCAAGCTGTGAAACGGGTACTTCGCTATCTCCAAGGGACTGTTGATTACGGCCTACATATCTCCCCTTGCTCTAA
CCTCACTCTTACTGGATACTCAGATGCTGATTGGGCTTGTTGTCCAGATGATCGACGGTCCATTGGTGCTTATTGCATCTTTTATGGCACTTCCCTGGTTTCATGGTCCT
CTAAGAAGCAAACTGTTGTTGCTCGCTCCAGTGCTGAAAGTGAATATCGTGCCATGGCTCAAGCTACAGCTGAAATTACATGGCTCCACAATCTTCTTCATGAACTTTGC
TTGCTGCCGCCAGCTCCTACTGTACTTTGGAGTGACAATATGAGTGCTATTTCCTTAGCCTCCAACCCAGTTTTTCACTCACGCACAAAACATATTGAACTTGACGTACA
CTTCATTCGTGACAAGGTTCTGGGAAAAACGCTTGAGGTTCGTTATGTGCCATCTCATGATCAGCTAGCCGACTTCCTCACAAAGCCACTGTCTTCTGCTGTCTTTCAAG
TTTTTCGCTCCAAACTCGGTGTATTTTCACCCTCTCGTTTGAGGGGGGATGTCAAGGATTGCAATGAAACATTACCCATCCATCTGTCACAACCATCAGCAACCACATCT
ACAGAAGATTCATAA
Protein sequenceShow/hide protein sequence
MASQGSSMNDSVTSVPAIQTGNNGSPSLGNPFNSILAVKLDEKNYILWKSIVIAILRGQKLEGFILGTTNPPPAIQQEATTSTATTNPEREKWTTTDQTLLGWLFSTMSP
SVASEVINLQTSREVWKALEELYGAANKSRVNQLRTGLQTTRKNNMRMGEYLAQMKEFSENLALAGEPISNSYLISSVLAGLDSEFLPIMCQINSQGPMSWQEFHATLVT
FENTLLRLKVINLDGDASANFVVANQNNNIQSSNSSQERNSEETIPLKTVSEDMDEIEGGVMGHSALVCYYRYDPPTVGQQHNSPRPAANIATAEVIADQSWYADSGATN
HTTAELANLNIQSEYGGKTSLVVGNGSKLKIAHVGESLIQNNCLSKPIALKKILHVPSIQKNLISISKLTADNNLFVEFHSHMCFVKDKQTGKVVLQGVLKDGLYQLNLN
SNPPKSRPQLLFSQASQPHVLSASVKPSFPSQSLSFSLWHKRLGHPSNKILHSVVKSCNLQCPINEKPPFCDACVYGKAHLLPFHNAPNITTAPLELVYSDLWGPSPVCS
LNGYRFYISFLDDFTRLTYIFPLKAKSEATSTFLHFKKLIENKFDRKIKTLQTDWGGEYRPLTSHLKALGIAFRHPCPHTSEQNGRVERKHRHIVEMGLTLLAQASLPLK
FWWEAFSSATYLINRLPTPTLHGLSPFEKAFGRKPDYKFLKVFGCSCFPCLRPYQKHKFDFHSIKCTFLGYDESYKGYICLSPQGRRYISRHVLFNEVEFPYISQPTTQP
ISSPNNSMSYCLVPPPTNHFNPVPTISTSNVQAAPPPSLVDTPPASNIVVSPSSAAVSPPTHMLRSVSSSPIVPILTDVVTSPSPHLQPPSIPTHISTHPMQTRAKCGIF
KPKIFQAQISPPTDLCEPTSVKQALLSHEWKSAMDSEYQALLGNGTWSLVPPSPHYNVVGNKWVFRLKFHPDGSIQRYKARLVAKGFHQTPAIDYHETFSPVIKPPTVKL
ILTLAVSKNWIVRQLDINNAFLNGHLAEAVYMVQPEGYVDPTKPTYVCKLHKALYGLKQAPRAWYDRLKQCLHRWKFQNSKADSSLFYYFANGHQIFLLVYVDDILITGT
SPTLIDDFITKLGNLFALKDLGHLHYFLGIEVTRNSSGMVLTQRRYIHDLLHRVGLEHIKPCPTPAVVGSHLSATQGTPLASPTQYRSTIGALQYLSHTRPDIAFIVNKL
SQFLQAPTTDHWQAVKRVLRYLQGTVDYGLHISPCSNLTLTGYSDADWACCPDDRRSIGAYCIFYGTSLVSWSSKKQTVVARSSAESEYRAMAQATAEITWLHNLLHELC
LLPPAPTVLWSDNMSAISLASNPVFHSRTKHIELDVHFIRDKVLGKTLEVRYVPSHDQLADFLTKPLSSAVFQVFRSKLGVFSPSRLRGDVKDCNETLPIHLSQPSATTS
TEDS