; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0018548 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0018548
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr5:29337573..29341016
RNA-Seq ExpressionLag0018548
SyntenyLag0018548
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR036397 - Ribonuclease H superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GAU19483.1 hypothetical protein TSUD_77270 [Trifolium subterraneum]3.9e-17035.28Show/hide
Query:  MSWNEAQSQLLMFEKRLEQLQAMKSSVVSIVQPSANLASTNVQQYNTFQHQSRHMNQSYQRKR----KN----------------LMIETNLSRSSSSGR
        +SW + Q+QLL FE R+EQL  + +  ++     AN +    +  N     S        R R    KN                   +   SRS+ S  
Subjt:  MSWNEAQSQLLMFEKRLEQLQAMKSSVVSIVQPSANLASTNVQQYNTFQHQSRHMNQSYQRKR----KN----------------LMIETNLSRSSSSGR

Query:  NQNKGDNPSAMITTPEHLRDTAWYLESGASNHLTSDMSNLSLKSDYTGNEMITAGDGNQLPIHYIGSSYIKSSVKNLILKGLMHVPQISKNLISISRLTM
        +  +G + +A + +   + D  WY +SGASNH+T         +++ G   +  G+G +L I   GSS +KS    L L  +++VP I+KNL+S+S+L  
Subjt:  NQNKGDNPSAMITTPEHLRDTAWYLESGASNHLTSDMSNLSLKSDYTGNEMITAGDGNQLPIHYIGSSYIKSSVKNLILKGLMHVPQISKNLISISRLTM

Query:  DNSVIVEFCDSCCAVKDKETRKVLLEGALKDGLYRI-----------------------------------------PSIKSASCPGLQ-----------
        DN+++VEF ++CC VKDK T KV+L+G LKDGLY++                                         PS   + C   Q           
Subjt:  DNSVIVEFCDSCCAVKDKETRKVLLEGALKDGLYRI-----------------------------------------PSIKSASCPGLQ-----------

Query:  -----------------------------------EDIKKYSWIFPLKLKSDAVTVFGQFKSLVENIYSSKIKVIRCDEGGEYKPIIHIASQCGIQVQLA
                                           +D  +++WI+PLK KS+ V  F QFK+L EN ++ +IKVI+CD GGEYKP+  +A + GIQ +++
Subjt:  -----------------------------------EDIKKYSWIFPLKLKSDAVTVFGQFKSLVENIYSSKIKVIRCDEGGEYKPIIHIASQCGIQVQLA

Query:  CPYTSAQNGRVERKHRHVVETRLALLAHAKMPLQYWWEAFHTSVYLINIMPTETIGGKVQFTVLNKEKPDYNSLRIFGVACYPCLRQYQHHKFDFHTTKC
        CPYTS QNGR ERKHRH+ E  L LLA A+MPL YWWEAF T+VYLIN +P++    +  ++++ +++PDY  L+ FG ACYPCL+ Y  HK  +HTT+C
Subjt:  CPYTSAQNGRVERKHRHVVETRLALLAHAKMPLQYWWEAFHTSVYLINIMPTETIGGKVQFTVLNKEKPDYNSLRIFGVACYPCLRQYQHHKFDFHTTKC

Query:  VFIGYSSNHK-----------------------------------VAASSINSPTSTTTSSSDGRSHESSLATPRLLDYPSPTNV----SINEETMDAIN
        VF+GYS++HK                                      ++IN P+++    + G   + +       + P+ TN      +N +T    N
Subjt:  VFIGYSSNHK-----------------------------------VAASSINSPTSTTTSSSDGRSHESSLATPRLLDYPSPTNV----SINEETMDAIN

Query:  SSA------------TRPSVINSSTTRPSTSIHPMVTRDKSGVTKPK-KFFGCYSQAQTSMDWSCNEPTSYIDALKVPSWQRAMKEEFTTLTKNQTWDLV
          +            T+   +  ++   +TS H + TR KSG+ KPK  + G     + +M     EP +  +AL  P W+ AM++EF  L  N+TW LV
Subjt:  SSA------------TRPSVINSSTTRPSTSIHPMVTRDKSGVTKPK-KFFGCYSQAQTSMDWSCNEPTSYIDALKVPSWQRAMKEEFTTLTKNQTWDLV

Query:  PPSSNCHLVGNKWIFKLKRDAHGAIVSYKARLVAQGFSQTLGIDFHETYTPVIKPMTFQIILTLAPTWDWSVRQLDVSNTFLNGPLNEIIFMPQPKDFED
        P  +  ++V +KW+FK K    G++   KARLVA+GF QT GID+ ET++PVIK  T +IIL++A   +W VRQLD++N FLNG L E +FM QP+ F D
Subjt:  PPSSNCHLVGNKWIFKLKRDAHGAIVSYKARLVAQGFSQTLGIDFHETYTPVIKPMTFQIILTLAPTWDWSVRQLDVSNTFLNGPLNEIIFMPQPKDFED

Query:  PKKPHYICRLRKALYGLKQVPRA-------------CQRQTPPCSFFARTKK---YFFLCMSMTCFLRVPPHTYLISLFKSFTSRL-LFEISEQSTILGI
          KP++IC+L KA+YGLKQ PRA              Q      S F    K    F L       +      +L +  K       L ++      LGI
Subjt:  PKKPHYICRLRKALYGLKQVPRA-------------CQRQTPPCSFFARTKK---YFFLCMSMTCFLRVPPHTYLISLFKSFTSRL-LFEISEQSTILGI

Query:  QITRSSKGFHLSQATYVQDLLSRLHLEHLKSAPTPMTFSSSITASSKVLFNDPSLYRSTIRSLHYVLLTRPDIAFAVNYLSQFAQAPQQAHWLSLKRILR
        ++ R + G +L Q+ Y+ DLL +  +++    PTPM      T   + L  DP+++R  I  L Y+  T PDIAF+VN LSQ+  +P   HW  +KRILR
Subjt:  QITRSSKGFHLSQATYVQDLLSRLHLEHLKSAPTPMTFSSSITASSKVLFNDPSLYRSTIRSLHYVLLTRPDIAFAVNYLSQFAQAPQQAHWLSLKRILR

Query:  YLHGTFNLGLSLQPTASISLTGYSDVDWAGCPIDRKSVGGYCVFLGSSLISWSSKKQQVVARFSTESEYRTLAHVACDLA
        YL GT N  L ++P+  + +TG+SD DWA    DRKS+ G CVFLG +LISWSS+KQ+VV+R STESEYR LA +A ++A
Subjt:  YLHGTFNLGLSLQPTASISLTGYSDVDWAGCPIDRKSVGGYCVFLGSSLISWSSKKQQVVARFSTESEYRTLAHVACDLA

GAU51268.1 hypothetical protein TSUD_412550 [Trifolium subterraneum]1.7e-17335.4Show/hide
Query:  MSWNEAQSQLLMFEKRLEQLQAMKSSVVSIVQPSANLASTNVQQYNTFQHQSRHMNQSYQRKR----KNLMIETNLSRSSSSGR----------------
        +SW + Q+QLL FE RL+Q        ++    SAN A+    + N F  +      +++  R    K  M  T     + +G                 
Subjt:  MSWNEAQSQLLMFEKRLEQLQAMKSSVVSIVQPSANLASTNVQQYNTFQHQSRHMNQSYQRKR----KNLMIETNLSRSSSSGR----------------

Query:  ----NQNKGDNPSAMITTPEHLRDTAWYLESGASNHLTSDMSNLSLKSDYTGNEMITAGDGNQLPIHYIGSSYIKSSVKNLILKGLMHVPQISKNLISIS
              +K  + SA I +P H +D  WY +SGA+NH+T         +++ G   +  G+G +L I   GS    + + NL L  +++VPQI+KNL+S+S
Subjt:  ----NQNKGDNPSAMITTPEHLRDTAWYLESGASNHLTSDMSNLSLKSDYTGNEMITAGDGNQLPIHYIGSSYIKSSVKNLILKGLMHVPQISKNLISIS

Query:  RLTMDNSVIVEFCDSCCAVKDKETRKVLLEGALKDGLYRIP--------SIKS-------------------------------ASCPGLQ---------
        +LT DN+++VEF  +CC+VKDK T + LL+G LKDGLY++         S+K                                + C   Q         
Subjt:  RLTMDNSVIVEFCDSCCAVKDKETRKVLLEGALKDGLYRIP--------SIKS-------------------------------ASCPGLQ---------

Query:  -------------------------------------EDIKKYSWIFPLKLKSDAVTVFGQFKSLVENIYSSKIKVIRCDEGGEYKPIIHIASQCGIQVQ
                                             +D  +++WIFPLK KSD +  F QFK+L EN ++ KIK+I+CD GGEYK +  ++ + GIQ +
Subjt:  -------------------------------------EDIKKYSWIFPLKLKSDAVTVFGQFKSLVENIYSSKIKVIRCDEGGEYKPIIHIASQCGIQVQ

Query:  LACPYTSAQNGRVERKHRHVVETRLALLAHAKMPLQYWWEAFHTSVYLINIMPTETIGGKVQFTVLNKEKPDYNSLRIFGVACYPCLRQYQHHKFDFHTT
        ++CPYTS QNGR ERKHRHV E  L LLA AKMPL+YWWEAF T+VYLIN +P+     +  ++++ K +PDYN+L+ FG ACYPCL+ Y  HK  FHTT
Subjt:  LACPYTSAQNGRVERKHRHVVETRLALLAHAKMPLQYWWEAFHTSVYLINIMPTETIGGKVQFTVLNKEKPDYNSLRIFGVACYPCLRQYQHHKFDFHTT

Query:  KCVFIGYSSNHKVAASSINS-------------------------------------------------------PTSTTTS--------SSDGRSHESS
        +CVF+GYS++HK     INS                                                       P + TTS        SSD   +E  
Subjt:  KCVFIGYSSNHKVAASSINS-------------------------------------------------------PTSTTTS--------SSDGRSHESS

Query:  LATPRLL---DYPSPTNVSINEETMDAINSSATRPSVINSSTTRPSTSIHPMVTRDKSGVTKPKKFFGCYSQAQTSMDWSCNEPTSYIDALKVPSWQRAM
        + +       +  S  ++  +        +++T    I     + +++ H M TR K G+ KPK        A+T  D    EP S  +AL  P W+ AM
Subjt:  LATPRLL---DYPSPTNVSINEETMDAINSSATRPSVINSSTTRPSTSIHPMVTRDKSGVTKPKKFFGCYSQAQTSMDWSCNEPTSYIDALKVPSWQRAM

Query:  KEEFTTLTKNQTWDLVPPSSNCHLVGNKWIFKLKRDAHGAIVSYKARLVAQGFSQTLGIDFHETYTPVIKPMTFQIILTLAPTWDWSVRQLDVSNTFLNG
         +E+  L  N TW LVP     +++ +KWIFK K  + G+I   KARLVA+GF QT G+DF ET++PV+K  T +IILT+A  ++W VRQLD++N FLNG
Subjt:  KEEFTTLTKNQTWDLVPPSSNCHLVGNKWIFKLKRDAHGAIVSYKARLVAQGFSQTLGIDFHETYTPVIKPMTFQIILTLAPTWDWSVRQLDVSNTFLNG

Query:  PLNEIIFMPQPKDFEDPKKPHYICRLRKALYGLKQVPRA---CQRQTPPCSFFARTKK----YFFLCMSMTCFLRVPPHTYL-----ISLFKSFTSRL--
         L E +FM QP+ + D  KP++IC+L KA+YGLKQ PRA     R T     F   K     +F      T FL +     +     I   ++FT++L  
Subjt:  PLNEIIFMPQPKDFEDPKKPHYICRLRKALYGLKQVPRA---CQRQTPPCSFFARTKK----YFFLCMSMTCFLRVPPHTYL-----ISLFKSFTSRL--

Query:  ---LFEISEQSTILGIQITRSSKGFHLSQATYVQDLLSRLHLEHLKSAPTPMTFSSSITASSKVLFNDPSLYRSTIRSLHYVLLTRPDIAFAVNYLSQFA
           L ++      LG+++ R   G +L Q  Y++D+L + ++E+  + PTPM       A  +++ N P+LYR  I +L Y+  TRPDIAFAVN LSQ+ 
Subjt:  ---LFEISEQSTILGIQITRSSKGFHLSQATYVQDLLSRLHLEHLKSAPTPMTFSSSITASSKVLFNDPSLYRSTIRSLHYVLLTRPDIAFAVNYLSQFA

Query:  QAPQQAHWLSLKRILRYLHGTFNLGLSLQPTASISLTGYSDVDWAGCPIDRKSVGGYCVFLGSSLISWSSKKQQVVARFSTESEYRTLAHVACDLA
          P   HW  +KRILRYL GT N  L ++P+ ++ + G+ D DWA    DRKS GG CVFLG +L+SW+S+KQ+VV+R STESEYR+LA +  +++
Subjt:  QAPQQAHWLSLKRILRYLHGTFNLGLSLQPTASISLTGYSDVDWAGCPIDRKSVGGYCVFLGSSLISWSSKKQQVVARFSTESEYRTLAHVACDLA

KYP50444.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan]1.8e-17536.8Show/hide
Query:  MSWNEAQSQLLMFEKRLEQLQAMKSSVVSIVQPSANLASTNVQ--------------QYNTFQHQSRHMNQS--------------------YQRKRKNL
        ++W E Q+QLL +E RLEQ+    +  ++   PS+N+++                  Q N      R   ++                    Y R  KN 
Subjt:  MSWNEAQSQLLMFEKRLEQLQAMKSSVVSIVQPSANLASTNVQ--------------QYNTFQHQSRHMNQS--------------------YQRKRKNL

Query:  MIETNLSRSSSSGRNQNKGDNPSAMITTPEHLRDTAWYLESGASNHLTSDMSNLSLKSDYTGNEMITAGDGNQLPIHYIGSSYIKSSVKNLILKGLMHVP
        + + +  + S   + QN   N +A + +P  + D  WY +SGASNH+T D + +   ++  G   +T G+G  L I   G S + +  K+L LK +++VP
Subjt:  MIETNLSRSSSSGRNQNKGDNPSAMITTPEHLRDTAWYLESGASNHLTSDMSNLSLKSDYTGNEMITAGDGNQLPIHYIGSSYIKSSVKNLILKGLMHVP

Query:  QISKNLISISRLTMDNSVIVEFCDSCCAVKDKETRKVLLEGALKDGLYRIP---------------------------------------SIKSASCPGL
        +I+KNL+SIS+LT DN + VEF D  C VKDK T ++LLEG +KDGLY++P                                       +I+++ C   
Subjt:  QISKNLISISRLTMDNSVIVEFCDSCCAVKDKETRKVLLEGALKDGLYRIP---------------------------------------SIKSASCPGL

Query:  Q----------------------------------------------------EDIKKYSWIFPLKLKSDAVTVFGQFKSLVENIYSSKIKVIRCDEGGE
        +                                                    +D  +++WI+PLK KSD    F QF++LVEN ++ +IK ++CD GGE
Subjt:  Q----------------------------------------------------EDIKKYSWIFPLKLKSDAVTVFGQFKSLVENIYSSKIKVIRCDEGGE

Query:  YKPIIHIASQCGIQVQLACPYTSAQNGRVERKHRHVVETRLALLAHAKMPLQYWWEAFHTSVYLINIMPTETIGGKVQFTVLNKEKPDYNSLRIFGVACY
        +K +  +  + GIQ++ +CPYTSAQNGR ERKHRHVVE+ L LLA AKMPL YWWEAF T+V+LIN +PT+ I  K  +  L  + PDY +++ FG ACY
Subjt:  YKPIIHIASQCGIQVQLACPYTSAQNGRVERKHRHVVETRLALLAHAKMPLQYWWEAFHTSVYLINIMPTETIGGKVQFTVLNKEKPDYNSLRIFGVACY

Query:  PCLRQYQHHKFDFHTTKCVFIGYSSNHKVAASSINSPTSTTTSS-----------SDG----RSHESSLATPRLLDYP-SPT--NVSINEETMDAINSSA
        PCL+ Y  HK  FHTTKCVF+GYS +HK     +NS      S             DG    R     +  P  L +P SPT  NV+  E+ +   N+S+
Subjt:  PCLRQYQHHKFDFHTTKCVFIGYSSNHKVAASSINSPTSTTTSS-----------SDG----RSHESSLATPRLLDYP-SPT--NVSINEETMDAINSSA

Query:  ----------------TRPSVINSST--------TRPSTSIHPMVTRDKSGVTKPKK-FFGCYSQAQTSMDWSCNEPTSYIDALKVPSWQRAMKEEFTTL
                        T  + I+ +T           S + H M TR K G+ KPKK + G   +          EP +  +AL+ P W++AM  EF  L
Subjt:  ----------------TRPSVINSST--------TRPSTSIHPMVTRDKSGVTKPKK-FFGCYSQAQTSMDWSCNEPTSYIDALKVPSWQRAMKEEFTTL

Query:  TKNQTWDLVPPSSNCHLVGNKWIFKLKRDAHGAIVSYKARLVAQGFSQTLGIDFHETYTPVIKPMTFQIILTLAPTWDWSVRQLDVSNTFLNGPLNEIIF
          N+TW LVP     +++  KW+FK K  A G I   KARLVA+GF QTLG+D+ ET++PVIK +T +IIL++A  ++W +RQ+D++N FLNG L E +F
Subjt:  TKNQTWDLVPPSSNCHLVGNKWIFKLKRDAHGAIVSYKARLVAQGFSQTLGIDFHETYTPVIKPMTFQIILTLAPTWDWSVRQLDVSNTFLNGPLNEIIF

Query:  MPQPKDFEDPKKPHYICRLRKALYGLKQVPRACQRQTPPCSF-----FARTKKYFFLCMSMT--CFLRVPPHTYLI-----SLFKSFTSRL--LFEISEQ
        M QP+ F D  +P +IC+L KA+YGLKQ PR+   +             R+    F+ MS     FL +     +I     S   SF  +L  +F + + 
Subjt:  MPQPKDFEDPKKPHYICRLRKALYGLKQVPRACQRQTPPCSF-----FARTKKYFFLCMSMT--CFLRVPPHTYLI-----SLFKSFTSRL--LFEISEQ

Query:  STI---LGIQITRSSKGFHLSQATYVQDLLSRLHLEHLKSAPTPMTFSSSITASSKVLFNDPSLYRSTIRSLHYVLLTRPDIAFAVNYLSQFAQAPQQAH
         ++   LG++  R + G +L Q  YV DLL + +LEH+ S PTPM    S++  ++++ N P+LYR  I  L Y+  TRPDIA++VN LSQ+ QAP   H
Subjt:  STI---LGIQITRSSKGFHLSQATYVQDLLSRLHLEHLKSAPTPMTFSSSITASSKVLFNDPSLYRSTIRSLHYVLLTRPDIAFAVNYLSQFAQAPQQAH

Query:  WLSLKRILRYLHGTFNLGLSLQPTASISLTGYSDVDWAGCPIDRKSVGGYCVFLGSSLISWSSKKQQVVARFSTES
        W S+KR+ RYL GT N  L ++P+  + +TG+SD DWA    DRKSV GYCVFLG SLI+WSSKKQ+VV+R STES
Subjt:  WLSLKRILRYLHGTFNLGLSLQPTASISLTGYSDVDWAGCPIDRKSVGGYCVFLGSSLISWSSKKQQVVARFSTES

PNY02796.1 copia protein (gag-int-pol protein), partial [Trifolium pratense]3.0e-17036.5Show/hide
Query:  SSSGRNQ-NKGDNPSAMITTPEHLRDTAWYLESGASNHLTSDMSNLSLKSDYTGNEMITAGDGNQLPIHYIGSSYIKSSVKNLILKGLMHVPQISKNLIS
        S SG NQ  +  + SA I +P + +D  WY +SGASNH+T         + + G   +  G+G +L I   GS    + +KNL L  +++VP+I+KNL+S
Subjt:  SSSGRNQ-NKGDNPSAMITTPEHLRDTAWYLESGASNHLTSDMSNLSLKSDYTGNEMITAGDGNQLPIHYIGSSYIKSSVKNLILKGLMHVPQISKNLIS

Query:  ISRLTMDNSVIVEFCDSCCAVKDKETRKVLLEGALKDGLYRIPSIKSAS---------------------------------------------CPGLQ-
        +S+LT DN++IVEF   CC+VKDK T K LL+G LK+GLY++ ++ S S                                             C   Q 
Subjt:  ISRLTMDNSVIVEFCDSCCAVKDKETRKVLLEGALKDGLYRIPSIKSAS---------------------------------------------CPGLQ-

Query:  ---------------------------------------------EDIKKYSWIFPLKLKSDAVTVFGQFKSLVENIYSSKIKVIRCDEGGEYKPIIHIA
                                                     +D  +++WI+PLK KS+ +  F QFK+LVEN ++ +IK+++CD GGEYK +  +A
Subjt:  ---------------------------------------------EDIKKYSWIFPLKLKSDAVTVFGQFKSLVENIYSSKIKVIRCDEGGEYKPIIHIA

Query:  SQCGIQVQLACPYTSAQNGRVERKHRHVVETRLALLAHAKMPLQYWWEAFHTSVYLINIMPTETIGGKVQFTVLNKEKPDYNSLRIFGVACYPCLRQYQH
         + GIQ +++CPYTS QNGR ERKHRHV E  L +LA A+MPL YWWEAF TSVYLIN +P+        +T++ K++PDY+ L+ FG ACYPCL+ Y  
Subjt:  SQCGIQVQLACPYTSAQNGRVERKHRHVVETRLALLAHAKMPLQYWWEAFHTSVYLINIMPTETIGGKVQFTVLNKEKPDYNSLRIFGVACYPCLRQYQH

Query:  HKFDFHTTKCVFIGYSSNHK------------------------------------VAASSINSPTSTTTSSSDGRS-----------------------
        HK  FHTT+CVF+GYS++HK                                    +   + N P     + +DG +                       
Subjt:  HKFDFHTTKCVFIGYSSNHK------------------------------------VAASSINSPTSTTTSSSDGRS-----------------------

Query:  ---HESS--LATPRLLDYPSPTNVSINEETMDAINSSATRPSVINSSTTRPSTSIHPMVTRDKSGVTKPKKFFGCYSQAQTSMDWSCNEPTSYIDALKVP
           HESS  L         S      N ++ + + +S      I +++   +T+ H M TR K G+ KPK+ +    +A T       EP +  +AL  P
Subjt:  ---HESS--LATPRLLDYPSPTNVSINEETMDAINSSATRPSVINSSTTRPSTSIHPMVTRDKSGVTKPKKFFGCYSQAQTSMDWSCNEPTSYIDALKVP

Query:  SWQRAMKEEFTTLTKNQTWDLVPPSSNCHLVGNKWIFKLKRDAHGAIVSYKARLVAQGFSQTLGIDFHETYTPVIKPMTFQIILTLAPTWDWSVRQLDVS
         W+ AM  EF  L  N TW LVP     +++ +KW+FK K  A G+I   KARLVA+GF QT G+D+ ET++PV+K  T +IIL++A  ++W VRQLD++
Subjt:  SWQRAMKEEFTTLTKNQTWDLVPPSSNCHLVGNKWIFKLKRDAHGAIVSYKARLVAQGFSQTLGIDFHETYTPVIKPMTFQIILTLAPTWDWSVRQLDVS

Query:  NTFLNGPLNEIIFMPQPKDFEDPKKPHYICRLRKALYGLKQVPRA-------------CQRQTPPCSFFAR--TKKYFFLCMSMTCFLRVPPHTYLISLF
        N FLNG L E +FM QP+ + D  KP++IC+L KA+YGLKQ PRA              Q      S F    T  +  L + +   +    +T  +  F
Subjt:  NTFLNGPLNEIIFMPQPKDFEDPKKPHYICRLRKALYGLKQVPRA-------------CQRQTPPCSFFAR--TKKYFFLCMSMTCFLRVPPHTYLISLF

Query:  --KSFTSRLLFEISEQSTILGIQITRSSKGFHLSQATYVQDLLSRLHLEHLKSAPTPMTFSSSITASSKVLFNDPSLYRSTIRSLHYVLLTRPDIAFAVN
          +  T   L ++      LGI++ R++ G +L Q+ Y+ DLL +  +E   + PTPM      T   + +  +P++YR  I +L Y+  TRPDIAFAVN
Subjt:  --KSFTSRLLFEISEQSTILGIQITRSSKGFHLSQATYVQDLLSRLHLEHLKSAPTPMTFSSSITASSKVLFNDPSLYRSTIRSLHYVLLTRPDIAFAVN

Query:  YLSQFAQAPQQAHWLSLKRILRYLHGTFNLGLSLQPTASISLTGYSDVDWAGCPIDRKSVGGYCVFLGSSLISWSSKKQQVVARFSTESEYRTLAHVACD
         LSQ+  +P   HW  +KRILRYL GT N  L ++P+  + + G+SD DWA    DRKS+ G CVFLG SLISWSS+KQ+VV+R STESEYR LA +A +
Subjt:  YLSQFAQAPQQAHWLSLKRILRYLHGTFNLGLSLQPTASISLTGYSDVDWAGCPIDRKSVGGYCVFLGSSLISWSSKKQQVVARFSTESEYRTLAHVACD

RVW64314.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]2.6e-16635.07Show/hide
Query:  MSWNEAQSQLLMFEKRLEQLQAMKSSVVSIVQPSANLASTNVQQYNTFQHQSRHMNQSYQRKRKNLMIETNLSRSSSSGRNQN-----------------
        MS +   S LL  E+RL    + ++SV      SANLA+   Q +N  +   ++    +  +R      TN  RS SS                      
Subjt:  MSWNEAQSQLLMFEKRLEQLQAMKSSVVSIVQPSANLASTNVQQYNTFQHQSRHMNQSYQRKRKNLMIETNLSRSSSSGRNQN-----------------

Query:  -----KGDNPS----------------AMITTPEHLRDTAWYLESGASNHLTSDMSNLSLKSDYTGNEMITAGDGNQLPIHYIGSSYIKSSVKNLILKGL
             +G NP+                AM+ +P  + D AW+ ++GA++HL+  +  LS    Y GN+ +  G+G  L I + G+++  SS K   L+ +
Subjt:  -----KGDNPS----------------AMITTPEHLRDTAWYLESGASNHLTSDMSNLSLKSDYTGNEMITAGDGNQLPIHYIGSSYIKSSVKNLILKGL

Query:  MHVPQISKNLISISRLTMDNSVIVEFCDSCCAVKDKETRKVLLEGALKDGLYRIP---------------------------------------------
        +HVP I+ NLIS+S+   DN+   EF      VKD+ T+K+LL+G+L+ GLYR P                                             
Subjt:  MHVPQISKNLISISRLTMDNSVIVEFCDSCCAVKDKETRKVLLEGALKDGLYRIP---------------------------------------------

Query:  -------------------------------SIKSASCP------------------GLQ------EDIKKYSWIFPLKLKSDAVTVFGQFKSLVENIYS
                                       S+  AS P                  G +      +D  ++SWI+PL  K  A++VF +FKSLVEN ++
Subjt:  -------------------------------SIKSASCP------------------GLQ------EDIKKYSWIFPLKLKSDAVTVFGQFKSLVENIYS

Query:  SKIKVIRCDEGGEYKPIIHIASQCGIQVQLACPYTSAQNGRVERKHRHVVETRLALLAHAKMPLQYWWEAFHTSVYLINIMPTETIGGKVQFTVLNKEKP
        S+I+ +R D GGE+K      +  GI+ Q +CPYT  QNGR ERK RH++ET LALLA A +P ++W  AFHT+++LIN +PT+ +  +  F +L  + P
Subjt:  SKIKVIRCDEGGEYKPIIHIASQCGIQVQLACPYTSAQNGRVERKHRHVVETRLALLAHAKMPLQYWWEAFHTSVYLINIMPTETIGGKVQFTVLNKEKP

Query:  DYNSLRIFGVACYPCLRQYQHHKFDFHTTKCVFIGYSSNHK--VAASSINSPTSTT------------TSSSDGRSHESSLATPRLLDYPSPTNVSINEE
        +Y+  +IFG  CYP +R Y  +K  + +++CVF+GYSSNHK  +  + +      T             S+ D  S   ++ TP  L   SP   S+   
Subjt:  DYNSLRIFGVACYPCLRQYQHHKFDFHTTKCVFIGYSSNHK--VAASSINSPTSTT------------TSSSDGRSHESSLATPRLLDYPSPTNVSINEE

Query:  TMDAI---------NSSATRPSVI-----NSSTTRP-STSIHPMVTRDKSGVTKPKKFFGCYSQAQTSMDWSCNEPTSYIDALKVPSWQRAMKEEFTTLT
        T  +          +S+ + P +I     + ST+ P  T+ HPMVTR K+G++K K +F  +           +EPT++  A+K  +W  AM++EF+ L 
Subjt:  TMDAI---------NSSATRPSVI-----NSSTTRP-STSIHPMVTRDKSGVTKPKKFFGCYSQAQTSMDWSCNEPTSYIDALKVPSWQRAMKEEFTTLT

Query:  KNQTWDLVPPSSNCHLVGNKWIFKLKRDAHGAIVSYKARLVAQGFSQTLGIDFHETYTPVIKPMTFQIILTLAPTWDWSVRQLDVSNTFLNGPLNEIIFM
        +N TW LVPP SN +++G KW++KLK    G +  YKARLVAQGF+QTLG+D+ ET++PV+K  T +IIL +A +++WSV QLDV N FL+G L E +FM
Subjt:  KNQTWDLVPPSSNCHLVGNKWIFKLKRDAHGAIVSYKARLVAQGFSQTLGIDFHETYTPVIKPMTFQIILTLAPTWDWSVRQLDVSNTFLNGPLNEIIFM

Query:  PQPKDFEDPKKPHYICRLRKALYGLKQVPRACQRQ--TPPCSFFARTKKYFFLCMSMTCFLRVPPHTYLISLF---------------KSFTSRL-----
         QP  F + + P ++C+L KALYGLKQ PRA   +  T    +  +  +        + F+    H  LI L                 SF +RL     
Subjt:  PQPKDFEDPKKPHYICRLRKALYGLKQVPRACQRQ--TPPCSFFARTKKYFFLCMSMTCFLRVPPHTYLISLF---------------KSFTSRL-----

Query:  LFEISEQSTILGIQITRSSKGFHLSQATYVQDLLSRLHLEHLKSAPTPMTFSSSITASSKVLFNDPSLYRSTIRSLHYVLLTRPDIAFAVNYLSQFAQAP
        L ++   +  LGI++ RS   FHLSQ  Y QDLLSR  +   K A TP     +++      F+D +LYRST+ +L Y+ LTRPDI+FAVN   QF   P
Subjt:  LFEISEQSTILGIQITRSSKGFHLSQATYVQDLLSRLHLEHLKSAPTPMTFSSSITASSKVLFNDPSLYRSTIRSLHYVLLTRPDIAFAVNYLSQFAQAP

Query:  QQAHWLSLKRILRYLHGTFNLGLSLQPTASISLTGYSDVDWAGCPIDRKSVGGYCVFLGSSLISWSSKKQQVVARFSTESEYRTLAHVACDL
           HWL++KRILRYL GT + G+ +Q + S+ + GY+D DWA CP DR+S GGY +FLG +L+SWSS KQ+VV+R S ESEYR LA    ++
Subjt:  QQAHWLSLKRILRYLHGTFNLGLSLQPTASISLTGYSDVDWAGCPIDRKSVGGYCVFLGSSLISWSSKKQQVVARFSTESEYRTLAHVACDL

TrEMBL top hitse value%identityAlignment
A0A151S6M8 Retrovirus-related Pol polyprotein from transposon TNT 1-948.8e-17636.8Show/hide
Query:  MSWNEAQSQLLMFEKRLEQLQAMKSSVVSIVQPSANLASTNVQ--------------QYNTFQHQSRHMNQS--------------------YQRKRKNL
        ++W E Q+QLL +E RLEQ+    +  ++   PS+N+++                  Q N      R   ++                    Y R  KN 
Subjt:  MSWNEAQSQLLMFEKRLEQLQAMKSSVVSIVQPSANLASTNVQ--------------QYNTFQHQSRHMNQS--------------------YQRKRKNL

Query:  MIETNLSRSSSSGRNQNKGDNPSAMITTPEHLRDTAWYLESGASNHLTSDMSNLSLKSDYTGNEMITAGDGNQLPIHYIGSSYIKSSVKNLILKGLMHVP
        + + +  + S   + QN   N +A + +P  + D  WY +SGASNH+T D + +   ++  G   +T G+G  L I   G S + +  K+L LK +++VP
Subjt:  MIETNLSRSSSSGRNQNKGDNPSAMITTPEHLRDTAWYLESGASNHLTSDMSNLSLKSDYTGNEMITAGDGNQLPIHYIGSSYIKSSVKNLILKGLMHVP

Query:  QISKNLISISRLTMDNSVIVEFCDSCCAVKDKETRKVLLEGALKDGLYRIP---------------------------------------SIKSASCPGL
        +I+KNL+SIS+LT DN + VEF D  C VKDK T ++LLEG +KDGLY++P                                       +I+++ C   
Subjt:  QISKNLISISRLTMDNSVIVEFCDSCCAVKDKETRKVLLEGALKDGLYRIP---------------------------------------SIKSASCPGL

Query:  Q----------------------------------------------------EDIKKYSWIFPLKLKSDAVTVFGQFKSLVENIYSSKIKVIRCDEGGE
        +                                                    +D  +++WI+PLK KSD    F QF++LVEN ++ +IK ++CD GGE
Subjt:  Q----------------------------------------------------EDIKKYSWIFPLKLKSDAVTVFGQFKSLVENIYSSKIKVIRCDEGGE

Query:  YKPIIHIASQCGIQVQLACPYTSAQNGRVERKHRHVVETRLALLAHAKMPLQYWWEAFHTSVYLINIMPTETIGGKVQFTVLNKEKPDYNSLRIFGVACY
        +K +  +  + GIQ++ +CPYTSAQNGR ERKHRHVVE+ L LLA AKMPL YWWEAF T+V+LIN +PT+ I  K  +  L  + PDY +++ FG ACY
Subjt:  YKPIIHIASQCGIQVQLACPYTSAQNGRVERKHRHVVETRLALLAHAKMPLQYWWEAFHTSVYLINIMPTETIGGKVQFTVLNKEKPDYNSLRIFGVACY

Query:  PCLRQYQHHKFDFHTTKCVFIGYSSNHKVAASSINSPTSTTTSS-----------SDG----RSHESSLATPRLLDYP-SPT--NVSINEETMDAINSSA
        PCL+ Y  HK  FHTTKCVF+GYS +HK     +NS      S             DG    R     +  P  L +P SPT  NV+  E+ +   N+S+
Subjt:  PCLRQYQHHKFDFHTTKCVFIGYSSNHKVAASSINSPTSTTTSS-----------SDG----RSHESSLATPRLLDYP-SPT--NVSINEETMDAINSSA

Query:  ----------------TRPSVINSST--------TRPSTSIHPMVTRDKSGVTKPKK-FFGCYSQAQTSMDWSCNEPTSYIDALKVPSWQRAMKEEFTTL
                        T  + I+ +T           S + H M TR K G+ KPKK + G   +          EP +  +AL+ P W++AM  EF  L
Subjt:  ----------------TRPSVINSST--------TRPSTSIHPMVTRDKSGVTKPKK-FFGCYSQAQTSMDWSCNEPTSYIDALKVPSWQRAMKEEFTTL

Query:  TKNQTWDLVPPSSNCHLVGNKWIFKLKRDAHGAIVSYKARLVAQGFSQTLGIDFHETYTPVIKPMTFQIILTLAPTWDWSVRQLDVSNTFLNGPLNEIIF
          N+TW LVP     +++  KW+FK K  A G I   KARLVA+GF QTLG+D+ ET++PVIK +T +IIL++A  ++W +RQ+D++N FLNG L E +F
Subjt:  TKNQTWDLVPPSSNCHLVGNKWIFKLKRDAHGAIVSYKARLVAQGFSQTLGIDFHETYTPVIKPMTFQIILTLAPTWDWSVRQLDVSNTFLNGPLNEIIF

Query:  MPQPKDFEDPKKPHYICRLRKALYGLKQVPRACQRQTPPCSF-----FARTKKYFFLCMSMT--CFLRVPPHTYLI-----SLFKSFTSRL--LFEISEQ
        M QP+ F D  +P +IC+L KA+YGLKQ PR+   +             R+    F+ MS     FL +     +I     S   SF  +L  +F + + 
Subjt:  MPQPKDFEDPKKPHYICRLRKALYGLKQVPRACQRQTPPCSF-----FARTKKYFFLCMSMT--CFLRVPPHTYLI-----SLFKSFTSRL--LFEISEQ

Query:  STI---LGIQITRSSKGFHLSQATYVQDLLSRLHLEHLKSAPTPMTFSSSITASSKVLFNDPSLYRSTIRSLHYVLLTRPDIAFAVNYLSQFAQAPQQAH
         ++   LG++  R + G +L Q  YV DLL + +LEH+ S PTPM    S++  ++++ N P+LYR  I  L Y+  TRPDIA++VN LSQ+ QAP   H
Subjt:  STI---LGIQITRSSKGFHLSQATYVQDLLSRLHLEHLKSAPTPMTFSSSITASSKVLFNDPSLYRSTIRSLHYVLLTRPDIAFAVNYLSQFAQAPQQAH

Query:  WLSLKRILRYLHGTFNLGLSLQPTASISLTGYSDVDWAGCPIDRKSVGGYCVFLGSSLISWSSKKQQVVARFSTES
        W S+KR+ RYL GT N  L ++P+  + +TG+SD DWA    DRKSV GYCVFLG SLI+WSSKKQ+VV+R STES
Subjt:  WLSLKRILRYLHGTFNLGLSLQPTASISLTGYSDVDWAGCPIDRKSVGGYCVFLGSSLISWSSKKQQVVARFSTES

A0A2K3NIC3 Copia protein (Gag-int-pol protein) (Fragment)1.5e-17036.5Show/hide
Query:  SSSGRNQ-NKGDNPSAMITTPEHLRDTAWYLESGASNHLTSDMSNLSLKSDYTGNEMITAGDGNQLPIHYIGSSYIKSSVKNLILKGLMHVPQISKNLIS
        S SG NQ  +  + SA I +P + +D  WY +SGASNH+T         + + G   +  G+G +L I   GS    + +KNL L  +++VP+I+KNL+S
Subjt:  SSSGRNQ-NKGDNPSAMITTPEHLRDTAWYLESGASNHLTSDMSNLSLKSDYTGNEMITAGDGNQLPIHYIGSSYIKSSVKNLILKGLMHVPQISKNLIS

Query:  ISRLTMDNSVIVEFCDSCCAVKDKETRKVLLEGALKDGLYRIPSIKSAS---------------------------------------------CPGLQ-
        +S+LT DN++IVEF   CC+VKDK T K LL+G LK+GLY++ ++ S S                                             C   Q 
Subjt:  ISRLTMDNSVIVEFCDSCCAVKDKETRKVLLEGALKDGLYRIPSIKSAS---------------------------------------------CPGLQ-

Query:  ---------------------------------------------EDIKKYSWIFPLKLKSDAVTVFGQFKSLVENIYSSKIKVIRCDEGGEYKPIIHIA
                                                     +D  +++WI+PLK KS+ +  F QFK+LVEN ++ +IK+++CD GGEYK +  +A
Subjt:  ---------------------------------------------EDIKKYSWIFPLKLKSDAVTVFGQFKSLVENIYSSKIKVIRCDEGGEYKPIIHIA

Query:  SQCGIQVQLACPYTSAQNGRVERKHRHVVETRLALLAHAKMPLQYWWEAFHTSVYLINIMPTETIGGKVQFTVLNKEKPDYNSLRIFGVACYPCLRQYQH
         + GIQ +++CPYTS QNGR ERKHRHV E  L +LA A+MPL YWWEAF TSVYLIN +P+        +T++ K++PDY+ L+ FG ACYPCL+ Y  
Subjt:  SQCGIQVQLACPYTSAQNGRVERKHRHVVETRLALLAHAKMPLQYWWEAFHTSVYLINIMPTETIGGKVQFTVLNKEKPDYNSLRIFGVACYPCLRQYQH

Query:  HKFDFHTTKCVFIGYSSNHK------------------------------------VAASSINSPTSTTTSSSDGRS-----------------------
        HK  FHTT+CVF+GYS++HK                                    +   + N P     + +DG +                       
Subjt:  HKFDFHTTKCVFIGYSSNHK------------------------------------VAASSINSPTSTTTSSSDGRS-----------------------

Query:  ---HESS--LATPRLLDYPSPTNVSINEETMDAINSSATRPSVINSSTTRPSTSIHPMVTRDKSGVTKPKKFFGCYSQAQTSMDWSCNEPTSYIDALKVP
           HESS  L         S      N ++ + + +S      I +++   +T+ H M TR K G+ KPK+ +    +A T       EP +  +AL  P
Subjt:  ---HESS--LATPRLLDYPSPTNVSINEETMDAINSSATRPSVINSSTTRPSTSIHPMVTRDKSGVTKPKKFFGCYSQAQTSMDWSCNEPTSYIDALKVP

Query:  SWQRAMKEEFTTLTKNQTWDLVPPSSNCHLVGNKWIFKLKRDAHGAIVSYKARLVAQGFSQTLGIDFHETYTPVIKPMTFQIILTLAPTWDWSVRQLDVS
         W+ AM  EF  L  N TW LVP     +++ +KW+FK K  A G+I   KARLVA+GF QT G+D+ ET++PV+K  T +IIL++A  ++W VRQLD++
Subjt:  SWQRAMKEEFTTLTKNQTWDLVPPSSNCHLVGNKWIFKLKRDAHGAIVSYKARLVAQGFSQTLGIDFHETYTPVIKPMTFQIILTLAPTWDWSVRQLDVS

Query:  NTFLNGPLNEIIFMPQPKDFEDPKKPHYICRLRKALYGLKQVPRA-------------CQRQTPPCSFFAR--TKKYFFLCMSMTCFLRVPPHTYLISLF
        N FLNG L E +FM QP+ + D  KP++IC+L KA+YGLKQ PRA              Q      S F    T  +  L + +   +    +T  +  F
Subjt:  NTFLNGPLNEIIFMPQPKDFEDPKKPHYICRLRKALYGLKQVPRA-------------CQRQTPPCSFFAR--TKKYFFLCMSMTCFLRVPPHTYLISLF

Query:  --KSFTSRLLFEISEQSTILGIQITRSSKGFHLSQATYVQDLLSRLHLEHLKSAPTPMTFSSSITASSKVLFNDPSLYRSTIRSLHYVLLTRPDIAFAVN
          +  T   L ++      LGI++ R++ G +L Q+ Y+ DLL +  +E   + PTPM      T   + +  +P++YR  I +L Y+  TRPDIAFAVN
Subjt:  --KSFTSRLLFEISEQSTILGIQITRSSKGFHLSQATYVQDLLSRLHLEHLKSAPTPMTFSSSITASSKVLFNDPSLYRSTIRSLHYVLLTRPDIAFAVN

Query:  YLSQFAQAPQQAHWLSLKRILRYLHGTFNLGLSLQPTASISLTGYSDVDWAGCPIDRKSVGGYCVFLGSSLISWSSKKQQVVARFSTESEYRTLAHVACD
         LSQ+  +P   HW  +KRILRYL GT N  L ++P+  + + G+SD DWA    DRKS+ G CVFLG SLISWSS+KQ+VV+R STESEYR LA +A +
Subjt:  YLSQFAQAPQQAHWLSLKRILRYLHGTFNLGLSLQPTASISLTGYSDVDWAGCPIDRKSVGGYCVFLGSSLISWSSKKQQVVARFSTESEYRTLAHVACD

A0A2Z6MBG6 Integrase catalytic domain-containing protein1.9e-17035.28Show/hide
Query:  MSWNEAQSQLLMFEKRLEQLQAMKSSVVSIVQPSANLASTNVQQYNTFQHQSRHMNQSYQRKR----KN----------------LMIETNLSRSSSSGR
        +SW + Q+QLL FE R+EQL  + +  ++     AN +    +  N     S        R R    KN                   +   SRS+ S  
Subjt:  MSWNEAQSQLLMFEKRLEQLQAMKSSVVSIVQPSANLASTNVQQYNTFQHQSRHMNQSYQRKR----KN----------------LMIETNLSRSSSSGR

Query:  NQNKGDNPSAMITTPEHLRDTAWYLESGASNHLTSDMSNLSLKSDYTGNEMITAGDGNQLPIHYIGSSYIKSSVKNLILKGLMHVPQISKNLISISRLTM
        +  +G + +A + +   + D  WY +SGASNH+T         +++ G   +  G+G +L I   GSS +KS    L L  +++VP I+KNL+S+S+L  
Subjt:  NQNKGDNPSAMITTPEHLRDTAWYLESGASNHLTSDMSNLSLKSDYTGNEMITAGDGNQLPIHYIGSSYIKSSVKNLILKGLMHVPQISKNLISISRLTM

Query:  DNSVIVEFCDSCCAVKDKETRKVLLEGALKDGLYRI-----------------------------------------PSIKSASCPGLQ-----------
        DN+++VEF ++CC VKDK T KV+L+G LKDGLY++                                         PS   + C   Q           
Subjt:  DNSVIVEFCDSCCAVKDKETRKVLLEGALKDGLYRI-----------------------------------------PSIKSASCPGLQ-----------

Query:  -----------------------------------EDIKKYSWIFPLKLKSDAVTVFGQFKSLVENIYSSKIKVIRCDEGGEYKPIIHIASQCGIQVQLA
                                           +D  +++WI+PLK KS+ V  F QFK+L EN ++ +IKVI+CD GGEYKP+  +A + GIQ +++
Subjt:  -----------------------------------EDIKKYSWIFPLKLKSDAVTVFGQFKSLVENIYSSKIKVIRCDEGGEYKPIIHIASQCGIQVQLA

Query:  CPYTSAQNGRVERKHRHVVETRLALLAHAKMPLQYWWEAFHTSVYLINIMPTETIGGKVQFTVLNKEKPDYNSLRIFGVACYPCLRQYQHHKFDFHTTKC
        CPYTS QNGR ERKHRH+ E  L LLA A+MPL YWWEAF T+VYLIN +P++    +  ++++ +++PDY  L+ FG ACYPCL+ Y  HK  +HTT+C
Subjt:  CPYTSAQNGRVERKHRHVVETRLALLAHAKMPLQYWWEAFHTSVYLINIMPTETIGGKVQFTVLNKEKPDYNSLRIFGVACYPCLRQYQHHKFDFHTTKC

Query:  VFIGYSSNHK-----------------------------------VAASSINSPTSTTTSSSDGRSHESSLATPRLLDYPSPTNV----SINEETMDAIN
        VF+GYS++HK                                      ++IN P+++    + G   + +       + P+ TN      +N +T    N
Subjt:  VFIGYSSNHK-----------------------------------VAASSINSPTSTTTSSSDGRSHESSLATPRLLDYPSPTNV----SINEETMDAIN

Query:  SSA------------TRPSVINSSTTRPSTSIHPMVTRDKSGVTKPK-KFFGCYSQAQTSMDWSCNEPTSYIDALKVPSWQRAMKEEFTTLTKNQTWDLV
          +            T+   +  ++   +TS H + TR KSG+ KPK  + G     + +M     EP +  +AL  P W+ AM++EF  L  N+TW LV
Subjt:  SSA------------TRPSVINSSTTRPSTSIHPMVTRDKSGVTKPK-KFFGCYSQAQTSMDWSCNEPTSYIDALKVPSWQRAMKEEFTTLTKNQTWDLV

Query:  PPSSNCHLVGNKWIFKLKRDAHGAIVSYKARLVAQGFSQTLGIDFHETYTPVIKPMTFQIILTLAPTWDWSVRQLDVSNTFLNGPLNEIIFMPQPKDFED
        P  +  ++V +KW+FK K    G++   KARLVA+GF QT GID+ ET++PVIK  T +IIL++A   +W VRQLD++N FLNG L E +FM QP+ F D
Subjt:  PPSSNCHLVGNKWIFKLKRDAHGAIVSYKARLVAQGFSQTLGIDFHETYTPVIKPMTFQIILTLAPTWDWSVRQLDVSNTFLNGPLNEIIFMPQPKDFED

Query:  PKKPHYICRLRKALYGLKQVPRA-------------CQRQTPPCSFFARTKK---YFFLCMSMTCFLRVPPHTYLISLFKSFTSRL-LFEISEQSTILGI
          KP++IC+L KA+YGLKQ PRA              Q      S F    K    F L       +      +L +  K       L ++      LGI
Subjt:  PKKPHYICRLRKALYGLKQVPRA-------------CQRQTPPCSFFARTKK---YFFLCMSMTCFLRVPPHTYLISLFKSFTSRL-LFEISEQSTILGI

Query:  QITRSSKGFHLSQATYVQDLLSRLHLEHLKSAPTPMTFSSSITASSKVLFNDPSLYRSTIRSLHYVLLTRPDIAFAVNYLSQFAQAPQQAHWLSLKRILR
        ++ R + G +L Q+ Y+ DLL +  +++    PTPM      T   + L  DP+++R  I  L Y+  T PDIAF+VN LSQ+  +P   HW  +KRILR
Subjt:  QITRSSKGFHLSQATYVQDLLSRLHLEHLKSAPTPMTFSSSITASSKVLFNDPSLYRSTIRSLHYVLLTRPDIAFAVNYLSQFAQAPQQAHWLSLKRILR

Query:  YLHGTFNLGLSLQPTASISLTGYSDVDWAGCPIDRKSVGGYCVFLGSSLISWSSKKQQVVARFSTESEYRTLAHVACDLA
        YL GT N  L ++P+  + +TG+SD DWA    DRKS+ G CVFLG +LISWSS+KQ+VV+R STESEYR LA +A ++A
Subjt:  YLHGTFNLGLSLQPTASISLTGYSDVDWAGCPIDRKSVGGYCVFLGSSLISWSSKKQQVVARFSTESEYRTLAHVACDLA

A0A2Z6P4D5 Integrase catalytic domain-containing protein8.2e-17435.4Show/hide
Query:  MSWNEAQSQLLMFEKRLEQLQAMKSSVVSIVQPSANLASTNVQQYNTFQHQSRHMNQSYQRKR----KNLMIETNLSRSSSSGR----------------
        +SW + Q+QLL FE RL+Q        ++    SAN A+    + N F  +      +++  R    K  M  T     + +G                 
Subjt:  MSWNEAQSQLLMFEKRLEQLQAMKSSVVSIVQPSANLASTNVQQYNTFQHQSRHMNQSYQRKR----KNLMIETNLSRSSSSGR----------------

Query:  ----NQNKGDNPSAMITTPEHLRDTAWYLESGASNHLTSDMSNLSLKSDYTGNEMITAGDGNQLPIHYIGSSYIKSSVKNLILKGLMHVPQISKNLISIS
              +K  + SA I +P H +D  WY +SGA+NH+T         +++ G   +  G+G +L I   GS    + + NL L  +++VPQI+KNL+S+S
Subjt:  ----NQNKGDNPSAMITTPEHLRDTAWYLESGASNHLTSDMSNLSLKSDYTGNEMITAGDGNQLPIHYIGSSYIKSSVKNLILKGLMHVPQISKNLISIS

Query:  RLTMDNSVIVEFCDSCCAVKDKETRKVLLEGALKDGLYRIP--------SIKS-------------------------------ASCPGLQ---------
        +LT DN+++VEF  +CC+VKDK T + LL+G LKDGLY++         S+K                                + C   Q         
Subjt:  RLTMDNSVIVEFCDSCCAVKDKETRKVLLEGALKDGLYRIP--------SIKS-------------------------------ASCPGLQ---------

Query:  -------------------------------------EDIKKYSWIFPLKLKSDAVTVFGQFKSLVENIYSSKIKVIRCDEGGEYKPIIHIASQCGIQVQ
                                             +D  +++WIFPLK KSD +  F QFK+L EN ++ KIK+I+CD GGEYK +  ++ + GIQ +
Subjt:  -------------------------------------EDIKKYSWIFPLKLKSDAVTVFGQFKSLVENIYSSKIKVIRCDEGGEYKPIIHIASQCGIQVQ

Query:  LACPYTSAQNGRVERKHRHVVETRLALLAHAKMPLQYWWEAFHTSVYLINIMPTETIGGKVQFTVLNKEKPDYNSLRIFGVACYPCLRQYQHHKFDFHTT
        ++CPYTS QNGR ERKHRHV E  L LLA AKMPL+YWWEAF T+VYLIN +P+     +  ++++ K +PDYN+L+ FG ACYPCL+ Y  HK  FHTT
Subjt:  LACPYTSAQNGRVERKHRHVVETRLALLAHAKMPLQYWWEAFHTSVYLINIMPTETIGGKVQFTVLNKEKPDYNSLRIFGVACYPCLRQYQHHKFDFHTT

Query:  KCVFIGYSSNHKVAASSINS-------------------------------------------------------PTSTTTS--------SSDGRSHESS
        +CVF+GYS++HK     INS                                                       P + TTS        SSD   +E  
Subjt:  KCVFIGYSSNHKVAASSINS-------------------------------------------------------PTSTTTS--------SSDGRSHESS

Query:  LATPRLL---DYPSPTNVSINEETMDAINSSATRPSVINSSTTRPSTSIHPMVTRDKSGVTKPKKFFGCYSQAQTSMDWSCNEPTSYIDALKVPSWQRAM
        + +       +  S  ++  +        +++T    I     + +++ H M TR K G+ KPK        A+T  D    EP S  +AL  P W+ AM
Subjt:  LATPRLL---DYPSPTNVSINEETMDAINSSATRPSVINSSTTRPSTSIHPMVTRDKSGVTKPKKFFGCYSQAQTSMDWSCNEPTSYIDALKVPSWQRAM

Query:  KEEFTTLTKNQTWDLVPPSSNCHLVGNKWIFKLKRDAHGAIVSYKARLVAQGFSQTLGIDFHETYTPVIKPMTFQIILTLAPTWDWSVRQLDVSNTFLNG
         +E+  L  N TW LVP     +++ +KWIFK K  + G+I   KARLVA+GF QT G+DF ET++PV+K  T +IILT+A  ++W VRQLD++N FLNG
Subjt:  KEEFTTLTKNQTWDLVPPSSNCHLVGNKWIFKLKRDAHGAIVSYKARLVAQGFSQTLGIDFHETYTPVIKPMTFQIILTLAPTWDWSVRQLDVSNTFLNG

Query:  PLNEIIFMPQPKDFEDPKKPHYICRLRKALYGLKQVPRA---CQRQTPPCSFFARTKK----YFFLCMSMTCFLRVPPHTYL-----ISLFKSFTSRL--
         L E +FM QP+ + D  KP++IC+L KA+YGLKQ PRA     R T     F   K     +F      T FL +     +     I   ++FT++L  
Subjt:  PLNEIIFMPQPKDFEDPKKPHYICRLRKALYGLKQVPRA---CQRQTPPCSFFARTKK----YFFLCMSMTCFLRVPPHTYL-----ISLFKSFTSRL--

Query:  ---LFEISEQSTILGIQITRSSKGFHLSQATYVQDLLSRLHLEHLKSAPTPMTFSSSITASSKVLFNDPSLYRSTIRSLHYVLLTRPDIAFAVNYLSQFA
           L ++      LG+++ R   G +L Q  Y++D+L + ++E+  + PTPM       A  +++ N P+LYR  I +L Y+  TRPDIAFAVN LSQ+ 
Subjt:  ---LFEISEQSTILGIQITRSSKGFHLSQATYVQDLLSRLHLEHLKSAPTPMTFSSSITASSKVLFNDPSLYRSTIRSLHYVLLTRPDIAFAVNYLSQFA

Query:  QAPQQAHWLSLKRILRYLHGTFNLGLSLQPTASISLTGYSDVDWAGCPIDRKSVGGYCVFLGSSLISWSSKKQQVVARFSTESEYRTLAHVACDLA
          P   HW  +KRILRYL GT N  L ++P+ ++ + G+ D DWA    DRKS GG CVFLG +L+SW+S+KQ+VV+R STESEYR+LA +  +++
Subjt:  QAPQQAHWLSLKRILRYLHGTFNLGLSLQPTASISLTGYSDVDWAGCPIDRKSVGGYCVFLGSSLISWSSKKQQVVARFSTESEYRTLAHVACDLA

A0A438FWJ3 Retrovirus-related Pol polyprotein from transposon TNT 1-941.3e-16635.07Show/hide
Query:  MSWNEAQSQLLMFEKRLEQLQAMKSSVVSIVQPSANLASTNVQQYNTFQHQSRHMNQSYQRKRKNLMIETNLSRSSSSGRNQN-----------------
        MS +   S LL  E+RL    + ++SV      SANLA+   Q +N  +   ++    +  +R      TN  RS SS                      
Subjt:  MSWNEAQSQLLMFEKRLEQLQAMKSSVVSIVQPSANLASTNVQQYNTFQHQSRHMNQSYQRKRKNLMIETNLSRSSSSGRNQN-----------------

Query:  -----KGDNPS----------------AMITTPEHLRDTAWYLESGASNHLTSDMSNLSLKSDYTGNEMITAGDGNQLPIHYIGSSYIKSSVKNLILKGL
             +G NP+                AM+ +P  + D AW+ ++GA++HL+  +  LS    Y GN+ +  G+G  L I + G+++  SS K   L+ +
Subjt:  -----KGDNPS----------------AMITTPEHLRDTAWYLESGASNHLTSDMSNLSLKSDYTGNEMITAGDGNQLPIHYIGSSYIKSSVKNLILKGL

Query:  MHVPQISKNLISISRLTMDNSVIVEFCDSCCAVKDKETRKVLLEGALKDGLYRIP---------------------------------------------
        +HVP I+ NLIS+S+   DN+   EF      VKD+ T+K+LL+G+L+ GLYR P                                             
Subjt:  MHVPQISKNLISISRLTMDNSVIVEFCDSCCAVKDKETRKVLLEGALKDGLYRIP---------------------------------------------

Query:  -------------------------------SIKSASCP------------------GLQ------EDIKKYSWIFPLKLKSDAVTVFGQFKSLVENIYS
                                       S+  AS P                  G +      +D  ++SWI+PL  K  A++VF +FKSLVEN ++
Subjt:  -------------------------------SIKSASCP------------------GLQ------EDIKKYSWIFPLKLKSDAVTVFGQFKSLVENIYS

Query:  SKIKVIRCDEGGEYKPIIHIASQCGIQVQLACPYTSAQNGRVERKHRHVVETRLALLAHAKMPLQYWWEAFHTSVYLINIMPTETIGGKVQFTVLNKEKP
        S+I+ +R D GGE+K      +  GI+ Q +CPYT  QNGR ERK RH++ET LALLA A +P ++W  AFHT+++LIN +PT+ +  +  F +L  + P
Subjt:  SKIKVIRCDEGGEYKPIIHIASQCGIQVQLACPYTSAQNGRVERKHRHVVETRLALLAHAKMPLQYWWEAFHTSVYLINIMPTETIGGKVQFTVLNKEKP

Query:  DYNSLRIFGVACYPCLRQYQHHKFDFHTTKCVFIGYSSNHK--VAASSINSPTSTT------------TSSSDGRSHESSLATPRLLDYPSPTNVSINEE
        +Y+  +IFG  CYP +R Y  +K  + +++CVF+GYSSNHK  +  + +      T             S+ D  S   ++ TP  L   SP   S+   
Subjt:  DYNSLRIFGVACYPCLRQYQHHKFDFHTTKCVFIGYSSNHK--VAASSINSPTSTT------------TSSSDGRSHESSLATPRLLDYPSPTNVSINEE

Query:  TMDAI---------NSSATRPSVI-----NSSTTRP-STSIHPMVTRDKSGVTKPKKFFGCYSQAQTSMDWSCNEPTSYIDALKVPSWQRAMKEEFTTLT
        T  +          +S+ + P +I     + ST+ P  T+ HPMVTR K+G++K K +F  +           +EPT++  A+K  +W  AM++EF+ L 
Subjt:  TMDAI---------NSSATRPSVI-----NSSTTRP-STSIHPMVTRDKSGVTKPKKFFGCYSQAQTSMDWSCNEPTSYIDALKVPSWQRAMKEEFTTLT

Query:  KNQTWDLVPPSSNCHLVGNKWIFKLKRDAHGAIVSYKARLVAQGFSQTLGIDFHETYTPVIKPMTFQIILTLAPTWDWSVRQLDVSNTFLNGPLNEIIFM
        +N TW LVPP SN +++G KW++KLK    G +  YKARLVAQGF+QTLG+D+ ET++PV+K  T +IIL +A +++WSV QLDV N FL+G L E +FM
Subjt:  KNQTWDLVPPSSNCHLVGNKWIFKLKRDAHGAIVSYKARLVAQGFSQTLGIDFHETYTPVIKPMTFQIILTLAPTWDWSVRQLDVSNTFLNGPLNEIIFM

Query:  PQPKDFEDPKKPHYICRLRKALYGLKQVPRACQRQ--TPPCSFFARTKKYFFLCMSMTCFLRVPPHTYLISLF---------------KSFTSRL-----
         QP  F + + P ++C+L KALYGLKQ PRA   +  T    +  +  +        + F+    H  LI L                 SF +RL     
Subjt:  PQPKDFEDPKKPHYICRLRKALYGLKQVPRACQRQ--TPPCSFFARTKKYFFLCMSMTCFLRVPPHTYLISLF---------------KSFTSRL-----

Query:  LFEISEQSTILGIQITRSSKGFHLSQATYVQDLLSRLHLEHLKSAPTPMTFSSSITASSKVLFNDPSLYRSTIRSLHYVLLTRPDIAFAVNYLSQFAQAP
        L ++   +  LGI++ RS   FHLSQ  Y QDLLSR  +   K A TP     +++      F+D +LYRST+ +L Y+ LTRPDI+FAVN   QF   P
Subjt:  LFEISEQSTILGIQITRSSKGFHLSQATYVQDLLSRLHLEHLKSAPTPMTFSSSITASSKVLFNDPSLYRSTIRSLHYVLLTRPDIAFAVNYLSQFAQAP

Query:  QQAHWLSLKRILRYLHGTFNLGLSLQPTASISLTGYSDVDWAGCPIDRKSVGGYCVFLGSSLISWSSKKQQVVARFSTESEYRTLAHVACDL
           HWL++KRILRYL GT + G+ +Q + S+ + GY+D DWA CP DR+S GGY +FLG +L+SWSS KQ+VV+R S ESEYR LA    ++
Subjt:  QQAHWLSLKRILRYLHGTFNLGLSLQPTASISLTGYSDVDWAGCPIDRKSVGGYCVFLGSSLISWSSKKQQVVARFSTESEYRTLAHVACDL

SwissProt top hitse value%identityAlignment
P04146 Copia protein7.2e-5032.42Show/hide
Query:  SWQRAMKEEFTTLTKNQTWDLVPPSSNCHLVGNKWIFKLKRDAHGAIVSYKARLVAQGFSQTLGIDFHETYTPVIKPMTFQIILTLAPTWDWSVRQLDVS
        SW+ A+  E      N TW +     N ++V ++W+F +K +  G  + YKARLVA+GF+Q   ID+ ET+ PV +  +F+ IL+L   ++  V Q+DV 
Subjt:  SWQRAMKEEFTTLTKNQTWDLVPPSSNCHLVGNKWIFKLKRDAHGAIVSYKARLVAQGFSQTLGIDFHETYTPVIKPMTFQIILTLAPTWDWSVRQLDVS

Query:  NTFLNGPLNEIIFMPQPKDFEDPKKPHYICRLRKALYGLKQVPR----ACQRQTPPCSFF-------------ARTKKYFFLCMSMTCFLRVPPHTYLIS
          FLNG L E I+M  P+          +C+L KA+YGLKQ  R      ++    C F                  +  ++ + +   +        ++
Subjt:  NTFLNGPLNEIIFMPQPKDFEDPKKPHYICRLRKALYGLKQVPR----ACQRQTPPCSFF-------------ARTKKYFFLCMSMTCFLRVPPHTYLIS

Query:  LFKSFTSR--LLFEISEQSTILGIQITRSSKGFHLSQATYVQDLLSRLHLEHLKSAPTPMTFSSSI-TASSKVLFNDPSLYRSTIRSLHYVLL-TRPDIA
         FK +      + +++E    +GI+I       +LSQ+ YV+ +LS+ ++E+  +  TP+    +    +S    N P   RS I  L Y++L TRPD+ 
Subjt:  LFKSFTSR--LLFEISEQSTILGIQITRSSKGFHLSQATYVQDLLSRLHLEHLKSAPTPMTFSSSI-TASSKVLFNDPSLYRSTIRSLHYVLL-TRPDIA

Query:  FAVNYLSQFAQAPQQAHWLSLKRILRYLHGTFNLGLSLQPTASI--SLTGYSDVDWAGCPIDRKSVGGYCV-FLGSSLISWSSKKQQVVARFSTESEYRT
         AVN LS+++       W +LKR+LRYL GT ++ L  +   +    + GY D DWAG  IDRKS  GY       +LI W++K+Q  VA  STE+EY  
Subjt:  FAVNYLSQFAQAPQQAHWLSLKRILRYLHGTFNLGLSLQPTASI--SLTGYSDVDWAGCPIDRKSVGGYCV-FLGSSLISWSSKKQQVVARFSTESEYRT

Query:  L
        L
Subjt:  L

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-943.5e-8131.25Show/hide
Query:  EDIKKYSWIFPLKLKSDAVTVFGQFKSLVENIYSSKIKVIRCDEGGEY--KPIIHIASQCGIQVQLACPYTSAQNGRVERKHRHVVETRLALLAHAKMPL
        +D  +  W++ LK K     VF +F +LVE     K+K +R D GGEY  +      S  GI+ +   P T   NG  ER +R +VE   ++L  AK+P 
Subjt:  EDIKKYSWIFPLKLKSDAVTVFGQFKSLVENIYSSKIKVIRCDEGGEY--KPIIHIASQCGIQVQLACPYTSAQNGRVERKHRHVVETRLALLAHAKMPL

Query:  QYWWEAFHTSVYLINIMPTETIGGKVQFTVLNKEKPDYNSLRIFGVACYPCLRQYQHHKFDFHTTKCVFIGYSSNH------------KVAASSINSPTS
         +W EA  T+ YLIN  P+  +  ++   V   ++  Y+ L++FG   +  + + Q  K D  +  C+FIGY                 + +  +    S
Subjt:  QYWWEAFHTSVYLINIMPTETIGGKVQFTVLNKEKPDYNSLRIFGVACYPCLRQYQHHKFDFHTTKCVFIGYSSNH------------KVAASSINSPTS

Query:  TTTSSSDGRSHESSLATPRLLDYPSPTNVSIN-EETMDAINSSATRPSVINSSTTRPSTSI----HPMVTRDK----SGVTKPKKFFGCYSQAQTSMDWS
           +++D      +   P  +  PS +N   + E T D ++    +P  +     +    +    HP    ++        +P+     Y   +  +   
Subjt:  TTTSSSDGRSHESSLATPRLLDYPSPTNVSIN-EETMDAINSSATRPSVINSSTTRPSTSI----HPMVTRDK----SGVTKPKKFFGCYSQAQTSMDWS

Query:  CNEPTSYIDALKVP---SWQRAMKEEFTTLTKNQTWDLVPPSSNCHLVGNKWIFKLKRDAHGAIVSYKARLVAQGFSQTLGIDFHETYTPVIKPMTFQII
          EP S  + L  P      +AM+EE  +L KN T+ LV        +  KW+FKLK+D    +V YKARLV +GF Q  GIDF E ++PV+K  + + I
Subjt:  CNEPTSYIDALKVP---SWQRAMKEEFTTLTKNQTWDLVPPSSNCHLVGNKWIFKLKRDAHGAIVSYKARLVAQGFSQTLGIDFHETYTPVIKPMTFQII

Query:  LTLAPTWDWSVRQLDVSNTFLNGPLNEIIFMPQPKDFEDPKKPHYICRLRKALYGLKQVPRA--------------CQRQTPPCSFFAR--TKKYFFLCM
        L+LA + D  V QLDV   FL+G L E I+M QP+ FE   K H +C+L K+LYGLKQ PR                +  + PC +F R     +  L +
Subjt:  LTLAPTWDWSVRQLDVSNTFLNGPLNEIIFMPQPKDFEDPKKPHYICRLRKALYGLKQVPRA--------------CQRQTPPCSFFAR--TKKYFFLCM

Query:  SMTCFLRVPPHTYLISLFKSFTSRL--LFEISEQSTILGIQIT--RSSKGFHLSQATYVQDLLSRLHLEHLKSAPTPMTFSSSITASSKVLFNDPS----
         +   L V     LI+  K   S+   + ++     ILG++I   R+S+   LSQ  Y++ +L R ++++ K   TP+  +  +  S K+          
Subjt:  SMTCFLRVPPHTYLISLFKSFTSRL--LFEISEQSTILGIQIT--RSSKGFHLSQATYVQDLLSRLHLEHLKSAPTPMTFSSSITASSKVLFNDPS----

Query:  ----LYRSTIRSLHYVLL-TRPDIAFAVNYLSQFAQAPQQAHWLSLKRILRYLHGTFNLGLSLQPTASISLTGYSDVDWAGCPIDRKSVGGYCVFLGSSL
             Y S + SL Y ++ TRPDIA AV  +S+F + P + HW ++K ILRYL GT    L    +  I L GY+D D AG   +RKS  GY        
Subjt:  ----LYRSTIRSLHYVLL-TRPDIAFAVNYLSQFAQAPQQAHWLSLKRILRYLHGTFNLGLSLQPTASISLTGYSDVDWAGCPIDRKSVGGYCVFLGSSL

Query:  ISWSSKKQQVVARFSTESEY
        ISW SK Q+ VA  +TE+EY
Subjt:  ISWSSKKQQVVARFSTESEY

P92519 Uncharacterized mitochondrial protein AtMg008106.1e-3341.85Show/hide
Query:  LGIQITRSSKGFHLSQATYVQDLLSRLHLEHLK--SAPTPMTFSSSITASSKVLFNDPSLYRSTIRSLHYVLLTRPDIAFAVNYLSQFAQAPQQAHWLSL
        LGIQI     G  LSQ  Y + +L+   +   K  S P P+  +SS++ +    + DPS +RS + +L Y+ LTRPDI++AVN + Q    P  A +  L
Subjt:  LGIQITRSSKGFHLSQATYVQDLLSRLHLEHLK--SAPTPMTFSSSITASSKVLFNDPSLYRSTIRSLHYVLLTRPDIAFAVNYLSQFAQAPQQAHWLSL

Query:  KRILRYLHGTFNLGLSLQPTASISLTGYSDVDWAGCPIDRKSVGGYCVFLGSSLISWSSKKQQVVARFSTESEYRTLAHVACDL
        KR+LRY+ GT   GL +   + +++  + D DWAGC   R+S  G+C FLG ++ISWS+K+Q  V+R STE+EYR LA  A +L
Subjt:  KRILRYLHGTFNLGLSLQPTASISLTGYSDVDWAGCPIDRKSVGGYCVFLGSSLISWSSKKQQVVARFSTESEYRTLAHVACDL

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE15.8e-13233.24Show/hide
Query:  PSAMITTPEHLRDTAWYLESGASNHLTSDMSNLSLKSDYTGNEMITAGDGNQLPIHYIGSSYIKSSVKNLILKGLMHVPQISKNLISISRLTMDNSVIVE
        P A +          W L+SGA++H+TSD +NLSL   YTG + +   DG+ +PI + GS+ + +  + L L  +++VP I KNLIS+ RL   N V VE
Subjt:  PSAMITTPEHLRDTAWYLESGASNHLTSDMSNLSLKSDYTGNEMITAGDGNQLPIHYIGSSYIKSSVKNLILKGLMHVPQISKNLISISRLTMDNSVIVE

Query:  FCDSCCAVKDKETRKVLLEGALKDGLYR------------------------------------------------IPSIKSASC---------------
        F  +   VKD  T   LL+G  KD LY                                                  PS K  SC               
Subjt:  FCDSCCAVKDKETRKVLLEGALKDGLYR------------------------------------------------IPSIKSASC---------------

Query:  -------------------PGLQED-----------IKKYSWIFPLKLKSDAVTVFGQFKSLVENIYSSKIKVIRCDEGGEYKPIIHIASQCGIQVQLAC
                           P L  D             +Y+W++PLK KS     F  FK+L+EN + ++I     D GGE+  +    SQ GI    + 
Subjt:  -------------------PGLQED-----------IKKYSWIFPLKLKSDAVTVFGQFKSLVENIYSSKIKVIRCDEGGEYKPIIHIASQCGIQVQLAC

Query:  PYTSAQNGRVERKHRHVVETRLALLAHAKMPLQYWWEAFHTSVYLINIMPTETIGGKVQFTVLNKEKPDYNSLRIFGVACYPCLRQYQHHKFDFHTTKCV
        P+T   NG  ERKHRH+VET L LL+HA +P  YW  AF  +VYLIN +PT  +  +  F  L    P+Y+ LR+FG ACYP LR Y  HK D  + +CV
Subjt:  PYTSAQNGRVERKHRHVVETRLALLAHAKMPLQYWWEAFHTSVYLINIMPTETIGGKVQFTVLNKEKPDYNSLRIFGVACYPCLRQYQHHKFDFHTTKCV

Query:  FIGYS-------------------------------SNHKVAASSINS---------------PTST---------------TTSSSDGRSHESSLATPR
        F+GYS                               SN+    S +                 PT T               T  SS      +S  +  
Subjt:  FIGYS-------------------------------SNHKVAASSINS---------------PTST---------------TTSSSDGRSHESSLATPR

Query:  LLDY-------------------PSPT--------------NVSINEETMDA-----------INSSATRPSVINS------STTRPSTSIHP-------
         LD                    P PT              N S N  T ++             SS++ PS   S      S T PS  IHP       
Subjt:  LLDY-------------------PSPT--------------NVSINEETMDA-----------INSSATRPSVINS------STTRPSTSIHP-------

Query:  -------------MVTRDKSGVTKPK-KFFGCYSQAQTSMDWSCNEPTSYIDALKVPSWQRAMKEEFTTLTKNQTWDLV-PPSSNCHLVGNKWIFKLKRD
                     M TR K+G+ KP  K+    S A  S      EP + I ALK   W+ AM  E      N TWDLV PP S+  +VG +WIF  K +
Subjt:  -------------MVTRDKSGVTKPK-KFFGCYSQAQTSMDWSCNEPTSYIDALKVPSWQRAMKEEFTTLTKNQTWDLV-PPSSNCHLVGNKWIFKLKRD

Query:  AHGAIVSYKARLVAQGFSQTLGIDFHETYTPVIKPMTFQIILTLAPTWDWSVRQLDVSNTFLNGPLNEIIFMPQPKDFEDPKKPHYICRLRKALYGLKQV
        + G++  YKARLVA+G++Q  G+D+ ET++PVIK  + +I+L +A    W +RQLDV+N FL G L + ++M QP  F D  +P+Y+C+LRKALYGLKQ 
Subjt:  AHGAIVSYKARLVAQGFSQTLGIDFHETYTPVIKPMTFQIILTLAPTWDWSVRQLDVSNTFLNGPLNEIIFMPQPKDFEDPKKPHYICRLRKALYGLKQV

Query:  PRACQ-------------RQTPPCSFFA--RTKKYFFLCMSMTCFLRVPP-----HTYLISLFKSFTSRLLFEISEQSTILGIQITRSSKGFHLSQATYV
        PRA                     S F   R K   ++ + +   L         H  L +L + F+ +   +  E    LGI+  R   G HLSQ  Y+
Subjt:  PRACQ-------------RQTPPCSFFA--RTKKYFFLCMSMTCFLRVPP-----HTYLISLFKSFTSRLLFEISEQSTILGIQITRSSKGFHLSQATYV

Query:  QDLLSRLHLEHLKSAPTPMTFSSSITASSKVLFNDPSLYRSTIRSLHYVLLTRPDIAFAVNYLSQFAQAPQQAHWLSLKRILRYLHGTFNLGLSLQPTAS
         DLL+R ++   K   TPM  S  ++  S     DP+ YR  + SL Y+  TRPDI++AVN LSQF   P + H  +LKRILRYL GT N G+ L+   +
Subjt:  QDLLSRLHLEHLKSAPTPMTFSSSITASSKVLFNDPSLYRSTIRSLHYVLLTRPDIAFAVNYLSQFAQAPQQAHWLSLKRILRYLHGTFNLGLSLQPTAS

Query:  ISLTGYSDVDWAGCPIDRKSVGGYCVFLGSSLISWSSKKQQVVARFSTESEYRTLAHVACDL
        +SL  YSD DWAG   D  S  GY V+LG   ISWSSKKQ+ V R STE+EYR++A+ + ++
Subjt:  ISLTGYSDVDWAGCPIDRKSVGGYCVFLGSSLISWSSKKQQVVARFSTESEYRTLAHVACDL

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE27.0e-13031.89Show/hide
Query:  PSAMITTPEHLRDTAWYLESGASNHLTSDMSNLSLKSDYTGNEMITAGDGNQLPIHYIGSSYIKSSVKNLILKGLMHVPQISKNLISISRLTMDNSVIVE
        P A +          W L+SGA++H+TSD +NLS    YTG + +   DG+ +PI + GS+ + +S ++L L  +++VP I KNLIS+ RL   N V VE
Subjt:  PSAMITTPEHLRDTAWYLESGASNHLTSDMSNLSLKSDYTGNEMITAGDGNQLPIHYIGSSYIKSSVKNLILKGLMHVPQISKNLISISRLTMDNSVIVE

Query:  FCDSCCAVKDKETRKVLLEGALKDGLYR------------------------------------------------IPSIKSASC---------------
        F  +   VKD  T   LL+G  KD LY                                                  PS K  SC               
Subjt:  FCDSCCAVKDKETRKVLLEGALKDGLYR------------------------------------------------IPSIKSASC---------------

Query:  -------------------PGLQED-----------IKKYSWIFPLKLKSDAVTVFGQFKSLVENIYSSKIKVIRCDEGGEYKPIIHIASQCGIQVQLAC
                           P L  D             +Y+W++PLK KS     F  FKSLVEN + ++I  +  D GGE+  +    SQ GI    + 
Subjt:  -------------------PGLQED-----------IKKYSWIFPLKLKSDAVTVFGQFKSLVENIYSSKIKVIRCDEGGEYKPIIHIASQCGIQVQLAC

Query:  PYTSAQNGRVERKHRHVVETRLALLAHAKMPLQYWWEAFHTSVYLINIMPTETIGGKVQFTVLNKEKPDYNSLRIFGVACYPCLRQYQHHKFDFHTTKCV
        P+T   NG  ERKHRH+VE  L LL+HA +P  YW  AF  +VYLIN +PT  +  +  F  L  + P+Y  L++FG ACYP LR Y  HK +  + +C 
Subjt:  PYTSAQNGRVERKHRHVVETRLALLAHAKMPLQYWWEAFHTSVYLINIMPTETIGGKVQFTVLNKEKPDYNSLRIFGVACYPCLRQYQHHKFDFHTTKCV

Query:  FIGYS---------------------------------------------------------------------------------------------SN
        F+GYS                                                                                             S+
Subjt:  FIGYS---------------------------------------------------------------------------------------------SN

Query:  HKVAASSINSPTST--TTSSSDG-------RSHESSLATPRLLDYPSPTNVSINEETMDA--------------INSSATRPSVINSSTT----------
          + +SSI+SP+S+  T  S +G          ++S +   +L+ P+P + S N    ++               ++S + P+  +SS+T          
Subjt:  HKVAASSINSPTST--TTSSSDG-------RSHESSLATPRLLDYPSPTNVSINEETMDA--------------INSSATRPSVINSSTT----------

Query:  ---------RPSTSIHPMVTRDKSGVTKPKKFFGCYSQAQTSMDWSCNEPTSYIDALKVPSWQRAMKEEFTTLTKNQTWDLV-PPSSNCHLVGNKWIFKL
                 +   + H M TR K G+ KP +    YS A TS+  + +EP + I A+K   W++AM  E      N TWDLV PP  +  +VG +WIF  
Subjt:  ---------RPSTSIHPMVTRDKSGVTKPKKFFGCYSQAQTSMDWSCNEPTSYIDALKVPSWQRAMKEEFTTLTKNQTWDLV-PPSSNCHLVGNKWIFKL

Query:  KRDAHGAIVSYKARLVAQGFSQTLGIDFHETYTPVIKPMTFQIILTLAPTWDWSVRQLDVSNTFLNGPLNEIIFMPQPKDFEDPKKPHYICRLRKALYGL
        K ++ G++  YKARLVA+G++Q  G+D+ ET++PVIK  + +I+L +A    W +RQLDV+N FL G L + ++M QP  F D  +P Y+CRLRKA+YGL
Subjt:  KRDAHGAIVSYKARLVAQGFSQTLGIDFHETYTPVIKPMTFQIILTLAPTWDWSVRQLDVSNTFLNGPLNEIIFMPQPKDFEDPKKPHYICRLRKALYGL

Query:  KQVPRACQ-------------RQTPPCSFFA--RTKKYFFLCMSMTCFLRVPPHTYLISLFKSFTSRLLFEISEQSTI---LGIQITRSSKGFHLSQATY
        KQ PRA                     S F   R +   ++ + +   L     T L+       S+  F + E   +   LGI+  R  +G HLSQ  Y
Subjt:  KQVPRACQ-------------RQTPPCSFFA--RTKKYFFLCMSMTCFLRVPPHTYLISLFKSFTSRLLFEISEQSTI---LGIQITRSSKGFHLSQATY

Query:  VQDLLSRLHLEHLKSAPTPMTFSSSITASSKVLFNDPSLYRSTIRSLHYVLLTRPDIAFAVNYLSQFAQAPQQAHWLSLKRILRYLHGTFNLGLSLQPTA
          DLL+R ++   K   TPM  S  +T  S     DP+ YR  + SL Y+  TRPD+++AVN LSQ+   P   HW +LKR+LRYL GT + G+ L+   
Subjt:  VQDLLSRLHLEHLKSAPTPMTFSSSITASSKVLFNDPSLYRSTIRSLHYVLLTRPDIAFAVNYLSQFAQAPQQAHWLSLKRILRYLHGTFNLGLSLQPTA

Query:  SISLTGYSDVDWAGCPIDRKSVGGYCVFLGSSLISWSSKKQQVVARFSTESEYRTLAHVACDL
        ++SL  YSD DWAG   D  S  GY V+LG   ISWSSKKQ+ V R STE+EYR++A+ + +L
Subjt:  SISLTGYSDVDWAGCPIDRKSVGGYCVFLGSSLISWSSKKQQVVARFSTESEYRTLAHVACDL

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 88.7e-6736.8Show/hide
Query:  EPTSYIDALKVPSWQRAMKEEFTTLTKNQTWDLVPPSSNCHLVGNKWIFKLKRDAHGAIVSYKARLVAQGFSQTLGIDFHETYTPVIKPMTFQIILTLAP
        EP++Y +A +   W  AM +E   +    TW++     N   +G KW++K+K ++ G I  YKARLVA+G++Q  GIDF ET++PV K  + ++IL ++ 
Subjt:  EPTSYIDALKVPSWQRAMKEEFTTLTKNQTWDLVPPSSNCHLVGNKWIFKLKRDAHGAIVSYKARLVAQGFSQTLGIDFHETYTPVIKPMTFQIILTLAP

Query:  TWDWSVRQLDVSNTFLNGPLNEIIFMPQPKDFE----DPKKPHYICRLRKALYGLKQVPR-------------ACQRQTPPCSFFARTKKYFFLCM----
         +++++ QLD+SN FLNG L+E I+M  P  +     D   P+ +C L+K++YGLKQ  R                +     ++F +     FLC+    
Subjt:  TWDWSVRQLDVSNTFLNGPLNEIIFMPQPKDFE----DPKKPHYICRLRKALYGLKQVPR-------------ACQRQTPPCSFFARTKKYFFLCM----

Query:  --SMTCFLRVPPHTYLISLFKSFTSRLLFEISEQSTILGIQITRSSKGFHLSQATYVQDLLSRLHLEHLKSAPTPMTFSSSITASSKVLFNDPSLYRSTI
           + C         L S  KS     L ++      LG++I RS+ G ++ Q  Y  DLL    L   K +  PM  S + +A S   F D   YR  I
Subjt:  --SMTCFLRVPPHTYLISLFKSFTSRLLFEISEQSTILGIQITRSSKGFHLSQATYVQDLLSRLHLEHLKSAPTPMTFSSSITASSKVLFNDPSLYRSTI

Query:  RSLHYVLLTRPDIAFAVNYLSQFAQAPQQAHWLSLKRILRYLHGTFNLGLSLQPTASISLTGYSDVDWAGCPIDRKSVGGYCVFLGSSLISWSSKKQQVV
          L Y+ +TR DI+FAVN LSQF++AP+ AH  ++ +IL Y+ GT   GL     A + L  +SD  +  C   R+S  GYC+FLG+SLISW SKKQQVV
Subjt:  RSLHYVLLTRPDIAFAVNYLSQFAQAPQQAHWLSLKRILRYLHGTFNLGLSLQPTASISLTGYSDVDWAGCPIDRKSVGGYCVFLGSSLISWSSKKQQVV

Query:  ARFSTESEYRTLA
        ++ S E+EYR L+
Subjt:  ARFSTESEYRTLA

ATMG00240.1 Gag-Pol-related retrotransposon family protein5.0e-1443.59Show/hide
Query:  YVLLTRPDIAFAVNYLSQFAQAPQQAHWLSLKRILRYLHGTFNLGLSLQPTASISLTGYSDVDWAGCPIDRKSVGGYC
        Y+ +TRPD+ FAVN LSQF+ A + A   ++ ++L Y+ GT   GL    T+ + L  ++D DWA CP  R+SV G+C
Subjt:  YVLLTRPDIAFAVNYLSQFAQAPQQAHWLSLKRILRYLHGTFNLGLSLQPTASISLTGYSDVDWAGCPIDRKSVGGYC

ATMG00810.1 DNA/RNA polymerases superfamily protein4.4e-3441.85Show/hide
Query:  LGIQITRSSKGFHLSQATYVQDLLSRLHLEHLK--SAPTPMTFSSSITASSKVLFNDPSLYRSTIRSLHYVLLTRPDIAFAVNYLSQFAQAPQQAHWLSL
        LGIQI     G  LSQ  Y + +L+   +   K  S P P+  +SS++ +    + DPS +RS + +L Y+ LTRPDI++AVN + Q    P  A +  L
Subjt:  LGIQITRSSKGFHLSQATYVQDLLSRLHLEHLK--SAPTPMTFSSSITASSKVLFNDPSLYRSTIRSLHYVLLTRPDIAFAVNYLSQFAQAPQQAHWLSL

Query:  KRILRYLHGTFNLGLSLQPTASISLTGYSDVDWAGCPIDRKSVGGYCVFLGSSLISWSSKKQQVVARFSTESEYRTLAHVACDL
        KR+LRY+ GT   GL +   + +++  + D DWAGC   R+S  G+C FLG ++ISWS+K+Q  V+R STE+EYR LA  A +L
Subjt:  KRILRYLHGTFNLGLSLQPTASISLTGYSDVDWAGCPIDRKSVGGYCVFLGSSLISWSSKKQQVVARFSTESEYRTLAHVACDL

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)7.7e-2346.21Show/hide
Query:  MVTRDKSGVTK--PKKFFGCYSQAQTSMDWSCNEPTSYIDALKVPSWQRAMKEEFTTLTKNQTWDLVPPSSNCHLVGNKWIFKLKRDAHGAIVSYKARLV
        M+TR K+G+ K  PK     YS   T+      EP S I ALK P W +AM+EE   L++N+TW LVPP  N +++G KW+FK K  + G +   KARLV
Subjt:  MVTRDKSGVTK--PKKFFGCYSQAQTSMDWSCNEPTSYIDALKVPSWQRAMKEEFTTLTKNQTWDLVPPSSNCHLVGNKWIFKLKRDAHGAIVSYKARLV

Query:  AQGFSQTLGIDFHETYTPVIKPMTFQIILTLA
        A+GF Q  GI F ETY+PV++  T + IL +A
Subjt:  AQGFSQTLGIDFHETYTPVIKPMTFQIILTLA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAATGTCATGGAATGAAGCTCAATCTCAGCTGCTCATGTTTGAAAAGAGACTTGAACAACTACAAGCCATGAAGAGTTCTGTTGTTTCTATTGTCCAGCCTTCAGC
AAATTTAGCCTCCACAAATGTTCAACAATACAACACATTTCAACATCAGTCAAGGCACATGAACCAAAGTTATCAGAGGAAGAGGAAGAACCTCATGATCGAAACCAATC
TGTCAAGATCTTCTTCAAGTGGCCGTAATCAAAACAAGGGAGATAATCCATCAGCAATGATAACAACCCCAGAACACCTTCGTGACACAGCTTGGTATTTGGAAAGTGGT
GCTAGCAACCATTTAACATCAGATATGTCCAACTTGTCTCTCAAATCTGACTACACAGGTAATGAAATGATAACTGCGGGAGATGGTAATCAATTGCCTATTCATTATAT
TGGAAGTTCCTATATTAAATCTAGTGTTAAGAATCTTATTTTGAAAGGCCTTATGCATGTTCCTCAAATAAGCAAGAATTTGATTAGTATATCAAGGCTAACTATGGATA
ACTCTGTCATTGTTGAGTTTTGTGATTCTTGTTGTGCTGTTAAGGACAAGGAAACAAGGAAGGTTCTTCTGGAAGGGGCGCTTAAGGATGGTCTTTATCGAATACCGAGT
ATCAAGTCTGCCTCATGCCCAGGATTACAAGAAGACATCAAGAAATACTCATGGATTTTTCCTCTGAAACTGAAAAGTGATGCGGTCACAGTATTTGGACAATTTAAAAG
TCTGGTAGAGAACATTTACAGCAGCAAAATTAAGGTTATTAGGTGTGATGAAGGGGGAGAATATAAACCTATAATTCACATAGCTTCTCAATGTGGAATTCAAGTACAAC
TAGCTTGCCCCTACACATCTGCACAAAATGGGAGAGTAGAAAGAAAACACCGACATGTGGTAGAAACCAGACTTGCATTACTTGCTCATGCTAAAATGCCATTACAATAT
TGGTGGGAGGCTTTTCATACTTCAGTGTACTTAATAAACATAATGCCAACTGAAACAATTGGAGGAAAAGTGCAATTTACCGTGTTGAATAAAGAAAAACCAGATTATAA
TAGCTTAAGAATATTTGGTGTAGCTTGCTACCCCTGTCTCAGACAGTACCAGCATCATAAGTTCGATTTTCACACTACAAAGTGTGTGTTCATAGGGTATAGCAGCAACC
ACAAAGTGGCTGCCTCCTCAATTAATTCCCCAACAAGTACCACTACTTCCTCCTCTGATGGAAGATCACATGAATCTTCATTGGCAACACCAAGGTTACTTGACTATCCA
TCACCTACCAATGTGTCCATTAATGAAGAAACCATGGATGCCATCAATTCTTCAGCCACTAGACCATCTGTTATTAATTCTTCAACCACCAGACCATCTACCAGCATTCA
TCCTATGGTTACTAGGGACAAAAGTGGTGTTACTAAACCCAAGAAATTTTTTGGTTGCTACTCTCAAGCTCAAACTTCTATGGATTGGTCGTGTAATGAACCAACTTCCT
ATATAGATGCACTTAAAGTTCCATCTTGGCAGAGAGCAATGAAGGAAGAATTTACAACCTTGACAAAGAATCAAACATGGGATTTGGTCCCTCCGTCTTCTAACTGTCAT
TTGGTAGGTAACAAATGGATATTTAAACTCAAGCGGGATGCTCATGGAGCGATTGTGAGCTATAAAGCGAGGTTAGTTGCTCAAGGTTTCTCTCAAACGCTAGGGATTGA
TTTTCATGAGACATATACCCCGGTGATCAAACCAATGACCTTCCAAATTATTCTCACTCTTGCTCCTACTTGGGATTGGTCAGTACGTCAGTTAGATGTAAGCAACACTT
TCCTTAATGGCCCTCTTAATGAAATAATATTTATGCCACAACCTAAAGATTTTGAAGACCCTAAAAAACCACATTATATATGTCGGCTTCGTAAAGCACTTTATGGGCTA
AAGCAGGTTCCTCGTGCCTGTCAAAGGCAGACTCCTCCTTGTTCATTTTTTGCAAGAACAAAGAAATACTTCTTCTTGTGTATGTCGATGACATGCTTCTTACGGGTTCC
TCCACACACTTACTTGATCAGTTTGTTCAAAAGCTTCACAAGCAGATTGCTCTTTGAGATCTCGGAGCAGTCCACTATTTTAGGCATTCAGATTACACGGTCGAGTAAAG
GTTTTCATTTGTCTCAAGCCACGTATGTTCAAGACTTATTATCCCGCCTCCACCTAGAGCATCTCAAGTCTGCTCCCACCCCCATGACATTTTCATCCTCTATCACAGCT
TCATCCAAGGTGTTATTTAATGATCCTTCCTTGTATCGCAGCACTATTAGGTCTTTGCATTATGTTCTTCTTACTCGACCAGATATTGCTTTTGCTGTGAACTATCTTAG
TCAATTTGCTCAAGCACCTCAGCAAGCCCATTGGCTCTCTCTCAAACGTATTTTGAGGTATCTTCATGGCACTTTTAACCTTGGTCTCTCTCTCCAACCCACTGCCTCCA
TCTCGCTCACAGGGTACTCTGATGTTGATTGGGCTGGCTGCCCTATTGATCGCAAATCTGTGGGAGGATATTGTGTTTTTCTTGGATCCTCGCTTATCTCTTGGTCTTCT
AAAAAGCAACAAGTTGTTGCTCGCTTTAGTACTGAGTCTGAGTACCGCACACTTGCTCATGTTGCTTGTGATCTTGCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCAATGTCATGGAATGAAGCTCAATCTCAGCTGCTCATGTTTGAAAAGAGACTTGAACAACTACAAGCCATGAAGAGTTCTGTTGTTTCTATTGTCCAGCCTTCAGC
AAATTTAGCCTCCACAAATGTTCAACAATACAACACATTTCAACATCAGTCAAGGCACATGAACCAAAGTTATCAGAGGAAGAGGAAGAACCTCATGATCGAAACCAATC
TGTCAAGATCTTCTTCAAGTGGCCGTAATCAAAACAAGGGAGATAATCCATCAGCAATGATAACAACCCCAGAACACCTTCGTGACACAGCTTGGTATTTGGAAAGTGGT
GCTAGCAACCATTTAACATCAGATATGTCCAACTTGTCTCTCAAATCTGACTACACAGGTAATGAAATGATAACTGCGGGAGATGGTAATCAATTGCCTATTCATTATAT
TGGAAGTTCCTATATTAAATCTAGTGTTAAGAATCTTATTTTGAAAGGCCTTATGCATGTTCCTCAAATAAGCAAGAATTTGATTAGTATATCAAGGCTAACTATGGATA
ACTCTGTCATTGTTGAGTTTTGTGATTCTTGTTGTGCTGTTAAGGACAAGGAAACAAGGAAGGTTCTTCTGGAAGGGGCGCTTAAGGATGGTCTTTATCGAATACCGAGT
ATCAAGTCTGCCTCATGCCCAGGATTACAAGAAGACATCAAGAAATACTCATGGATTTTTCCTCTGAAACTGAAAAGTGATGCGGTCACAGTATTTGGACAATTTAAAAG
TCTGGTAGAGAACATTTACAGCAGCAAAATTAAGGTTATTAGGTGTGATGAAGGGGGAGAATATAAACCTATAATTCACATAGCTTCTCAATGTGGAATTCAAGTACAAC
TAGCTTGCCCCTACACATCTGCACAAAATGGGAGAGTAGAAAGAAAACACCGACATGTGGTAGAAACCAGACTTGCATTACTTGCTCATGCTAAAATGCCATTACAATAT
TGGTGGGAGGCTTTTCATACTTCAGTGTACTTAATAAACATAATGCCAACTGAAACAATTGGAGGAAAAGTGCAATTTACCGTGTTGAATAAAGAAAAACCAGATTATAA
TAGCTTAAGAATATTTGGTGTAGCTTGCTACCCCTGTCTCAGACAGTACCAGCATCATAAGTTCGATTTTCACACTACAAAGTGTGTGTTCATAGGGTATAGCAGCAACC
ACAAAGTGGCTGCCTCCTCAATTAATTCCCCAACAAGTACCACTACTTCCTCCTCTGATGGAAGATCACATGAATCTTCATTGGCAACACCAAGGTTACTTGACTATCCA
TCACCTACCAATGTGTCCATTAATGAAGAAACCATGGATGCCATCAATTCTTCAGCCACTAGACCATCTGTTATTAATTCTTCAACCACCAGACCATCTACCAGCATTCA
TCCTATGGTTACTAGGGACAAAAGTGGTGTTACTAAACCCAAGAAATTTTTTGGTTGCTACTCTCAAGCTCAAACTTCTATGGATTGGTCGTGTAATGAACCAACTTCCT
ATATAGATGCACTTAAAGTTCCATCTTGGCAGAGAGCAATGAAGGAAGAATTTACAACCTTGACAAAGAATCAAACATGGGATTTGGTCCCTCCGTCTTCTAACTGTCAT
TTGGTAGGTAACAAATGGATATTTAAACTCAAGCGGGATGCTCATGGAGCGATTGTGAGCTATAAAGCGAGGTTAGTTGCTCAAGGTTTCTCTCAAACGCTAGGGATTGA
TTTTCATGAGACATATACCCCGGTGATCAAACCAATGACCTTCCAAATTATTCTCACTCTTGCTCCTACTTGGGATTGGTCAGTACGTCAGTTAGATGTAAGCAACACTT
TCCTTAATGGCCCTCTTAATGAAATAATATTTATGCCACAACCTAAAGATTTTGAAGACCCTAAAAAACCACATTATATATGTCGGCTTCGTAAAGCACTTTATGGGCTA
AAGCAGGTTCCTCGTGCCTGTCAAAGGCAGACTCCTCCTTGTTCATTTTTTGCAAGAACAAAGAAATACTTCTTCTTGTGTATGTCGATGACATGCTTCTTACGGGTTCC
TCCACACACTTACTTGATCAGTTTGTTCAAAAGCTTCACAAGCAGATTGCTCTTTGAGATCTCGGAGCAGTCCACTATTTTAGGCATTCAGATTACACGGTCGAGTAAAG
GTTTTCATTTGTCTCAAGCCACGTATGTTCAAGACTTATTATCCCGCCTCCACCTAGAGCATCTCAAGTCTGCTCCCACCCCCATGACATTTTCATCCTCTATCACAGCT
TCATCCAAGGTGTTATTTAATGATCCTTCCTTGTATCGCAGCACTATTAGGTCTTTGCATTATGTTCTTCTTACTCGACCAGATATTGCTTTTGCTGTGAACTATCTTAG
TCAATTTGCTCAAGCACCTCAGCAAGCCCATTGGCTCTCTCTCAAACGTATTTTGAGGTATCTTCATGGCACTTTTAACCTTGGTCTCTCTCTCCAACCCACTGCCTCCA
TCTCGCTCACAGGGTACTCTGATGTTGATTGGGCTGGCTGCCCTATTGATCGCAAATCTGTGGGAGGATATTGTGTTTTTCTTGGATCCTCGCTTATCTCTTGGTCTTCT
AAAAAGCAACAAGTTGTTGCTCGCTTTAGTACTGAGTCTGAGTACCGCACACTTGCTCATGTTGCTTGTGATCTTGCTTAG
Protein sequenceShow/hide protein sequence
MAMSWNEAQSQLLMFEKRLEQLQAMKSSVVSIVQPSANLASTNVQQYNTFQHQSRHMNQSYQRKRKNLMIETNLSRSSSSGRNQNKGDNPSAMITTPEHLRDTAWYLESG
ASNHLTSDMSNLSLKSDYTGNEMITAGDGNQLPIHYIGSSYIKSSVKNLILKGLMHVPQISKNLISISRLTMDNSVIVEFCDSCCAVKDKETRKVLLEGALKDGLYRIPS
IKSASCPGLQEDIKKYSWIFPLKLKSDAVTVFGQFKSLVENIYSSKIKVIRCDEGGEYKPIIHIASQCGIQVQLACPYTSAQNGRVERKHRHVVETRLALLAHAKMPLQY
WWEAFHTSVYLINIMPTETIGGKVQFTVLNKEKPDYNSLRIFGVACYPCLRQYQHHKFDFHTTKCVFIGYSSNHKVAASSINSPTSTTTSSSDGRSHESSLATPRLLDYP
SPTNVSINEETMDAINSSATRPSVINSSTTRPSTSIHPMVTRDKSGVTKPKKFFGCYSQAQTSMDWSCNEPTSYIDALKVPSWQRAMKEEFTTLTKNQTWDLVPPSSNCH
LVGNKWIFKLKRDAHGAIVSYKARLVAQGFSQTLGIDFHETYTPVIKPMTFQIILTLAPTWDWSVRQLDVSNTFLNGPLNEIIFMPQPKDFEDPKKPHYICRLRKALYGL
KQVPRACQRQTPPCSFFARTKKYFFLCMSMTCFLRVPPHTYLISLFKSFTSRLLFEISEQSTILGIQITRSSKGFHLSQATYVQDLLSRLHLEHLKSAPTPMTFSSSITA
SSKVLFNDPSLYRSTIRSLHYVLLTRPDIAFAVNYLSQFAQAPQQAHWLSLKRILRYLHGTFNLGLSLQPTASISLTGYSDVDWAGCPIDRKSVGGYCVFLGSSLISWSS
KKQQVVARFSTESEYRTLAHVACDLA