; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0001124 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0001124
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr4:24825151..24828444
RNA-Seq ExpressionLag0001124
SyntenyLag0001124
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN74695.1 hypothetical protein VITISV_024648 [Vitis vinifera]3.5e-19639.69Show/hide
Query:  KLNNDNFLTWKVQIEFALEGHELESFILEDIEPPPKTIPVGTDPNVTKLNPAYAKWKKQDRLISSWLLGSMSYDILEQVIHCKTSRAIWKCLQI-FNLQN
        KL+N NFL  + QI   L GH+L+ F+ E    P + +    D     +NP +  W++Q +LI S LL S++  +L ++++C TS  +WK L++ F  Q 
Subjt:  KLNNDNFLTWKVQIEFALEGHELESFILEDIEPPPKTIPVGTDPNVTKLNPAYAKWKKQDRLISSWLLGSMSYDILEQVIHCKTSRAIWKCLQI-FNLQN

Query:  RAQIMRLKSKLQTIQKGNLSLNEYFSQIKHCVDALAAIGKQVDEEDHITYILTGLGSEYESMISVITAQTEDIGVQEAMTLLLTHENQIETKAK------
        RA++ + K++L   +KG+LS+++Y  +I++ VD LA +G ++  +DHI  I  GL  +YE+ I  + ++ +   V+E   LLL  E++IE   K      
Subjt:  RAQIMRLKSKLQTIQKGNLSLNEYFSQIKHCVDALAAIGKQVDEEDHITYILTGLGSEYESMISVITAQTEDIGVQEAMTLLLTHENQIETKAK------

Query:  -------TVNNDGSVPSANLMTQSNHTPDPDPQKQNN--QYTSSNYNNRGRGGGRSNRGGRSWNNNRNKPQCQYCFKFEHTAIKCYALAGRFPQQQSGNS
               T N +GS P  N    + ++    P    N  Q+   N+  +GR  GR  RG  SW  N NKPQCQ C +  H  ++CY    RF Q  +G S
Subjt:  -------TVNNDGSVPSANLMTQSNHTPDPDPQKQNN--QYTSSNYNNRGRGGGRSNRGGRSWNNNRNKPQCQYCFKFEHTAIKCYALAGRFPQQQSGNS

Query:  TF---HPQFGAGAFLPNQQMQEML---------TASDMNQDNGWYPDS-------------------------------GLSISHSGNSSFNSS---NRL
              PQ G  A L +QQ+ E           T +++ QDN WYPDS                               GL I H G++SF+SS   ++ 
Subjt:  TF---HPQFGAGAFLPNQQMQEML---------TASDMNQDNGWYPDS-------------------------------GLSISHSGNSSFNSS---NRL

Query:  FHLKNLLHVPQITKNLISVSQFARDNSVFFEFHPEFFYVKDRLTSQTLLQGPLHEGLYRFNLSILPSTSTRNIPAIN-KCLSSQNHSSVLTVNTIEKPSV
          LK LLHVP+ITKNL+SVS+FA DN VFFEFHP   +VKD  T   L+ G L  GLY F+ + L       +P  N  C +S    S          S 
Subjt:  FHLKNLLHVPQITKNLISVSQFARDNSVFFEFHPEFFYVKDRLTSQTLLQGPLHEGLYRFNLSILPSTSTRNIPAIN-KCLSSQNHSSVLTVNTIEKPSV

Query:  VDIWHRRLGHPASFVVQTIVKLCNAVKSPYIKHSFCNACAVGKSHALPFQTSNSVYTKPLQLIVADVWDRA-----------------------------
          +WH RLGHP+S +V  ++  CN      I    C+ C +GK H   F  S S YTKPL+LI  D+W  A                             
Subjt:  VDIWHRRLGHPASFVVQTIVKLCNAVKSPYIKHSFCNACAVGKSHALPFQTSNSVYTKPLQLIVADVWDRA-----------------------------

Query:  ---------------------------------------YITSRNGF--KVTCPYTSQQNGIAERKHRQIVDIGLILLSQASLPFIFWDDAFATAAYISN
                                               Y+TS NG   +++CPYT +QNG+AERKHR IV+ G+ LL+QASLPF +WD+AF T+ Y+ N
Subjt:  ---------------------------------------YITSRNGF--KVTCPYTSQQNGIAERKHRQIVDIGLILLSQASLPFIFWDDAFATAAYISN

Query:  RLPSTVLNGSCPLKQLFHIDPNYTFLKIFGCRCYPCLRPYNSHKISFRSIPCTFLGYSNQHKGYKCLSSDGRLYVLHNVTFDELSFPYKQ---QSTPLPS
        RLP+ VL    PL+ LFH  P+Y+ LK+FGC CYP LRP+N HK+ FRSIPCTFLGYS  HKGYKCLS +G + +  +V FDE +FP+ Q   Q     S
Subjt:  RLPSTVLNGSCPLKQLFHIDPNYTFLKIFGCRCYPCLRPYNSHKISFRSIPCTFLGYSNQHKGYKCLSSDGRLYVLHNVTFDELSFPYKQ---QSTPLPS

Query:  ISPSVTQNVPILSSTPMDTQTSSHESSTDVPLSPAPAPTIASPAEQPISTASVDSSSHDTSSNASQSMVPTVVNNHPMVTRGKHGIFKPKALMVSYDNTE
         S S T ++P  +S P+    SS   ST  P +P+  P                +S+H+ +S    S  P   ++H M+TR K GIFKPKA ++S   T 
Subjt:  ISPSVTQNVPILSSTPMDTQTSSHESSTDVPLSPAPAPTIASPAEQPISTASVDSSSHDTSSNASQSMVPTVVNNHPMVTRGKHGIFKPKALMVSYDNTE

Query:  PPSAKEAFKHPAWRKAMESEYDALMANQTWTLVPLPRNQKVVGCKWVFKIKRNSDGSIARYKARLVAKGFHQTPDIDYSETYSPMVKPVTIRVLLTLALA
        P S  EA +   W++AM  EY AL+ N TW LVP P ++K++GCKWVFK+K N DG+I +YKARLVAKGFHQ    D++ET+SP+VKP TIR++LT+AL 
Subjt:  PPSAKEAFKHPAWRKAMESEYDALMANQTWTLVPLPRNQKVVGCKWVFKIKRNSDGSIARYKARLVAKGFHQTPDIDYSETYSPMVKPVTIRVLLTLALA

Query:  NKWSLRQVDINNAFLHGLLSENVYMEQPGGFECVSGFTP-LVCKLRKALYGLKQALRAWFERLSAFLHSIGFKTSKVDPSLLFKHSATGVCYILVYVDDI
         +W +RQ+D+NNAFL+G L E+++M QP GF  +    P  VCKL K+LYGLKQA RAWFE+L   L  +GF ++K D SL    + T   YILVYVDDI
Subjt:  NKWSLRQVDINNAFLHGLLSENVYMEQPGGFECVSGFTP-LVCKLRKALYGLKQALRAWFERLSAFLHSIGFKTSKVDPSLLFKHSATGVCYILVYVDDI

Query:  V
        +
Subjt:  V

CAN81099.1 hypothetical protein VITISV_017741 [Vitis vinifera]2.1e-19337.57Show/hide
Query:  NKISTVKLNNDNFLTWKVQIEFALEGHELESFILEDIEPPPKTIPVGTDPNVTKLNPAYAKWKKQDRLISSWLLGSMSYDILEQVIHCKTSRAIWKCLQI
        N   +VKL+N NFL WK QI  A+ G+ L+ F+  D E      P      +  L          +  I+   LG  S+  L Q          +   Q 
Subjt:  NKISTVKLNNDNFLTWKVQIEFALEGHELESFILEDIEPPPKTIPVGTDPNVTKLNPAYAKWKKQDRLISSWLLGSMSYDILEQVIHCKTSRAIWKCLQI

Query:  FNLQNRAQIMRLKSKLQTIQKGNLSLNEYFSQIKHCVDALAAIGKQVDEEDHITYILTGLGSEYESMISVITAQTEDIGVQEAMTLLLTHENQIETKAKT
        F  Q RA+  + K++LQ  +KG  +++EY ++IK CVD+LA++G  +  +DH+  IL GL ++YES ++ +  + +D  V+E   LL+ HE+++E    +
Subjt:  FNLQNRAQIMRLKSKLQTIQKGNLSLNEYFSQIKHCVDALAAIGKQVDEEDHITYILTGLGSEYESMISVITAQTEDIGVQEAMTLLLTHENQIETKAKT

Query:  VNNDGS--VPSANLMTQSNHTPDPDPQKQNNQYTSSNYNNR-GRGG-------------------GRSNRGG-------------RSWN--NNRNKPQCQ
        +++  S  V S+N + + N     D    N+Q + S YN   GRGG                   GRSNRGG               WN  N   KP CQ
Subjt:  VNNDGS--VPSANLMTQSNHTPDPDPQKQNNQYTSSNYNNR-GRGG-------------------GRSNRGG-------------RSWN--NNRNKPQCQ

Query:  YCFKFEHTAIKC-YALAGRFPQQQSGNSTFHPQFGAGAFLPNQQMQEMLTASDMNQDNGWYPDS-------------------------------GLSIS
         C K  H   +C Y     F   Q+ +S         +F P  Q+  ++  S++  D+ WYPDS                               GLSI 
Subjt:  YCFKFEHTAIKC-YALAGRFPQQQSGNSTFHPQFGAGAFLPNQQMQEMLTASDMNQDNGWYPDS-------------------------------GLSIS

Query:  HSGNSSFNS--SNRLFHLKNLLHVPQITKNLISVSQFARDNSVFFEFHPEFFYVKDRLTSQTLLQGPLHEGLYRFNLSILPSTSTRNIPAINKCLSSQNH
        H G S F S  S++   L +LLHVP ITKNL+SVS+FA+DN VFFEFH +  +VKD++T   L+ G + +GLY F+ S L    T+++      ++S   
Subjt:  HSGNSSFNS--SNRLFHLKNLLHVPQITKNLISVSQFARDNSVFFEFHPEFFYVKDRLTSQTLLQGPLHEGLYRFNLSILPSTSTRNIPAINKCLSSQNH

Query:  SSVLTVNTIEKPSVVDIWHRRLGHPASFVVQTIVKLCNAVKSPYIKHSFCNACAVGKSHALPFQTSNSVYTKPLQLIVADVWDRAYITSRNGF-------
        S V T +     S  D+WH+RLGHP++  ++ ++  CN      +  +FC++C +GK H  PF  S++ YTKPL+LI  D+W    + S +G+       
Subjt:  SSVLTVNTIEKPSVVDIWHRRLGHPASFVVQTIVKLCNAVKSPYIKHSFCNACAVGKSHALPFQTSNSVYTKPLQLIVADVWDRAYITSRNGF-------

Query:  --------------------------------------------------------------KVTCPYTSQQNGIAERKHRQIVDIGLILLSQASLPFIF
                                                                      +V+CP+T QQNG+AERKHR IV+ GL LL  ASLP  F
Subjt:  --------------------------------------------------------------KVTCPYTSQQNGIAERKHRQIVDIGLILLSQASLPFIF

Query:  WDDAFATAAYISNRLPSTVLNGSCPLKQLFHIDPNYTFLKIFGCRCYPCLRPYNSHKISFRSIPCTFLGYSNQHKGYKCLSSDGRLYVLHNVTFDELSFP
        WD++F T  Y+SNRLP+ +L+  CP++ LF   P+Y+FLK+FGC C+P LRPYN+HK+ +RS  CTFLGYS +HKGYKC+SS+GR+Y+ H+V F+E SFP
Subjt:  WDDAFATAAYISNRLPSTVLNGSCPLKQLFHIDPNYTFLKIFGCRCYPCLRPYNSHKISFRSIPCTFLGYSNQHKGYKCLSSDGRLYVLHNVTFDELSFP

Query:  YKQ----QSTPLPSISPSVTQ-----NVPILSSTPMDTQTSSHESSTDV----------PLSPAPAPTIASPAEQPISTASVDSSSHDTSSNASQSMVPT
        Y +     S  L ++SPS +      + P+LS T + T TS   S+  +          P +P  A T  +PA Q +S        H  SS A  S+  T
Subjt:  YKQ----QSTPLPSISPSVTQ-----NVPILSSTPMDTQTSSHESSTDV----------PLSPAPAPTIASPAEQPISTASVDSSSHDTSSNASQSMVPT

Query:  VV----NNHPMVTRGKHGIFKPKALMVSYDNTEPPSAKEAFKHPAWRKAMESEYDALMANQTWTLVPLPRNQKVVGCKWVFKIKRNSDGSIARYKARLVA
        +     N HPM+TR K GI KPK  + +    EP S   A +   W+KAM +EYDAL  N TW+LVPLP  ++ +GCKWV+K K N DG++ +YKARLVA
Subjt:  VV----NNHPMVTRGKHGIFKPKALMVSYDNTEPPSAKEAFKHPAWRKAMESEYDALMANQTWTLVPLPRNQKVVGCKWVFKIKRNSDGSIARYKARLVA

Query:  KGFHQTPDIDYSETYSPMVKPVTIRVLLTLALANKWSLRQVDINNAFLHGLLSENVYMEQPGGFECVSGFTP-LVCKLRKALYGLKQALRAWFERLSAFL
        KGFHQ    D++ET+SP+VKP T+RV+ T+AL+  W+++Q+D+NNAFL+G L E V+M+QP GF  +    P LVC+L KALYGLKQA RAWFE+L   L
Subjt:  KGFHQTPDIDYSETYSPMVKPVTIRVLLTLALANKWSLRQVDINNAFLHGLLSENVYMEQPGGFECVSGFTP-LVCKLRKALYGLKQALRAWFERLSAFL

Query:  HSIGFKTSKVDPSLLFKHSATGVCYILVYVDDIV
         S GF ++K D SL  + +   + Y+LVYVDDI+
Subjt:  HSIGFKTSKVDPSLLFKHSATGVCYILVYVDDIV

KYP46257.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan]2.2e-19038.22Show/hide
Query:  KLNNDNFLTWKVQIEFALEGHELESFILEDIEPPPKTIPVGTDPNVTKLNPAYAKWKKQDRLISSWLLGSMSYDILEQVIHCKTSRAIW-KCLQIFNLQN
        KL+  N+L W  Q+E  ++GH L  +++      P+      D +   ++ +Y  W++QD+L+ SWL  SMS D+L +VI CK+S  +W K    F+   
Subjt:  KLNNDNFLTWKVQIEFALEGHELESFILEDIEPPPKTIPVGTDPNVTKLNPAYAKWKKQDRLISSWLLGSMSYDILEQVIHCKTSRAIW-KCLQIFNLQN

Query:  RAQIMRLKSKLQTIQKGNLSLNEYFSQIKHCVDALAAIGKQVDEEDHITYILTGLGSEYESMISVITAQTEDIGVQEAMTLLLTHENQIETKAKTVNNDG
         A+  +L+++L++    NLS++EY  +I+  VDAL AIG  V  ++H+  IL GL  EYES +S+I+++ + + + E  TLLL HE++++   K      
Subjt:  RAQIMRLKSKLQTIQKGNLSLNEYFSQIKHCVDALAAIGKQVDEEDHITYILTGLGSEYESMISVITAQTEDIGVQEAMTLLLTHENQIETKAKTVNNDG

Query:  SVPSANLMTQSNHTPDPDPQKQNNQYTSSNYNNRG-----RGGGRSNRGGR--SW-NNNRNK---PQCQYCFKFEHTAIKCY------------ALAGRF
        +  S N+ T      +PDP   N Q   ++ NN+      RGG  ++RGGR  +W    R +    QCQ C ++ H A  CY              A  +
Subjt:  SVPSANLMTQSNHTPDPDPQKQNNQYTSSNYNNRG-----RGGGRSNRGGR--SW-NNNRNK---PQCQYCFKFEHTAIKCY------------ALAGRF

Query:  PQQQ------SGNSTFHPQFGAGAFLPN-----QQMQEMLTASDMNQDNGWYPDSGLSISHSGNSSFNS---SNRLFHLKNLLHVPQITKNLISVSQFAR
        P         + N+ ++P  GA   + N      Q         ++  NG     GL I+ +G ++F+S       F L NLL VP ITKNLISVSQF++
Subjt:  PQQQ------SGNSTFHPQFGAGAFLPN-----QQMQEMLTASDMNQDNGWYPDSGLSISHSGNSSFNS---SNRLFHLKNLLHVPQITKNLISVSQFAR

Query:  DNSVFFEFHPEFFYVKDRLTSQTLLQGPL-HEGLYRFNLSILPSTSTRNIPAINKCLSSQNHSSVLTVNT-IEKPSVVDIWHRRLGHPASFVVQTIVKLC
        DNSV+FEFHP    VK + T++ LLQG +  +GLY+F  ++LP+ ST+++         Q  + + +VNT    P     WH RLGHP   V++ + +  
Subjt:  DNSVFFEFHPEFFYVKDRLTSQTLLQGPL-HEGLYRFNLSILPSTSTRNIPAINKCLSSQNHSSVLTVNT-IEKPSVVDIWHRRLGHPASFVVQTIVKLC

Query:  NAVKSPYIKHSFCNACAVGKSHALPFQTSNSVYTKPLQLIVADVWDRAYITSRNGF--------------------------------------------
        N       K +FC  C +GKSH LP   S ++Y  P  L+ +D+W  A +TS  G+                                            
Subjt:  NAVKSPYIKHSFCNACAVGKSHALPFQTSNSVYTKPLQLIVADVWDRAYITSRNGF--------------------------------------------

Query:  -------------------------KVTCPYTSQQNGIAERKHRQIVDIGLILLSQASLPFIFWDDAFATAAYISNRLPSTVLNGSCPLKQLFHIDPNYT
                                 ++ CP+T  QNG+AERKHR IV++GL L++QA LP  FWD +F TA Y+ NRLPS+ +  + P  +LFH  P+Y 
Subjt:  -------------------------KVTCPYTSQQNGIAERKHRQIVDIGLILLSQASLPFIFWDDAFATAAYISNRLPSTVLNGSCPLKQLFHIDPNYT

Query:  FLKIFGCRCYPCLRPYNSHKISFRSIPCTFLGYSNQHKGYKCLSSDGRLYVLHNVTFDELSFPYKQQSTPLPSISPSVTQNVPILSSTPMDTQTSSHESS
         L+IFGC C+P LRPYN HK+ FRS  C FLGYS  HKGYKCL++DGRLY+  +V F+E  FPYK   +   +    +  +VP+   T   T T +H + 
Subjt:  FLKIFGCRCYPCLRPYNSHKISFRSIPCTFLGYSNQHKGYKCLSSDGRLYVLHNVTFDELSFPYKQQSTPLPSISPSVTQNVPILSSTPMDTQTSSHESS

Query:  TDVPLSPAPAPTIASPAEQPISTASVDSS--SHDTSSNASQSMVPTV--VNNHPMVTRGKHGIFKPKAL-MVSYDNTEPPSAKEAFKHPAWRKAMESEYD
           P  P  +P++ S +  P + A    S  S  TSSN   S  P +   N HPM TR K GI KP+    +     EP + K+A  +P W  AM++EY+
Subjt:  TDVPLSPAPAPTIASPAEQPISTASVDSS--SHDTSSNASQSMVPTV--VNNHPMVTRGKHGIFKPKAL-MVSYDNTEPPSAKEAFKHPAWRKAMESEYD

Query:  ALMANQTWTLVPLPRNQKVVGCKWVFKIKRNSDGSIARYKARLVAKGFHQTPDIDYSETYSPMVKPVTIRVLLTLALANKWSLRQVDINNAFLHGLLSEN
        AL+ N TW+LVPLP ++  +GCKWVF++K N +GS+ +YKARLVAKGF+Q    DY ET+SP++KPVT+R++LTLAL ++W ++Q+D+NNAFL+G L E 
Subjt:  ALMANQTWTLVPLPRNQKVVGCKWVFKIKRNSDGSIARYKARLVAKGFHQTPDIDYSETYSPMVKPVTIRVLLTLALANKWSLRQVDINNAFLHGLLSEN

Query:  VYMEQPGGFECVSGFTPLVCKLRKALYGLKQALRAWFERLSAFLHSIGFKTSKVDPSLLFKHSATGVCYILVYVDDIV
        VYM+QP GFE  +    LVCKL KA+YGLKQA RAWF++L + L  + F  SK DPSL        V YILVYVDDI+
Subjt:  VYMEQPGGFECVSGFTPLVCKLRKALYGLKQALRAWFERLSAFLHSIGFKTSKVDPSLLFKHSATGVCYILVYVDDIV

RVW44519.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]2.8e-19337.56Show/hide
Query:  DASTHTEADSYEGNENSSNSIVNPGNKISTVKLNNDNFLTWKVQIEFALEGHELESFILEDIEPPPKTIPVGTDP-NVTKLNPAYAKWKKQDRLISSWLL
        D S      +    + S   +++P +++ T++L +DNFL WK QIE A+ G+ LE F+    + PPK +   TD   V   NP +  +++QD L+ SWLL
Subjt:  DASTHTEADSYEGNENSSNSIVNPGNKISTVKLNNDNFLTWKVQIEFALEGHELESFILEDIEPPPKTIPVGTDP-NVTKLNPAYAKWKKQDRLISSWLL

Query:  GSMSYDILEQVIHCKTSRAIWKCLQIFNLQNRAQIMRLKSKLQTIQKGNLSLNEYFSQIKHCVDALAAIGKQVDEEDHITYILTGLGSEYESMISVITAQ
         S+    L QV+ C ++                             +  L++ +Y +++K+  D LA  G ++ + DHI  I+ GLG EYES+I+VI+++
Subjt:  GSMSYDILEQVIHCKTSRAIWKCLQIFNLQNRAQIMRLKSKLQTIQKGNLSLNEYFSQIKHCVDALAAIGKQVDEEDHITYILTGLGSEYESMISVITAQ

Query:  TEDIGVQEAMTLLLTHENQIETKAKTVNNDGSVPSANLMTQSNHTPDPDPQKQNNQYTSSNYNNRGRGGGRS--------NRG-GRSWNNNRNKPQCQYC
             +Q   + L+ HE +I  K    +ND SV   N  +Q ++   P     +N Y SS + NR + GG          NRG GR       KPQCQ C
Subjt:  TEDIGVQEAMTLLLTHENQIETKAKTVNNDGSVPSANLMTQSNHTPDPDPQKQNNQYTSSNYNNRGRGGGRS--------NRG-GRSWNNNRNKPQCQYC

Query:  FKFEHTAIKC-YALAGRFPQQQSGNSTFHPQFGAGA--------------------FLPNQ---QMQEMLTASDMNQDNGWYPDS---------------
         KF HT  +C Y     F      N       G+GA                       NQ   +M+ M+   +  Q+  W+PDS               
Subjt:  FKFEHTAIKC-YALAGRFPQQQSGNSTFHPQFGAGA--------------------FLPNQ---QMQEMLTASDMNQDNGWYPDS---------------

Query:  ----------------GLSISHSGNSSFNSS---NRLFHLKNLLHVPQITKNLISVSQFARDNSVFFEFHPEFFYVKDRLTSQTLLQGPLHEGLYRFNLS
                        GL ISH G S F SS   N++  LKN+L VP I KNL+SVSQFARDN+V+FEFHP+  +VKD+     LLQG LH+GLY+FNLS
Subjt:  ----------------GLSISHSGNSSFNSS---NRLFHLKNLLHVPQITKNLISVSQFARDNSVFFEFHPEFFYVKDRLTSQTLLQGPLHEGLYRFNLS

Query:  ---------ILPSTSTRNIPAINKCLSSQNHSSVLTVNTIEKPSVVDIWHRRLGHPASFVVQTIVKLCNAVKSPYIKHSFCNACAVGKSHALPFQTSNSV
                 +  S     +   N  L   N +S     T     V D+WH+RLGHPAS +V  ++       S     S C+AC +GKSH LPF  S +V
Subjt:  ---------ILPSTSTRNIPAINKCLSSQNHSSVLTVNTIEKPSVVDIWHRRLGHPASFVVQTIVKLCNAVKSPYIKHSFCNACAVGKSHALPFQTSNSV

Query:  YTKPLQLIVADVWDRAYITSRNGF---------------------------------------------------------------------KVTCPYT
        YTKPLQL+V+D+W  A I S  GF                                                                     +++CP+T
Subjt:  YTKPLQLIVADVWDRAYITSRNGF---------------------------------------------------------------------KVTCPYT

Query:  SQQNGIAERKHRQIVDIGLILLSQASLPFIFWDDAFATAAYISNRLPSTVLNGSCPLKQLFHIDPNYTFLKIFGCRCYPCLRPYNSHKISFRSIPCTFLG
        S+QNGI ERKHR IV++GL LL+QASLP  +W DAF+TA ++ NRLP+ VL   CP + LF+  PNY+ LK+FGC C+P LRPYN HK+ FRS PCTFLG
Subjt:  SQQNGIAERKHRQIVDIGLILLSQASLPFIFWDDAFATAAYISNRLPSTVLNGSCPLKQLFHIDPNYTFLKIFGCRCYPCLRPYNSHKISFRSIPCTFLG

Query:  YSNQHKGYKCLSSDGRLYVLHNVTFDELSFPYKQQ-STPLPSISPS--------VTQNVPILS-----STPMDTQTSSH--------------------E
        YS++HKGYKCL+  GR+++  +V FDE  FP+  +   P+  +S S        + +N+  LS     S P  +  SSH                    +
Subjt:  YSNQHKGYKCLSSDGRLYVLHNVTFDELSFPYKQQ-STPLPSISPS--------VTQNVPILS-----STPMDTQTSSH--------------------E

Query:  SSTDVPL------SPAPAPTIASPAEQPISTASVDSSSHDTSSNASQSMVPTVVNNHPMVTRGKHGIFKPKALMVSYDNTEPPSAKEAFKHPAWRKAMES
        SS+ VP+       P+ +   A P   P+ST     +S + + + +   V      H MVTR K+GIFKPK   V  +  EP + +EA  HP W++AM+ 
Subjt:  SSTDVPL------SPAPAPTIASPAEQPISTASVDSSSHDTSSNASQSMVPTVVNNHPMVTRGKHGIFKPKALMVSYDNTEPPSAKEAFKHPAWRKAMES

Query:  EYDALMANQTWTLVPLPRNQKVVGCKWVFKIKRNSDGSIARYKARLVAKGFHQTPDIDYSETYSPMVKPVTIRVLLTLALANKWSLRQVDINNAFLHGLL
        E+ ALM N+TW+LV LP N+  VGC+WVFK+KRN DGS++RYKARLVAKG+ Q P  D+ ET+SP+VKP TIRV+L +A++  W +RQ+D+NNAFL+G L
Subjt:  EYDALMANQTWTLVPLPRNQKVVGCKWVFKIKRNSDGSIARYKARLVAKGFHQTPDIDYSETYSPMVKPVTIRVLLTLALANKWSLRQVDINNAFLHGLL

Query:  SENVYMEQPGGFE-CVSGFTPLVCKLRKALYGLKQALRAWFERLSAFLHSIGFKTSKVDPSLLFKHSATGVCYILVYVDDIV
         E VYM+QP GF+   +    LVCKL KALYGLKQA RAWF++L   L   GF ++K D SL  + +     ++LVYVDDIV
Subjt:  SENVYMEQPGGFE-CVSGFTPLVCKLRKALYGLKQALRAWFERLSAFLHSIGFKTSKVDPSLLFKHSATGVCYILVYVDDIV

RVW60229.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]2.3e-20338.46Show/hide
Query:  DASTHTEADSYEGNENSSNSIVNPGNKISTVKLNNDNFLTWKVQIEFALEGHELESFILEDIEPPPKTIPVGTDP-NVTKLNPAYAKWKKQDRLISSWLL
        D S      +    + S   +++P +++ T++L +DNFL WK QIE A+ G+ LE F+    + PPK +   TD   V   NP +  +++QD L+ SWLL
Subjt:  DASTHTEADSYEGNENSSNSIVNPGNKISTVKLNNDNFLTWKVQIEFALEGHELESFILEDIEPPPKTIPVGTDP-NVTKLNPAYAKWKKQDRLISSWLL

Query:  GSMSYDILEQVIHCKTSRAIWKCL-QIFNLQNRAQIMRLKSKLQTIQKGNLSLNEYFSQIKHCVDALAAIGKQVDEEDHITYILTGLGSEYESMISVITA
         S+    L QV+ C ++  +W  + Q FN Q+ A++M  KS++Q ++K  L++ +Y +++K+  D LA  G ++ + DHI  I+ GLG EYES+I+VI++
Subjt:  GSMSYDILEQVIHCKTSRAIWKCL-QIFNLQNRAQIMRLKSKLQTIQKGNLSLNEYFSQIKHCVDALAAIGKQVDEEDHITYILTGLGSEYESMISVITA

Query:  QTEDIGVQEAMTLLLTHENQIETKAKTVNNDGSVPSANLMTQSNHTPDPDPQKQNNQYTSSNYNNRGRGGGRS--------NRG-GRSWNNNRNKPQCQY
        +     +Q   + L+ HE +I  K    +ND SV   N  +Q ++   P     +N Y SS + NR + GG          NRG GR       KPQCQ 
Subjt:  QTEDIGVQEAMTLLLTHENQIETKAKTVNNDGSVPSANLMTQSNHTPDPDPQKQNNQYTSSNYNNRGRGGGRS--------NRG-GRSWNNNRNKPQCQY

Query:  CFKFEHTAIKC-YALAGRFPQQQSGNSTFHPQFGAGA--------------------FLPNQ---QMQEMLTASDMNQDNGWYPDS--------------
        C KF HT  +C Y     F      N       G+GA                       NQ   +M+ M+   +  Q+  W+PDS              
Subjt:  CFKFEHTAIKC-YALAGRFPQQQSGNSTFHPQFGAGA--------------------FLPNQ---QMQEMLTASDMNQDNGWYPDS--------------

Query:  -----------------GLSISHSGNSSFNSS---NRLFHLKNLLHVPQITKNLISVSQFARDNSVFFEFHPEFFYVKDRLTSQTLLQGPLHEGLYRFNL
                         GL ISH G S F SS   N++  LKN+L VP I KNL+SVSQFARDN+V+FEFHP+  +VKD+     LLQG LH+GLY+FNL
Subjt:  -----------------GLSISHSGNSSFNSS---NRLFHLKNLLHVPQITKNLISVSQFARDNSVFFEFHPEFFYVKDRLTSQTLLQGPLHEGLYRFNL

Query:  S---------ILPSTSTRNIPAINKCLSSQNHSSVLTVNTIEKPSVVDIWHRRLGHPASFVVQTIVKLCNAVKSPYIKHSFCNACAVGKSHALPFQTSNS
        S         +  S     +   N  L   N +S     T     V D+WH+RLGHPAS +V  ++       S     S C+AC +GKSH LPF  S +
Subjt:  S---------ILPSTSTRNIPAINKCLSSQNHSSVLTVNTIEKPSVVDIWHRRLGHPASFVVQTIVKLCNAVKSPYIKHSFCNACAVGKSHALPFQTSNS

Query:  VYTKPLQLIVADVWDRAYITSRNGF---------------------------------------------------------------------KVTCPY
        VYTKPLQL+V+D+W  A I S  GF                                                                     +++CP+
Subjt:  VYTKPLQLIVADVWDRAYITSRNGF---------------------------------------------------------------------KVTCPY

Query:  TSQQNGIAERKHRQIVDIGLILLSQASLPFIFWDDAFATAAYISNRLPSTVLNGSCPLKQLFHIDPNYTFLKIFGCRCYPCLRPYNSHKISFRSIPCTFL
        TS+QNGI ERKHR IV++GL LL+QASLP  +W DAF+TA ++ NRLP+ VL   CP + LF+  PNY+ LK+FGC C+P LRPYN HK+ FRS PCTFL
Subjt:  TSQQNGIAERKHRQIVDIGLILLSQASLPFIFWDDAFATAAYISNRLPSTVLNGSCPLKQLFHIDPNYTFLKIFGCRCYPCLRPYNSHKISFRSIPCTFL

Query:  GYSNQHKGYKCLSSDGRLYVLHNVTFDELSFPYKQQ-STPLPSISPS--------VTQNVPILS-----STPMDTQTSSH--------------------
        GYS++HKGYKCL+  GR+++  +V FDE  FP+  +   P+  +S S        + +N+  LS     S P  +  SSH                    
Subjt:  GYSNQHKGYKCLSSDGRLYVLHNVTFDELSFPYKQQ-STPLPSISPS--------VTQNVPILS-----STPMDTQTSSH--------------------

Query:  ESSTDVPL------SPAPAPTIASPAEQPISTASVDSSSHDTSSNASQSMVPTVVNNHPMVTRGKHGIFKPKALMVSYDNTEPPSAKEAFKHPAWRKAME
        +SS+ VP+       P+ +   A P   P+ST     +S + + + +   V      H MVTR K+GIFKPK   V  +  EP + +EA  HP W++AM+
Subjt:  ESSTDVPL------SPAPAPTIASPAEQPISTASVDSSSHDTSSNASQSMVPTVVNNHPMVTRGKHGIFKPKALMVSYDNTEPPSAKEAFKHPAWRKAME

Query:  SEYDALMANQTWTLVPLPRNQKVVGCKWVFKIKRNSDGSIARYKARLVAKGFHQTPDIDYSETYSPMVKPVTIRVLLTLALANKWSLRQVDINNAFLHGL
         E+ ALM N+TW+LV LP N+  VGC+WVFK+KRN DGS++RYKARLVAKG+ Q P  D+ ET+SP+VKP TIRV+L +A++  W +RQ+D+NNAFL+G 
Subjt:  SEYDALMANQTWTLVPLPRNQKVVGCKWVFKIKRNSDGSIARYKARLVAKGFHQTPDIDYSETYSPMVKPVTIRVLLTLALANKWSLRQVDINNAFLHGL

Query:  LSENVYMEQPGGFE-CVSGFTPLVCKLRKALYGLKQALRAWFERLSAFLHSIGFKTSKVDPSLLFKHSATGVCYILVYVDDIV
        L E VYM+QP GF+   +    LVCKL KALYGLKQA RAWF++L   L   GF ++K D SL  + +     ++LVYVDDIV
Subjt:  LSENVYMEQPGGFE-CVSGFTPLVCKLRKALYGLKQALRAWFERLSAFLHSIGFKTSKVDPSLLFKHSATGVCYILVYVDDIV

TrEMBL top hitse value%identityAlignment
A0A151RUP0 Retrovirus-related Pol polyprotein from transposon TNT 1-941.1e-19038.22Show/hide
Query:  KLNNDNFLTWKVQIEFALEGHELESFILEDIEPPPKTIPVGTDPNVTKLNPAYAKWKKQDRLISSWLLGSMSYDILEQVIHCKTSRAIW-KCLQIFNLQN
        KL+  N+L W  Q+E  ++GH L  +++      P+      D +   ++ +Y  W++QD+L+ SWL  SMS D+L +VI CK+S  +W K    F+   
Subjt:  KLNNDNFLTWKVQIEFALEGHELESFILEDIEPPPKTIPVGTDPNVTKLNPAYAKWKKQDRLISSWLLGSMSYDILEQVIHCKTSRAIW-KCLQIFNLQN

Query:  RAQIMRLKSKLQTIQKGNLSLNEYFSQIKHCVDALAAIGKQVDEEDHITYILTGLGSEYESMISVITAQTEDIGVQEAMTLLLTHENQIETKAKTVNNDG
         A+  +L+++L++    NLS++EY  +I+  VDAL AIG  V  ++H+  IL GL  EYES +S+I+++ + + + E  TLLL HE++++   K      
Subjt:  RAQIMRLKSKLQTIQKGNLSLNEYFSQIKHCVDALAAIGKQVDEEDHITYILTGLGSEYESMISVITAQTEDIGVQEAMTLLLTHENQIETKAKTVNNDG

Query:  SVPSANLMTQSNHTPDPDPQKQNNQYTSSNYNNRG-----RGGGRSNRGGR--SW-NNNRNK---PQCQYCFKFEHTAIKCY------------ALAGRF
        +  S N+ T      +PDP   N Q   ++ NN+      RGG  ++RGGR  +W    R +    QCQ C ++ H A  CY              A  +
Subjt:  SVPSANLMTQSNHTPDPDPQKQNNQYTSSNYNNRG-----RGGGRSNRGGR--SW-NNNRNK---PQCQYCFKFEHTAIKCY------------ALAGRF

Query:  PQQQ------SGNSTFHPQFGAGAFLPN-----QQMQEMLTASDMNQDNGWYPDSGLSISHSGNSSFNS---SNRLFHLKNLLHVPQITKNLISVSQFAR
        P         + N+ ++P  GA   + N      Q         ++  NG     GL I+ +G ++F+S       F L NLL VP ITKNLISVSQF++
Subjt:  PQQQ------SGNSTFHPQFGAGAFLPN-----QQMQEMLTASDMNQDNGWYPDSGLSISHSGNSSFNS---SNRLFHLKNLLHVPQITKNLISVSQFAR

Query:  DNSVFFEFHPEFFYVKDRLTSQTLLQGPL-HEGLYRFNLSILPSTSTRNIPAINKCLSSQNHSSVLTVNT-IEKPSVVDIWHRRLGHPASFVVQTIVKLC
        DNSV+FEFHP    VK + T++ LLQG +  +GLY+F  ++LP+ ST+++         Q  + + +VNT    P     WH RLGHP   V++ + +  
Subjt:  DNSVFFEFHPEFFYVKDRLTSQTLLQGPL-HEGLYRFNLSILPSTSTRNIPAINKCLSSQNHSSVLTVNT-IEKPSVVDIWHRRLGHPASFVVQTIVKLC

Query:  NAVKSPYIKHSFCNACAVGKSHALPFQTSNSVYTKPLQLIVADVWDRAYITSRNGF--------------------------------------------
        N       K +FC  C +GKSH LP   S ++Y  P  L+ +D+W  A +TS  G+                                            
Subjt:  NAVKSPYIKHSFCNACAVGKSHALPFQTSNSVYTKPLQLIVADVWDRAYITSRNGF--------------------------------------------

Query:  -------------------------KVTCPYTSQQNGIAERKHRQIVDIGLILLSQASLPFIFWDDAFATAAYISNRLPSTVLNGSCPLKQLFHIDPNYT
                                 ++ CP+T  QNG+AERKHR IV++GL L++QA LP  FWD +F TA Y+ NRLPS+ +  + P  +LFH  P+Y 
Subjt:  -------------------------KVTCPYTSQQNGIAERKHRQIVDIGLILLSQASLPFIFWDDAFATAAYISNRLPSTVLNGSCPLKQLFHIDPNYT

Query:  FLKIFGCRCYPCLRPYNSHKISFRSIPCTFLGYSNQHKGYKCLSSDGRLYVLHNVTFDELSFPYKQQSTPLPSISPSVTQNVPILSSTPMDTQTSSHESS
         L+IFGC C+P LRPYN HK+ FRS  C FLGYS  HKGYKCL++DGRLY+  +V F+E  FPYK   +   +    +  +VP+   T   T T +H + 
Subjt:  FLKIFGCRCYPCLRPYNSHKISFRSIPCTFLGYSNQHKGYKCLSSDGRLYVLHNVTFDELSFPYKQQSTPLPSISPSVTQNVPILSSTPMDTQTSSHESS

Query:  TDVPLSPAPAPTIASPAEQPISTASVDSS--SHDTSSNASQSMVPTV--VNNHPMVTRGKHGIFKPKAL-MVSYDNTEPPSAKEAFKHPAWRKAMESEYD
           P  P  +P++ S +  P + A    S  S  TSSN   S  P +   N HPM TR K GI KP+    +     EP + K+A  +P W  AM++EY+
Subjt:  TDVPLSPAPAPTIASPAEQPISTASVDSS--SHDTSSNASQSMVPTV--VNNHPMVTRGKHGIFKPKAL-MVSYDNTEPPSAKEAFKHPAWRKAMESEYD

Query:  ALMANQTWTLVPLPRNQKVVGCKWVFKIKRNSDGSIARYKARLVAKGFHQTPDIDYSETYSPMVKPVTIRVLLTLALANKWSLRQVDINNAFLHGLLSEN
        AL+ N TW+LVPLP ++  +GCKWVF++K N +GS+ +YKARLVAKGF+Q    DY ET+SP++KPVT+R++LTLAL ++W ++Q+D+NNAFL+G L E 
Subjt:  ALMANQTWTLVPLPRNQKVVGCKWVFKIKRNSDGSIARYKARLVAKGFHQTPDIDYSETYSPMVKPVTIRVLLTLALANKWSLRQVDINNAFLHGLLSEN

Query:  VYMEQPGGFECVSGFTPLVCKLRKALYGLKQALRAWFERLSAFLHSIGFKTSKVDPSLLFKHSATGVCYILVYVDDIV
        VYM+QP GFE  +    LVCKL KA+YGLKQA RAWF++L + L  + F  SK DPSL        V YILVYVDDI+
Subjt:  VYMEQPGGFECVSGFTPLVCKLRKALYGLKQALRAWFERLSAFLHSIGFKTSKVDPSLLFKHSATGVCYILVYVDDIV

A0A438EA49 Retrovirus-related Pol polyprotein from transposon TNT 1-941.3e-19337.56Show/hide
Query:  DASTHTEADSYEGNENSSNSIVNPGNKISTVKLNNDNFLTWKVQIEFALEGHELESFILEDIEPPPKTIPVGTDP-NVTKLNPAYAKWKKQDRLISSWLL
        D S      +    + S   +++P +++ T++L +DNFL WK QIE A+ G+ LE F+    + PPK +   TD   V   NP +  +++QD L+ SWLL
Subjt:  DASTHTEADSYEGNENSSNSIVNPGNKISTVKLNNDNFLTWKVQIEFALEGHELESFILEDIEPPPKTIPVGTDP-NVTKLNPAYAKWKKQDRLISSWLL

Query:  GSMSYDILEQVIHCKTSRAIWKCLQIFNLQNRAQIMRLKSKLQTIQKGNLSLNEYFSQIKHCVDALAAIGKQVDEEDHITYILTGLGSEYESMISVITAQ
         S+    L QV+ C ++                             +  L++ +Y +++K+  D LA  G ++ + DHI  I+ GLG EYES+I+VI+++
Subjt:  GSMSYDILEQVIHCKTSRAIWKCLQIFNLQNRAQIMRLKSKLQTIQKGNLSLNEYFSQIKHCVDALAAIGKQVDEEDHITYILTGLGSEYESMISVITAQ

Query:  TEDIGVQEAMTLLLTHENQIETKAKTVNNDGSVPSANLMTQSNHTPDPDPQKQNNQYTSSNYNNRGRGGGRS--------NRG-GRSWNNNRNKPQCQYC
             +Q   + L+ HE +I  K    +ND SV   N  +Q ++   P     +N Y SS + NR + GG          NRG GR       KPQCQ C
Subjt:  TEDIGVQEAMTLLLTHENQIETKAKTVNNDGSVPSANLMTQSNHTPDPDPQKQNNQYTSSNYNNRGRGGGRS--------NRG-GRSWNNNRNKPQCQYC

Query:  FKFEHTAIKC-YALAGRFPQQQSGNSTFHPQFGAGA--------------------FLPNQ---QMQEMLTASDMNQDNGWYPDS---------------
         KF HT  +C Y     F      N       G+GA                       NQ   +M+ M+   +  Q+  W+PDS               
Subjt:  FKFEHTAIKC-YALAGRFPQQQSGNSTFHPQFGAGA--------------------FLPNQ---QMQEMLTASDMNQDNGWYPDS---------------

Query:  ----------------GLSISHSGNSSFNSS---NRLFHLKNLLHVPQITKNLISVSQFARDNSVFFEFHPEFFYVKDRLTSQTLLQGPLHEGLYRFNLS
                        GL ISH G S F SS   N++  LKN+L VP I KNL+SVSQFARDN+V+FEFHP+  +VKD+     LLQG LH+GLY+FNLS
Subjt:  ----------------GLSISHSGNSSFNSS---NRLFHLKNLLHVPQITKNLISVSQFARDNSVFFEFHPEFFYVKDRLTSQTLLQGPLHEGLYRFNLS

Query:  ---------ILPSTSTRNIPAINKCLSSQNHSSVLTVNTIEKPSVVDIWHRRLGHPASFVVQTIVKLCNAVKSPYIKHSFCNACAVGKSHALPFQTSNSV
                 +  S     +   N  L   N +S     T     V D+WH+RLGHPAS +V  ++       S     S C+AC +GKSH LPF  S +V
Subjt:  ---------ILPSTSTRNIPAINKCLSSQNHSSVLTVNTIEKPSVVDIWHRRLGHPASFVVQTIVKLCNAVKSPYIKHSFCNACAVGKSHALPFQTSNSV

Query:  YTKPLQLIVADVWDRAYITSRNGF---------------------------------------------------------------------KVTCPYT
        YTKPLQL+V+D+W  A I S  GF                                                                     +++CP+T
Subjt:  YTKPLQLIVADVWDRAYITSRNGF---------------------------------------------------------------------KVTCPYT

Query:  SQQNGIAERKHRQIVDIGLILLSQASLPFIFWDDAFATAAYISNRLPSTVLNGSCPLKQLFHIDPNYTFLKIFGCRCYPCLRPYNSHKISFRSIPCTFLG
        S+QNGI ERKHR IV++GL LL+QASLP  +W DAF+TA ++ NRLP+ VL   CP + LF+  PNY+ LK+FGC C+P LRPYN HK+ FRS PCTFLG
Subjt:  SQQNGIAERKHRQIVDIGLILLSQASLPFIFWDDAFATAAYISNRLPSTVLNGSCPLKQLFHIDPNYTFLKIFGCRCYPCLRPYNSHKISFRSIPCTFLG

Query:  YSNQHKGYKCLSSDGRLYVLHNVTFDELSFPYKQQ-STPLPSISPS--------VTQNVPILS-----STPMDTQTSSH--------------------E
        YS++HKGYKCL+  GR+++  +V FDE  FP+  +   P+  +S S        + +N+  LS     S P  +  SSH                    +
Subjt:  YSNQHKGYKCLSSDGRLYVLHNVTFDELSFPYKQQ-STPLPSISPS--------VTQNVPILS-----STPMDTQTSSH--------------------E

Query:  SSTDVPL------SPAPAPTIASPAEQPISTASVDSSSHDTSSNASQSMVPTVVNNHPMVTRGKHGIFKPKALMVSYDNTEPPSAKEAFKHPAWRKAMES
        SS+ VP+       P+ +   A P   P+ST     +S + + + +   V      H MVTR K+GIFKPK   V  +  EP + +EA  HP W++AM+ 
Subjt:  SSTDVPL------SPAPAPTIASPAEQPISTASVDSSSHDTSSNASQSMVPTVVNNHPMVTRGKHGIFKPKALMVSYDNTEPPSAKEAFKHPAWRKAMES

Query:  EYDALMANQTWTLVPLPRNQKVVGCKWVFKIKRNSDGSIARYKARLVAKGFHQTPDIDYSETYSPMVKPVTIRVLLTLALANKWSLRQVDINNAFLHGLL
        E+ ALM N+TW+LV LP N+  VGC+WVFK+KRN DGS++RYKARLVAKG+ Q P  D+ ET+SP+VKP TIRV+L +A++  W +RQ+D+NNAFL+G L
Subjt:  EYDALMANQTWTLVPLPRNQKVVGCKWVFKIKRNSDGSIARYKARLVAKGFHQTPDIDYSETYSPMVKPVTIRVLLTLALANKWSLRQVDINNAFLHGLL

Query:  SENVYMEQPGGFE-CVSGFTPLVCKLRKALYGLKQALRAWFERLSAFLHSIGFKTSKVDPSLLFKHSATGVCYILVYVDDIV
         E VYM+QP GF+   +    LVCKL KALYGLKQA RAWF++L   L   GF ++K D SL  + +     ++LVYVDDIV
Subjt:  SENVYMEQPGGFE-CVSGFTPLVCKLRKALYGLKQALRAWFERLSAFLHSIGFKTSKVDPSLLFKHSATGVCYILVYVDDIV

A0A438FJP6 Retrovirus-related Pol polyprotein from transposon TNT 1-941.1e-20338.46Show/hide
Query:  DASTHTEADSYEGNENSSNSIVNPGNKISTVKLNNDNFLTWKVQIEFALEGHELESFILEDIEPPPKTIPVGTDP-NVTKLNPAYAKWKKQDRLISSWLL
        D S      +    + S   +++P +++ T++L +DNFL WK QIE A+ G+ LE F+    + PPK +   TD   V   NP +  +++QD L+ SWLL
Subjt:  DASTHTEADSYEGNENSSNSIVNPGNKISTVKLNNDNFLTWKVQIEFALEGHELESFILEDIEPPPKTIPVGTDP-NVTKLNPAYAKWKKQDRLISSWLL

Query:  GSMSYDILEQVIHCKTSRAIWKCL-QIFNLQNRAQIMRLKSKLQTIQKGNLSLNEYFSQIKHCVDALAAIGKQVDEEDHITYILTGLGSEYESMISVITA
         S+    L QV+ C ++  +W  + Q FN Q+ A++M  KS++Q ++K  L++ +Y +++K+  D LA  G ++ + DHI  I+ GLG EYES+I+VI++
Subjt:  GSMSYDILEQVIHCKTSRAIWKCL-QIFNLQNRAQIMRLKSKLQTIQKGNLSLNEYFSQIKHCVDALAAIGKQVDEEDHITYILTGLGSEYESMISVITA

Query:  QTEDIGVQEAMTLLLTHENQIETKAKTVNNDGSVPSANLMTQSNHTPDPDPQKQNNQYTSSNYNNRGRGGGRS--------NRG-GRSWNNNRNKPQCQY
        +     +Q   + L+ HE +I  K    +ND SV   N  +Q ++   P     +N Y SS + NR + GG          NRG GR       KPQCQ 
Subjt:  QTEDIGVQEAMTLLLTHENQIETKAKTVNNDGSVPSANLMTQSNHTPDPDPQKQNNQYTSSNYNNRGRGGGRS--------NRG-GRSWNNNRNKPQCQY

Query:  CFKFEHTAIKC-YALAGRFPQQQSGNSTFHPQFGAGA--------------------FLPNQ---QMQEMLTASDMNQDNGWYPDS--------------
        C KF HT  +C Y     F      N       G+GA                       NQ   +M+ M+   +  Q+  W+PDS              
Subjt:  CFKFEHTAIKC-YALAGRFPQQQSGNSTFHPQFGAGA--------------------FLPNQ---QMQEMLTASDMNQDNGWYPDS--------------

Query:  -----------------GLSISHSGNSSFNSS---NRLFHLKNLLHVPQITKNLISVSQFARDNSVFFEFHPEFFYVKDRLTSQTLLQGPLHEGLYRFNL
                         GL ISH G S F SS   N++  LKN+L VP I KNL+SVSQFARDN+V+FEFHP+  +VKD+     LLQG LH+GLY+FNL
Subjt:  -----------------GLSISHSGNSSFNSS---NRLFHLKNLLHVPQITKNLISVSQFARDNSVFFEFHPEFFYVKDRLTSQTLLQGPLHEGLYRFNL

Query:  S---------ILPSTSTRNIPAINKCLSSQNHSSVLTVNTIEKPSVVDIWHRRLGHPASFVVQTIVKLCNAVKSPYIKHSFCNACAVGKSHALPFQTSNS
        S         +  S     +   N  L   N +S     T     V D+WH+RLGHPAS +V  ++       S     S C+AC +GKSH LPF  S +
Subjt:  S---------ILPSTSTRNIPAINKCLSSQNHSSVLTVNTIEKPSVVDIWHRRLGHPASFVVQTIVKLCNAVKSPYIKHSFCNACAVGKSHALPFQTSNS

Query:  VYTKPLQLIVADVWDRAYITSRNGF---------------------------------------------------------------------KVTCPY
        VYTKPLQL+V+D+W  A I S  GF                                                                     +++CP+
Subjt:  VYTKPLQLIVADVWDRAYITSRNGF---------------------------------------------------------------------KVTCPY

Query:  TSQQNGIAERKHRQIVDIGLILLSQASLPFIFWDDAFATAAYISNRLPSTVLNGSCPLKQLFHIDPNYTFLKIFGCRCYPCLRPYNSHKISFRSIPCTFL
        TS+QNGI ERKHR IV++GL LL+QASLP  +W DAF+TA ++ NRLP+ VL   CP + LF+  PNY+ LK+FGC C+P LRPYN HK+ FRS PCTFL
Subjt:  TSQQNGIAERKHRQIVDIGLILLSQASLPFIFWDDAFATAAYISNRLPSTVLNGSCPLKQLFHIDPNYTFLKIFGCRCYPCLRPYNSHKISFRSIPCTFL

Query:  GYSNQHKGYKCLSSDGRLYVLHNVTFDELSFPYKQQ-STPLPSISPS--------VTQNVPILS-----STPMDTQTSSH--------------------
        GYS++HKGYKCL+  GR+++  +V FDE  FP+  +   P+  +S S        + +N+  LS     S P  +  SSH                    
Subjt:  GYSNQHKGYKCLSSDGRLYVLHNVTFDELSFPYKQQ-STPLPSISPS--------VTQNVPILS-----STPMDTQTSSH--------------------

Query:  ESSTDVPL------SPAPAPTIASPAEQPISTASVDSSSHDTSSNASQSMVPTVVNNHPMVTRGKHGIFKPKALMVSYDNTEPPSAKEAFKHPAWRKAME
        +SS+ VP+       P+ +   A P   P+ST     +S + + + +   V      H MVTR K+GIFKPK   V  +  EP + +EA  HP W++AM+
Subjt:  ESSTDVPL------SPAPAPTIASPAEQPISTASVDSSSHDTSSNASQSMVPTVVNNHPMVTRGKHGIFKPKALMVSYDNTEPPSAKEAFKHPAWRKAME

Query:  SEYDALMANQTWTLVPLPRNQKVVGCKWVFKIKRNSDGSIARYKARLVAKGFHQTPDIDYSETYSPMVKPVTIRVLLTLALANKWSLRQVDINNAFLHGL
         E+ ALM N+TW+LV LP N+  VGC+WVFK+KRN DGS++RYKARLVAKG+ Q P  D+ ET+SP+VKP TIRV+L +A++  W +RQ+D+NNAFL+G 
Subjt:  SEYDALMANQTWTLVPLPRNQKVVGCKWVFKIKRNSDGSIARYKARLVAKGFHQTPDIDYSETYSPMVKPVTIRVLLTLALANKWSLRQVDINNAFLHGL

Query:  LSENVYMEQPGGFE-CVSGFTPLVCKLRKALYGLKQALRAWFERLSAFLHSIGFKTSKVDPSLLFKHSATGVCYILVYVDDIV
        L E VYM+QP GF+   +    LVCKL KALYGLKQA RAWF++L   L   GF ++K D SL  + +     ++LVYVDDIV
Subjt:  LSENVYMEQPGGFE-CVSGFTPLVCKLRKALYGLKQALRAWFERLSAFLHSIGFKTSKVDPSLLFKHSATGVCYILVYVDDIV

A5BFT3 Integrase catalytic domain-containing protein1.0e-19337.57Show/hide
Query:  NKISTVKLNNDNFLTWKVQIEFALEGHELESFILEDIEPPPKTIPVGTDPNVTKLNPAYAKWKKQDRLISSWLLGSMSYDILEQVIHCKTSRAIWKCLQI
        N   +VKL+N NFL WK QI  A+ G+ L+ F+  D E      P      +  L          +  I+   LG  S+  L Q          +   Q 
Subjt:  NKISTVKLNNDNFLTWKVQIEFALEGHELESFILEDIEPPPKTIPVGTDPNVTKLNPAYAKWKKQDRLISSWLLGSMSYDILEQVIHCKTSRAIWKCLQI

Query:  FNLQNRAQIMRLKSKLQTIQKGNLSLNEYFSQIKHCVDALAAIGKQVDEEDHITYILTGLGSEYESMISVITAQTEDIGVQEAMTLLLTHENQIETKAKT
        F  Q RA+  + K++LQ  +KG  +++EY ++IK CVD+LA++G  +  +DH+  IL GL ++YES ++ +  + +D  V+E   LL+ HE+++E    +
Subjt:  FNLQNRAQIMRLKSKLQTIQKGNLSLNEYFSQIKHCVDALAAIGKQVDEEDHITYILTGLGSEYESMISVITAQTEDIGVQEAMTLLLTHENQIETKAKT

Query:  VNNDGS--VPSANLMTQSNHTPDPDPQKQNNQYTSSNYNNR-GRGG-------------------GRSNRGG-------------RSWN--NNRNKPQCQ
        +++  S  V S+N + + N     D    N+Q + S YN   GRGG                   GRSNRGG               WN  N   KP CQ
Subjt:  VNNDGS--VPSANLMTQSNHTPDPDPQKQNNQYTSSNYNNR-GRGG-------------------GRSNRGG-------------RSWN--NNRNKPQCQ

Query:  YCFKFEHTAIKC-YALAGRFPQQQSGNSTFHPQFGAGAFLPNQQMQEMLTASDMNQDNGWYPDS-------------------------------GLSIS
         C K  H   +C Y     F   Q+ +S         +F P  Q+  ++  S++  D+ WYPDS                               GLSI 
Subjt:  YCFKFEHTAIKC-YALAGRFPQQQSGNSTFHPQFGAGAFLPNQQMQEMLTASDMNQDNGWYPDS-------------------------------GLSIS

Query:  HSGNSSFNS--SNRLFHLKNLLHVPQITKNLISVSQFARDNSVFFEFHPEFFYVKDRLTSQTLLQGPLHEGLYRFNLSILPSTSTRNIPAINKCLSSQNH
        H G S F S  S++   L +LLHVP ITKNL+SVS+FA+DN VFFEFH +  +VKD++T   L+ G + +GLY F+ S L    T+++      ++S   
Subjt:  HSGNSSFNS--SNRLFHLKNLLHVPQITKNLISVSQFARDNSVFFEFHPEFFYVKDRLTSQTLLQGPLHEGLYRFNLSILPSTSTRNIPAINKCLSSQNH

Query:  SSVLTVNTIEKPSVVDIWHRRLGHPASFVVQTIVKLCNAVKSPYIKHSFCNACAVGKSHALPFQTSNSVYTKPLQLIVADVWDRAYITSRNGF-------
        S V T +     S  D+WH+RLGHP++  ++ ++  CN      +  +FC++C +GK H  PF  S++ YTKPL+LI  D+W    + S +G+       
Subjt:  SSVLTVNTIEKPSVVDIWHRRLGHPASFVVQTIVKLCNAVKSPYIKHSFCNACAVGKSHALPFQTSNSVYTKPLQLIVADVWDRAYITSRNGF-------

Query:  --------------------------------------------------------------KVTCPYTSQQNGIAERKHRQIVDIGLILLSQASLPFIF
                                                                      +V+CP+T QQNG+AERKHR IV+ GL LL  ASLP  F
Subjt:  --------------------------------------------------------------KVTCPYTSQQNGIAERKHRQIVDIGLILLSQASLPFIF

Query:  WDDAFATAAYISNRLPSTVLNGSCPLKQLFHIDPNYTFLKIFGCRCYPCLRPYNSHKISFRSIPCTFLGYSNQHKGYKCLSSDGRLYVLHNVTFDELSFP
        WD++F T  Y+SNRLP+ +L+  CP++ LF   P+Y+FLK+FGC C+P LRPYN+HK+ +RS  CTFLGYS +HKGYKC+SS+GR+Y+ H+V F+E SFP
Subjt:  WDDAFATAAYISNRLPSTVLNGSCPLKQLFHIDPNYTFLKIFGCRCYPCLRPYNSHKISFRSIPCTFLGYSNQHKGYKCLSSDGRLYVLHNVTFDELSFP

Query:  YKQ----QSTPLPSISPSVTQ-----NVPILSSTPMDTQTSSHESSTDV----------PLSPAPAPTIASPAEQPISTASVDSSSHDTSSNASQSMVPT
        Y +     S  L ++SPS +      + P+LS T + T TS   S+  +          P +P  A T  +PA Q +S        H  SS A  S+  T
Subjt:  YKQ----QSTPLPSISPSVTQ-----NVPILSSTPMDTQTSSHESSTDV----------PLSPAPAPTIASPAEQPISTASVDSSSHDTSSNASQSMVPT

Query:  VV----NNHPMVTRGKHGIFKPKALMVSYDNTEPPSAKEAFKHPAWRKAMESEYDALMANQTWTLVPLPRNQKVVGCKWVFKIKRNSDGSIARYKARLVA
        +     N HPM+TR K GI KPK  + +    EP S   A +   W+KAM +EYDAL  N TW+LVPLP  ++ +GCKWV+K K N DG++ +YKARLVA
Subjt:  VV----NNHPMVTRGKHGIFKPKALMVSYDNTEPPSAKEAFKHPAWRKAMESEYDALMANQTWTLVPLPRNQKVVGCKWVFKIKRNSDGSIARYKARLVA

Query:  KGFHQTPDIDYSETYSPMVKPVTIRVLLTLALANKWSLRQVDINNAFLHGLLSENVYMEQPGGFECVSGFTP-LVCKLRKALYGLKQALRAWFERLSAFL
        KGFHQ    D++ET+SP+VKP T+RV+ T+AL+  W+++Q+D+NNAFL+G L E V+M+QP GF  +    P LVC+L KALYGLKQA RAWFE+L   L
Subjt:  KGFHQTPDIDYSETYSPMVKPVTIRVLLTLALANKWSLRQVDINNAFLHGLLSENVYMEQPGGFECVSGFTP-LVCKLRKALYGLKQALRAWFERLSAFL

Query:  HSIGFKTSKVDPSLLFKHSATGVCYILVYVDDIV
         S GF ++K D SL  + +   + Y+LVYVDDI+
Subjt:  HSIGFKTSKVDPSLLFKHSATGVCYILVYVDDIV

A5BK17 Integrase catalytic domain-containing protein1.7e-19639.69Show/hide
Query:  KLNNDNFLTWKVQIEFALEGHELESFILEDIEPPPKTIPVGTDPNVTKLNPAYAKWKKQDRLISSWLLGSMSYDILEQVIHCKTSRAIWKCLQI-FNLQN
        KL+N NFL  + QI   L GH+L+ F+ E    P + +    D     +NP +  W++Q +LI S LL S++  +L ++++C TS  +WK L++ F  Q 
Subjt:  KLNNDNFLTWKVQIEFALEGHELESFILEDIEPPPKTIPVGTDPNVTKLNPAYAKWKKQDRLISSWLLGSMSYDILEQVIHCKTSRAIWKCLQI-FNLQN

Query:  RAQIMRLKSKLQTIQKGNLSLNEYFSQIKHCVDALAAIGKQVDEEDHITYILTGLGSEYESMISVITAQTEDIGVQEAMTLLLTHENQIETKAK------
        RA++ + K++L   +KG+LS+++Y  +I++ VD LA +G ++  +DHI  I  GL  +YE+ I  + ++ +   V+E   LLL  E++IE   K      
Subjt:  RAQIMRLKSKLQTIQKGNLSLNEYFSQIKHCVDALAAIGKQVDEEDHITYILTGLGSEYESMISVITAQTEDIGVQEAMTLLLTHENQIETKAK------

Query:  -------TVNNDGSVPSANLMTQSNHTPDPDPQKQNN--QYTSSNYNNRGRGGGRSNRGGRSWNNNRNKPQCQYCFKFEHTAIKCYALAGRFPQQQSGNS
               T N +GS P  N    + ++    P    N  Q+   N+  +GR  GR  RG  SW  N NKPQCQ C +  H  ++CY    RF Q  +G S
Subjt:  -------TVNNDGSVPSANLMTQSNHTPDPDPQKQNN--QYTSSNYNNRGRGGGRSNRGGRSWNNNRNKPQCQYCFKFEHTAIKCYALAGRFPQQQSGNS

Query:  TF---HPQFGAGAFLPNQQMQEML---------TASDMNQDNGWYPDS-------------------------------GLSISHSGNSSFNSS---NRL
              PQ G  A L +QQ+ E           T +++ QDN WYPDS                               GL I H G++SF+SS   ++ 
Subjt:  TF---HPQFGAGAFLPNQQMQEML---------TASDMNQDNGWYPDS-------------------------------GLSISHSGNSSFNSS---NRL

Query:  FHLKNLLHVPQITKNLISVSQFARDNSVFFEFHPEFFYVKDRLTSQTLLQGPLHEGLYRFNLSILPSTSTRNIPAIN-KCLSSQNHSSVLTVNTIEKPSV
          LK LLHVP+ITKNL+SVS+FA DN VFFEFHP   +VKD  T   L+ G L  GLY F+ + L       +P  N  C +S    S          S 
Subjt:  FHLKNLLHVPQITKNLISVSQFARDNSVFFEFHPEFFYVKDRLTSQTLLQGPLHEGLYRFNLSILPSTSTRNIPAIN-KCLSSQNHSSVLTVNTIEKPSV

Query:  VDIWHRRLGHPASFVVQTIVKLCNAVKSPYIKHSFCNACAVGKSHALPFQTSNSVYTKPLQLIVADVWDRA-----------------------------
          +WH RLGHP+S +V  ++  CN      I    C+ C +GK H   F  S S YTKPL+LI  D+W  A                             
Subjt:  VDIWHRRLGHPASFVVQTIVKLCNAVKSPYIKHSFCNACAVGKSHALPFQTSNSVYTKPLQLIVADVWDRA-----------------------------

Query:  ---------------------------------------YITSRNGF--KVTCPYTSQQNGIAERKHRQIVDIGLILLSQASLPFIFWDDAFATAAYISN
                                               Y+TS NG   +++CPYT +QNG+AERKHR IV+ G+ LL+QASLPF +WD+AF T+ Y+ N
Subjt:  ---------------------------------------YITSRNGF--KVTCPYTSQQNGIAERKHRQIVDIGLILLSQASLPFIFWDDAFATAAYISN

Query:  RLPSTVLNGSCPLKQLFHIDPNYTFLKIFGCRCYPCLRPYNSHKISFRSIPCTFLGYSNQHKGYKCLSSDGRLYVLHNVTFDELSFPYKQ---QSTPLPS
        RLP+ VL    PL+ LFH  P+Y+ LK+FGC CYP LRP+N HK+ FRSIPCTFLGYS  HKGYKCLS +G + +  +V FDE +FP+ Q   Q     S
Subjt:  RLPSTVLNGSCPLKQLFHIDPNYTFLKIFGCRCYPCLRPYNSHKISFRSIPCTFLGYSNQHKGYKCLSSDGRLYVLHNVTFDELSFPYKQ---QSTPLPS

Query:  ISPSVTQNVPILSSTPMDTQTSSHESSTDVPLSPAPAPTIASPAEQPISTASVDSSSHDTSSNASQSMVPTVVNNHPMVTRGKHGIFKPKALMVSYDNTE
         S S T ++P  +S P+    SS   ST  P +P+  P                +S+H+ +S    S  P   ++H M+TR K GIFKPKA ++S   T 
Subjt:  ISPSVTQNVPILSSTPMDTQTSSHESSTDVPLSPAPAPTIASPAEQPISTASVDSSSHDTSSNASQSMVPTVVNNHPMVTRGKHGIFKPKALMVSYDNTE

Query:  PPSAKEAFKHPAWRKAMESEYDALMANQTWTLVPLPRNQKVVGCKWVFKIKRNSDGSIARYKARLVAKGFHQTPDIDYSETYSPMVKPVTIRVLLTLALA
        P S  EA +   W++AM  EY AL+ N TW LVP P ++K++GCKWVFK+K N DG+I +YKARLVAKGFHQ    D++ET+SP+VKP TIR++LT+AL 
Subjt:  PPSAKEAFKHPAWRKAMESEYDALMANQTWTLVPLPRNQKVVGCKWVFKIKRNSDGSIARYKARLVAKGFHQTPDIDYSETYSPMVKPVTIRVLLTLALA

Query:  NKWSLRQVDINNAFLHGLLSENVYMEQPGGFECVSGFTP-LVCKLRKALYGLKQALRAWFERLSAFLHSIGFKTSKVDPSLLFKHSATGVCYILVYVDDI
         +W +RQ+D+NNAFL+G L E+++M QP GF  +    P  VCKL K+LYGLKQA RAWFE+L   L  +GF ++K D SL    + T   YILVYVDDI
Subjt:  NKWSLRQVDINNAFLHGLLSENVYMEQPGGFECVSGFTP-LVCKLRKALYGLKQALRAWFERLSAFLHSIGFKTSKVDPSLLFKHSATGVCYILVYVDDI

Query:  V
        +
Subjt:  V

SwissProt top hitse value%identityAlignment
P04146 Copia protein2.3e-4127.84Show/hide
Query:  FKVTCPYTSQQNGIAERKHRQIVDIGLILLSQASLPFIFWDDAFATAAYISNRLPSTVL--NGSCPLKQLFHIDPNYTFLKIFGCRCYPCLRPYNSHKIS
        + +T P+T Q NG++ER  R I +    ++S A L   FW +A  TA Y+ NR+PS  L  +   P +   +  P    L++FG   Y  ++     K  
Subjt:  FKVTCPYTSQQNGIAERKHRQIVDIGLILLSQASLPFIFWDDAFATAAYISNRLPSTVL--NGSCPLKQLFHIDPNYTFLKIFGCRCYPCLRPYNSHKIS

Query:  FRSIPCTFLGYSNQHKGYKCLSS-DGRLYVLHNVTFDEL----SFPYKQQSTPLPSISPSVTQNVPILSSTPMDTQ--------------TSSHES----
         +S    F+GY  +  G+K   + + +  V  +V  DE     S   K ++  L     S  +N P  S   + T+                S ES    
Subjt:  FRSIPCTFLGYSNQHKGYKCLSS-DGRLYVLHNVTFDEL----SFPYKQQSTPLPSISPSVTQNVPILSSTPMDTQ--------------TSSHES----

Query:  ---------STDVPLSPAPAPTIASPAEQPISTASV-----------------------DSSSHDTSSNASQSMV--PTVVNNHPMVTRGKHGI-FKPKA
                  T+ P        I    +   S                           +S   +T+ +  +  +  PT  +   ++ R    +  KP+ 
Subjt:  ---------STDVPLSPAPAPTIASPAEQPISTASV-----------------------DSSSHDTSSNASQSMV--PTVVNNHPMVTRGKHGI-FKPKA

Query:  LMVSYDNT-------------EPPSAKEAFKH----PAWRKAMESEYDALMANQTWTLVPLPRNQKVVGCKWVFKIKRNSDGSIARYKARLVAKGFHQTP
             DN+             + P++ +  ++     +W +A+ +E +A   N TWT+   P N+ +V  +WVF +K N  G+  RYKARLVA+GF Q  
Subjt:  LMVSYDNT-------------EPPSAKEAFKH----PAWRKAMESEYDALMANQTWTLVPLPRNQKVVGCKWVFKIKRNSDGSIARYKARLVAKGFHQTP

Query:  DIDYSETYSPMVKPVTIRVLLTLALANKWSLRQVDINNAFLHGLLSENVYMEQPGGFECVSGFTPLVCKLRKALYGLKQALRAWFERLSAFLHSIGFKTS
         IDY ET++P+ +  + R +L+L +     + Q+D+  AFL+G L E +YM  P G  C S     VCKL KA+YGLKQA R WFE     L    F  S
Subjt:  DIDYSETYSPMVKPVTIRVLLTLALANKWSLRQVDINNAFLHGLLSENVYMEQPGGFECVSGFTPLVCKLRKALYGLKQALRAWFERLSAFLHSIGFKTS

Query:  KVDPS--LLFKHSATGVCYILVYVDDIV
         VD    +L K +     Y+L+YVDD+V
Subjt:  KVDPS--LLFKHSATGVCYILVYVDDIV

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.2e-7626.59Show/hide
Query:  GNKISTVKLNNDN-FLTWKVQIEFALEGHELESFILEDIEPPPKTIPVGTDPNVTKLNPAYAKWKKQDRLISSWLLGSMSYDILEQVIHCKTSRAIWKCL
        G K    K N DN F TW+ ++   L    L   +L+     P T+                 W   D   +S +   +S D++  +I   T+R IW  L
Subjt:  GNKISTVKLNNDN-FLTWKVQIEFALEGHELESFILEDIEPPPKTIPVGTDPNVTKLNPAYAKWKKQDRLISSWLLGSMSYDILEQVIHCKTSRAIWKCL

Query:  Q-IFNLQNRAQIMRLKSKLQTIQKG-NLSLNEYFSQIKHCVDALAAIGKQVDEEDHITYILTGLGSEYESMISVITAQTEDIGVQEAMTLLLTHENQIET
        + ++  +     + LK +L  +      +   + +     +  LA +G +++EED    +L  L S Y+++ + I      I +++  + LL +E     
Subjt:  Q-IFNLQNRAQIMRLKSKLQTIQKG-NLSLNEYFSQIKHCVDALAAIGKQVDEEDHITYILTGLGSEYESMISVITAQTEDIGVQEAMTLLLTHENQIET

Query:  KAKTVNNDGSVPSANLMTQSNHTPDPDPQKQNNQYTSSNYNNRGRGGGRSNRGGRSWNNNRNKPQCQYCFKFEHTAIKCYALAGRFP-QQQSGNSTFHPQ
          K   N G      L+T+           +   Y  S+ NN GR G R     RS +  RN   C     F+          G    Q+   N+    Q
Subjt:  KAKTVNNDGSVPSANLMTQSNHTPDPDPQKQNNQYTSSNYNNRGRGGGRSNRGGRSWNNNRNKPQCQYCFKFEHTAIKCYALAGRFP-QQQSGNSTFHPQ

Query:  FGAGAFLPNQQMQEMLTASDMNQDNGWYPDSGLS-------------------ISHSGNSSFNSSNRL------------FHLKNLLHVPQITKNLISVS
              L   + +E +  S    ++ W  D+  S                       GN+S++    +              LK++ HVP +  NLIS  
Subjt:  FGAGAFLPNQQMQEMLTASDMNQDNGWYPDSGLS-------------------ISHSGNSSFNSSNRL------------FHLKNLLHVPQITKNLISVS

Query:  QFARDNSVFFEFHPEFFYVKDRLTSQTLL--QGPLHEGLYRFNLSILPSTSTRNIPAINKCLSSQNHSSVLTVNTIEKPSVVDIWHRRLGHPASFVVQTI
           RD      +   F   K RLT  +L+  +G     LYR N  I                          +N  +    VD+WH+R+GH +   +Q +
Subjt:  QFARDNSVFFEFHPEFFYVKDRLTSQTLL--QGPLHEGLYRFNLSILPSTSTRNIPAINKCLSSQNHSSVLTVNTIEKPSVVDIWHRRLGHPASFVVQTI

Query:  VK--LCNAVKSPYIKHSFCNACAVGKSHALPFQTSNSVYTKPLQLIVADVWDRAYITSRNGFKV------------------------------------
         K  L +  K   +K   C+ C  GK H + FQTS+      L L+ +DV     I S  G K                                     
Subjt:  VK--LCNAVKSPYIKHSFCNACAVGKSHALPFQTSNSVYTKPLQLIVADVWDRAYITSRNGFKV------------------------------------

Query:  -----------------------------------TCPYTSQQNGIAERKHRQIVDIGLILLSQASLPFIFWDDAFATAAYISNRLPSTVLNGSCPLKQL
                                           T P T Q NG+AER +R IV+    +L  A LP  FW +A  TA Y+ NR PS  L    P +  
Subjt:  -----------------------------------TCPYTSQQNGIAERKHRQIVDIGLILLSQASLPFIFWDDAFATAAYISNRLPSTVLNGSCPLKQL

Query:  FHIDPNYTFLKIFGCRCYPCLRPYNSHKISFRSIPCTFLGYSNQHKGYKCLSSDGRLYV-LHNVTFDELSFPYKQQSTPLPSISPSVTQNVPILSSTPMD
         + + +Y+ LK+FGCR +  +      K+  +SIPC F+GY ++  GY+      +  +   +V F E     +  +     +   +  N   + ST   
Subjt:  FHIDPNYTFLKIFGCRCYPCLRPYNSHKISFRSIPCTFLGYSNQHKGYKCLSSDGRLYV-LHNVTFDELSFPYKQQSTPLPSISPSVTQNVPILSSTPMD

Query:  TQTSSHESSTDVPLSPAPAP-TIASPAEQPISTASVDSSSHDTSSNASQSMVPTVVNNHPMVTRGKHGIFKPKALMVSYDNTEPPSAKEAFKHP---AWR
           +S ES+TD        P  +    EQ      V+   H T         P   +  P V   ++     + +++S D+ EP S KE   HP      
Subjt:  TQTSSHESSTDVPLSPAPAP-TIASPAEQPISTASVDSSSHDTSSNASQSMVPTVVNNHPMVTRGKHGIFKPKALMVSYDNTEPPSAKEAFKHP---AWR

Query:  KAMESEYDALMANQTWTLVPLPRNQKVVGCKWVFKIKRNSDGSIARYKARLVAKGFHQTPDIDYSETYSPMVKPVTIRVLLTLALANKWSLRQVDINNAF
        KAM+ E ++L  N T+ LV LP+ ++ + CKWVFK+K++ D  + RYKARLV KGF Q   ID+ E +SP+VK  +IR +L+LA +    + Q+D+  AF
Subjt:  KAMESEYDALMANQTWTLVPLPRNQKVVGCKWVFKIKRNSDGSIARYKARLVAKGFHQTPDIDYSETYSPMVKPVTIRVLLTLALANKWSLRQVDINNAF

Query:  LHGLLSENVYMEQPGGFECVSGFTPLVCKLRKALYGLKQALRAWFERLSAFLHSIGFKTSKVDPSLLFKH-SATGVCYILVYVDDIV
        LHG L E +YMEQP GFE V+G   +VCKL K+LYGLKQA R W+ +  +F+ S  +  +  DP + FK  S      +L+YVDD++
Subjt:  LHGLLSENVYMEQPGGFECVSGFTPLVCKLRKALYGLKQALRAWFERLSAFLHSIGFKTSKVDPSLLFKH-SATGVCYILVYVDDIV

P92520 Uncharacterized mitochondrial protein AtMg008204.2e-2752.8Show/hide
Query:  MVTRGKHGIFK--PK--ALMVSYDNTEPPSAKEAFKHPAWRKAMESEYDALMANQTWTLVPLPRNQKVVGCKWVFKIKRNSDGSIARYKARLVAKGFHQT
        M+TR K GI K  PK    + +    EP S   A K P W +AM+ E DAL  N+TW LVP P NQ ++GCKWVFK K +SDG++ R KARLVAKGFHQ 
Subjt:  MVTRGKHGIFK--PK--ALMVSYDNTEPPSAKEAFKHPAWRKAMESEYDALMANQTWTLVPLPRNQKVVGCKWVFKIKRNSDGSIARYKARLVAKGFHQT

Query:  PDIDYSETYSPMVKPVTIRVLLTLA
          I + ETYSP+V+  TIR +L +A
Subjt:  PDIDYSETYSPMVKPVTIRVLLTLA

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.5e-14433.02Show/hide
Query:  SSNSIVNPGNKISTVKLNNDNFLTWKVQIEFALEGHELESFILEDIEPPPKTIPVGTDPNVTKLNPAYAKWKKQDRLISSWLLGSMSYDILEQVIHCKTS
        ++ SI+N  N  +  KL + N+L W  Q+    +G+EL  F+      PP TI  GTD    ++NP Y +WK+QD+LI S +LG++S  +   V    T+
Subjt:  SSNSIVNPGNKISTVKLNNDNFLTWKVQIEFALEGHELESFILEDIEPPPKTIPVGTDPNVTKLNPAYAKWKKQDRLISSWLLGSMSYDILEQVIHCKTS

Query:  RAIWKCL-QIFNLQNRAQIMRLKSKLQTIQKGNLSLNEYFSQIKHCVDALAAIGKQVDEEDHITYILTGLGSEYESMISVITAQTEDIGVQEAMTLLLTH
          IW+ L +I+   +   + +L+++L+   KG  ++++Y   +    D LA +GK +D ++ +  +L  L  EY+ +I  I A+     + E    LL H
Subjt:  RAIWKCL-QIFNLQNRAQIMRLKSKLQTIQKGNLSLNEYFSQIKHCVDALAAIGKQVDEEDHITYILTGLGSEYESMISVITAQTEDIGVQEAMTLLLTH

Query:  ENQIETKAKTVNNDGSVP-SANLMTQSNHTP--DPDPQKQNNQYTSSNYNNRGRGGGRSNRGGRSWNNNRNKP---QCQYCFKFEHTAIKCYALAGRFPQ
        E++I      V++   +P +AN ++  N T   + +   +NN+Y + N NN  +   +S+      NNN++KP   +CQ C    H+A +C        Q
Subjt:  ENQIETKAKTVNNDGSVP-SANLMTQSNHTP--DPDPQKQNNQYTSSNYNNRGRGGGRSNRGGRSWNNNRNKP---QCQYCFKFEHTAIKCYALAGRFPQ

Query:  QQSGNSTFHPQFGAGAFLPNQQMQEMLTASDMNQDNGWYPDSG-------------------------------LSISHSGNSSFNSSNRLFHLKNLLHV
         Q   S+ + Q     F P Q    +   S  + +N W  DSG                               + ISH+G++S ++ +R  +L N+L+V
Subjt:  QQSGNSTFHPQFGAGAFLPNQQMQEMLTASDMNQDNGWYPDSG-------------------------------LSISHSGNSSFNSSNRLFHLKNLLHV

Query:  PQITKNLISVSQFARDNSVFFEFHPEFFYVKDRLTSQTLLQGPLHEGLYRFNLSILPSTSTRNIPAINKCLSSQNHSSVLTVNTIEKPSVVDIWHRRLGH
        P I KNLISV +    N V  EF P  F VKD  T   LLQG   + LY +     P  S++ +       S   HSS               WH RLGH
Subjt:  PQITKNLISVSQFARDNSVFFEFHPEFFYVKDRLTSQTLLQGPLHEGLYRFNLSILPSTSTRNIPAINKCLSSQNHSSVLTVNTIEKPSVVDIWHRRLGH

Query:  PASFVVQTIVKLCN-AVKSPYIKHSFCNACAVGKSHALPFQTSNSVYTKPLQLIVADVWD-----------------------------------RAYIT
        PA  ++ +++   + +V +P  K   C+ C + KS+ +PF  S    T+PL+ I +DVW                                      +IT
Subjt:  PASFVVQTIVKLCN-AVKSPYIKHSFCNACAVGKSHALPFQTSNSVYTKPLQLIVADVWD-----------------------------------RAYIT

Query:  SRN---------------------------------GFKVTCPYTSQQNGIAERKHRQIVDIGLILLSQASLPFIFWDDAFATAAYISNRLPSTVLNGSC
         +N                                     + P+T + NG++ERKHR IV+ GL LLS AS+P  +W  AFA A Y+ NRLP+ +L    
Subjt:  SRN---------------------------------GFKVTCPYTSQQNGIAERKHRQIVDIGLILLSQASLPFIFWDDAFATAAYISNRLPSTVLNGSC

Query:  PLKQLFHIDPNYTFLKIFGCRCYPCLRPYNSHKISFRSIPCTFLGYSNQHKGYKCLS-SDGRLYVLHNVTFDELSFPY-------------KQQST----
        P ++LF   PNY  L++FGC CYP LRPYN HK+  +S  C FLGYS     Y CL     RLY+  +V FDE  FP+             +++S+    
Subjt:  PLKQLFHIDPNYTFLKIFGCRCYPCLRPYNSHKISFRSIPCTFLGYSNQHKGYKCLS-SDGRLYVLHNVTFDELSFPY-------------KQQST----

Query:  ------------PLPSIS-------PSVTQNVPI----LSSTPMDTQTSSHESSTDVPLSP---APAPTI----------------------ASPAE--Q
                    P PS S       P  + + P     +SS+ +D+  SS   S+  P +P    P PT                        SP++  Q
Subjt:  ------------PLPSIS-------PSVTQNVPI----LSSTPMDTQTSSHESSTDVPLSP---APAPTI----------------------ASPAE--Q

Query:  PISTASVDSSS--HDTSSNASQSMVPT--------------VVNN--------HPMVTRGKHGIFKPK---ALMVSY-DNTEPPSAKEAFKHPAWRKAME
         +ST +  SSS    T+S +S S  PT              +VNN        H M TR K GI KP    +L VS    +EP +A +A K   WR AM 
Subjt:  PISTASVDSSS--HDTSSNASQSMVPT--------------VVNN--------HPMVTRGKHGIFKPK---ALMVSY-DNTEPPSAKEAFKHPAWRKAME

Query:  SEYDALMANQTWTLVPLPRNQ-KVVGCKWVFKIKRNSDGSIARYKARLVAKGFHQTPDIDYSETYSPMVKPVTIRVLLTLALANKWSLRQVDINNAFLHG
        SE +A + N TW LVP P +   +VGC+W+F  K NSDGS+ RYKARLVAKG++Q P +DY+ET+SP++K  +IR++L +A+   W +RQ+D+NNAFL G
Subjt:  SEYDALMANQTWTLVPLPRNQ-KVVGCKWVFKIKRNSDGSIARYKARLVAKGFHQTPDIDYSETYSPMVKPVTIRVLLTLALANKWSLRQVDINNAFLHG

Query:  LLSENVYMEQPGGFECVSGFTP-LVCKLRKALYGLKQALRAWFERLSAFLHSIGFKTSKVDPSLLFKHSATGVCYILVYVDDIV
         L+++VYM QP GF  +    P  VCKLRKALYGLKQA RAW+  L  +L +IGF  S  D SL        + Y+LVYVDDI+
Subjt:  LLSENVYMEQPGGFECVSGFTP-LVCKLRKALYGLKQALRAWFERLSAFLHSIGFKTSKVDPSLLFKHSATGVCYILVYVDDIV

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.3e-13731.83Show/hide
Query:  NKISTVKLNNDNFLTWKVQIEFALEGHELESFILEDIEPPPKTIPVGTDPNVTKLNPAYAKWKKQDRLISSWLLGSMSYDILEQVIHCKTSRAIWKCL-Q
        N  +  KL + N+L W  Q+    +G+EL  F+      PP TI  GTD  V ++NP Y +W++QD+LI S +LG++S  +   V    T+  IW+ L +
Subjt:  NKISTVKLNNDNFLTWKVQIEFALEGHELESFILEDIEPPPKTIPVGTDPNVTKLNPAYAKWKKQDRLISSWLLGSMSYDILEQVIHCKTSRAIWKCL-Q

Query:  IFNLQNRAQIMRLKSKLQTIQKGNLSLNEYFSQIKHCVDALAAIGKQVDEEDHITYILTGLGSEYESMISVITAQTEDIGVQEAMTLLLTHENQIETKAK
        I+   +   + +L+   +                    D LA +GK +D ++ +  +L  L  +Y+ +I  I A+     + E    L+      E+K  
Subjt:  IFNLQNRAQIMRLKSKLQTIQKGNLSLNEYFSQIKHCVDALAAIGKQVDEEDHITYILTGLGSEYESMISVITAQTEDIGVQEAMTLLLTHENQIETKAK

Query:  TVNNDGSVP-SANLMTQSNHTPDPDPQKQNNQYTSSNYNNRGRGGGRSNRGGRSWNNNRNKP---QCQYCFKFEHTAIKCYALAGRFPQQQSGNSTFHPQ
         +N+   VP +AN++T  N   + +   + +    +N NNR      S+ G RS +N + KP   +CQ C    H+A +C       PQ     ST + Q
Subjt:  TVNNDGSVP-SANLMTQSNHTPDPDPQKQNNQYTSSNYNNRGRGGGRSNRGGRSWNNNRNKP---QCQYCFKFEHTAIKCYALAGRFPQQQSGNSTFHPQ

Query:  FGAGAFLPNQQMQEMLTASDMNQDNGWYPDSG-------------------------------LSISHSGNSSFNSSNRLFHLKNLLHVPQITKNLISVS
             F P Q    +   S  N +N W  DSG                               + I+H+G++S  +S+R   L  +L+VP I KNLISV 
Subjt:  FGAGAFLPNQQMQEMLTASDMNQDNGWYPDSG-------------------------------LSISHSGNSSFNSSNRLFHLKNLLHVPQITKNLISVS

Query:  QFARDNSVFFEFHPEFFYVKDRLTSQTLLQGPLHEGLYRFNLSILPSTSTRNIPAINKCLSSQNHSSVLTVNTIEKPSVVDIWHRRLGHPASFVVQTIVK
        +    N V  EF P  F VKD  T   LLQG   + LY +     P  S++ +       S   HSS               WH RLGHP+  ++ +++ 
Subjt:  QFARDNSVFFEFHPEFFYVKDRLTSQTLLQGPLHEGLYRFNLSILPSTSTRNIPAINKCLSSQNHSSVLTVNTIEKPSVVDIWHRRLGHPASFVVQTIVK

Query:  LCN-AVKSPYIKHSFCNACAVGKSHALPFQTSNSVYTKPLQLIVADVWD---------RAYI--------------------------------------
          +  V +P  K   C+ C + KSH +PF  S    +KPL+ I +DVW          R Y+                                      
Subjt:  LCN-AVKSPYIKHSFCNACAVGKSHALPFQTSNSVYTKPLQLIVADVWD---------RAYI--------------------------------------

Query:  -------------------TSRNGFK--VTCPYTSQQNGIAERKHRQIVDIGLILLSQASLPFIFWDDAFATAAYISNRLPSTVLNGSCPLKQLFHIDPN
                            S++G     + P+T + NG++ERKHR IV++GL LLS AS+P  +W  AF+ A Y+ NRLP+ +L    P ++LF   PN
Subjt:  -------------------TSRNGFK--VTCPYTSQQNGIAERKHRQIVDIGLILLSQASLPFIFWDDAFATAAYISNRLPSTVLNGSCPLKQLFHIDPN

Query:  YTFLKIFGCRCYPCLRPYNSHKISFRSIPCTFLGYSNQHKGYKCLS-SDGRLYVLHNVTFDELSFPYKQ-------------------------------
        Y  LK+FGC CYP LRPYN HK+  +S  C F+GYS     Y CL    GRLY   +V FDE  FP+                                 
Subjt:  YTFLKIFGCRCYPCLRPYNSHKISFRSIPCTFLGYSNQHKGYKCLS-SDGRLYVLHNVTFDELSFPYKQ-------------------------------

Query:  -----------QSTPLPSISPS---------------------------VTQNVPILSSTPMDTQTSSHES---STDVPLSPAP-APTIASPAEQ-----
                    ++P P  SPS                            + N P  ++ P  TQ S+  S   +   P SP+P +P   SP  Q     
Subjt:  -----------QSTPLPSISPS---------------------------VTQNVPILSSTPMDTQTSSHES---STDVPLSPAP-APTIASPAEQ-----

Query:  ---PISTASVDSSSHDTSSNASQSMVPTV--------------VNNHPMVTRGKHGIFKPKALMVSY-----DNTEPPSAKEAFKHPAWRKAMESEYDAL
           P  + S+   +  +SS+ S   +P V              VN H M TR K GI KP     SY      N+EP +A +A K   WR+AM SE +A 
Subjt:  ---PISTASVDSSSHDTSSNASQSMVPTV--------------VNNHPMVTRGKHGIFKPKALMVSY-----DNTEPPSAKEAFKHPAWRKAMESEYDAL

Query:  MANQTWTLV-PLPRNQKVVGCKWVFKIKRNSDGSIARYKARLVAKGFHQTPDIDYSETYSPMVKPVTIRVLLTLALANKWSLRQVDINNAFLHGLLSENV
        + N TW LV P P +  +VGC+W+F  K NSDGS+ RYKARLVAKG++Q P +DY+ET+SP++K  +IR++L +A+   W +RQ+D+NNAFL G L++ V
Subjt:  MANQTWTLV-PLPRNQKVVGCKWVFKIKRNSDGSIARYKARLVAKGFHQTPDIDYSETYSPMVKPVTIRVLLTLALANKWSLRQVDINNAFLHGLLSENV

Query:  YMEQPGGFECVSGFTP-LVCKLRKALYGLKQALRAWFERLSAFLHSIGFKTSKVDPSLLFKHSATGVCYILVYVDDIV
        YM QP GF  V    P  VC+LRKA+YGLKQA RAW+  L  +L ++GF  S  D SL        + Y+LVYVDDI+
Subjt:  YMEQPGGFECVSGFTP-LVCKLRKALYGLKQALRAWFERLSAFLHSIGFKTSKVDPSLLFKHSATGVCYILVYVDDIV

Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).9.1e-0925.76Show/hide
Query:  ISTVKLNNDNFLTWKVQIEFALEGHELESFILEDIEPPPKTIPVGTDPNVTKLNPAYAKWKKQDRLISSWLLGSMSYDILEQVIHCKTSRAIWKCL-QIF
        I  +  + DN++ WK++    L   +   FI             GT P     +P Y  W++ + ++  WL+ SM+  +LE V++ +T+  +W+ L ++F
Subjt:  ISTVKLNNDNFLTWKVQIEFALEGHELESFILEDIEPPPKTIPVGTDPNVTKLNPAYAKWKKQDRLISSWLLGSMSYDILEQVIHCKTSRAIWKCL-QIF

Query:  NLQNRAQIMRLKSKLQTIQKGNLSLNEYFSQI
              +I +L+ +L T+++G  S+ EYF ++
Subjt:  NLQNRAQIMRLKSKLQTIQKGNLSLNEYFSQI

AT1G34070.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)3.8e-0724.27Show/hide
Query:  DPNVTKLNPAYAKWKKQDRLISSWLLGSMSYDILE-QVIHCKTSRAIW-KCLQIFNLQNRAQIMRLKSKLQTIQKGNLSLNEYFSQIKHCVDALAAIGKQ
        D  +   N     W+K+D ++   L G+++    +   +   TSR IW +    F     A+ +RL S+L+T   G++ + +Y+ ++K   D+L  +   
Subjt:  DPNVTKLNPAYAKWKKQDRLISSWLLGSMSYDILE-QVIHCKTSRAIW-KCLQIFNLQNRAQIMRLKSKLQTIQKGNLSLNEYFSQIKHCVDALAAIGKQ

Query:  VDEEDHITYILTGLGSEYESMISVITAQTEDIGVQEAMTLLLTHENQIETKAKTVNNDGSVPSANLMTQSNHTPDPDPQKQNNQYTSSNYNNRGRGGG-R
        V + + + Y+L GL  +++++I+VI  +       +A T+L   E++++   K         S++ +   +  P P    Q +      Y  RGRG    
Subjt:  VDEEDHITYILTGLGSEYESMISVITAQTEDIGVQEAMTLLLTHENQIETKAKTVNNDGSVPSANLMTQSNHTPDPDPQKQNNQYTSSNYNNRGRGGG-R

Query:  SNRGGR
          RGGR
Subjt:  SNRGGR

AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 82.7e-4545.1Show/hide
Query:  EPPSAKEAFKHPAWRKAMESEYDALMANQTWTLVPLPRNQKVVGCKWVFKIKRNSDGSIARYKARLVAKGFHQTPDIDYSETYSPMVKPVTIRVLLTLAL
        EP +  EA +   W  AM+ E  A+    TW +  LP N+K +GCKWV+KIK NSDG+I RYKARLVAKG+ Q   ID+ ET+SP+ K  +++++L ++ 
Subjt:  EPPSAKEAFKHPAWRKAMESEYDALMANQTWTLVPLPRNQKVVGCKWVFKIKRNSDGSIARYKARLVAKGFHQTPDIDYSETYSPMVKPVTIRVLLTLAL

Query:  ANKWSLRQVDINNAFLHGLLSENVYMEQPGGFECVSGFT---PLVCKLRKALYGLKQALRAWFERLSAFLHSIGFKTSKVDPSLLFKHSATGVCYILVYV
           ++L Q+DI+NAFL+G L E +YM+ P G+    G +     VC L+K++YGLKQA R WF + S  L   GF  S  D +   K +AT    +LVYV
Subjt:  ANKWSLRQVDINNAFLHGLLSENVYMEQPGGFECVSGFT---PLVCKLRKALYGLKQALRAWFERLSAFLHSIGFKTSKVDPSLLFKHSATGVCYILVYV

Query:  DDIV
        DDI+
Subjt:  DDIV

AT5G48050.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)2.3e-1227.18Show/hide
Query:  KWKKQDRLISSWLLGSMSYDILEQVIHCK-TSRAIWKCLQ-IFNLQNRAQIMRLKSKLQTIQKGNLSLNEYFSQIKHCVDALAAIGKQVDEEDHITYILT
        +WK++D L+  W+ G+++  +L+ +I    T+R +W  L+ +F     A+ ++ +++L+T    +LS++EY  ++K   D L  +   + +   + ++L 
Subjt:  KWKKQDRLISSWLLGSMSYDILEQVIHCK-TSRAIWKCLQ-IFNLQNRAQIMRLKSKLQTIQKGNLSLNEYFSQIKHCVDALAAIGKQVDEEDHITYILT

Query:  GLGSEYESMISVITAQTEDIGVQEAMTLLLTHENQIETKAKTVNNDGSVPS-ANLMTQSNHTPDPDPQKQNNQYTSSNYNNRGRG-GGRSNRGGRS----
        GL  +Y+ +++VI  ++      EA ++LL  E+++  K+K+  +  + PS +N++       +  PQ+ +N     N +N GRG   + NRGG S    
Subjt:  GLGSEYESMISVITAQTEDIGVQEAMTLLLTHENQIETKAKTVNNDGSVPS-ANLMTQSNHTPDPDPQKQNNQYTSSNYNNRGRG-GGRSNRGGRS----

Query:  WNNNRN
        +NNN N
Subjt:  WNNNRN

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)3.0e-2852.8Show/hide
Query:  MVTRGKHGIFK--PK--ALMVSYDNTEPPSAKEAFKHPAWRKAMESEYDALMANQTWTLVPLPRNQKVVGCKWVFKIKRNSDGSIARYKARLVAKGFHQT
        M+TR K GI K  PK    + +    EP S   A K P W +AM+ E DAL  N+TW LVP P NQ ++GCKWVFK K +SDG++ R KARLVAKGFHQ 
Subjt:  MVTRGKHGIFK--PK--ALMVSYDNTEPPSAKEAFKHPAWRKAMESEYDALMANQTWTLVPLPRNQKVVGCKWVFKIKRNSDGSIARYKARLVAKGFHQT

Query:  PDIDYSETYSPMVKPVTIRVLLTLA
          I + ETYSP+V+  TIR +L +A
Subjt:  PDIDYSETYSPMVKPVTIRVLLTLA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATGCATCTACCCATACTGAAGCTGACTCATATGAAGGAAATGAGAACTCATCAAACTCAATTGTGAATCCGGGCAACAAGATATCGACTGTCAAGCTCAACAATGA
CAACTTCCTCACGTGGAAAGTTCAAATCGAGTTTGCCCTAGAAGGCCATGAATTAGAGAGTTTCATTCTTGAAGATATTGAACCTCCACCGAAGACAATCCCTGTGGGAA
CTGATCCGAATGTCACCAAATTGAATCCTGCATACGCAAAATGGAAGAAACAAGATCGACTAATCTCGTCTTGGCTTCTTGGTTCTATGTCATACGACATTCTTGAACAG
GTAATTCATTGCAAGACTTCCAGAGCTATCTGGAAATGCCTCCAAATATTCAATTTGCAAAATAGGGCACAAATCATGCGCCTGAAGTCCAAACTTCAAACAATCCAGAA
GGGCAATTTGTCTCTAAATGAATACTTCTCTCAAATCAAACATTGTGTTGATGCACTTGCTGCAATTGGCAAACAAGTTGATGAAGAAGATCACATTACGTATATCCTAA
CTGGCCTCGGTTCTGAATATGAATCGATGATCTCAGTCATTACAGCTCAAACAGAGGATATTGGCGTTCAAGAAGCGATGACCCTACTCTTGACTCATGAAAACCAAATT
GAAACCAAAGCCAAGACAGTCAATAACGATGGATCAGTTCCTTCTGCCAATCTTATGACACAGAGCAATCATACTCCAGATCCGGATCCTCAAAAGCAAAACAATCAGTA
CACCTCGTCGAATTATAACAACCGAGGACGTGGAGGTGGCAGATCCAATCGTGGTGGCCGATCCTGGAATAACAATCGCAATAAACCACAGTGTCAGTACTGTTTTAAGT
TTGAGCATACTGCCATCAAATGCTATGCTCTTGCGGGTCGCTTCCCTCAACAACAGTCTGGCAACTCTACTTTTCATCCTCAGTTTGGAGCTGGTGCATTTCTGCCTAAT
CAACAGATGCAAGAAATGCTTACTGCCTCAGATATGAACCAAGATAATGGTTGGTACCCAGATTCGGGTCTGTCCATCTCTCATTCTGGCAATTCTTCGTTTAATTCCTC
AAATCGTCTCTTTCATCTTAAAAACTTACTCCATGTTCCCCAAATAACCAAGAATCTTATCTCTGTTAGCCAATTTGCTAGAGACAACTCAGTTTTCTTTGAATTTCACC
CTGAATTTTTCTATGTGAAGGATCGTCTTACTAGCCAAACGCTACTCCAAGGTCCACTCCATGAAGGGCTGTACAGGTTCAATCTTTCCATTTTGCCATCTACTTCTACT
AGAAATATACCTGCTATAAATAAATGCCTTTCGTCTCAAAATCATTCATCTGTACTTACTGTTAATACCATAGAAAAGCCATCTGTTGTAGATATCTGGCATAGGAGACT
TGGCCATCCAGCCTCTTTTGTTGTTCAAACCATTGTGAAACTTTGTAATGCTGTTAAGTCTCCATATATAAAACATTCTTTTTGTAATGCTTGTGCGGTTGGGAAATCAC
ATGCTTTACCTTTTCAGACCTCTAATAGTGTCTATACTAAACCATTACAATTAATTGTAGCTGATGTATGGGATCGAGCTTACATTACTTCTCGAAATGGCTTTAAAGTA
ACTTGCCCATACACCTCTCAGCAAAACGGGATTGCTGAGCGGAAACATCGACAAATTGTTGACATAGGGCTCATCTTGCTCTCCCAAGCTTCCCTTCCATTCATCTTTTG
GGATGATGCATTTGCTACCGCCGCATACATTAGTAATCGATTGCCTTCTACAGTCCTGAATGGTTCTTGTCCTCTAAAGCAACTATTTCACATTGACCCTAACTATACCT
TTCTCAAGATTTTTGGTTGTCGTTGCTACCCCTGCTTGAGACCCTATAATTCTCACAAAATTAGTTTTCGTTCTATTCCATGTACCTTCCTTGGCTATAGCAATCAGCAT
AAAGGATACAAATGTCTATCCTCTGATGGCCGCTTGTATGTCTTGCACAATGTAACATTTGATGAACTTTCTTTTCCATACAAACAACAATCCACACCTCTGCCCTCAAT
TTCACCTTCTGTTACACAGAATGTCCCCATCCTGTCCTCCACACCTATGGATACTCAAACATCCTCACATGAGTCATCGACTGATGTGCCTCTCAGCCCTGCACCTGCTC
CAACTATTGCATCACCTGCTGAACAGCCTATATCTACTGCCTCAGTTGACAGCAGCTCGCATGATACAAGTTCTAATGCTTCTCAATCTATGGTTCCTACTGTAGTGAAC
AATCATCCCATGGTGACTAGAGGTAAACATGGAATTTTTAAACCTAAAGCACTTATGGTATCGTATGACAATACTGAACCTCCAAGTGCTAAAGAGGCATTCAAGCATCC
TGCGTGGAGAAAGGCCATGGAATCTGAATATGATGCACTGATGGCGAATCAAACTTGGACCCTTGTCCCTCTGCCCAGAAATCAAAAAGTGGTTGGGTGTAAGTGGGTGT
TCAAAATCAAACGAAACTCTGATGGATCCATAGCTCGTTATAAGGCTAGACTGGTCGCAAAAGGGTTTCATCAAACACCAGACATAGACTATTCTGAAACTTATAGTCCA
ATGGTGAAACCAGTCACTATTAGAGTTCTACTTACTCTTGCCTTGGCAAATAAATGGTCCTTACGGCAGGTTGACATAAATAATGCGTTTCTCCATGGTCTCCTTTCTGA
AAATGTGTACATGGAGCAACCTGGGGGTTTTGAATGTGTCTCTGGTTTTACACCACTTGTCTGTAAACTTCGTAAAGCCTTATATGGACTCAAACAAGCCCTTCGTGCTT
GGTTTGAACGCCTAAGTGCCTTCTTACACTCTATTGGTTTTAAAACATCCAAAGTTGATCCGTCTCTATTGTTTAAGCACTCAGCTACTGGAGTTTGTTACATACTTGTG
TATGTGGATGACATAGTCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGATGCATCTACCCATACTGAAGCTGACTCATATGAAGGAAATGAGAACTCATCAAACTCAATTGTGAATCCGGGCAACAAGATATCGACTGTCAAGCTCAACAATGA
CAACTTCCTCACGTGGAAAGTTCAAATCGAGTTTGCCCTAGAAGGCCATGAATTAGAGAGTTTCATTCTTGAAGATATTGAACCTCCACCGAAGACAATCCCTGTGGGAA
CTGATCCGAATGTCACCAAATTGAATCCTGCATACGCAAAATGGAAGAAACAAGATCGACTAATCTCGTCTTGGCTTCTTGGTTCTATGTCATACGACATTCTTGAACAG
GTAATTCATTGCAAGACTTCCAGAGCTATCTGGAAATGCCTCCAAATATTCAATTTGCAAAATAGGGCACAAATCATGCGCCTGAAGTCCAAACTTCAAACAATCCAGAA
GGGCAATTTGTCTCTAAATGAATACTTCTCTCAAATCAAACATTGTGTTGATGCACTTGCTGCAATTGGCAAACAAGTTGATGAAGAAGATCACATTACGTATATCCTAA
CTGGCCTCGGTTCTGAATATGAATCGATGATCTCAGTCATTACAGCTCAAACAGAGGATATTGGCGTTCAAGAAGCGATGACCCTACTCTTGACTCATGAAAACCAAATT
GAAACCAAAGCCAAGACAGTCAATAACGATGGATCAGTTCCTTCTGCCAATCTTATGACACAGAGCAATCATACTCCAGATCCGGATCCTCAAAAGCAAAACAATCAGTA
CACCTCGTCGAATTATAACAACCGAGGACGTGGAGGTGGCAGATCCAATCGTGGTGGCCGATCCTGGAATAACAATCGCAATAAACCACAGTGTCAGTACTGTTTTAAGT
TTGAGCATACTGCCATCAAATGCTATGCTCTTGCGGGTCGCTTCCCTCAACAACAGTCTGGCAACTCTACTTTTCATCCTCAGTTTGGAGCTGGTGCATTTCTGCCTAAT
CAACAGATGCAAGAAATGCTTACTGCCTCAGATATGAACCAAGATAATGGTTGGTACCCAGATTCGGGTCTGTCCATCTCTCATTCTGGCAATTCTTCGTTTAATTCCTC
AAATCGTCTCTTTCATCTTAAAAACTTACTCCATGTTCCCCAAATAACCAAGAATCTTATCTCTGTTAGCCAATTTGCTAGAGACAACTCAGTTTTCTTTGAATTTCACC
CTGAATTTTTCTATGTGAAGGATCGTCTTACTAGCCAAACGCTACTCCAAGGTCCACTCCATGAAGGGCTGTACAGGTTCAATCTTTCCATTTTGCCATCTACTTCTACT
AGAAATATACCTGCTATAAATAAATGCCTTTCGTCTCAAAATCATTCATCTGTACTTACTGTTAATACCATAGAAAAGCCATCTGTTGTAGATATCTGGCATAGGAGACT
TGGCCATCCAGCCTCTTTTGTTGTTCAAACCATTGTGAAACTTTGTAATGCTGTTAAGTCTCCATATATAAAACATTCTTTTTGTAATGCTTGTGCGGTTGGGAAATCAC
ATGCTTTACCTTTTCAGACCTCTAATAGTGTCTATACTAAACCATTACAATTAATTGTAGCTGATGTATGGGATCGAGCTTACATTACTTCTCGAAATGGCTTTAAAGTA
ACTTGCCCATACACCTCTCAGCAAAACGGGATTGCTGAGCGGAAACATCGACAAATTGTTGACATAGGGCTCATCTTGCTCTCCCAAGCTTCCCTTCCATTCATCTTTTG
GGATGATGCATTTGCTACCGCCGCATACATTAGTAATCGATTGCCTTCTACAGTCCTGAATGGTTCTTGTCCTCTAAAGCAACTATTTCACATTGACCCTAACTATACCT
TTCTCAAGATTTTTGGTTGTCGTTGCTACCCCTGCTTGAGACCCTATAATTCTCACAAAATTAGTTTTCGTTCTATTCCATGTACCTTCCTTGGCTATAGCAATCAGCAT
AAAGGATACAAATGTCTATCCTCTGATGGCCGCTTGTATGTCTTGCACAATGTAACATTTGATGAACTTTCTTTTCCATACAAACAACAATCCACACCTCTGCCCTCAAT
TTCACCTTCTGTTACACAGAATGTCCCCATCCTGTCCTCCACACCTATGGATACTCAAACATCCTCACATGAGTCATCGACTGATGTGCCTCTCAGCCCTGCACCTGCTC
CAACTATTGCATCACCTGCTGAACAGCCTATATCTACTGCCTCAGTTGACAGCAGCTCGCATGATACAAGTTCTAATGCTTCTCAATCTATGGTTCCTACTGTAGTGAAC
AATCATCCCATGGTGACTAGAGGTAAACATGGAATTTTTAAACCTAAAGCACTTATGGTATCGTATGACAATACTGAACCTCCAAGTGCTAAAGAGGCATTCAAGCATCC
TGCGTGGAGAAAGGCCATGGAATCTGAATATGATGCACTGATGGCGAATCAAACTTGGACCCTTGTCCCTCTGCCCAGAAATCAAAAAGTGGTTGGGTGTAAGTGGGTGT
TCAAAATCAAACGAAACTCTGATGGATCCATAGCTCGTTATAAGGCTAGACTGGTCGCAAAAGGGTTTCATCAAACACCAGACATAGACTATTCTGAAACTTATAGTCCA
ATGGTGAAACCAGTCACTATTAGAGTTCTACTTACTCTTGCCTTGGCAAATAAATGGTCCTTACGGCAGGTTGACATAAATAATGCGTTTCTCCATGGTCTCCTTTCTGA
AAATGTGTACATGGAGCAACCTGGGGGTTTTGAATGTGTCTCTGGTTTTACACCACTTGTCTGTAAACTTCGTAAAGCCTTATATGGACTCAAACAAGCCCTTCGTGCTT
GGTTTGAACGCCTAAGTGCCTTCTTACACTCTATTGGTTTTAAAACATCCAAAGTTGATCCGTCTCTATTGTTTAAGCACTCAGCTACTGGAGTTTGTTACATACTTGTG
TATGTGGATGACATAGTCTGA
Protein sequenceShow/hide protein sequence
MDASTHTEADSYEGNENSSNSIVNPGNKISTVKLNNDNFLTWKVQIEFALEGHELESFILEDIEPPPKTIPVGTDPNVTKLNPAYAKWKKQDRLISSWLLGSMSYDILEQ
VIHCKTSRAIWKCLQIFNLQNRAQIMRLKSKLQTIQKGNLSLNEYFSQIKHCVDALAAIGKQVDEEDHITYILTGLGSEYESMISVITAQTEDIGVQEAMTLLLTHENQI
ETKAKTVNNDGSVPSANLMTQSNHTPDPDPQKQNNQYTSSNYNNRGRGGGRSNRGGRSWNNNRNKPQCQYCFKFEHTAIKCYALAGRFPQQQSGNSTFHPQFGAGAFLPN
QQMQEMLTASDMNQDNGWYPDSGLSISHSGNSSFNSSNRLFHLKNLLHVPQITKNLISVSQFARDNSVFFEFHPEFFYVKDRLTSQTLLQGPLHEGLYRFNLSILPSTST
RNIPAINKCLSSQNHSSVLTVNTIEKPSVVDIWHRRLGHPASFVVQTIVKLCNAVKSPYIKHSFCNACAVGKSHALPFQTSNSVYTKPLQLIVADVWDRAYITSRNGFKV
TCPYTSQQNGIAERKHRQIVDIGLILLSQASLPFIFWDDAFATAAYISNRLPSTVLNGSCPLKQLFHIDPNYTFLKIFGCRCYPCLRPYNSHKISFRSIPCTFLGYSNQH
KGYKCLSSDGRLYVLHNVTFDELSFPYKQQSTPLPSISPSVTQNVPILSSTPMDTQTSSHESSTDVPLSPAPAPTIASPAEQPISTASVDSSSHDTSSNASQSMVPTVVN
NHPMVTRGKHGIFKPKALMVSYDNTEPPSAKEAFKHPAWRKAMESEYDALMANQTWTLVPLPRNQKVVGCKWVFKIKRNSDGSIARYKARLVAKGFHQTPDIDYSETYSP
MVKPVTIRVLLTLALANKWSLRQVDINNAFLHGLLSENVYMEQPGGFECVSGFTPLVCKLRKALYGLKQALRAWFERLSAFLHSIGFKTSKVDPSLLFKHSATGVCYILV
YVDDIV