; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI06G21930 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI06G21930
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionRNA-directed DNA polymerase
Genome locationChr6:19876180..19881921
RNA-Seq ExpressionCSPI06G21930
SyntenyCSPI06G21930
Gene Ontology termsGO:0006807 - nitrogen compound metabolic process (biological process)
GO:0043170 - macromolecule metabolic process (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008233 - peptidase activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR012337 - Ribonuclease H-like superfamily
IPR036280 - Multiheme cytochrome superfamily
IPR036397 - Ribonuclease H superfamily
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR041588 - Integrase zinc-binding domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAA7014963.1 unnamed protein product [Microthlaspi erraticum]5.5e-30454.9Show/hide
Query:  RPWQHDTQTLHKGRENTYEFHWMGKRITLLPLTKKNEENSKTRGQ-LFTTCSGKTL---LKERK---------------------QDILALKE-------
        RPWQ+D    ++GR+N   F W   +I + P+   ++   K +   L  T S K L    KE K                     +++L + E       
Subjt:  RPWQHDTQTLHKGRENTYEFHWMGKRITLLPLTKKNEENSKTRGQ-LFTTCSGKTL---LKERK---------------------QDILALKE-------

Query:  ---PDGLPPLRDIQHHIDLIPGASLPNLAHYRMTPKEYAALHEHIEDLLKKGHIKPSLSPCAVPALLTPKKDGSWRMCVDSRAINRITVKYRFPIPRVGD
           P+ LPP+RDIQHHIDLIPG+SLPNL HYRM+PKE   L E IEDLLKKG I+ S+SPCAVP LL PKK   WRMCVDSRAIN+IT+KYRFPIPR+ D
Subjt:  ---PDGLPPLRDIQHHIDLIPGASLPNLAHYRMTPKEYAALHEHIEDLLKKGHIKPSLSPCAVPALLTPKKDGSWRMCVDSRAINRITVKYRFPIPRVGD

Query:  LLDQLGKATIFSKIDLRSGYHQIRIRPGDEWKTAFKTNEGLFEWMVMPFGLSNAPSTFMRLMNQVLHPFLNKFIVVYFDDILVYSSGNDEHLLHLKKLFQ
        +LD+L  + IFSKIDLRSGYHQIRIRPGDEWKTAFK+ +GL+EW+VMPFGLSNAPSTFMRLMNQ+L PF   F+VVYFDDIL+YS   +EHL HLK++ Q
Subjt:  LLDQLGKATIFSKIDLRSGYHQIRIRPGDEWKTAFKTNEGLFEWMVMPFGLSNAPSTFMRLMNQVLHPFLNKFIVVYFDDILVYSSGNDEHLLHLKKLFQ

Query:  VLTEKELYINQKKCEFLKSEITFLGFIIKKGEVSMEPRKVEAIREWLAPTTVKEVQAFLGLASFYRKFIKNFSSICAPLTDCLKKGNFKWTSFQQESFEE
        VL E +LY+N KKC F  +++ FLGF++ +  + ++  KV AIR+W AP +V EV++F GL +FYR+F+++FS+I AP+T+CLKKG F W S Q +SF  
Subjt:  VLTEKELYINQKKCEFLKSEITFLGFIIKKGEVSMEPRKVEAIREWLAPTTVKEVQAFLGLASFYRKFIKNFSSICAPLTDCLKKGNFKWTSFQQESFEE

Query:  IKKRLASSPVLQLPDFSSPFEVAVDACCTGIGAVLSQRGHPIEYLSEKLSTARQTWSTYEQELYALVRALKQWEHYLLSKEFVLLTDHFSLKYLQSQKNI
        IK++L ++PVL LPDF   F+V  DA   GIGAVLSQ   P+ + SEKLS ARQ WSTY+QE YA+ RAL+QWEHYL+ +EF+L TDH +LK+L SQK I
Subjt:  IKKRLASSPVLQLPDFSSPFEVAVDACCTGIGAVLSQRGHPIEYLSEKLSTARQTWSTYEQELYALVRALKQWEHYLLSKEFVLLTDHFSLKYLQSQKNI

Query:  SRMHARWISYLQRFDFVIKHQAGKENKVADALSRKGSLLTLLSSEIIAFEHLPELYERDADFADIWHKCSNHLRAEGYHILEGFLFKGDQLCIPHTSLRE
        ++MHARW+S+LQ+F F+I+H++G  NKVADALSR+ SLL  L+ EI+ FE L ELYE DA+F ++W KC+    +  +HI +GFLFKGD+LCIP +SLRE
Subjt:  SRMHARWISYLQRFDFVIKHQAGKENKVADALSRKGSLLTLLSSEIIAFEHLPELYERDADFADIWHKCSNHLRAEGYHILEGFLFKGDQLCIPHTSLRE

Query:  ALIKEAHSNGLAGHFEQDKTFDTISIRYYWPQLRKDSNNFVKRCSVCQRAKGSQTNAGLYTPLPIPQSIWEDLSIDFVLGLPKTQRNHDSVMVVVDRFSK
         LI++ H  GL+GH  +DKT  ++  RYYWP LR+D+   VKRC +CQ +KG   N GLY PLP+P  IW+DLS+DFVLGLP+TQR  DSV VVVDRFSK
Subjt:  ALIKEAHSNGLAGHFEQDKTFDTISIRYYWPQLRKDSNNFVKRCSVCQRAKGSQTNAGLYTPLPIPQSIWEDLSIDFVLGLPKTQRNHDSVMVVVDRFSK

Query:  MAHFIACKRTNDA----------------------------FLSHFWKTLWRKFDTTLKYSTIAHPQTDGQTEVTNRTLGNLIRCLSGSKPKQWDLSHAQ
        M HFIACK+T DA                            FLSHFW TLWR F TTLK S+ AHPQTDGQTEVTNRTLGN+IR + G +PKQWDL+  Q
Subjt:  MAHFIACKRTNDA----------------------------FLSHFWKTLWRKFDTTLKYSTIAHPQTDGQTEVTNRTLGNLIRCLSGSKPKQWDLSHAQ

Query:  AEFAFNNMKNRSTDKCPFEVVYTRRPRLTFDLASLPVTVESHKEAETMAEDIEKLHKEVHDHLVQSTNSYKKAADKKRRQTVFSKGDLVMVHLRKNRFPT
         EFA+N+  + +T K PF +VYT  P+   DL  LP        AETMAE+I    + V   L  +    K AADK+RR  VF +GD VMV LRK RFP 
Subjt:  AEFAFNNMKNRSTDKCPFEVVYTRRPRLTFDLASLPVTVESHKEAETMAEDIEKLHKEVHDHLVQSTNSYKKAADKKRRQTVFSKGDLVMVHLRKNRFPT

Query:  GTYNKLKDRQIGPFRITEKYGDNAFKVELPPDMHIHSVFNIADLKPYHA
        GTY KL+  + GPF++  K  DNA+ V+LP +M+I + FN+AD+  YHA
Subjt:  GTYNKLKDRQIGPFRITEKYGDNAFKVELPPDMHIHSVFNIADLKPYHA

KAG7588770.1 Integrase catalytic core [Arabidopsis suecica]1.1e-0627.71Show/hide
Query:  MADNGEINNRRRQRGRGNFRNLHNPRYANRGRADLNPPPQFYADVHNEQEDESSSSDDQNNPWDALDETQRGQHGRRFGTRAETHHDFKMKVDLPSYSGK
        +A   +I N  +  G G  RN +     NR +   N      +    E E+E     +QNN                         D+++K D+P +SG 
Subjt:  MADNGEINNRRRQRGRGNFRNLHNPRYANRGRADLNPPPQFYADVHNEQEDESSSSDDQNNPWDALDETQRGQHGRRFGTRAETHHDFKMKVDLPSYSGK

Query:  RDIESFLDWLKSTENFFSYMDTPEQKKVRLVALKLKGGASAWWEQLEANRQRPWQHDTQTLHKGRE
          +E FLDW    + FF  M+ PE K+V++VA++LK  A+ WW++L   RQR  +   +T  + ++
Subjt:  RDIESFLDWLKSTENFFSYMDTPEQKKVRLVALKLKGGASAWWEQLEANRQRPWQHDTQTLHKGRE

PWA81295.1 transposon Ty3-I Gag-Pol polyprotein [Artemisia annua]9.8e-30954.48Show/hide
Query:  RPWQHDTQTLHKGRENTYEFHWMGKRITLLPLTKKNEENSKTRGQLFTTCSG-KTLLKERKQ--------------------------------DILALK
        RPWQHD    H+G+ N Y F W GK I +LPL   +         L T  S  K    ERK+                                 ++A  
Subjt:  RPWQHDTQTLHKGRENTYEFHWMGKRITLLPLTKKNEENSKTRGQLFTTCSG-KTLLKERKQ--------------------------------DILALK

Query:  EPDGLPPLRDIQHHIDLIPGASLPNLAHYRMTPKEYAALHEHIEDLLKKGHIKPSLSPCAVPALLTPKKDGSWRMCVDSRAINRITVKYRFPIPRVGDLL
         PD LPPLR+IQH IDL+PGASLPNL HYRM+PKE   L E +E+LL+KGHI+ S+SPCAVPALLTPKKDGSWRMCVDSRAIN+ITV+YRFPIPR+ DLL
Subjt:  EPDGLPPLRDIQHHIDLIPGASLPNLAHYRMTPKEYAALHEHIEDLLKKGHIKPSLSPCAVPALLTPKKDGSWRMCVDSRAINRITVKYRFPIPRVGDLL

Query:  DQLGKATIFSKIDLRSGYHQIRIRPGDEWKTAFKTNEGLFEWMVMPFGLSNAPSTFMRLMNQVLHPFLNKFIVVYFDDILVYSSGNDEHLLHLKKLFQVL
        DQL  A +FSKIDLRSGYHQIRI+PGDEWKTAFKT +GL+EW+VMPFGLSNAPSTFMRLM QVL PF+ KF+VVYFDDILVYS    EHL HL+K+ + L
Subjt:  DQLGKATIFSKIDLRSGYHQIRIRPGDEWKTAFKTNEGLFEWMVMPFGLSNAPSTFMRLMNQVLHPFLNKFIVVYFDDILVYSSGNDEHLLHLKKLFQVL

Query:  TEKELYINQKKCEFLKSEITFLGFIIKKGEVSMEPRKVEAIREWLAPTTVKEVQAFLGLASFYRKFIKNFSSICAPLTDCLKKGNFKWTSFQQESFEEIK
        TE EL++N KKC FL +++ FLG+I+    + ++  KV+A+R+W +P T+ EV++F GLA+FYR+F++NFSSI AP+T+C+KKG FKWT   +ESF+ IK
Subjt:  TEKELYINQKKCEFLKSEITFLGFIIKKGEVSMEPRKVEAIREWLAPTTVKEVQAFLGLASFYRKFIKNFSSICAPLTDCLKKGNFKWTSFQQESFEEIK

Query:  KRLASSPVLQLPDFSSPFEVAVDACCTGIGAVLSQRGHPIEYLSEKLSTARQTWSTYEQELYALVRALKQWEHYLLSKEFVLLTDHFSLKYLQSQKNISR
        +RL ++PVL LP+F + FE+  DAC TGIGAVLSQ G P+ + SEKL+ ARQ WSTYEQELYA+V+A+K+WEHYL+ +EFV+ +DH +LKY Q+Q+++++
Subjt:  KRLASSPVLQLPDFSSPFEVAVDACCTGIGAVLSQRGHPIEYLSEKLSTARQTWSTYEQELYALVRALKQWEHYLLSKEFVLLTDHFSLKYLQSQKNISR

Query:  MHARWISYLQRFDFVIKHQAGKENKVADALSRKGSLLTLLSSEIIAFEHLPELYERDADFADIWHKCSNHLRAEGYHILEGFLFKGDQLCIPHTSLREAL
        +HARW S+L++F++VIKH++G  NKVADALSRK +LL  +S++++ FE +  LYE D DF   W +         + +L+G+LFKG++LCIP TSLR  L
Subjt:  MHARWISYLQRFDFVIKHQAGKENKVADALSRKGSLLTLLSSEIIAFEHLPELYERDADFADIWHKCSNHLRAEGYHILEGFLFKGDQLCIPHTSLREAL

Query:  IKEAHSNGLAGHFEQDKTFDTISIRYYWPQLRKDSNNFVKRCSVCQRAKGSQTNAGLYTPLPIPQSIWEDLSIDFVLGLPKTQRNHDSVMVVVDRFSKMA
        IKE H+ GL+ H  +DKT  ++  R+YWPQL++D  +FV+RC VCQ  KG   N GLY PLP+P+S W D+S+DFVLGLP+TQR  DSV VVVDRFSKMA
Subjt:  IKEAHSNGLAGHFEQDKTFDTISIRYYWPQLRKDSNNFVKRCSVCQRAKGSQTNAGLYTPLPIPQSIWEDLSIDFVLGLPKTQRNHDSVMVVVDRFSKMA

Query:  HFIACKRTNDA----------------------------FLSHFWKTLWRKFDTTLKYSTIAHPQTDGQTEVTNRTLGNLIRCLSGSKPKQWDLSHAQAE
        HFI CK+T+DA                            FL+HFW TLWR+  T+L +S+ AHPQTDGQTEV NRTLGN+IRCL G KPK WD+S AQAE
Subjt:  HFIACKRTNDA----------------------------FLSHFWKTLWRKFDTTLKYSTIAHPQTDGQTEVTNRTLGNLIRCLSGSKPKQWDLSHAQAE

Query:  FAFNNMKNRSTDKCPFEVVYTRRPRLTFDLASLPVTVESHKEAETMAEDIEKLHKEVHDHLVQSTNSYKKAADKKRRQTVFSKGDLVMVHLRKNRFPTGT
        FA+N+  + ST   PF+VVY   PR   DL  LP   + + +A  M E+++  H+ V   + +S   YK AADK RR  +F  GD VMV LRK RFP GT
Subjt:  FAFNNMKNRSTDKCPFEVVYTRRPRLTFDLASLPVTVESHKEAETMAEDIEKLHKEVHDHLVQSTNSYKKAADKKRRQTVFSKGDLVMVHLRKNRFPTGT

Query:  YNKLKDRQIGPFRITEKYGDNAFKVELPPDMHIHSVFNIADLKPYHAPD
        Y+KL+ ++ GP++I  K  DNA+ V+LP  M I   FN++D+  +H  D
Subjt:  YNKLKDRQIGPFRITEKYGDNAFKVELPPDMHIHSVFNIADLKPYHAPD

PWA81295.1 transposon Ty3-I Gag-Pol polyprotein [Artemisia annua]1.0e-0732.7Show/hide
Query:  NGEINNRRRQRGRGNFRNLHNPRYANRGRADLNPPPQFYADVHNEQEDESSSSDDQNNPWDALDE----TQRGQHGRRFGTRAETHHD-------FKMKV
        NG+  NR      GN+R   N     R  A    P Q +      + DE   SD++    +A DE     + G   RR   R   +H        +++K 
Subjt:  NGEINNRRRQRGRGNFRNLHNPRYANRGRADLNPPPQFYADVHNEQEDESSSSDDQNNPWDALDE----TQRGQHGRRFGTRAETHHD-------FKMKV

Query:  DLPSYSGKRDIESFLDWLKSTENFFSYMDTPEQKKVRLVALKLKGGASAWWEQLEANRQ
        ++P++ G  DIE+ LDWL   + FF  MD P+ ++V++VA KL+GGA AWW++ + NR+
Subjt:  DLPSYSGKRDIESFLDWLKSTENFFSYMDTPEQKKVRLVALKLKGGASAWWEQLEANRQ

PWA81295.1 transposon Ty3-I Gag-Pol polyprotein [Artemisia annua]5.5e-30454.9Show/hide
Query:  RPWQHDTQTLHKGRENTYEFHWMGKRITLLPLTKKNEENSKTRGQ-LFTTCSGKTL---LKERK---------------------QDILALKE-------
        RPWQ+D    ++GR+N   F W   +I + P+   ++   K +   L  T S K L    KE K                     +++L + E       
Subjt:  RPWQHDTQTLHKGRENTYEFHWMGKRITLLPLTKKNEENSKTRGQ-LFTTCSGKTL---LKERK---------------------QDILALKE-------

Query:  ---PDGLPPLRDIQHHIDLIPGASLPNLAHYRMTPKEYAALHEHIEDLLKKGHIKPSLSPCAVPALLTPKKDGSWRMCVDSRAINRITVKYRFPIPRVGD
           P+ LPP+RDIQHHIDLIPG+SLPNL HYRM+PKE   L E IEDLLKKG I+ S+SPCAVP LL PKK   WRMCVDSRAIN+IT+KYRFPIPR+ D
Subjt:  ---PDGLPPLRDIQHHIDLIPGASLPNLAHYRMTPKEYAALHEHIEDLLKKGHIKPSLSPCAVPALLTPKKDGSWRMCVDSRAINRITVKYRFPIPRVGD

Query:  LLDQLGKATIFSKIDLRSGYHQIRIRPGDEWKTAFKTNEGLFEWMVMPFGLSNAPSTFMRLMNQVLHPFLNKFIVVYFDDILVYSSGNDEHLLHLKKLFQ
        +LD+L  + IFSKIDLRSGYHQIRIRPGDEWKTAFK+ +GL+EW+VMPFGLSNAPSTFMRLMNQ+L PF   F+VVYFDDIL+YS   +EHL HLK++ Q
Subjt:  LLDQLGKATIFSKIDLRSGYHQIRIRPGDEWKTAFKTNEGLFEWMVMPFGLSNAPSTFMRLMNQVLHPFLNKFIVVYFDDILVYSSGNDEHLLHLKKLFQ

Query:  VLTEKELYINQKKCEFLKSEITFLGFIIKKGEVSMEPRKVEAIREWLAPTTVKEVQAFLGLASFYRKFIKNFSSICAPLTDCLKKGNFKWTSFQQESFEE
        VL E +LY+N KKC F  +++ FLGF++ +  + ++  KV AIR+W AP +V EV++F GL +FYR+F+++FS+I AP+T+CLKKG F W S Q +SF  
Subjt:  VLTEKELYINQKKCEFLKSEITFLGFIIKKGEVSMEPRKVEAIREWLAPTTVKEVQAFLGLASFYRKFIKNFSSICAPLTDCLKKGNFKWTSFQQESFEE

Query:  IKKRLASSPVLQLPDFSSPFEVAVDACCTGIGAVLSQRGHPIEYLSEKLSTARQTWSTYEQELYALVRALKQWEHYLLSKEFVLLTDHFSLKYLQSQKNI
        IK++L ++PVL LPDF   F+V  DA   GIGAVLSQ   P+ + SEKLS ARQ WSTY+QE YA+ RAL+QWEHYL+ +EF+L TDH +LK+L SQK I
Subjt:  IKKRLASSPVLQLPDFSSPFEVAVDACCTGIGAVLSQRGHPIEYLSEKLSTARQTWSTYEQELYALVRALKQWEHYLLSKEFVLLTDHFSLKYLQSQKNI

Query:  SRMHARWISYLQRFDFVIKHQAGKENKVADALSRKGSLLTLLSSEIIAFEHLPELYERDADFADIWHKCSNHLRAEGYHILEGFLFKGDQLCIPHTSLRE
        ++MHARW+S+LQ+F F+I+H++G  NKVADALSR+ SLL  L+ EI+ FE L ELYE DA+F ++W KC+    +  +HI +GFLFKGD+LCIP +SLRE
Subjt:  SRMHARWISYLQRFDFVIKHQAGKENKVADALSRKGSLLTLLSSEIIAFEHLPELYERDADFADIWHKCSNHLRAEGYHILEGFLFKGDQLCIPHTSLRE

Query:  ALIKEAHSNGLAGHFEQDKTFDTISIRYYWPQLRKDSNNFVKRCSVCQRAKGSQTNAGLYTPLPIPQSIWEDLSIDFVLGLPKTQRNHDSVMVVVDRFSK
         LI++ H  GL+GH  +DKT  ++  RYYWP LR+D+   VKRC +CQ +KG   N GLY PLP+P  IW+DLS+DFVLGLP+TQR  DSV VVVDRFSK
Subjt:  ALIKEAHSNGLAGHFEQDKTFDTISIRYYWPQLRKDSNNFVKRCSVCQRAKGSQTNAGLYTPLPIPQSIWEDLSIDFVLGLPKTQRNHDSVMVVVDRFSK

Query:  MAHFIACKRTNDA----------------------------FLSHFWKTLWRKFDTTLKYSTIAHPQTDGQTEVTNRTLGNLIRCLSGSKPKQWDLSHAQ
        M HFIACK+T DA                            FLSHFW TLWR F TTLK S+ AHPQTDGQTEVTNRTLGN+IR + G +PKQWDL+  Q
Subjt:  MAHFIACKRTNDA----------------------------FLSHFWKTLWRKFDTTLKYSTIAHPQTDGQTEVTNRTLGNLIRCLSGSKPKQWDLSHAQ

Query:  AEFAFNNMKNRSTDKCPFEVVYTRRPRLTFDLASLPVTVESHKEAETMAEDIEKLHKEVHDHLVQSTNSYKKAADKKRRQTVFSKGDLVMVHLRKNRFPT
         EFA+N+  + +T K PF +VYT  P+   DL  LP        AETMAE+I    + V   L  +    K AADK+RR  VF +GD VMV LRK RFP 
Subjt:  AEFAFNNMKNRSTDKCPFEVVYTRRPRLTFDLASLPVTVESHKEAETMAEDIEKLHKEVHDHLVQSTNSYKKAADKKRRQTVFSKGDLVMVHLRKNRFPT

Query:  GTYNKLKDRQIGPFRITEKYGDNAFKVELPPDMHIHSVFNIADLKPYHA
        GTY KL+  + GPF++  K  DNA+ V+LP +M+I + FN+AD+  YHA
Subjt:  GTYNKLKDRQIGPFRITEKYGDNAFKVELPPDMHIHSVFNIADLKPYHA

XP_025979678.1 uncharacterized protein LOC112997809 [Glycine max]1.1e-30154.15Show/hide
Query:  RPWQHDTQTLHKGRENTYEFHWMGKRITLLPLT--KKNEENSKTRGQLFTTCSGK--------------------TLLKERK-------------QDILA
        RPWQ D    ++GR+N   F W   +I + P+    KN ++ K+   + T    K                    + +KE                +++A
Subjt:  RPWQHDTQTLHKGRENTYEFHWMGKRITLLPLT--KKNEENSKTRGQLFTTCSGK--------------------TLLKERK-------------QDILA

Query:  LKEPDGLPPLRDIQHHIDLIPGASLPNLAHYRMTPKEYAALHEHIEDLLKKGHIKPSLSPCAVPALLTPKKDGSWRMCVDSRAINRITVKYRFPIPRVGD
         + P+ LPP+RDIQH IDLIPG+SLPNL HYRM+PKE   L E IEDLL+KG I+ S+SPCAVP LL PKK   WRMCVDSRAIN+IT+KYRFPIPR+ D
Subjt:  LKEPDGLPPLRDIQHHIDLIPGASLPNLAHYRMTPKEYAALHEHIEDLLKKGHIKPSLSPCAVPALLTPKKDGSWRMCVDSRAINRITVKYRFPIPRVGD

Query:  LLDQLGKATIFSKIDLRSGYHQIRIRPGDEWKTAFKTNEGLFEWMVMPFGLSNAPSTFMRLMNQVLHPFLNKFIVVYFDDILVYSSGNDEHLLHLKKLFQ
        +LD+L  + +FSKIDLRSGYHQIRIRPGDEWKTAFK+ +GL+EW+VMPFGLSNAPSTFMRLMNQVL PF+  F+VVYFDDIL+YS   +EHL H++ + Q
Subjt:  LLDQLGKATIFSKIDLRSGYHQIRIRPGDEWKTAFKTNEGLFEWMVMPFGLSNAPSTFMRLMNQVLHPFLNKFIVVYFDDILVYSSGNDEHLLHLKKLFQ

Query:  VLTEKELYINQKKCEFLKSEITFLGFIIKKGEVSMEPRKVEAIREWLAPTTVKEVQAFLGLASFYRKFIKNFSSICAPLTDCLKKGNFKWTSFQQESFEE
        VL E +LYIN KKC F  +++ FLGF++ +  + ++  KV AIR+W APT+V EV++F GLA+FYR+FI++FS+I AP+T+CLKKG + W   Q++SF  
Subjt:  VLTEKELYINQKKCEFLKSEITFLGFIIKKGEVSMEPRKVEAIREWLAPTTVKEVQAFLGLASFYRKFIKNFSSICAPLTDCLKKGNFKWTSFQQESFEE

Query:  IKKRLASSPVLQLPDFSSPFEVAVDACCTGIGAVLSQRGHPIEYLSEKLSTARQTWSTYEQELYALVRALKQWEHYLLSKEFVLLTDHFSLKYLQSQKNI
        IK++L ++PVL LPDF   F+V  DA   GIGAVLSQ   PI + SEKLS AR+ WSTY+QE YA+ RAL+QWEHYL+ +EF+L TDH +LK+L SQK I
Subjt:  IKKRLASSPVLQLPDFSSPFEVAVDACCTGIGAVLSQRGHPIEYLSEKLSTARQTWSTYEQELYALVRALKQWEHYLLSKEFVLLTDHFSLKYLQSQKNI

Query:  SRMHARWISYLQRFDFVIKHQAGKENKVADALSRKGSLLTLLSSEIIAFEHLPELYERDADFADIWHKCSNHLRAEGYHILEGFLFKGDQLCIPHTSLRE
        ++MHARW+S+LQ+F F+I+H++G  NKVADALSR+ SLL  L+ E++ FE L ELYE DA+F ++W KC  H   + +H+ EGFLFKG++LCIP +SLRE
Subjt:  SRMHARWISYLQRFDFVIKHQAGKENKVADALSRKGSLLTLLSSEIIAFEHLPELYERDADFADIWHKCSNHLRAEGYHILEGFLFKGDQLCIPHTSLRE

Query:  ALIKEAHSNGLAGHFEQDKTFDTISIRYYWPQLRKDSNNFVKRCSVCQRAKGSQTNAGLYTPLPIPQSIWEDLSIDFVLGLPKTQRNHDSVMVVVDRFSK
         LI++ H  GL+GH  +DKT  ++  R+YWP LRKD+   VK+C  CQ +KG   N GLY PLPIP  IW+DL++DFVLGLP+TQR  DSV VVVDRFSK
Subjt:  ALIKEAHSNGLAGHFEQDKTFDTISIRYYWPQLRKDSNNFVKRCSVCQRAKGSQTNAGLYTPLPIPQSIWEDLSIDFVLGLPKTQRNHDSVMVVVDRFSK

Query:  MAHFIACKRTNDA----------------------------FLSHFWKTLWRKFDTTLKYSTIAHPQTDGQTEVTNRTLGNLIRCLSGSKPKQWDLSHAQ
        M+HFIACK+T DA                            FLSHFW TLW+ FDT+L  S+ AHPQTDGQTEVTNRTLGN+IRC+ G KPKQWDL+  Q
Subjt:  MAHFIACKRTNDA----------------------------FLSHFWKTLWRKFDTTLKYSTIAHPQTDGQTEVTNRTLGNLIRCLSGSKPKQWDLSHAQ

Query:  AEFAFNNMKNRSTDKCPFEVVYTRRPRLTFDLASLPVTVESHKEAETMAEDIEKLHKEVHDHLVQSTNSYKKAADKKRRQTVFSKGDLVMVHLRKNRFPT
         EFA+N+  + +T K PF +VYT  PR   DL  LP        AE MAE+I  +   V   L  +    K AADK++R  VF+ GD VMV LRK RFP 
Subjt:  AEFAFNNMKNRSTDKCPFEVVYTRRPRLTFDLASLPVTVESHKEAETMAEDIEKLHKEVHDHLVQSTNSYKKAADKKRRQTVFSKGDLVMVHLRKNRFPT

Query:  GTYNKLKDRQIGPFRITEKYGDNAFKVELPPDMHIHSVFNIADLKPYHAPD
        GTY+KL+ R+ GPF++T K  DNA+ V LP  M+I + FN+AD+  YHA +
Subjt:  GTYNKLKDRQIGPFRITEKYGDNAFKVELPPDMHIHSVFNIADLKPYHAPD

XP_025979678.1 uncharacterized protein LOC112997809 [Glycine max]1.3e-1038.1Show/hide
Query:  ETHHDFKMKVDLPSYSGKRDIESFLDWLKSTENFFSYMDTPEQKKVRLVALKLKGGASAWWEQLEANRQRPWQHDTQTLHKGRE
        + HHD+++K D+P + G   +E FLDW    + FF  MD PE K+V++VA++LK  A+ WW++L   R R  ++  +T  K ++
Subjt:  ETHHDFKMKVDLPSYSGKRDIESFLDWLKSTENFFSYMDTPEQKKVRLVALKLKGGASAWWEQLEANRQRPWQHDTQTLHKGRE

XP_025979678.1 uncharacterized protein LOC112997809 [Glycine max]7.4e-30154.06Show/hide
Query:  RPWQHDTQTLHKGRENTYEFHWMGKRITL-----------------LPLTKKNEENSKT--RGQLFTTCSGKTLLK----------------ERKQDILA
        RPWQ+D    ++GR+N   F W   +I +                 L ++  ++E  K     Q F     K L+                 E  ++++A
Subjt:  RPWQHDTQTLHKGRENTYEFHWMGKRITL-----------------LPLTKKNEENSKT--RGQLFTTCSGKTLLK----------------ERKQDILA

Query:  LKEPDGLPPLRDIQHHIDLIPGASLPNLAHYRMTPKEYAALHEHIEDLLKKGHIKPSLSPCAVPALLTPKKDGSWRMCVDSRAINRITVKYRFPIPRVGD
         + PD LPP+RDIQHHIDLIPG+SLPNL HYRM+PKE   +   IEDLLKKG I+ S+SPCAVP LL PKK   WRMCVDSRAIN+IT+KYRFPIPR+ D
Subjt:  LKEPDGLPPLRDIQHHIDLIPGASLPNLAHYRMTPKEYAALHEHIEDLLKKGHIKPSLSPCAVPALLTPKKDGSWRMCVDSRAINRITVKYRFPIPRVGD

Query:  LLDQLGKATIFSKIDLRSGYHQIRIRPGDEWKTAFKTNEGLFEWMVMPFGLSNAPSTFMRLMNQVLHPFLNKFIVVYFDDILVYSSGNDEHLLHLKKLFQ
        +LD+L  + +FSKIDLRSGYHQIRIR GDEWKTAFK+ +GL+EW+VMPFGLSNAPSTFMRLMNQVL PF+  F+VVYFDDIL+YS   D+HL H++++ Q
Subjt:  LLDQLGKATIFSKIDLRSGYHQIRIRPGDEWKTAFKTNEGLFEWMVMPFGLSNAPSTFMRLMNQVLHPFLNKFIVVYFDDILVYSSGNDEHLLHLKKLFQ

Query:  VLTEKELYINQKKCEFLKSEITFLGFIIKKGEVSMEPRKVEAIREWLAPTTVKEVQAFLGLASFYRKFIKNFSSICAPLTDCLKKGNFKWTSFQQESFEE
        VL E +LY+N KKC F  +++ FLGF++ +  + ++  KV AI++W  P T  EV++F GLA+FYR+F+++FS+I AP+T+CLKKG F W   Q ESF  
Subjt:  VLTEKELYINQKKCEFLKSEITFLGFIIKKGEVSMEPRKVEAIREWLAPTTVKEVQAFLGLASFYRKFIKNFSSICAPLTDCLKKGNFKWTSFQQESFEE

Query:  IKKRLASSPVLQLPDFSSPFEVAVDACCTGIGAVLSQRGHPIEYLSEKLSTARQTWSTYEQELYALVRALKQWEHYLLSKEFVLLTDHFSLKYLQSQKNI
        IK++L ++PVL LPDF   F+V  DA   GIGAVLSQ   PI + SEKLS ARQ WSTY+QE YA+ RAL+QWEHYL+ +EF+L TDH +LK+L SQK I
Subjt:  IKKRLASSPVLQLPDFSSPFEVAVDACCTGIGAVLSQRGHPIEYLSEKLSTARQTWSTYEQELYALVRALKQWEHYLLSKEFVLLTDHFSLKYLQSQKNI

Query:  SRMHARWISYLQRFDFVIKHQAGKENKVADALSRKGSLLTLLSSEIIAFEHLPELYERDADFADIWHKCSNHLRAEGYHILEGFLFKGDQLCIPHTSLRE
        ++MHARW+S+LQ+F F+I+H++G  NKVADALSR+ SLLT L+ EI+ FE L ELYE DA+F ++W KC+    +  +HI EG+LFKGD+LCIP +SLRE
Subjt:  SRMHARWISYLQRFDFVIKHQAGKENKVADALSRKGSLLTLLSSEIIAFEHLPELYERDADFADIWHKCSNHLRAEGYHILEGFLFKGDQLCIPHTSLRE

Query:  ALIKEAHSNGLAGHFEQDKTFDTISIRYYWPQLRKDSNNFVKRCSVCQRAKGSQTNAGLYTPLPIPQSIWEDLSIDFVLGLPKTQRNHDSVMVVVDRFSK
         LI+E H  GL+GH  +DKT  ++  RYYWP LRKD+   V+RC VCQ +KG   N GLY PL +P  IW+DLS+DFVLGLP+TQR  DSV VVVD+FSK
Subjt:  ALIKEAHSNGLAGHFEQDKTFDTISIRYYWPQLRKDSNNFVKRCSVCQRAKGSQTNAGLYTPLPIPQSIWEDLSIDFVLGLPKTQRNHDSVMVVVDRFSK

Query:  MAHFIACKRTNDA----------------------------FLSHFWKTLWRKFDTTLKYSTIAHPQTDGQTEVTNRTLGNLIRCLSGSKPKQWDLSHAQ
        M HFIAC++T DA                            FLSHFW TLWR F T+LK S+ AHPQ+DGQTEVTNRTLGN+IR + G KPKQWDL+  Q
Subjt:  MAHFIACKRTNDA----------------------------FLSHFWKTLWRKFDTTLKYSTIAHPQTDGQTEVTNRTLGNLIRCLSGSKPKQWDLSHAQ

Query:  AEFAFNNMKNRSTDKCPFEVVYTRRPRLTFDLASLPVTVESHKEAETMAEDIEKLHKEVHDHLVQSTNSYKKAADKKRRQTVFSKGDLVMVHLRKNRFPT
         EFA+N+  + +T K PF +VYT  P+   DL  LP        A+ MA+DI    + V   L  +    K+AADKK+R  VF +GD VMV L+K RFP 
Subjt:  AEFAFNNMKNRSTDKCPFEVVYTRRPRLTFDLASLPVTVESHKEAETMAEDIEKLHKEVHDHLVQSTNSYKKAADKKRRQTVFSKGDLVMVHLRKNRFPT

Query:  GTYNKLKDRQIGPFRITEKYGDNAFKVELPPDMHIHSVFNIADLKPYHA
        GTY KL+ R+ GPF+I +K  DNA+ V+LP DM I   FN+AD+  YHA
Subjt:  GTYNKLKDRQIGPFRITEKYGDNAFKVELPPDMHIHSVFNIADLKPYHA

TrEMBL top hitse value%identityAlignment
A0A2U1P6A2 Transposon Ty3-I Gag-Pol polyprotein4.9e-0832.7Show/hide
Query:  NGEINNRRRQRGRGNFRNLHNPRYANRGRADLNPPPQFYADVHNEQEDESSSSDDQNNPWDALDE----TQRGQHGRRFGTRAETHHD-------FKMKV
        NG+  NR      GN+R   N     R  A    P Q +      + DE   SD++    +A DE     + G   RR   R   +H        +++K 
Subjt:  NGEINNRRRQRGRGNFRNLHNPRYANRGRADLNPPPQFYADVHNEQEDESSSSDDQNNPWDALDE----TQRGQHGRRFGTRAETHHD-------FKMKV

Query:  DLPSYSGKRDIESFLDWLKSTENFFSYMDTPEQKKVRLVALKLKGGASAWWEQLEANRQ
        ++P++ G  DIE+ LDWL   + FF  MD P+ ++V++VA KL+GGA AWW++ + NR+
Subjt:  DLPSYSGKRDIESFLDWLKSTENFFSYMDTPEQKKVRLVALKLKGGASAWWEQLEANRQ

A0A2U1P6A2 Transposon Ty3-I Gag-Pol polyprotein2.7e-30454.9Show/hide
Query:  RPWQHDTQTLHKGRENTYEFHWMGKRITLLPLTKKNEENSKTRGQ-LFTTCSGKTL---LKERK---------------------QDILALKE-------
        RPWQ+D    ++GR+N   F W   +I + P+   ++   K +   L  T S K L    KE K                     +++L + E       
Subjt:  RPWQHDTQTLHKGRENTYEFHWMGKRITLLPLTKKNEENSKTRGQ-LFTTCSGKTL---LKERK---------------------QDILALKE-------

Query:  ---PDGLPPLRDIQHHIDLIPGASLPNLAHYRMTPKEYAALHEHIEDLLKKGHIKPSLSPCAVPALLTPKKDGSWRMCVDSRAINRITVKYRFPIPRVGD
           P+ LPP+RDIQHHIDLIPG+SLPNL HYRM+PKE   L E IEDLLKKG I+ S+SPCAVP LL PKK   WRMCVDSRAIN+IT+KYRFPIPR+ D
Subjt:  ---PDGLPPLRDIQHHIDLIPGASLPNLAHYRMTPKEYAALHEHIEDLLKKGHIKPSLSPCAVPALLTPKKDGSWRMCVDSRAINRITVKYRFPIPRVGD

Query:  LLDQLGKATIFSKIDLRSGYHQIRIRPGDEWKTAFKTNEGLFEWMVMPFGLSNAPSTFMRLMNQVLHPFLNKFIVVYFDDILVYSSGNDEHLLHLKKLFQ
        +LD+L  + IFSKIDLRSGYHQIRIRPGDEWKTAFK+ +GL+EW+VMPFGLSNAPSTFMRLMNQ+L PF   F+VVYFDDIL+YS   +EHL HLK++ Q
Subjt:  LLDQLGKATIFSKIDLRSGYHQIRIRPGDEWKTAFKTNEGLFEWMVMPFGLSNAPSTFMRLMNQVLHPFLNKFIVVYFDDILVYSSGNDEHLLHLKKLFQ

Query:  VLTEKELYINQKKCEFLKSEITFLGFIIKKGEVSMEPRKVEAIREWLAPTTVKEVQAFLGLASFYRKFIKNFSSICAPLTDCLKKGNFKWTSFQQESFEE
        VL E +LY+N KKC F  +++ FLGF++ +  + ++  KV AIR+W AP +V EV++F GL +FYR+F+++FS+I AP+T+CLKKG F W S Q +SF  
Subjt:  VLTEKELYINQKKCEFLKSEITFLGFIIKKGEVSMEPRKVEAIREWLAPTTVKEVQAFLGLASFYRKFIKNFSSICAPLTDCLKKGNFKWTSFQQESFEE

Query:  IKKRLASSPVLQLPDFSSPFEVAVDACCTGIGAVLSQRGHPIEYLSEKLSTARQTWSTYEQELYALVRALKQWEHYLLSKEFVLLTDHFSLKYLQSQKNI
        IK++L ++PVL LPDF   F+V  DA   GIGAVLSQ   P+ + SEKLS ARQ WSTY+QE YA+ RAL+QWEHYL+ +EF+L TDH +LK+L SQK I
Subjt:  IKKRLASSPVLQLPDFSSPFEVAVDACCTGIGAVLSQRGHPIEYLSEKLSTARQTWSTYEQELYALVRALKQWEHYLLSKEFVLLTDHFSLKYLQSQKNI

Query:  SRMHARWISYLQRFDFVIKHQAGKENKVADALSRKGSLLTLLSSEIIAFEHLPELYERDADFADIWHKCSNHLRAEGYHILEGFLFKGDQLCIPHTSLRE
        ++MHARW+S+LQ+F F+I+H++G  NKVADALSR+ SLL  L+ EI+ FE L ELYE DA+F ++W KC+    +  +HI +GFLFKGD+LCIP +SLRE
Subjt:  SRMHARWISYLQRFDFVIKHQAGKENKVADALSRKGSLLTLLSSEIIAFEHLPELYERDADFADIWHKCSNHLRAEGYHILEGFLFKGDQLCIPHTSLRE

Query:  ALIKEAHSNGLAGHFEQDKTFDTISIRYYWPQLRKDSNNFVKRCSVCQRAKGSQTNAGLYTPLPIPQSIWEDLSIDFVLGLPKTQRNHDSVMVVVDRFSK
         LI++ H  GL+GH  +DKT  ++  RYYWP LR+D+   VKRC +CQ +KG   N GLY PLP+P  IW+DLS+DFVLGLP+TQR  DSV VVVDRFSK
Subjt:  ALIKEAHSNGLAGHFEQDKTFDTISIRYYWPQLRKDSNNFVKRCSVCQRAKGSQTNAGLYTPLPIPQSIWEDLSIDFVLGLPKTQRNHDSVMVVVDRFSK

Query:  MAHFIACKRTNDA----------------------------FLSHFWKTLWRKFDTTLKYSTIAHPQTDGQTEVTNRTLGNLIRCLSGSKPKQWDLSHAQ
        M HFIACK+T DA                            FLSHFW TLWR F TTLK S+ AHPQTDGQTEVTNRTLGN+IR + G +PKQWDL+  Q
Subjt:  MAHFIACKRTNDA----------------------------FLSHFWKTLWRKFDTTLKYSTIAHPQTDGQTEVTNRTLGNLIRCLSGSKPKQWDLSHAQ

Query:  AEFAFNNMKNRSTDKCPFEVVYTRRPRLTFDLASLPVTVESHKEAETMAEDIEKLHKEVHDHLVQSTNSYKKAADKKRRQTVFSKGDLVMVHLRKNRFPT
         EFA+N+  + +T K PF +VYT  P+   DL  LP        AETMAE+I    + V   L  +    K AADK+RR  VF +GD VMV LRK RFP 
Subjt:  AEFAFNNMKNRSTDKCPFEVVYTRRPRLTFDLASLPVTVESHKEAETMAEDIEKLHKEVHDHLVQSTNSYKKAADKKRRQTVFSKGDLVMVHLRKNRFPT

Query:  GTYNKLKDRQIGPFRITEKYGDNAFKVELPPDMHIHSVFNIADLKPYHA
        GTY KL+  + GPF++  K  DNA+ V+LP +M+I + FN+AD+  YHA
Subjt:  GTYNKLKDRQIGPFRITEKYGDNAFKVELPPDMHIHSVFNIADLKPYHA

A0A5B7BER3 Uncharacterized protein0.0e+0057.73Show/hide
Query:  RPWQHDTQTLHKGRENTYEFHWMGKRITLLPLTKKNE--ENSKTRGQLFTTCSGKTLLKERK--------------------------------QDILAL
        RPWQ D    HKG++NTY F W  K++ L+P  K +   + SK  G+   T +G   +++ K                                QDI   
Subjt:  RPWQHDTQTLHKGRENTYEFHWMGKRITLLPLTKKNE--ENSKTRGQLFTTCSGKTLLKERK--------------------------------QDILAL

Query:  KEPDGLPPLRDIQHHIDLIPGASLPNLAHYRMTPKEYAALHEHIEDLLKKGHIKPSLSPCAVPALLTPKKDGSWRMCVDSRAINRITVKYRFPIPRVGDL
        + PD LPP+RDIQHHIDL+PGASLPNL HYRM+PKE   L + +EDL+ KG I+ S+SPCAVPALLTPKKDGSWRMCVDSRAIN+ITVKYRFPIPR+ D+
Subjt:  KEPDGLPPLRDIQHHIDLIPGASLPNLAHYRMTPKEYAALHEHIEDLLKKGHIKPSLSPCAVPALLTPKKDGSWRMCVDSRAINRITVKYRFPIPRVGDL

Query:  LDQLGKATIFSKIDLRSGYHQIRIRPGDEWKTAFKTNEGLFEWMVMPFGLSNAPSTFMRLMNQVLHPFLNKFIVVYFDDILVYSSGNDEHLLHLKKLFQV
        LD L  + IFSKIDLRSGYHQIRIRPGDEWKTAFKT EGL+EW+VMPFGLSNAPSTFMR+MNQVL PF+ KF+VVYFDDIL+YS    EHL H++++   
Subjt:  LDQLGKATIFSKIDLRSGYHQIRIRPGDEWKTAFKTNEGLFEWMVMPFGLSNAPSTFMRLMNQVLHPFLNKFIVVYFDDILVYSSGNDEHLLHLKKLFQV

Query:  LTEKELYINQKKCEFLKSEITFLGFIIKKGEVSMEPRKVEAIREWLAPTTVKEVQAFLGLASFYRKFIKNFSSICAPLTDCLKKGNFKWTSFQQESFEEI
        L E +LYIN KKC FL + + FLGFII    + ++  KV AIR+W  P TV ++++F GLA+FYR+FI+NFSSI AP+TDC+KKG F+W   Q+ SF  I
Subjt:  LTEKELYINQKKCEFLKSEITFLGFIIKKGEVSMEPRKVEAIREWLAPTTVKEVQAFLGLASFYRKFIKNFSSICAPLTDCLKKGNFKWTSFQQESFEEI

Query:  KKRLASSPVLQLPDFSSPFEVAVDACCTGIGAVLSQRGHPIEYLSEKLSTARQTWSTYEQELYALVRALKQWEHYLLSKEFVLLTDHFSLKYLQSQKNIS
        K++L+++PVL LP F   F+V  DA  TGIGAVLSQ G P+E+ SEKL+ ARQ W+TYE EL+A+VRALK WEHYL+ +EFV+ +DH +LK++ +Q ++S
Subjt:  KKRLASSPVLQLPDFSSPFEVAVDACCTGIGAVLSQRGHPIEYLSEKLSTARQTWSTYEQELYALVRALKQWEHYLLSKEFVLLTDHFSLKYLQSQKNIS

Query:  RMHARWISYLQRFDFVIKHQAGKENKVADALSRKGSLLTLLSSEIIAFEHLPELYERDADFADIWHKCSNHLRAEGYHILEGFLFKGDQLCIPHTSLREA
        RMH RWI++LQRF FV+KH+AG++NKVADALSR+ +LL ++SSEI +FE L ELY+ D DF   W KC     +  +HI +G+LFKG+QLCIP TSLRE 
Subjt:  RMHARWISYLQRFDFVIKHQAGKENKVADALSRKGSLLTLLSSEIIAFEHLPELYERDADFADIWHKCSNHLRAEGYHILEGFLFKGDQLCIPHTSLREA

Query:  LIKEAHSNGLAGHFEQDKTFDTISIRYYWPQLRKDSNNFVKRCSVCQRAKGSQTNAGLYTPLPIPQSIWEDLSIDFVLGLPKTQRNHDSVMVVVDRFSKM
        ++++ HS GL GH  +DKT   +  RYYWPQL++D   FV++C +CQ AKG   N GLYTPLP+P+ IWEDL++DF+LGLP+TQR  DSV VVVDRFSKM
Subjt:  LIKEAHSNGLAGHFEQDKTFDTISIRYYWPQLRKDSNNFVKRCSVCQRAKGSQTNAGLYTPLPIPQSIWEDLSIDFVLGLPKTQRNHDSVMVVVDRFSKM

Query:  AHFIACKRTNDA----------------------------FLSHFWKTLWRKFDTTLKYSTIAHPQTDGQTEVTNRTLGNLIRCLSGSKPKQWDLSHAQA
        AHFI CK+T+DA                            FLSHFW+TLWRKFDT+L+YS+ AHPQTDGQTEVTNRTLGNLIRC SG +PKQWD+   Q 
Subjt:  AHFIACKRTNDA----------------------------FLSHFWKTLWRKFDTTLKYSTIAHPQTDGQTEVTNRTLGNLIRCLSGSKPKQWDLSHAQA

Query:  EFAFNNMKNRSTDKCPFEVVYTRRPRLTFDLASLPVTVESHKEAETMAEDIEKLHKEVHDHLVQSTNSYKKAADKKRRQTVFSKGDLVMVHLRKNRFPTG
        EFA+N M NRST K PFE+VYT+ P+   DLA LP    S   AE  A+    + +EV  +L ++ N YK AADK RR  VF++GDLVMV LRKNRFP G
Subjt:  EFAFNNMKNRSTDKCPFEVVYTRRPRLTFDLASLPVTVESHKEAETMAEDIEKLHKEVHDHLVQSTNSYKKAADKKRRQTVFSKGDLVMVHLRKNRFPTG

Query:  TYNKLKDRQIGPFRITEKYGDNAFKVELPPDMHIHSVFNIADLKPYHAPDD
        TYNKLK+R+ GPFR+  K  DNA+ VELP DM I S FN+ADL  YH PD+
Subjt:  TYNKLKDRQIGPFRITEKYGDNAFKVELPPDMHIHSVFNIADLKPYHAPDD

A0A5B7BER3 Uncharacterized protein1.3e-1335.36Show/hide
Query:  RRMDELPAPPQ-LEAEMADNGEINNRRRQRGRGNFRNLHNPRYANRGRADLNPPPQFYADVHNEQEDESSSSDDQNNPWDALDETQRGQHGRRFGTRAET
        +R D L    Q ++  +AD G   NR R+R R    ++   +   R           YA+        +S SD++   + A     RG +GR    R + 
Subjt:  RRMDELPAPPQ-LEAEMADNGEINNRRRQRGRGNFRNLHNPRYANRGRADLNPPPQFYADVHNEQEDESSSSDDQNNPWDALDETQRGQHGRRFGTRAET

Query:  HHDFKMKVDLPSYSGKRDIESFLDWLKSTENFFSYMDTPEQKKVRLVALKLKGGASAWWEQLEANRQRPWQHDTQTLHKGR
          +++MK+DLPS++G   IESFLDW+   E FF  M+  + K+V+LVA KLKGGASAWW+Q++ NR+R  +   +T  K R
Subjt:  HHDFKMKVDLPSYSGKRDIESFLDWLKSTENFFSYMDTPEQKKVRLVALKLKGGASAWWEQLEANRQRPWQHDTQTLHKGR

A0A5B7BER3 Uncharacterized protein5.2e-31054.74Show/hide
Query:  RPWQHDTQTLHKGRENTYEFHWMGKRITLLPLTKK-NEENSKTRGQLFTTCSGKTLLKERKQD--------------------------------ILALK
        RPWQ+D    +KG++N Y F   G+++ L PL +       + + +      G+T L +  +D                                I+  +
Subjt:  RPWQHDTQTLHKGRENTYEFHWMGKRITLLPLTKK-NEENSKTRGQLFTTCSGKTLLKERKQD--------------------------------ILALK

Query:  EPDGLPPLRDIQHHIDLIPGASLPNLAHYRMTPKEYAALHEHIEDLLKKGHIKPSLSPCAVPALLTPKKDGSWRMCVDSRAINRITVKYRFPIPRVGDLL
         P+GLPP+RDIQHHIDLIPGASLPN  HYRM+P+E A L   +E+L+KKG ++ S+SPCAVPALL PKKDGSWRMC+DSRAIN+IT+KYRFPIPR+ D+L
Subjt:  EPDGLPPLRDIQHHIDLIPGASLPNLAHYRMTPKEYAALHEHIEDLLKKGHIKPSLSPCAVPALLTPKKDGSWRMCVDSRAINRITVKYRFPIPRVGDLL

Query:  DQLGKATIFSKIDLRSGYHQIRIRPGDEWKTAFKTNEGLFEWMVMPFGLSNAPSTFMRLMNQVLHPFLNKFIVVYFDDILVYSSGNDEHLLHLKKLFQVL
        D L  + IFSKIDLRSGYHQIRIRPGDEWKTAFKT EGL+EW+VMPFGLSNAPSTFMRLMNQVL PF   F+VVYFDDIL+YS    +H+ HL+++F VL
Subjt:  DQLGKATIFSKIDLRSGYHQIRIRPGDEWKTAFKTNEGLFEWMVMPFGLSNAPSTFMRLMNQVLHPFLNKFIVVYFDDILVYSSGNDEHLLHLKKLFQVL

Query:  TEKELYINQKKCEFLKSEITFLGFIIKKGEVSMEPRKVEAIREWLAPTTVKEVQAFLGLASFYRKFIKNFSSICAPLTDCLKKGNFKWTSFQQESFEEIK
           +L++N  KC F+ S + FLGF++    + ++  KV AIR+W  P  + EV++F GLA+FYR+F+++FS I AP+T+C+KKG F W    + SF  IK
Subjt:  TEKELYINQKKCEFLKSEITFLGFIIKKGEVSMEPRKVEAIREWLAPTTVKEVQAFLGLASFYRKFIKNFSSICAPLTDCLKKGNFKWTSFQQESFEEIK

Query:  KRLASSPVLQLPDFSSPFEVAVDACCTGIGAVLSQRGHPIEYLSEKLSTARQTWSTYEQELYALVRALKQWEHYLLSKEFVLLTDHFSLKYLQSQKNISR
        ++LAS+PVL LPDF   FEV  DA   GIGAVLSQ   P+ + SEKLS AR+ WSTYE ELYA+ RA+K WEHYL+ +EF+L +DH +LK++ +Q N++R
Subjt:  KRLASSPVLQLPDFSSPFEVAVDACCTGIGAVLSQRGHPIEYLSEKLSTARQTWSTYEQELYALVRALKQWEHYLLSKEFVLLTDHFSLKYLQSQKNISR

Query:  MHARWISYLQRFDFVIKHQAGKENKVADALSRKGSLLTLLSSEIIAFEHLPELYERDADFADIWHKCSNHLRAEGYHILEGFLFKGDQLCIPHTSLREAL
        MHARW++++QRF+F +KH++G+ NKVADALSRK SLLT L +E+I FE + +LY  D DF + W KC   L  EG H  +G+LF+G+QLCIP +SLRE +
Subjt:  MHARWISYLQRFDFVIKHQAGKENKVADALSRKGSLLTLLSSEIIAFEHLPELYERDADFADIWHKCSNHLRAEGYHILEGFLFKGDQLCIPHTSLREAL

Query:  IKEAHSNGLAGHFEQDKTFDTISIRYYWPQLRKDSNNFVKRCSVCQRAKGSQTNAGLYTPLPIPQSIWEDLSIDFVLGLPKTQRNHDSVMVVVDRFSKMA
        I E H  GL GH  +DKT      RYYWPQL++D  N VKRC  CQ +KG   N GLY PLPIP   WEDLS+DF+LGLP+TQR  DSV VVVDRFSKMA
Subjt:  IKEAHSNGLAGHFEQDKTFDTISIRYYWPQLRKDSNNFVKRCSVCQRAKGSQTNAGLYTPLPIPQSIWEDLSIDFVLGLPKTQRNHDSVMVVVDRFSKMA

Query:  HFIACKRTNDA----------------------------FLSHFWKTLWRKFDTTLKYSTIAHPQTDGQTEVTNRTLGNLIRCLSGSKPKQWDLSHAQAE
        HFIACK+T+DA                            FLSHFW+TLWR+FDTTL +S+ +HPQTDGQTEV NRTLGNLIRCLSG +PKQWDL+ AQAE
Subjt:  HFIACKRTNDA----------------------------FLSHFWKTLWRKFDTTLKYSTIAHPQTDGQTEVTNRTLGNLIRCLSGSKPKQWDLSHAQAE

Query:  FAFNNMKNRSTDKCPFEVVYTRRPRLTFDLASLPVTVESHKEAETMAEDIEKLHKEVHDHLVQSTNSYKKAADKKRRQTVFSKGDLVMVHLRKNRFPTGT
        FA+N+M NRST K PF+VVY + P+   DL  LP     +  AE MA+ +  + +EV  +L  S   YK AADKKRR  +F +GDLVMV+LRK R P GT
Subjt:  FAFNNMKNRSTDKCPFEVVYTRRPRLTFDLASLPVTVESHKEAETMAEDIEKLHKEVHDHLVQSTNSYKKAADKKRRQTVFSKGDLVMVHLRKNRFPTGT

Query:  YNKLKDRQIGPFRITEKYGDNAFKVELPPDMHIHSVFNIADLKPYHAPDD
         +KL D++ GP++I +K  DNA++V+LP DM I   FN+ADL  YH PD+
Subjt:  YNKLKDRQIGPFRITEKYGDNAFKVELPPDMHIHSVFNIADLKPYHAPDD

A0A6D2IKM3 Reverse transcriptase2.7e-30454.9Show/hide
Query:  RPWQHDTQTLHKGRENTYEFHWMGKRITLLPLTKKNEENSKTRGQ-LFTTCSGKTL---LKERK---------------------QDILALKE-------
        RPWQ+D    ++GR+N   F W   +I + P+   ++   K +   L  T S K L    KE K                     +++L + E       
Subjt:  RPWQHDTQTLHKGRENTYEFHWMGKRITLLPLTKKNEENSKTRGQ-LFTTCSGKTL---LKERK---------------------QDILALKE-------

Query:  ---PDGLPPLRDIQHHIDLIPGASLPNLAHYRMTPKEYAALHEHIEDLLKKGHIKPSLSPCAVPALLTPKKDGSWRMCVDSRAINRITVKYRFPIPRVGD
           P+ LPP+RDIQHHIDLIPG+SLPNL HYRM+PKE   L E IEDLLKKG I+ S+SPCAVP LL PKK   WRMCVDSRAIN+IT+KYRFPIPR+ D
Subjt:  ---PDGLPPLRDIQHHIDLIPGASLPNLAHYRMTPKEYAALHEHIEDLLKKGHIKPSLSPCAVPALLTPKKDGSWRMCVDSRAINRITVKYRFPIPRVGD

Query:  LLDQLGKATIFSKIDLRSGYHQIRIRPGDEWKTAFKTNEGLFEWMVMPFGLSNAPSTFMRLMNQVLHPFLNKFIVVYFDDILVYSSGNDEHLLHLKKLFQ
        +LD+L  + IFSKIDLRSGYHQIRIRPGDEWKTAFK+ +GL+EW+VMPFGLSNAPSTFMRLMNQ+L PF   F+VVYFDDIL+YS   +EHL HLK++ Q
Subjt:  LLDQLGKATIFSKIDLRSGYHQIRIRPGDEWKTAFKTNEGLFEWMVMPFGLSNAPSTFMRLMNQVLHPFLNKFIVVYFDDILVYSSGNDEHLLHLKKLFQ

Query:  VLTEKELYINQKKCEFLKSEITFLGFIIKKGEVSMEPRKVEAIREWLAPTTVKEVQAFLGLASFYRKFIKNFSSICAPLTDCLKKGNFKWTSFQQESFEE
        VL E +LY+N KKC F  +++ FLGF++ +  + ++  KV AIR+W AP +V EV++F GL +FYR+F+++FS+I AP+T+CLKKG F W S Q +SF  
Subjt:  VLTEKELYINQKKCEFLKSEITFLGFIIKKGEVSMEPRKVEAIREWLAPTTVKEVQAFLGLASFYRKFIKNFSSICAPLTDCLKKGNFKWTSFQQESFEE

Query:  IKKRLASSPVLQLPDFSSPFEVAVDACCTGIGAVLSQRGHPIEYLSEKLSTARQTWSTYEQELYALVRALKQWEHYLLSKEFVLLTDHFSLKYLQSQKNI
        IK++L ++PVL LPDF   F+V  DA   GIGAVLSQ   P+ + SEKLS ARQ WSTY+QE YA+ RAL+QWEHYL+ +EF+L TDH +LK+L SQK I
Subjt:  IKKRLASSPVLQLPDFSSPFEVAVDACCTGIGAVLSQRGHPIEYLSEKLSTARQTWSTYEQELYALVRALKQWEHYLLSKEFVLLTDHFSLKYLQSQKNI

Query:  SRMHARWISYLQRFDFVIKHQAGKENKVADALSRKGSLLTLLSSEIIAFEHLPELYERDADFADIWHKCSNHLRAEGYHILEGFLFKGDQLCIPHTSLRE
        ++MHARW+S+LQ+F F+I+H++G  NKVADALSR+ SLL  L+ EI+ FE L ELYE DA+F ++W KC+    +  +HI +GFLFKGD+LCIP +SLRE
Subjt:  SRMHARWISYLQRFDFVIKHQAGKENKVADALSRKGSLLTLLSSEIIAFEHLPELYERDADFADIWHKCSNHLRAEGYHILEGFLFKGDQLCIPHTSLRE

Query:  ALIKEAHSNGLAGHFEQDKTFDTISIRYYWPQLRKDSNNFVKRCSVCQRAKGSQTNAGLYTPLPIPQSIWEDLSIDFVLGLPKTQRNHDSVMVVVDRFSK
         LI++ H  GL+GH  +DKT  ++  RYYWP LR+D+   VKRC +CQ +KG   N GLY PLP+P  IW+DLS+DFVLGLP+TQR  DSV VVVDRFSK
Subjt:  ALIKEAHSNGLAGHFEQDKTFDTISIRYYWPQLRKDSNNFVKRCSVCQRAKGSQTNAGLYTPLPIPQSIWEDLSIDFVLGLPKTQRNHDSVMVVVDRFSK

Query:  MAHFIACKRTNDA----------------------------FLSHFWKTLWRKFDTTLKYSTIAHPQTDGQTEVTNRTLGNLIRCLSGSKPKQWDLSHAQ
        M HFIACK+T DA                            FLSHFW TLWR F TTLK S+ AHPQTDGQTEVTNRTLGN+IR + G +PKQWDL+  Q
Subjt:  MAHFIACKRTNDA----------------------------FLSHFWKTLWRKFDTTLKYSTIAHPQTDGQTEVTNRTLGNLIRCLSGSKPKQWDLSHAQ

Query:  AEFAFNNMKNRSTDKCPFEVVYTRRPRLTFDLASLPVTVESHKEAETMAEDIEKLHKEVHDHLVQSTNSYKKAADKKRRQTVFSKGDLVMVHLRKNRFPT
         EFA+N+  + +T K PF +VYT  P+   DL  LP        AETMAE+I    + V   L  +    K AADK+RR  VF +GD VMV LRK RFP 
Subjt:  AEFAFNNMKNRSTDKCPFEVVYTRRPRLTFDLASLPVTVESHKEAETMAEDIEKLHKEVHDHLVQSTNSYKKAADKKRRQTVFSKGDLVMVHLRKNRFPT

Query:  GTYNKLKDRQIGPFRITEKYGDNAFKVELPPDMHIHSVFNIADLKPYHA
        GTY KL+  + GPF++  K  DNA+ V+LP +M+I + FN+AD+  YHA
Subjt:  GTYNKLKDRQIGPFRITEKYGDNAFKVELPPDMHIHSVFNIADLKPYHA

A0A6N2LVR1 Uncharacterized protein9.3e-1556.41Show/hide
Query:  FKMKVDLPSYSGKRDIESFLDWLKSTENFFSYMDTPEQKKVRLVALKLKGGASAWWEQLEANRQRPWQHDTQTLHKGR
        F+MK+DLPS++G+  IE FLDWL   E FF YM+ PE KKV+LVA +L GGASAWWEQL+  R R  +   Q+  K R
Subjt:  FKMKVDLPSYSGKRDIESFLDWLKSTENFFSYMDTPEQKKVRLVALKLKGGASAWWEQLEANRQRPWQHDTQTLHKGR

A0A6N2LVR1 Uncharacterized protein4.7e-30954.48Show/hide
Query:  RPWQHDTQTLHKGRENTYEFHWMGKRITLLPLTKKNEENSKTRGQLFTTCSG-KTLLKERKQ--------------------------------DILALK
        RPWQHD    H+G+ N Y F W GK I +LPL   +         L T  S  K    ERK+                                 ++A  
Subjt:  RPWQHDTQTLHKGRENTYEFHWMGKRITLLPLTKKNEENSKTRGQLFTTCSG-KTLLKERKQ--------------------------------DILALK

Query:  EPDGLPPLRDIQHHIDLIPGASLPNLAHYRMTPKEYAALHEHIEDLLKKGHIKPSLSPCAVPALLTPKKDGSWRMCVDSRAINRITVKYRFPIPRVGDLL
         PD LPPLR+IQH IDL+PGASLPNL HYRM+PKE   L E +E+LL+KGHI+ S+SPCAVPALLTPKKDGSWRMCVDSRAIN+ITV+YRFPIPR+ DLL
Subjt:  EPDGLPPLRDIQHHIDLIPGASLPNLAHYRMTPKEYAALHEHIEDLLKKGHIKPSLSPCAVPALLTPKKDGSWRMCVDSRAINRITVKYRFPIPRVGDLL

Query:  DQLGKATIFSKIDLRSGYHQIRIRPGDEWKTAFKTNEGLFEWMVMPFGLSNAPSTFMRLMNQVLHPFLNKFIVVYFDDILVYSSGNDEHLLHLKKLFQVL
        DQL  A +FSKIDLRSGYHQIRI+PGDEWKTAFKT +GL+EW+VMPFGLSNAPSTFMRLM QVL PF+ KF+VVYFDDILVYS    EHL HL+K+ + L
Subjt:  DQLGKATIFSKIDLRSGYHQIRIRPGDEWKTAFKTNEGLFEWMVMPFGLSNAPSTFMRLMNQVLHPFLNKFIVVYFDDILVYSSGNDEHLLHLKKLFQVL

Query:  TEKELYINQKKCEFLKSEITFLGFIIKKGEVSMEPRKVEAIREWLAPTTVKEVQAFLGLASFYRKFIKNFSSICAPLTDCLKKGNFKWTSFQQESFEEIK
        TE EL++N KKC FL +++ FLG+I+    + ++  KV+A+R+W +P T+ EV++F GLA+FYR+F++NFSSI AP+T+C+KKG FKWT   +ESF+ IK
Subjt:  TEKELYINQKKCEFLKSEITFLGFIIKKGEVSMEPRKVEAIREWLAPTTVKEVQAFLGLASFYRKFIKNFSSICAPLTDCLKKGNFKWTSFQQESFEEIK

Query:  KRLASSPVLQLPDFSSPFEVAVDACCTGIGAVLSQRGHPIEYLSEKLSTARQTWSTYEQELYALVRALKQWEHYLLSKEFVLLTDHFSLKYLQSQKNISR
        +RL ++PVL LP+F + FE+  DAC TGIGAVLSQ G P+ + SEKL+ ARQ WSTYEQELYA+V+A+K+WEHYL+ +EFV+ +DH +LKY Q+Q+++++
Subjt:  KRLASSPVLQLPDFSSPFEVAVDACCTGIGAVLSQRGHPIEYLSEKLSTARQTWSTYEQELYALVRALKQWEHYLLSKEFVLLTDHFSLKYLQSQKNISR

Query:  MHARWISYLQRFDFVIKHQAGKENKVADALSRKGSLLTLLSSEIIAFEHLPELYERDADFADIWHKCSNHLRAEGYHILEGFLFKGDQLCIPHTSLREAL
        +HARW S+L++F++VIKH++G  NKVADALSRK +LL  +S++++ FE +  LYE D DF   W +         + +L+G+LFKG++LCIP TSLR  L
Subjt:  MHARWISYLQRFDFVIKHQAGKENKVADALSRKGSLLTLLSSEIIAFEHLPELYERDADFADIWHKCSNHLRAEGYHILEGFLFKGDQLCIPHTSLREAL

Query:  IKEAHSNGLAGHFEQDKTFDTISIRYYWPQLRKDSNNFVKRCSVCQRAKGSQTNAGLYTPLPIPQSIWEDLSIDFVLGLPKTQRNHDSVMVVVDRFSKMA
        IKE H+ GL+ H  +DKT  ++  R+YWPQL++D  +FV+RC VCQ  KG   N GLY PLP+P+S W D+S+DFVLGLP+TQR  DSV VVVDRFSKMA
Subjt:  IKEAHSNGLAGHFEQDKTFDTISIRYYWPQLRKDSNNFVKRCSVCQRAKGSQTNAGLYTPLPIPQSIWEDLSIDFVLGLPKTQRNHDSVMVVVDRFSKMA

Query:  HFIACKRTNDA----------------------------FLSHFWKTLWRKFDTTLKYSTIAHPQTDGQTEVTNRTLGNLIRCLSGSKPKQWDLSHAQAE
        HFI CK+T+DA                            FL+HFW TLWR+  T+L +S+ AHPQTDGQTEV NRTLGN+IRCL G KPK WD+S AQAE
Subjt:  HFIACKRTNDA----------------------------FLSHFWKTLWRKFDTTLKYSTIAHPQTDGQTEVTNRTLGNLIRCLSGSKPKQWDLSHAQAE

Query:  FAFNNMKNRSTDKCPFEVVYTRRPRLTFDLASLPVTVESHKEAETMAEDIEKLHKEVHDHLVQSTNSYKKAADKKRRQTVFSKGDLVMVHLRKNRFPTGT
        FA+N+  + ST   PF+VVY   PR   DL  LP   + + +A  M E+++  H+ V   + +S   YK AADK RR  +F  GD VMV LRK RFP GT
Subjt:  FAFNNMKNRSTDKCPFEVVYTRRPRLTFDLASLPVTVESHKEAETMAEDIEKLHKEVHDHLVQSTNSYKKAADKKRRQTVFSKGDLVMVHLRKNRFPTGT

Query:  YNKLKDRQIGPFRITEKYGDNAFKVELPPDMHIHSVFNIADLKPYHAPD
        Y+KL+ ++ GP++I  K  DNA+ V+LP  M I   FN++D+  +H  D
Subjt:  YNKLKDRQIGPFRITEKYGDNAFKVELPPDMHIHSVFNIADLKPYHAPD

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein6.9e-12431.91Show/hide
Query:  QDILALKEPDGLP-PLRDIQHHIDLIPGASLPNLAHYRMTPKEYAALHEHIEDLLKKGHIKPSLSPCAVPALLTPKKDGSWRMCVDSRAINRITVKYRFP
        +DI A    + LP P++ ++  ++L        + +Y + P +  A+++ I   LK G I+ S +  A P +  PKK+G+ RM VD + +N+      +P
Subjt:  QDILALKEPDGLP-PLRDIQHHIDLIPGASLPNLAHYRMTPKEYAALHEHIEDLLKKGHIKPSLSPCAVPALLTPKKDGSWRMCVDSRAINRITVKYRFP

Query:  IPRVGDLLDQLGKATIFSKIDLRSGYHQIRIRPGDEWKTAFKTNEGLFEWMVMPFGLSNAPSTFMRLMNQVLHPFLNKFIVVYFDDILVYSSGNDEHLLH
        +P +  LL ++  +TIF+K+DL+S YH IR+R GDE K AF+   G+FE++VMP+G+S AP+ F   +N +L       +V Y DDIL++S    EH+ H
Subjt:  IPRVGDLLDQLGKATIFSKIDLRSGYHQIRIRPGDEWKTAFKTNEGLFEWMVMPFGLSNAPSTFMRLMNQVLHPFLNKFIVVYFDDILVYSSGNDEHLLH

Query:  LKKLFQVLTEKELYINQKKCEFLKSEITFLGFIIKKGEVSMEPRKVEAIREWLAPTTVKEVQAFLGLASFYRKFIKNFSSICAPLTDCLKKG-NFKWTSF
        +K + Q L    L INQ KCEF +S++ F+G+ I +   +     ++ + +W  P   KE++ FLG  ++ RKFI   S +  PL + LKK   +KWT  
Subjt:  LKKLFQVLTEKELYINQKKCEFLKSEITFLGFIIKKGEVSMEPRKVEAIREWLAPTTVKEVQAFLGLASFYRKFIKNFSSICAPLTDCLKKG-NFKWTSF

Query:  QQESFEEIKKRLASSPVLQLPDFSSPFEVAVDACCTGIGAVLSQRG-----HPIEYLSEKLSTARQTWSTYEQELYALVRALKQWEHYLLS--KEFVLLT
        Q ++ E IK+ L S PVL+  DFS    +  DA    +GAVLSQ+      +P+ Y S K+S A+  +S  ++E+ A++++LK W HYL S  + F +LT
Subjt:  QQESFEEIKKRLASSPVLQLPDFSSPFEVAVDACCTGIGAVLSQRG-----HPIEYLSEKLSTARQTWSTYEQELYALVRALKQWEHYLLS--KEFVLLT

Query:  DHFSL--KYLQSQKNISRMHARWISYLQRFDFVIKHQAGKENKVADALSR------------KGSLLTLLSSEIIAFEHLPEL---YERDADFADIWHKC
        DH +L  +     +  ++  ARW  +LQ F+F I ++ G  N +ADALSR            + + +  ++   I  +   ++   Y  D    ++ +  
Subjt:  DHFSL--KYLQSQKNISRMHARWISYLQRFDFVIKHQAGKENKVADALSR------------KGSLLTLLSSEIIAFEHLPEL---YERDADFADIWHKC

Query:  SNHLRAEGYHILEGFLFKG-DQLCIPH-TSLREALIKEAHSNGLAGHFEQDKTFDTISIRYYWPQLRKDSNNFVKRCSVCQRAKG-SQTNAGLYTPLPIP
           +  E   + +G L    DQ+ +P+ T L   +IK+ H  G   H   +   + I  R+ W  +RK    +V+ C  CQ  K  +    G   P+P  
Subjt:  SNHLRAEGYHILEGFLFKG-DQLCIPH-TSLREALIKEAHSNGLAGHFEQDKTFDTISIRYYWPQLRKDSNNFVKRCSVCQRAKG-SQTNAGLYTPLPIP

Query:  QSIWEDLSIDFVLGLPKTQRNHDSVMVVVDRFSKMAHFIACKRT---------------------------ND-AFLSHFWKTLWRKFDTTLKYSTIAHP
        +  WE LS+DF+  LP++   ++++ VVVDRFSKMA  + C ++                           ND  F S  WK    K++  +K+S    P
Subjt:  QSIWEDLSIDFVLGLPKTQRNHDSVMVVVDRFSKMAHFIACKRT---------------------------ND-AFLSHFWKTLWRKFDTTLKYSTIAHP

Query:  QTDGQTEVTNRTLGNLIRCLSGSKPKQWDLSHAQAEFAFNNMKNRSTDKCPFEVVYTRRPRLTFDLASLPVTVESHKEAETMAEDIEKLHKEVHDHLVQS
        QTDGQTE TN+T+  L+RC+  + P  W    +  + ++NN  + +T   PFE+V+   P     L+ L +   S K  E   E I+ + + V +HL  +
Subjt:  QTDGQTEVTNRTLGNLIRCLSGSKPKQWDLSHAQAEFAFNNMKNRSTDKCPFEVVYTRRPRLTFDLASLPVTVESHKEAETMAEDIEKLHKEVHDHLVQS

Query:  TNSYKKAADKKRRQ-TVFSKGDLVMVHLRKNRFPTGTYNKLKDRQIGPFRITEKYGDNAFKVELPPDMH--IHSVFNIADLKPYHAPDDFQLA
            KK  D K ++   F  GDLVMV   K  F     NKL     GPF + +K G N ++++LP  +     S F+++ L+ Y    +   A
Subjt:  TNSYKKAADKKRRQ-TVFSKGDLVMVHLRKNRFPTGTYNKLKDRQIGPFRITEKYGDNAFKVELPPDMH--IHSVFNIADLKPYHAPDDFQLA

P0CT35 Transposon Tf2-2 polyprotein6.9e-12431.91Show/hide
Query:  QDILALKEPDGLP-PLRDIQHHIDLIPGASLPNLAHYRMTPKEYAALHEHIEDLLKKGHIKPSLSPCAVPALLTPKKDGSWRMCVDSRAINRITVKYRFP
        +DI A    + LP P++ ++  ++L        + +Y + P +  A+++ I   LK G I+ S +  A P +  PKK+G+ RM VD + +N+      +P
Subjt:  QDILALKEPDGLP-PLRDIQHHIDLIPGASLPNLAHYRMTPKEYAALHEHIEDLLKKGHIKPSLSPCAVPALLTPKKDGSWRMCVDSRAINRITVKYRFP

Query:  IPRVGDLLDQLGKATIFSKIDLRSGYHQIRIRPGDEWKTAFKTNEGLFEWMVMPFGLSNAPSTFMRLMNQVLHPFLNKFIVVYFDDILVYSSGNDEHLLH
        +P +  LL ++  +TIF+K+DL+S YH IR+R GDE K AF+   G+FE++VMP+G+S AP+ F   +N +L       +V Y DDIL++S    EH+ H
Subjt:  IPRVGDLLDQLGKATIFSKIDLRSGYHQIRIRPGDEWKTAFKTNEGLFEWMVMPFGLSNAPSTFMRLMNQVLHPFLNKFIVVYFDDILVYSSGNDEHLLH

Query:  LKKLFQVLTEKELYINQKKCEFLKSEITFLGFIIKKGEVSMEPRKVEAIREWLAPTTVKEVQAFLGLASFYRKFIKNFSSICAPLTDCLKKG-NFKWTSF
        +K + Q L    L INQ KCEF +S++ F+G+ I +   +     ++ + +W  P   KE++ FLG  ++ RKFI   S +  PL + LKK   +KWT  
Subjt:  LKKLFQVLTEKELYINQKKCEFLKSEITFLGFIIKKGEVSMEPRKVEAIREWLAPTTVKEVQAFLGLASFYRKFIKNFSSICAPLTDCLKKG-NFKWTSF

Query:  QQESFEEIKKRLASSPVLQLPDFSSPFEVAVDACCTGIGAVLSQRG-----HPIEYLSEKLSTARQTWSTYEQELYALVRALKQWEHYLLS--KEFVLLT
        Q ++ E IK+ L S PVL+  DFS    +  DA    +GAVLSQ+      +P+ Y S K+S A+  +S  ++E+ A++++LK W HYL S  + F +LT
Subjt:  QQESFEEIKKRLASSPVLQLPDFSSPFEVAVDACCTGIGAVLSQRG-----HPIEYLSEKLSTARQTWSTYEQELYALVRALKQWEHYLLS--KEFVLLT

Query:  DHFSL--KYLQSQKNISRMHARWISYLQRFDFVIKHQAGKENKVADALSR------------KGSLLTLLSSEIIAFEHLPEL---YERDADFADIWHKC
        DH +L  +     +  ++  ARW  +LQ F+F I ++ G  N +ADALSR            + + +  ++   I  +   ++   Y  D    ++ +  
Subjt:  DHFSL--KYLQSQKNISRMHARWISYLQRFDFVIKHQAGKENKVADALSR------------KGSLLTLLSSEIIAFEHLPEL---YERDADFADIWHKC

Query:  SNHLRAEGYHILEGFLFKG-DQLCIPH-TSLREALIKEAHSNGLAGHFEQDKTFDTISIRYYWPQLRKDSNNFVKRCSVCQRAKG-SQTNAGLYTPLPIP
           +  E   + +G L    DQ+ +P+ T L   +IK+ H  G   H   +   + I  R+ W  +RK    +V+ C  CQ  K  +    G   P+P  
Subjt:  SNHLRAEGYHILEGFLFKG-DQLCIPH-TSLREALIKEAHSNGLAGHFEQDKTFDTISIRYYWPQLRKDSNNFVKRCSVCQRAKG-SQTNAGLYTPLPIP

Query:  QSIWEDLSIDFVLGLPKTQRNHDSVMVVVDRFSKMAHFIACKRT---------------------------ND-AFLSHFWKTLWRKFDTTLKYSTIAHP
        +  WE LS+DF+  LP++   ++++ VVVDRFSKMA  + C ++                           ND  F S  WK    K++  +K+S    P
Subjt:  QSIWEDLSIDFVLGLPKTQRNHDSVMVVVDRFSKMAHFIACKRT---------------------------ND-AFLSHFWKTLWRKFDTTLKYSTIAHP

Query:  QTDGQTEVTNRTLGNLIRCLSGSKPKQWDLSHAQAEFAFNNMKNRSTDKCPFEVVYTRRPRLTFDLASLPVTVESHKEAETMAEDIEKLHKEVHDHLVQS
        QTDGQTE TN+T+  L+RC+  + P  W    +  + ++NN  + +T   PFE+V+   P     L+ L +   S K  E   E I+ + + V +HL  +
Subjt:  QTDGQTEVTNRTLGNLIRCLSGSKPKQWDLSHAQAEFAFNNMKNRSTDKCPFEVVYTRRPRLTFDLASLPVTVESHKEAETMAEDIEKLHKEVHDHLVQS

Query:  TNSYKKAADKKRRQ-TVFSKGDLVMVHLRKNRFPTGTYNKLKDRQIGPFRITEKYGDNAFKVELPPDMH--IHSVFNIADLKPYHAPDDFQLA
            KK  D K ++   F  GDLVMV   K  F     NKL     GPF + +K G N ++++LP  +     S F+++ L+ Y    +   A
Subjt:  TNSYKKAADKKRRQ-TVFSKGDLVMVHLRKNRFPTGTYNKLKDRQIGPFRITEKYGDNAFKVELPPDMH--IHSVFNIADLKPYHAPDDFQLA

P0CT41 Transposon Tf2-12 polyprotein6.9e-12431.91Show/hide
Query:  QDILALKEPDGLP-PLRDIQHHIDLIPGASLPNLAHYRMTPKEYAALHEHIEDLLKKGHIKPSLSPCAVPALLTPKKDGSWRMCVDSRAINRITVKYRFP
        +DI A    + LP P++ ++  ++L        + +Y + P +  A+++ I   LK G I+ S +  A P +  PKK+G+ RM VD + +N+      +P
Subjt:  QDILALKEPDGLP-PLRDIQHHIDLIPGASLPNLAHYRMTPKEYAALHEHIEDLLKKGHIKPSLSPCAVPALLTPKKDGSWRMCVDSRAINRITVKYRFP

Query:  IPRVGDLLDQLGKATIFSKIDLRSGYHQIRIRPGDEWKTAFKTNEGLFEWMVMPFGLSNAPSTFMRLMNQVLHPFLNKFIVVYFDDILVYSSGNDEHLLH
        +P +  LL ++  +TIF+K+DL+S YH IR+R GDE K AF+   G+FE++VMP+G+S AP+ F   +N +L       +V Y DDIL++S    EH+ H
Subjt:  IPRVGDLLDQLGKATIFSKIDLRSGYHQIRIRPGDEWKTAFKTNEGLFEWMVMPFGLSNAPSTFMRLMNQVLHPFLNKFIVVYFDDILVYSSGNDEHLLH

Query:  LKKLFQVLTEKELYINQKKCEFLKSEITFLGFIIKKGEVSMEPRKVEAIREWLAPTTVKEVQAFLGLASFYRKFIKNFSSICAPLTDCLKKG-NFKWTSF
        +K + Q L    L INQ KCEF +S++ F+G+ I +   +     ++ + +W  P   KE++ FLG  ++ RKFI   S +  PL + LKK   +KWT  
Subjt:  LKKLFQVLTEKELYINQKKCEFLKSEITFLGFIIKKGEVSMEPRKVEAIREWLAPTTVKEVQAFLGLASFYRKFIKNFSSICAPLTDCLKKG-NFKWTSF

Query:  QQESFEEIKKRLASSPVLQLPDFSSPFEVAVDACCTGIGAVLSQRG-----HPIEYLSEKLSTARQTWSTYEQELYALVRALKQWEHYLLS--KEFVLLT
        Q ++ E IK+ L S PVL+  DFS    +  DA    +GAVLSQ+      +P+ Y S K+S A+  +S  ++E+ A++++LK W HYL S  + F +LT
Subjt:  QQESFEEIKKRLASSPVLQLPDFSSPFEVAVDACCTGIGAVLSQRG-----HPIEYLSEKLSTARQTWSTYEQELYALVRALKQWEHYLLS--KEFVLLT

Query:  DHFSL--KYLQSQKNISRMHARWISYLQRFDFVIKHQAGKENKVADALSR------------KGSLLTLLSSEIIAFEHLPEL---YERDADFADIWHKC
        DH +L  +     +  ++  ARW  +LQ F+F I ++ G  N +ADALSR            + + +  ++   I  +   ++   Y  D    ++ +  
Subjt:  DHFSL--KYLQSQKNISRMHARWISYLQRFDFVIKHQAGKENKVADALSR------------KGSLLTLLSSEIIAFEHLPEL---YERDADFADIWHKC

Query:  SNHLRAEGYHILEGFLFKG-DQLCIPH-TSLREALIKEAHSNGLAGHFEQDKTFDTISIRYYWPQLRKDSNNFVKRCSVCQRAKG-SQTNAGLYTPLPIP
           +  E   + +G L    DQ+ +P+ T L   +IK+ H  G   H   +   + I  R+ W  +RK    +V+ C  CQ  K  +    G   P+P  
Subjt:  SNHLRAEGYHILEGFLFKG-DQLCIPH-TSLREALIKEAHSNGLAGHFEQDKTFDTISIRYYWPQLRKDSNNFVKRCSVCQRAKG-SQTNAGLYTPLPIP

Query:  QSIWEDLSIDFVLGLPKTQRNHDSVMVVVDRFSKMAHFIACKRT---------------------------ND-AFLSHFWKTLWRKFDTTLKYSTIAHP
        +  WE LS+DF+  LP++   ++++ VVVDRFSKMA  + C ++                           ND  F S  WK    K++  +K+S    P
Subjt:  QSIWEDLSIDFVLGLPKTQRNHDSVMVVVDRFSKMAHFIACKRT---------------------------ND-AFLSHFWKTLWRKFDTTLKYSTIAHP

Query:  QTDGQTEVTNRTLGNLIRCLSGSKPKQWDLSHAQAEFAFNNMKNRSTDKCPFEVVYTRRPRLTFDLASLPVTVESHKEAETMAEDIEKLHKEVHDHLVQS
        QTDGQTE TN+T+  L+RC+  + P  W    +  + ++NN  + +T   PFE+V+   P     L+ L +   S K  E   E I+ + + V +HL  +
Subjt:  QTDGQTEVTNRTLGNLIRCLSGSKPKQWDLSHAQAEFAFNNMKNRSTDKCPFEVVYTRRPRLTFDLASLPVTVESHKEAETMAEDIEKLHKEVHDHLVQS

Query:  TNSYKKAADKKRRQ-TVFSKGDLVMVHLRKNRFPTGTYNKLKDRQIGPFRITEKYGDNAFKVELPPDMH--IHSVFNIADLKPYHAPDDFQLA
            KK  D K ++   F  GDLVMV   K  F     NKL     GPF + +K G N ++++LP  +     S F+++ L+ Y    +   A
Subjt:  TNSYKKAADKKRRQ-TVFSKGDLVMVHLRKNRFPTGTYNKLKDRQIGPFRITEKYGDNAFKVELPPDMH--IHSVFNIADLKPYHAPDDFQLA

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein4.2e-13733.8Show/hide
Query:  GKRITLLPLTKKNEENSKTRGQLFTTCSGKTLLKERKQDILALKEPDGLPPLRDI--QHHIDLIPGASLPNLAHYRMTPKEYAALHEHIEDLLKKGHIKP
        GK   ++   +  E N+       T C+    L+++ ++I+    P     + +I  +H I++ PGA LP L  Y +T K    +++ ++ LL    I P
Subjt:  GKRITLLPLTKKNEENSKTRGQLFTTCSGKTLLKERKQDILALKEPDGLPPLRDI--QHHIDLIPGASLPNLAHYRMTPKEYAALHEHIEDLLKKGHIKP

Query:  SLSPCAVPALLTPKKDGSWRMCVDSRAINRITVKYRFPIPRVGDLLDQLGKATIFSKIDLRSGYHQIRIRPGDEWKTAFKTNEGLFEWMVMPFGLSNAPS
        S SPC+ P +L PKKDG++R+CVD R +N+ T+   FP+PR+ +LL ++G A IF+ +DL SGYHQI + P D +KTAF T  G +E+ VMPFGL NAPS
Subjt:  SLSPCAVPALLTPKKDGSWRMCVDSRAINRITVKYRFPIPRVGDLLDQLGKATIFSKIDLRSGYHQIRIRPGDEWKTAFKTNEGLFEWMVMPFGLSNAPS

Query:  TFMRLMNQVLHPFLNKFIVVYFDDILVYSSGNDEHLLHLKKLFQVLTEKELYINQKKCEFLKSEITFLGFIIKKGEVSMEPRKVEAIREWLAPTTVKEVQ
        TF R M         +F+ VY DDIL++S   +EH  HL  + + L  + L + +KKC+F   E  FLG+ I   +++    K  AIR++  P TVK+ Q
Subjt:  TFMRLMNQVLHPFLNKFIVVYFDDILVYSSGNDEHLLHLKKLFQVLTEKELYINQKKCEFLKSEITFLGFIIKKGEVSMEPRKVEAIREWLAPTTVKEVQ

Query:  AFLGLASFYRKFIKNFSSICAP--LTDCLKKGNFKWTSFQQESFEEIKKRLASSPVLQLPDFSSPFEVAVDACCTGIGAVLSQRGHP------IEYLSEK
         FLG+ ++YR+FI N S I  P  L  C K    +WT  Q ++ E++K  L +SPVL   +  + + +  DA   GIGAVL +  +       + Y S+ 
Subjt:  AFLGLASFYRKFIKNFSSICAP--LTDCLKKGNFKWTSFQQESFEEIKKRLASSPVLQLPDFSSPFEVAVDACCTGIGAVLSQRGHP------IEYLSEK

Query:  LSTARQTWSTYEQELYALVRALKQWEHYLLSKEFVLLTDHFSLKYLQSQKNISRMHARWISYLQRFDFVIKHQAGKENKVADALSRKGSLLTLLSSEIIA
        L +A++ +   E EL  +++AL  + + L  K F L TDH SL  LQ++   +R   RW+  L  +DF +++ AG +N VADA+SR    +T  +S  I 
Subjt:  LSTARQTWSTYEQELYALVRALKQWEHYLLSKEFVLLTDHFSLKYLQSQKNISRMHARWISYLQRFDFVIKHQAGKENKVADALSRKGSLLTLLSSEIIA

Query:  FE-----------------HLPELYERDADFADIWHKCSNHLRAE-------GYHILEGFLFKGDQLCIPHTSLREALIKEAHSNGL-AGHFEQDKTFDT
         E                 H+ EL + +    D+    S   + E        Y + +  ++  D+L +P    + A+++  H + L  GHF    T   
Subjt:  FE-----------------HLPELYERDADFADIWHKCSNHLRAE-------GYHILEGFLFKGDQLCIPHTSLREALIKEAHSNGL-AGHFEQDKTFDT

Query:  ISIRYYWPQLRKDSNNFVKRCSVCQRAKGSQTNA-GLYTPLPIPQSIWEDLSIDFVLGLPKTQRNHDSVMVVVDRFSKMAHFIACKRTNDA---------
        IS  YYWP+L+     +++ C  CQ  K  +    GL  PLPI +  W D+S+DFV GLP T  N + ++VVVDRFSK AHFIA ++T DA         
Subjt:  ISIRYYWPQLRKDSNNFVKRCSVCQRAKGSQTNA-GLYTPLPIPQSIWEDLSIDFVLGLPKTQRNHDSVMVVVDRFSKMAHFIACKRTNDA---------

Query:  --FLSH-FWKTLWRKFD---TTLKY-------------STIAHPQTDGQTEVTNRTLGNLIRCLSGSKPKQWDLSHAQAEFAFNNMKNRSTDKCPFEVVY
          F  H F +T+    D   T  KY             S+  HPQTDGQ+E T +TL  L+R    +  + W +   Q EF +N+   R+  K PFE+  
Subjt:  --FLSH-FWKTLWRKFD---TTLKY-------------STIAHPQTDGQTEVTNRTLGNLIRCLSGSKPKQWDLSHAQAEFAFNNMKNRSTDKCPFEVVY

Query:  TRRPRLTFDLASLPVTVESHKEAETMAEDIEKLHKEVHDHLVQSTNSYKKAADKKRRQTVFSKGDLVMVHLRKNRFPTGTYNKLKDRQIGPFRITEKYGD
           P      +   V   S    E +A+ ++ L  +  + L  +    +   +++R+  + + GD V+VH R   F  G Y K++   +GPFR+ +K  D
Subjt:  TRRPRLTFDLASLPVTVESHKEAETMAEDIEKLHKEVHDHLVQSTNSYKKAADKKRRQTVFSKGDLVMVHLRKNRFPTGTYNKLKDRQIGPFRITEKYGD

Query:  NAFKVELPPDMHIHSVFNIADLK
        NA++++L      H V N+  LK
Subjt:  NAFKVELPPDMHIHSVFNIADLK

Q99315 Transposon Ty3-G Gag-Pol polyprotein1.1e-13733.58Show/hide
Query:  GKRITLLPLTKKNEENSKTRGQLFTTCSGKTLLKERKQDILALKEPDGLPPLRDI--QHHIDLIPGASLPNLAHYRMTPKEYAALHEHIEDLLKKGHIKP
        GK   ++   +  E N+       T C+    L+++ ++I+    P     + +I  +H I++ PGA LP L  Y +T K    +++ ++ LL    I P
Subjt:  GKRITLLPLTKKNEENSKTRGQLFTTCSGKTLLKERKQDILALKEPDGLPPLRDI--QHHIDLIPGASLPNLAHYRMTPKEYAALHEHIEDLLKKGHIKP

Query:  SLSPCAVPALLTPKKDGSWRMCVDSRAINRITVKYRFPIPRVGDLLDQLGKATIFSKIDLRSGYHQIRIRPGDEWKTAFKTNEGLFEWMVMPFGLSNAPS
        S SPC+ P +L PKKDG++R+CVD R +N+ T+   FP+PR+ +LL ++G A IF+ +DL SGYHQI + P D +KTAF T  G +E+ VMPFGL NAPS
Subjt:  SLSPCAVPALLTPKKDGSWRMCVDSRAINRITVKYRFPIPRVGDLLDQLGKATIFSKIDLRSGYHQIRIRPGDEWKTAFKTNEGLFEWMVMPFGLSNAPS

Query:  TFMRLMNQVLHPFLNKFIVVYFDDILVYSSGNDEHLLHLKKLFQVLTEKELYINQKKCEFLKSEITFLGFIIKKGEVSMEPRKVEAIREWLAPTTVKEVQ
        TF R M         +F+ VY DDIL++S   +EH  HL  + + L  + L + +KKC+F   E  FLG+ I   +++    K  AIR++  P TVK+ Q
Subjt:  TFMRLMNQVLHPFLNKFIVVYFDDILVYSSGNDEHLLHLKKLFQVLTEKELYINQKKCEFLKSEITFLGFIIKKGEVSMEPRKVEAIREWLAPTTVKEVQ

Query:  AFLGLASFYRKFIKNFSSICAP--LTDCLKKGNFKWTSFQQESFEEIKKRLASSPVLQLPDFSSPFEVAVDACCTGIGAVLSQRGHP------IEYLSEK
         FLG+ ++YR+FI N S I  P  L  C K    +WT  Q ++ +++K  L +SPVL   +  + + +  DA   GIGAVL +  +       + Y S+ 
Subjt:  AFLGLASFYRKFIKNFSSICAP--LTDCLKKGNFKWTSFQQESFEEIKKRLASSPVLQLPDFSSPFEVAVDACCTGIGAVLSQRGHP------IEYLSEK

Query:  LSTARQTWSTYEQELYALVRALKQWEHYLLSKEFVLLTDHFSLKYLQSQKNISRMHARWISYLQRFDFVIKHQAGKENKVADALSRKGSLLTLLSSEIIA
        L +A++ +   E EL  +++AL  + + L  K F L TDH SL  LQ++   +R   RW+  L  +DF +++ AG +N VADA+SR    +T  +S  I 
Subjt:  LSTARQTWSTYEQELYALVRALKQWEHYLLSKEFVLLTDHFSLKYLQSQKNISRMHARWISYLQRFDFVIKHQAGKENKVADALSRKGSLLTLLSSEIIA

Query:  FE-----------------HLPELYERDADFADIWHKCSNHLRAE-------GYHILEGFLFKGDQLCIPHTSLREALIKEAHSNGL-AGHFEQDKTFDT
         E                 H+ EL + +    D+    S   + E        Y + +  ++  D+L +P    + A+++  H + L  GHF    T   
Subjt:  FE-----------------HLPELYERDADFADIWHKCSNHLRAE-------GYHILEGFLFKGDQLCIPHTSLREALIKEAHSNGL-AGHFEQDKTFDT

Query:  ISIRYYWPQLRKDSNNFVKRCSVCQRAKGSQTNA-GLYTPLPIPQSIWEDLSIDFVLGLPKTQRNHDSVMVVVDRFSKMAHFIACKRTNDA---------
        IS  YYWP+L+     +++ C  CQ  K  +    GL  PLPI +  W D+S+DFV GLP T  N + ++VVVDRFSK AHFIA ++T DA         
Subjt:  ISIRYYWPQLRKDSNNFVKRCSVCQRAKGSQTNA-GLYTPLPIPQSIWEDLSIDFVLGLPKTQRNHDSVMVVVDRFSKMAHFIACKRTNDA---------

Query:  --FLSH-FWKTLWRKFD---TTLKY-------------STIAHPQTDGQTEVTNRTLGNLIRCLSGSKPKQWDLSHAQAEFAFNNMKNRSTDKCPFEVVY
          F  H F +T+    D   T  KY             S+  HPQTDGQ+E T +TL  L+R  + +  + W +   Q EF +N+   R+  K PFE+  
Subjt:  --FLSH-FWKTLWRKFD---TTLKY-------------STIAHPQTDGQTEVTNRTLGNLIRCLSGSKPKQWDLSHAQAEFAFNNMKNRSTDKCPFEVVY

Query:  TRRPRLTFDLASLPVTVESHKEAETMAEDIEKLHKEVHDHLVQSTNSYKKAADKKRRQTVFSKGDLVMVHLRKNRFPTGTYNKLKDRQIGPFRITEKYGD
           P      +   V   S    E +A+ ++ L  +  + L  +    +   +++R+  + + GD V+VH R   F  G Y K++   +GPFR+ +K  D
Subjt:  TRRPRLTFDLASLPVTVESHKEAETMAEDIEKLHKEVHDHLVQSTNSYKKAADKKRRQTVFSKGDLVMVHLRKNRFPTGTYNKLKDRQIGPFRITEKYGD

Query:  NAFKVELPPDMHIHSVFNIADLKPY-HAPDDF
        NA++++L      H V N+  LK + + PD +
Subjt:  NAFKVELPPDMHIHSVFNIADLKPY-HAPDDF

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein9.2e-2339.85Show/hide
Query:  HLKKLFQVLTEKELYINQKKCEFLKSEITFLG--FIIKKGEVSMEPRKVEAIREWLAPTTVKEVQAFLGLASFYRKFIKNFSSICAPLTDCLKKGNFKWT
        HL  + Q+  + + Y N+KKC F + +I +LG   II    VS +P K+EA+  W  P    E++ FLGL  +YR+F+KN+  I  PLT+ LKK + KWT
Subjt:  HLKKLFQVLTEKELYINQKKCEFLKSEITFLG--FIIKKGEVSMEPRKVEAIREWLAPTTVKEVQAFLGLASFYRKFIKNFSSICAPLTDCLKKGNFKWT

Query:  SFQQESFEEIKKRLASSPVLQLPDFSSPFEVAV
             +F+ +K  + + PVL LPD   PF   V
Subjt:  SFQQESFEEIKKRLASSPVLQLPDFSSPFEVAV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAGCAGCTTGGTGCTCTTCAAATGCAGAATTCAAAGATTATTCAATACAGGACATCTGCCTCAACTGGACTGCCTTCATCCAAAGAGAATCAGGGCTGTTATTTTTC
CAGATTGCTAGAACTGGCTTTTAGCTGTTATTCCTGTTTTCAACTGTTGTTGTTAGTAGCTGTTTTGTCTGTTAGTAGCTGTTATTCCTGTTTTCAACTGTTGTTGTTAG
TAGCTGTTTTGTCTGTTAGTAGCTGTTATTCCTGTTTTCAACTGTTGTTGTTAGTAGCTGTTTTGTCTGTTAGTAGCTGTTATTCCTGTTTTCAACTGTTGTTGTTAGTA
GCTGTTTTGTCTGTTAGTAGCTGTTATTCCTGTTTTCAACTGTTGTTGTTAGTAGCTGTTTTGTCTGTTAGTAGCTGTTATTCCTGTTTTCAACTGTTGTTGTTAGTAGC
TGTTTTGTCTGTTAGTAGCTGTTATTCCTGTTTTCAACTGTTGTTGTTAGTAGCTGTTTTGTCTGTTAGTAGCTGTTATTCCTGTTTTCAACTGTTGTTGTTAGTAGCTG
TTTTGTCTGTTAGTAGCTGTTATTCCTGTTTTCAACTGTTGTTGTTAGTAGCTGTTTTGTCTGTTAGTAGCTGTTATTCCTGTTTTCAACTGTTGTTGTTAGTAGCTGTT
TTGTCTGTTAGTAGCTGTTATTCCTGTTTTCAACTGTTGTTGTTAGTAGCTGTTTTGTCTGTTAGTAGCTGTTATTCCTGTTTTCAACTGTTGTTGTTAGTAGCTGTTTT
GTCTGTTAGTAGCTGTTATTCCTGTTTTCAACTGTTGTTGTTAGTAGCTGTTTTGTCTGTTAGTAGCTGTTATTCCTGTTTTCAACTGTTGTTGTTAGTAGCTGTTTTGT
CTGTTAGTAGCTGTTATTCCTGTTTTCAACTGTTGTTGTTAGTAGCTGTTTCTTCTGTTAATAAGCTGTTGGCAGCTTCCAGTGCTGAACCCCCGAATGGACTGCAAGGA
TCACGGGTGAAGAATTTAAAGTTATGGGTTTTGATGATCCATAACTTTGAACTTTTGGCAAGAAGGATGGACGAATTACCGGCACCACCGCAGCTCGAAGCAGAAATGGC
AGACAATGGGGAGATCAACAACAGACGAAGGCAACGTGGACGGGGAAATTTCAGAAACCTCCACAATCCACGATACGCTAATCGAGGAAGGGCTGATCTCAACCCACCAC
CACAATTCTACGCCGACGTCCATAATGAGCAAGAAGACGAATCTTCGAGCAGTGACGACCAGAACAACCCCTGGGACGCCTTAGACGAAACACAGAGAGGACAACACGGC
AGGAGATTCGGCACGCGGGCAGAAACTCATCATGACTTCAAGATGAAGGTTGACTTACCTTCATATAGTGGCAAGAGAGATATCGAATCCTTTTTGGATTGGCTAAAAAG
TACTGAGAACTTTTTCAGTTACATGGACACCCCTGAACAGAAAAAGGTACGTCTCGTGGCCTTGAAACTCAAAGGGGGCGCATCAGCATGGTGGGAGCAACTGGAAGCCA
ACAGGCAAAGACCGTGGCAACACGACACTCAAACCTTGCATAAGGGGAGAGAAAACACATACGAATTCCACTGGATGGGTAAACGGATAACTCTACTGCCCTTGACCAAA
AAGAACGAGGAGAACAGTAAGACAAGGGGCCAGTTATTCACAACATGCAGTGGCAAAACCCTTTTAAAAGAAAGAAAGCAGGATATTTTAGCCCTTAAAGAACCTGACGG
ACTGCCACCTCTTCGAGACATTCAGCACCATATAGATCTGATCCCCGGAGCATCACTGCCAAACCTGGCTCATTACAGAATGACCCCTAAAGAGTATGCAGCGCTCCATG
AACATATCGAAGACCTACTCAAGAAAGGGCATATTAAGCCAAGCCTCAGTCCTTGTGCTGTCCCAGCTCTCCTCACCCCCAAGAAGGATGGAAGTTGGAGAATGTGTGTA
GATAGCCGAGCCATTAACCGAATCACAGTAAAATACAGATTCCCTATCCCAAGAGTTGGAGACCTCTTGGATCAACTCGGCAAGGCCACCATCTTCTCGAAAATTGACCT
AAGGAGTGGATACCATCAAATACGTATCCGACCAGGTGATGAATGGAAGACTGCCTTCAAAACGAATGAAGGGCTGTTTGAATGGATGGTAATGCCCTTCGGCCTATCCA
ATGCTCCCAGTACCTTCATGAGGCTAATGAATCAGGTACTGCACCCCTTCCTCAACAAATTCATTGTGGTTTATTTCGACGATATATTGGTGTACAGCAGTGGGAACGAC
GAACACTTGCTCCACCTTAAAAAGTTGTTTCAAGTATTGACAGAAAAGGAACTATACATCAATCAAAAGAAGTGTGAATTCTTGAAGTCTGAAATTACATTTCTTGGTTT
TATAATCAAGAAAGGAGAAGTAAGCATGGAACCAAGAAAGGTTGAAGCAATACGAGAGTGGTTGGCTCCAACCACTGTCAAAGAAGTTCAAGCCTTCCTAGGACTGGCTT
CTTTTTACAGAAAGTTTATAAAGAACTTCAGCTCCATCTGCGCACCACTAACCGACTGCTTAAAGAAGGGAAACTTTAAGTGGACTTCATTCCAACAAGAGAGCTTCGAG
GAAATAAAGAAAAGGTTAGCTTCTAGCCCTGTTCTGCAACTGCCAGATTTCTCTTCTCCTTTCGAAGTAGCAGTAGACGCCTGTTGCACAGGAATTGGGGCAGTCTTATC
CCAACGAGGTCACCCGATTGAATACCTCAGTGAGAAATTGAGCACCGCACGACAAACATGGAGCACATACGAGCAAGAATTATATGCCTTAGTTCGAGCTCTCAAGCAGT
GGGAACATTACTTGCTCTCCAAAGAGTTTGTTCTCTTAACAGATCATTTCTCCCTGAAATATCTTCAATCTCAAAAGAATATCAGCCGGATGCATGCCCGTTGGATATCT
TATTTACAGAGATTCGATTTTGTTATCAAACACCAGGCTGGAAAAGAGAACAAGGTGGCTGACGCACTGAGTAGAAAAGGCTCCCTACTCACACTTCTCTCCTCAGAAAT
AATTGCTTTCGAGCACCTGCCAGAACTATACGAAAGAGATGCTGACTTCGCAGACATCTGGCATAAATGCTCCAATCACCTAAGGGCTGAAGGATACCACATCCTAGAAG
GATTCCTCTTCAAGGGAGACCAATTATGCATACCACACACTTCCCTGCGGGAAGCCTTAATAAAGGAAGCTCATTCCAACGGATTAGCCGGACATTTCGAGCAAGACAAG
ACCTTTGACACAATCTCCATACGGTACTACTGGCCACAGCTAAGAAAAGACTCCAATAACTTTGTGAAGAGATGTTCCGTTTGTCAACGGGCCAAGGGCTCCCAAACTAA
TGCAGGGTTATATACCCCGCTGCCGATTCCACAATCAATCTGGGAAGATCTCTCAATTGACTTTGTACTCGGACTCCCCAAGACTCAAAGAAACCATGATTCAGTCATGG
TGGTTGTTGACCGATTCAGCAAAATGGCTCACTTCATCGCTTGCAAGAGAACGAATGATGCTTTCCTAAGCCATTTCTGGAAGACACTGTGGAGAAAGTTTGATACAACG
CTGAAGTATAGCACAATAGCTCACCCTCAGACCGACGGACAGACTGAGGTAACAAACAGAACACTGGGTAACTTGATACGCTGCCTCAGTGGCTCTAAACCTAAACAGTG
GGATTTGTCCCACGCACAAGCAGAGTTTGCCTTCAACAACATGAAGAACCGATCGACTGACAAATGCCCCTTTGAAGTCGTGTACACTAGACGACCGAGGTTGACATTTG
ACCTCGCATCCCTCCCTGTTACTGTAGAAAGTCACAAAGAGGCCGAAACCATGGCAGAGGATATCGAGAAACTACACAAGGAAGTTCATGACCACCTTGTCCAATCCACT
AATTCCTATAAGAAAGCAGCAGACAAAAAGAGGAGGCAAACTGTTTTCTCCAAAGGGGACTTAGTAATGGTACACCTAAGAAAAAACAGATTCCCCACTGGAACGTACAA
CAAACTGAAGGATAGACAGATCGGCCCATTTCGCATTACAGAAAAATATGGAGATAATGCTTTTAAGGTCGAACTTCCCCCAGACATGCACATCCATTCGGTATTCAATA
TTGCAGACTTGAAACCCTATCATGCCCCAGACGACTTCCAGCTTGCTGACTAA
mRNA sequenceShow/hide mRNA sequence
ATGCAGCAGCTTGGTGCTCTTCAAATGCAGAATTCAAAGATTATTCAATACAGGACATCTGCCTCAACTGGACTGCCTTCATCCAAAGAGAATCAGGGCTGTTATTTTTC
CAGATTGCTAGAACTGGCTTTTAGCTGTTATTCCTGTTTTCAACTGTTGTTGTTAGTAGCTGTTTTGTCTGTTAGTAGCTGTTATTCCTGTTTTCAACTGTTGTTGTTAG
TAGCTGTTTTGTCTGTTAGTAGCTGTTATTCCTGTTTTCAACTGTTGTTGTTAGTAGCTGTTTTGTCTGTTAGTAGCTGTTATTCCTGTTTTCAACTGTTGTTGTTAGTA
GCTGTTTTGTCTGTTAGTAGCTGTTATTCCTGTTTTCAACTGTTGTTGTTAGTAGCTGTTTTGTCTGTTAGTAGCTGTTATTCCTGTTTTCAACTGTTGTTGTTAGTAGC
TGTTTTGTCTGTTAGTAGCTGTTATTCCTGTTTTCAACTGTTGTTGTTAGTAGCTGTTTTGTCTGTTAGTAGCTGTTATTCCTGTTTTCAACTGTTGTTGTTAGTAGCTG
TTTTGTCTGTTAGTAGCTGTTATTCCTGTTTTCAACTGTTGTTGTTAGTAGCTGTTTTGTCTGTTAGTAGCTGTTATTCCTGTTTTCAACTGTTGTTGTTAGTAGCTGTT
TTGTCTGTTAGTAGCTGTTATTCCTGTTTTCAACTGTTGTTGTTAGTAGCTGTTTTGTCTGTTAGTAGCTGTTATTCCTGTTTTCAACTGTTGTTGTTAGTAGCTGTTTT
GTCTGTTAGTAGCTGTTATTCCTGTTTTCAACTGTTGTTGTTAGTAGCTGTTTTGTCTGTTAGTAGCTGTTATTCCTGTTTTCAACTGTTGTTGTTAGTAGCTGTTTTGT
CTGTTAGTAGCTGTTATTCCTGTTTTCAACTGTTGTTGTTAGTAGCTGTTTCTTCTGTTAATAAGCTGTTGGCAGCTTCCAGTGCTGAACCCCCGAATGGACTGCAAGGA
TCACGGGTGAAGAATTTAAAGTTATGGGTTTTGATGATCCATAACTTTGAACTTTTGGCAAGAAGGATGGACGAATTACCGGCACCACCGCAGCTCGAAGCAGAAATGGC
AGACAATGGGGAGATCAACAACAGACGAAGGCAACGTGGACGGGGAAATTTCAGAAACCTCCACAATCCACGATACGCTAATCGAGGAAGGGCTGATCTCAACCCACCAC
CACAATTCTACGCCGACGTCCATAATGAGCAAGAAGACGAATCTTCGAGCAGTGACGACCAGAACAACCCCTGGGACGCCTTAGACGAAACACAGAGAGGACAACACGGC
AGGAGATTCGGCACGCGGGCAGAAACTCATCATGACTTCAAGATGAAGGTTGACTTACCTTCATATAGTGGCAAGAGAGATATCGAATCCTTTTTGGATTGGCTAAAAAG
TACTGAGAACTTTTTCAGTTACATGGACACCCCTGAACAGAAAAAGGTACGTCTCGTGGCCTTGAAACTCAAAGGGGGCGCATCAGCATGGTGGGAGCAACTGGAAGCCA
ACAGGCAAAGACCGTGGCAACACGACACTCAAACCTTGCATAAGGGGAGAGAAAACACATACGAATTCCACTGGATGGGTAAACGGATAACTCTACTGCCCTTGACCAAA
AAGAACGAGGAGAACAGTAAGACAAGGGGCCAGTTATTCACAACATGCAGTGGCAAAACCCTTTTAAAAGAAAGAAAGCAGGATATTTTAGCCCTTAAAGAACCTGACGG
ACTGCCACCTCTTCGAGACATTCAGCACCATATAGATCTGATCCCCGGAGCATCACTGCCAAACCTGGCTCATTACAGAATGACCCCTAAAGAGTATGCAGCGCTCCATG
AACATATCGAAGACCTACTCAAGAAAGGGCATATTAAGCCAAGCCTCAGTCCTTGTGCTGTCCCAGCTCTCCTCACCCCCAAGAAGGATGGAAGTTGGAGAATGTGTGTA
GATAGCCGAGCCATTAACCGAATCACAGTAAAATACAGATTCCCTATCCCAAGAGTTGGAGACCTCTTGGATCAACTCGGCAAGGCCACCATCTTCTCGAAAATTGACCT
AAGGAGTGGATACCATCAAATACGTATCCGACCAGGTGATGAATGGAAGACTGCCTTCAAAACGAATGAAGGGCTGTTTGAATGGATGGTAATGCCCTTCGGCCTATCCA
ATGCTCCCAGTACCTTCATGAGGCTAATGAATCAGGTACTGCACCCCTTCCTCAACAAATTCATTGTGGTTTATTTCGACGATATATTGGTGTACAGCAGTGGGAACGAC
GAACACTTGCTCCACCTTAAAAAGTTGTTTCAAGTATTGACAGAAAAGGAACTATACATCAATCAAAAGAAGTGTGAATTCTTGAAGTCTGAAATTACATTTCTTGGTTT
TATAATCAAGAAAGGAGAAGTAAGCATGGAACCAAGAAAGGTTGAAGCAATACGAGAGTGGTTGGCTCCAACCACTGTCAAAGAAGTTCAAGCCTTCCTAGGACTGGCTT
CTTTTTACAGAAAGTTTATAAAGAACTTCAGCTCCATCTGCGCACCACTAACCGACTGCTTAAAGAAGGGAAACTTTAAGTGGACTTCATTCCAACAAGAGAGCTTCGAG
GAAATAAAGAAAAGGTTAGCTTCTAGCCCTGTTCTGCAACTGCCAGATTTCTCTTCTCCTTTCGAAGTAGCAGTAGACGCCTGTTGCACAGGAATTGGGGCAGTCTTATC
CCAACGAGGTCACCCGATTGAATACCTCAGTGAGAAATTGAGCACCGCACGACAAACATGGAGCACATACGAGCAAGAATTATATGCCTTAGTTCGAGCTCTCAAGCAGT
GGGAACATTACTTGCTCTCCAAAGAGTTTGTTCTCTTAACAGATCATTTCTCCCTGAAATATCTTCAATCTCAAAAGAATATCAGCCGGATGCATGCCCGTTGGATATCT
TATTTACAGAGATTCGATTTTGTTATCAAACACCAGGCTGGAAAAGAGAACAAGGTGGCTGACGCACTGAGTAGAAAAGGCTCCCTACTCACACTTCTCTCCTCAGAAAT
AATTGCTTTCGAGCACCTGCCAGAACTATACGAAAGAGATGCTGACTTCGCAGACATCTGGCATAAATGCTCCAATCACCTAAGGGCTGAAGGATACCACATCCTAGAAG
GATTCCTCTTCAAGGGAGACCAATTATGCATACCACACACTTCCCTGCGGGAAGCCTTAATAAAGGAAGCTCATTCCAACGGATTAGCCGGACATTTCGAGCAAGACAAG
ACCTTTGACACAATCTCCATACGGTACTACTGGCCACAGCTAAGAAAAGACTCCAATAACTTTGTGAAGAGATGTTCCGTTTGTCAACGGGCCAAGGGCTCCCAAACTAA
TGCAGGGTTATATACCCCGCTGCCGATTCCACAATCAATCTGGGAAGATCTCTCAATTGACTTTGTACTCGGACTCCCCAAGACTCAAAGAAACCATGATTCAGTCATGG
TGGTTGTTGACCGATTCAGCAAAATGGCTCACTTCATCGCTTGCAAGAGAACGAATGATGCTTTCCTAAGCCATTTCTGGAAGACACTGTGGAGAAAGTTTGATACAACG
CTGAAGTATAGCACAATAGCTCACCCTCAGACCGACGGACAGACTGAGGTAACAAACAGAACACTGGGTAACTTGATACGCTGCCTCAGTGGCTCTAAACCTAAACAGTG
GGATTTGTCCCACGCACAAGCAGAGTTTGCCTTCAACAACATGAAGAACCGATCGACTGACAAATGCCCCTTTGAAGTCGTGTACACTAGACGACCGAGGTTGACATTTG
ACCTCGCATCCCTCCCTGTTACTGTAGAAAGTCACAAAGAGGCCGAAACCATGGCAGAGGATATCGAGAAACTACACAAGGAAGTTCATGACCACCTTGTCCAATCCACT
AATTCCTATAAGAAAGCAGCAGACAAAAAGAGGAGGCAAACTGTTTTCTCCAAAGGGGACTTAGTAATGGTACACCTAAGAAAAAACAGATTCCCCACTGGAACGTACAA
CAAACTGAAGGATAGACAGATCGGCCCATTTCGCATTACAGAAAAATATGGAGATAATGCTTTTAAGGTCGAACTTCCCCCAGACATGCACATCCATTCGGTATTCAATA
TTGCAGACTTGAAACCCTATCATGCCCCAGACGACTTCCAGCTTGCTGACTAA
Protein sequenceShow/hide protein sequence
MQQLGALQMQNSKIIQYRTSASTGLPSSKENQGCYFSRLLELAFSCYSCFQLLLLVAVLSVSSCYSCFQLLLLVAVLSVSSCYSCFQLLLLVAVLSVSSCYSCFQLLLLV
AVLSVSSCYSCFQLLLLVAVLSVSSCYSCFQLLLLVAVLSVSSCYSCFQLLLLVAVLSVSSCYSCFQLLLLVAVLSVSSCYSCFQLLLLVAVLSVSSCYSCFQLLLLVAV
LSVSSCYSCFQLLLLVAVLSVSSCYSCFQLLLLVAVLSVSSCYSCFQLLLLVAVLSVSSCYSCFQLLLLVAVLSVSSCYSCFQLLLLVAVSSVNKLLAASSAEPPNGLQG
SRVKNLKLWVLMIHNFELLARRMDELPAPPQLEAEMADNGEINNRRRQRGRGNFRNLHNPRYANRGRADLNPPPQFYADVHNEQEDESSSSDDQNNPWDALDETQRGQHG
RRFGTRAETHHDFKMKVDLPSYSGKRDIESFLDWLKSTENFFSYMDTPEQKKVRLVALKLKGGASAWWEQLEANRQRPWQHDTQTLHKGRENTYEFHWMGKRITLLPLTK
KNEENSKTRGQLFTTCSGKTLLKERKQDILALKEPDGLPPLRDIQHHIDLIPGASLPNLAHYRMTPKEYAALHEHIEDLLKKGHIKPSLSPCAVPALLTPKKDGSWRMCV
DSRAINRITVKYRFPIPRVGDLLDQLGKATIFSKIDLRSGYHQIRIRPGDEWKTAFKTNEGLFEWMVMPFGLSNAPSTFMRLMNQVLHPFLNKFIVVYFDDILVYSSGND
EHLLHLKKLFQVLTEKELYINQKKCEFLKSEITFLGFIIKKGEVSMEPRKVEAIREWLAPTTVKEVQAFLGLASFYRKFIKNFSSICAPLTDCLKKGNFKWTSFQQESFE
EIKKRLASSPVLQLPDFSSPFEVAVDACCTGIGAVLSQRGHPIEYLSEKLSTARQTWSTYEQELYALVRALKQWEHYLLSKEFVLLTDHFSLKYLQSQKNISRMHARWIS
YLQRFDFVIKHQAGKENKVADALSRKGSLLTLLSSEIIAFEHLPELYERDADFADIWHKCSNHLRAEGYHILEGFLFKGDQLCIPHTSLREALIKEAHSNGLAGHFEQDK
TFDTISIRYYWPQLRKDSNNFVKRCSVCQRAKGSQTNAGLYTPLPIPQSIWEDLSIDFVLGLPKTQRNHDSVMVVVDRFSKMAHFIACKRTNDAFLSHFWKTLWRKFDTT
LKYSTIAHPQTDGQTEVTNRTLGNLIRCLSGSKPKQWDLSHAQAEFAFNNMKNRSTDKCPFEVVYTRRPRLTFDLASLPVTVESHKEAETMAEDIEKLHKEVHDHLVQST
NSYKKAADKKRRQTVFSKGDLVMVHLRKNRFPTGTYNKLKDRQIGPFRITEKYGDNAFKVELPPDMHIHSVFNIADLKPYHAPDDFQLAD