; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0004698 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0004698
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
DescriptionReverse transcriptase
Genome locationchr01:18282413..18295447
RNA-Seq ExpressionPay0004698
SyntenyPay0004698
Gene Ontology termsGO:0006259 - DNA metabolic process (biological process)
GO:0016020 - membrane (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004518 - nuclease activity (molecular function)
GO:0008233 - peptidase activity (molecular function)
GO:0016779 - nucleotidyltransferase activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0040689.1 pol protein [Cucumis melo var. makuwa]1.3e-26772.29Show/hide
Query:  MTVERYDAEFDILSRFAPEMIAIKTTRADKFVRGLKLDIQGLVRAFGPATHADALRLAVDLSLQERANSSKVSDRGLASGHKRKAEQQSISVPQRNFRSG
        MTVE+YDAEFD+LSRFAPEMIA +  RADKFVRGL+LDIQGLVRAF PATHADALRLAVDLSLQERANSSK + RG  SG KRKAEQQ + VPQRNFRSG
Subjt:  MTVERYDAEFDILSRFAPEMIAIKTTRADKFVRGLKLDIQGLVRAFGPATHADALRLAVDLSLQERANSSKVSDRGLASGHKRKAEQQSISVPQRNFRSG

Query:  GEFRRFQQKPFEAGEAAEGSRCVPLVGSTIWAVDYFGTRTLFKCRQEGHTADRCPMRLTGNAQNQGAGAPHQGKVIATNKTEVERADTVVIGTLPVLGHY
        GEFRRFQQKPFEAGEAA G       G        FGTRT FKCRQEGHTADRCP+RLTGNAQNQ AGAPHQG+V ATNKTE E+A TVV GTLPVLGHY
Subjt:  GEFRRFQQKPFEAGEAAEGSRCVPLVGSTIWAVDYFGTRTLFKCRQEGHTADRCPMRLTGNAQNQGAGAPHQGKVIATNKTEVERADTVVIGTLPVLGHY

Query:  ALVFFNSVSSHSSISSAFVLHARLEVKPLHHVLSVSTPSGESMLSKEK----------------------------------------------------
        ALV F+S SSHS IS AFVLHARLEV+PLHHVLSVSTPSGE MLSKEK                                                    
Subjt:  ALVFFNSVSSHSSISSAFVLHARLEVKPLHHVLSVSTPSGESMLSKEK----------------------------------------------------

Query:  -------------------------GVPDVFPDELPKLPPPREIDFAIELEPGTAPILRAPYRMAPAELKVLKVQLQELLAKGFIRPNVSPWGAPVLFVK
                                   PDVFP+ELP LPP RE++FAIELEPGT PI RAPYRMAPAELK LKVQLQELL KGFIRP++SPWGAPVLFVK
Subjt:  -------------------------GVPDVFPDELPKLPPPREIDFAIELEPGTAPILRAPYRMAPAELKVLKVQLQELLAKGFIRPNVSPWGAPVLFVK

Query:  KKGGSMRLCMDYRELNKVTVKNRYPLHRIDDLFDQLQGATVFSKINLRSGYHQLRIRDSDIPKTAFRSGYGHYEFVGMSFGLTNAPAVFMDLMKRVFKNF
        KK GSMRLC+DYRELNKVTVKNRYPL RIDDLFDQLQGATVFSKI+LRSGYHQLRI+D D+PKTAFRS YGHYEF+ MSFGLTNAPAVFMDLM RVF+ F
Subjt:  KKGGSMRLCMDYRELNKVTVKNRYPLHRIDDLFDQLQGATVFSKINLRSGYHQLRIRDSDIPKTAFRSGYGHYEFVGMSFGLTNAPAVFMDLMKRVFKNF

Query:  LDTIVIVFINDILVYSKTKAKHEKHLHQVLETLRANKLYAKFSKCELWLKKVTFLGHVVSSEGVSVDPSKIEAVTSWPRPSTVSEIRSFLGLAGYYK---
        LDT VIVFI+DIL+YSKT+A+HE+HL  VL+TLR NKLYAKFSKCE WLK+V+FLGHVVS  GVSVDP KIEAVT W RPSTVSE+RSFLGLAGYY+   
Subjt:  LDTIVIVFINDILVYSKTKAKHEKHLHQVLETLRANKLYAKFSKCELWLKKVTFLGHVVSSEGVSVDPSKIEAVTSWPRPSTVSEIRSFLGLAGYYK---

Query:  -------SPLTQLTRKGTPFVWSPAYERSFQELKQKLVTAPVLPVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLK
               +PLTQLTRKG PFVWS A E SFQ LKQKLVTAPVL VPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLK
Subjt:  -------SPLTQLTRKGTPFVWSPAYERSFQELKQKLVTAPVLPVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLK

KAA0043391.1 pol protein [Cucumis melo var. makuwa]1.1e-26669.61Show/hide
Query:  MTVERYDAEFDILSRFAPEMIAIKTTRADKFVRGLKLDIQGLVRAFGPATHADALRLAVDLSLQERANSSKVSDRGLASGHKRKAEQQSISVPQRNFRSG
        MTVE+YDAEFD+LSRFAPEMIA +  RADKFVRGL+LDIQGLVRAF PATHADALRLAVDLSLQERANSSK + RGL SG KRKAEQQ + VPQRNFRSG
Subjt:  MTVERYDAEFDILSRFAPEMIAIKTTRADKFVRGLKLDIQGLVRAFGPATHADALRLAVDLSLQERANSSKVSDRGLASGHKRKAEQQSISVPQRNFRSG

Query:  GEFRRFQQKPFEAGEAAEGSRCVPLVGSTIWAVDYFGTRTLFKCRQEGHTADRCPMRLTGNAQNQGAGAPHQGKVIATNKTEVERADTVVIGTLPVLGHY
        GEFRRFQQKPFEAGEAA G       G        FGTRT FKCRQEGHTADRCP+RLTGNAQNQGAGAPHQG+V ATNKTE E+A TVV GTLPVLGHY
Subjt:  GEFRRFQQKPFEAGEAAEGSRCVPLVGSTIWAVDYFGTRTLFKCRQEGHTADRCPMRLTGNAQNQGAGAPHQGKVIATNKTEVERADTVVIGTLPVLGHY

Query:  ALVFFNSVSSHSSISSAFVLHARLEVKPLHHVLSVSTPSGESMLSKEK----------------------------------------------------
        ALV F+S SSHS ISSAFVLHARLEV+PLHHVLSVSTPSGE MLSKEK                                                    
Subjt:  ALVFFNSVSSHSSISSAFVLHARLEVKPLHHVLSVSTPSGESMLSKEK----------------------------------------------------

Query:  ---------------------------------------------------------GVPDVFPDELPKLPPPREIDFAIELEPGTAPILRAPYRMAPAE
                                                                   PDVFP+ELP LPP RE++FAIELEPGT PI RAPYRMAPAE
Subjt:  ---------------------------------------------------------GVPDVFPDELPKLPPPREIDFAIELEPGTAPILRAPYRMAPAE

Query:  LKVLKVQLQELLAKGFIRPNVSPWGAPVLFVKKKGGSMRLCMDYRELNKVTVKNRYPLHRIDDLFDQLQGATVFSKINLRSGYHQLRIRDSDIPKTAFRS
        LK LKVQLQELL KGFIRP+VSPWGAPVLFVKKK GSMRLC+DYRELNKVTVKNRYPL RIDDLFDQLQGATVFSKI+LRSGYHQLRI+D D+PKTAFRS
Subjt:  LKVLKVQLQELLAKGFIRPNVSPWGAPVLFVKKKGGSMRLCMDYRELNKVTVKNRYPLHRIDDLFDQLQGATVFSKINLRSGYHQLRIRDSDIPKTAFRS

Query:  GYGHYEFVGMSFGLTNAPAVFMDLMKRVFKNFLDTIVIVFINDILVYSKTKAKHEKHLHQVLETLRANKLYAKFSKCELWLKKVTFLGHVVSSEGVSVDP
         YGHYEF+ MSFGLTNAPAVFMDLM RVF+ FLDT VIVFI+DIL+YSKT+A+HE+HL  VL+TLR NKLYAKFSKCE WLK+V+FLGHVVS  GVSVDP
Subjt:  GYGHYEFVGMSFGLTNAPAVFMDLMKRVFKNFLDTIVIVFINDILVYSKTKAKHEKHLHQVLETLRANKLYAKFSKCELWLKKVTFLGHVVSSEGVSVDP

Query:  SKIEAVTSWPRPSTVSEIRSFLGLAGYY----------KSPLTQLTRKGTPFVWSPAYERSFQELKQKLVTAPVLPVPDGSGSFVIYSDASKKGLGCVLM
        +KIEAVT W RPSTVSE+RSFLGLAGYY           +PLTQLTRKG PFVWS A E SFQ LKQKLVTAPVL VPDGSGSFVIYSDASKKGLGCVLM
Subjt:  SKIEAVTSWPRPSTVSEIRSFLGLAGYY----------KSPLTQLTRKGTPFVWSPAYERSFQELKQKLVTAPVLPVPDGSGSFVIYSDASKKGLGCVLM

Query:  QQGKVVAYASRQLK
        QQGKVVAYASRQLK
Subjt:  QQGKVVAYASRQLK

KAA0046185.1 pol protein [Cucumis melo var. makuwa]1.6e-26571.37Show/hide
Query:  MTVERYDAEFDILSRFAPEMIAIKTTRADKFVRGLKLDIQGLVRAFGPATHADALRLAVDLSLQERANSSKVSDRGLASGHKRKAEQQSISVPQRNFRSG
        MTVE+YDAEFD+LSRFAPEMIA +  RADKFVRGL+LDIQGLVRAF PATHADALRLAVDLSLQERANSSK + RG  SG KRKAEQQ + VPQRNFRSG
Subjt:  MTVERYDAEFDILSRFAPEMIAIKTTRADKFVRGLKLDIQGLVRAFGPATHADALRLAVDLSLQERANSSKVSDRGLASGHKRKAEQQSISVPQRNFRSG

Query:  GEFRRFQQKPFEAGEAAEGSRCVPLVGSTIWAVDYFGTRTLFKCRQEGHTADRCPMRLTGNAQNQGAGAPHQGKVIATNKTEVERADTVVIGTLPVLGHY
        GEFRRFQQKPFEA EAA G       G        FGTRT FKCRQEGHTADRCP+RLTGN QNQGAGAPHQG+V ATNKTE E+A TVV GTLPVLGHY
Subjt:  GEFRRFQQKPFEAGEAAEGSRCVPLVGSTIWAVDYFGTRTLFKCRQEGHTADRCPMRLTGNAQNQGAGAPHQGKVIATNKTEVERADTVVIGTLPVLGHY

Query:  ALVFFNSVSSHSSISSAFVLHARLEVKPLHHVLSVSTPSGESMLSKEK----------------------------------------------------
        ALV F+S SSHS ISSAFVLHARLEV+PLHHVLSVSTPSGE MLSKEK                                                    
Subjt:  ALVFFNSVSSHSSISSAFVLHARLEVKPLHHVLSVSTPSGESMLSKEK----------------------------------------------------

Query:  -------------------------------GVPDVFPDELPKLPPPREIDFAIELEPGTAPILRAPYRMAPAELKVLKVQLQELLAKGFIRPNVSPWGA
                                         PDVFP+ELP LPP RE++FAIELEPGT PI RAPYRMAPAELK LKVQLQELL KGFIRP+VSPWGA
Subjt:  -------------------------------GVPDVFPDELPKLPPPREIDFAIELEPGTAPILRAPYRMAPAELKVLKVQLQELLAKGFIRPNVSPWGA

Query:  PVLFVKKKGGSMRLCMDYRELNKVTVKNRYPLHRIDDLFDQLQGATVFSKINLRSGYHQLRIRDSDIPKTAFRSGYGHYEFVGMSFGLTNAPAVFMDLMK
        PVLFVKKK  SMRLC+DYRELNKVTVKNRYPL RIDDLFDQLQGATVFSKI+LRSGYHQLRI+D D+PKTAFRS YGHYEF+ MSFGLTNAPAVFMDLM 
Subjt:  PVLFVKKKGGSMRLCMDYRELNKVTVKNRYPLHRIDDLFDQLQGATVFSKINLRSGYHQLRIRDSDIPKTAFRSGYGHYEFVGMSFGLTNAPAVFMDLMK

Query:  RVFKNFLDTIVIVFINDILVYSKTKAKHEKHLHQVLETLRANKLYAKFSKCELWLKKVTFLGHVVSSEGVSVDPSKIEAVTSWPRPSTVSEIRSFLGLAG
        RVF+ FLDT VIVFI+DIL+YSKT+A+HE+HL  VL+TLR NKLYAKFSKCE WLK+V+FLGH+VS  GVSVDP+KIEAVT W RPSTVSE+RSFLGLAG
Subjt:  RVFKNFLDTIVIVFINDILVYSKTKAKHEKHLHQVLETLRANKLYAKFSKCELWLKKVTFLGHVVSSEGVSVDPSKIEAVTSWPRPSTVSEIRSFLGLAG

Query:  YYK----------SPLTQLTRKGTPFVWSPAYERSFQELKQKLVTAPVLPVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLK
        YY+          +PLTQLTRKG PFVWS A E SFQ LKQKLVTAPVL V DGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLK
Subjt:  YYK----------SPLTQLTRKGTPFVWSPAYERSFQELKQKLVTAPVLPVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLK

KAA0051051.1 reverse transcriptase [Cucumis melo var. makuwa]2.1e-26872.71Show/hide
Query:  MTVERYDAEFDILSRFAPEMIAIKTTRADKFVRGLKLDIQGLVRAFGPATHADALRLAVDLSLQERANSSKVSDRGLASGHKRKAEQQSISVPQRNFRSG
        MTVE+YDAEFD+LSRFAPEMIA +  RADKFVRGL+LDIQGLVRAF P THADALRLAVDLSLQERANSSK + RG  SG KRKAEQQ + +PQRNFRSG
Subjt:  MTVERYDAEFDILSRFAPEMIAIKTTRADKFVRGLKLDIQGLVRAFGPATHADALRLAVDLSLQERANSSKVSDRGLASGHKRKAEQQSISVPQRNFRSG

Query:  GEFRRFQQKPFEAGEAAEGSRCVPLVGSTIWAVDYFGTRTLFKCRQEGHTADRCPMRLTGNAQNQGAGAPHQGKVIATNKTEVERADTVVIGTLPVLGHY
        GEFRRFQQKPFEAGEAA G       G        FGT T FKCRQEGHTADRCP+RLTGNAQNQGAGAPHQG+V ATNKTE ERA TVV GTLPVLGHY
Subjt:  GEFRRFQQKPFEAGEAAEGSRCVPLVGSTIWAVDYFGTRTLFKCRQEGHTADRCPMRLTGNAQNQGAGAPHQGKVIATNKTEVERADTVVIGTLPVLGHY

Query:  ALVFFNSVSSHSSISSAFVLHARLEVKPLHHVLSVSTPSGESMLSKEK----------------------------------------------------
        ALV F+S SSHS ISSAFVLHARLEV+PLHHVLSVSTPSGE MLSKEK                                                    
Subjt:  ALVFFNSVSSHSSISSAFVLHARLEVKPLHHVLSVSTPSGESMLSKEK----------------------------------------------------

Query:  ---------------------GVPDVFPDELPKLPPPREIDFAIELEPGTAPILRAPYRMAPAELKVLKVQLQELLAKGFIRPNVSPWGAPVLFVKKKGG
                               PDVFP+ELP LPP RE++FAIELEPGT PI RAPYRMAPAELK LKVQLQELL KGFIRP+VSPWGAPVLFVKKK G
Subjt:  ---------------------GVPDVFPDELPKLPPPREIDFAIELEPGTAPILRAPYRMAPAELKVLKVQLQELLAKGFIRPNVSPWGAPVLFVKKKGG

Query:  SMRLCMDYRELNKVTVKNRYPLHRIDDLFDQLQGATVFSKINLRSGYHQLRIRDSDIPKTAFRSGYGHYEFVGMSFGLTNAPAVFMDLMKRVFKNFLDTI
        SMRLC+DYRELNKVTVKNRYPL RIDDLFDQLQGATVFSKI+LRSGYHQLRI++ D+PKTAFRS YGHYEF+ MSFGLTNAPAVFMDLM RVF+ FLDT 
Subjt:  SMRLCMDYRELNKVTVKNRYPLHRIDDLFDQLQGATVFSKINLRSGYHQLRIRDSDIPKTAFRSGYGHYEFVGMSFGLTNAPAVFMDLMKRVFKNFLDTI

Query:  VIVFINDILVYSKTKAKHEKHLHQVLETLRANKLYAKFSKCELWLKKVTFLGHVVSSEGVSVDPSKIEAVTSWPRPSTVSEIRSFLGLAGYYK-------
        VIVFI+DIL+YSKT+A+HE+HL  VL+TLR NKLYAKFSKCE WLK+V+FLGHVVS  GVSVDP+KIEAVT W RPSTVSE+RSFLGLAGYY+       
Subjt:  VIVFINDILVYSKTKAKHEKHLHQVLETLRANKLYAKFSKCELWLKKVTFLGHVVSSEGVSVDPSKIEAVTSWPRPSTVSEIRSFLGLAGYYK-------

Query:  ---SPLTQLTRKGTPFVWSPAYERSFQELKQKLVTAPVLPVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLK
           +PLTQLTRKG PFVWS A E SFQ LKQKLVTAPVL VPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLK
Subjt:  ---SPLTQLTRKGTPFVWSPAYERSFQELKQKLVTAPVLPVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLK

KAA0062112.1 pol protein [Cucumis melo var. makuwa]2.2e-26571.51Show/hide
Query:  MTVERYDAEFDILSRFAPEMIAIKTTRADKFVRGLKLDIQGLVRAFGPATHADALRLAVDLSLQERANSSKVSDRGLASGHKRKAEQQSISVPQRNFRSG
        MTVE+YDAEFD+LSRFAPEMIA +  RADKFVRGL+LDIQGLVRAF PATHADALRLAVDLSLQERANSSK   RG  SG KRKAEQQ + VPQRNFRSG
Subjt:  MTVERYDAEFDILSRFAPEMIAIKTTRADKFVRGLKLDIQGLVRAFGPATHADALRLAVDLSLQERANSSKVSDRGLASGHKRKAEQQSISVPQRNFRSG

Query:  GEFRRFQQKPFEAGEAAEGSRCVPLVGSTIWAVDYFGTRTLFKCRQEGHTADRCPMRLTGNAQNQGAGAPHQGKVIATNKTEVERADTVVIGTLPVLGHY
        GEFRRFQQKPFEAGEAA G       G        FGTRT FKCRQEGHTADRCP+RLTGNAQNQGAGAPHQG+V ATNKTE E+A TVV GTLPVLGHY
Subjt:  GEFRRFQQKPFEAGEAAEGSRCVPLVGSTIWAVDYFGTRTLFKCRQEGHTADRCPMRLTGNAQNQGAGAPHQGKVIATNKTEVERADTVVIGTLPVLGHY

Query:  ALVFFNSVSSHSSISSAFVLHARLEVKPLHHVLSVSTPSGESMLSKEK----------------------------------------------------
        ALV F+S SSHS ISS FVLHA LEV+PLHHVLSVSTPSGE MLSKEK                                                    
Subjt:  ALVFFNSVSSHSSISSAFVLHARLEVKPLHHVLSVSTPSGESMLSKEK----------------------------------------------------

Query:  -------------------------------GVPDVFPDELPKLPPPREIDFAIELEPGTAPILRAPYRMAPAELKVLKVQLQELLAKGFIRPNVSPWGA
                                         PDVFP+ELP LPP RE++FAIELEPGT PI RAPYRMAPAELK LKVQLQELL KGFIRP+VSP GA
Subjt:  -------------------------------GVPDVFPDELPKLPPPREIDFAIELEPGTAPILRAPYRMAPAELKVLKVQLQELLAKGFIRPNVSPWGA

Query:  PVLFVKKKGGSMRLCMDYRELNKVTVKNRYPLHRIDDLFDQLQGATVFSKINLRSGYHQLRIRDSDIPKTAFRSGYGHYEFVGMSFGLTNAPAVFMDLMK
        PVLFVKKK GSMRLC+DYRELNKVTVKNRYPL RIDDLFDQLQGATVFSKI+LRSGYHQLRI+D D+PKTAFRS YGHYEF+ MSFGLTNAPAVFMDLM 
Subjt:  PVLFVKKKGGSMRLCMDYRELNKVTVKNRYPLHRIDDLFDQLQGATVFSKINLRSGYHQLRIRDSDIPKTAFRSGYGHYEFVGMSFGLTNAPAVFMDLMK

Query:  RVFKNFLDTIVIVFINDILVYSKTKAKHEKHLHQVLETLRANKLYAKFSKCELWLKKVTFLGHVVSSEGVSVDPSKIEAVTSWPRPSTVSEIRSFLGLAG
        RVF+ FLDT VIVFI+DIL+YSKT+A+HE HL  VL+TLR NKLYAKFSKCE WLK+V+FLGHVVS  GVSVDP+KIEAVT W RPSTVSE+RSFLGLAG
Subjt:  RVFKNFLDTIVIVFINDILVYSKTKAKHEKHLHQVLETLRANKLYAKFSKCELWLKKVTFLGHVVSSEGVSVDPSKIEAVTSWPRPSTVSEIRSFLGLAG

Query:  YYK----------SPLTQLTRKGTPFVWSPAYERSFQELKQKLVTAPVLPVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLK
        YY+          +PLTQLTRKG PFVWS A E SFQ LKQKLVTAPVL VPDGSGSFVIYSD SKKGLGCVLMQQGKVVAYASRQLK
Subjt:  YYK----------SPLTQLTRKGTPFVWSPAYERSFQELKQKLVTAPVLPVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLK

TrEMBL top hitse value%identityAlignment
A0A5A7THE6 Reverse transcriptase6.5e-26872.29Show/hide
Query:  MTVERYDAEFDILSRFAPEMIAIKTTRADKFVRGLKLDIQGLVRAFGPATHADALRLAVDLSLQERANSSKVSDRGLASGHKRKAEQQSISVPQRNFRSG
        MTVE+YDAEFD+LSRFAPEMIA +  RADKFVRGL+LDIQGLVRAF PATHADALRLAVDLSLQERANSSK + RG  SG KRKAEQQ + VPQRNFRSG
Subjt:  MTVERYDAEFDILSRFAPEMIAIKTTRADKFVRGLKLDIQGLVRAFGPATHADALRLAVDLSLQERANSSKVSDRGLASGHKRKAEQQSISVPQRNFRSG

Query:  GEFRRFQQKPFEAGEAAEGSRCVPLVGSTIWAVDYFGTRTLFKCRQEGHTADRCPMRLTGNAQNQGAGAPHQGKVIATNKTEVERADTVVIGTLPVLGHY
        GEFRRFQQKPFEAGEAA G       G        FGTRT FKCRQEGHTADRCP+RLTGNAQNQ AGAPHQG+V ATNKTE E+A TVV GTLPVLGHY
Subjt:  GEFRRFQQKPFEAGEAAEGSRCVPLVGSTIWAVDYFGTRTLFKCRQEGHTADRCPMRLTGNAQNQGAGAPHQGKVIATNKTEVERADTVVIGTLPVLGHY

Query:  ALVFFNSVSSHSSISSAFVLHARLEVKPLHHVLSVSTPSGESMLSKEK----------------------------------------------------
        ALV F+S SSHS IS AFVLHARLEV+PLHHVLSVSTPSGE MLSKEK                                                    
Subjt:  ALVFFNSVSSHSSISSAFVLHARLEVKPLHHVLSVSTPSGESMLSKEK----------------------------------------------------

Query:  -------------------------GVPDVFPDELPKLPPPREIDFAIELEPGTAPILRAPYRMAPAELKVLKVQLQELLAKGFIRPNVSPWGAPVLFVK
                                   PDVFP+ELP LPP RE++FAIELEPGT PI RAPYRMAPAELK LKVQLQELL KGFIRP++SPWGAPVLFVK
Subjt:  -------------------------GVPDVFPDELPKLPPPREIDFAIELEPGTAPILRAPYRMAPAELKVLKVQLQELLAKGFIRPNVSPWGAPVLFVK

Query:  KKGGSMRLCMDYRELNKVTVKNRYPLHRIDDLFDQLQGATVFSKINLRSGYHQLRIRDSDIPKTAFRSGYGHYEFVGMSFGLTNAPAVFMDLMKRVFKNF
        KK GSMRLC+DYRELNKVTVKNRYPL RIDDLFDQLQGATVFSKI+LRSGYHQLRI+D D+PKTAFRS YGHYEF+ MSFGLTNAPAVFMDLM RVF+ F
Subjt:  KKGGSMRLCMDYRELNKVTVKNRYPLHRIDDLFDQLQGATVFSKINLRSGYHQLRIRDSDIPKTAFRSGYGHYEFVGMSFGLTNAPAVFMDLMKRVFKNF

Query:  LDTIVIVFINDILVYSKTKAKHEKHLHQVLETLRANKLYAKFSKCELWLKKVTFLGHVVSSEGVSVDPSKIEAVTSWPRPSTVSEIRSFLGLAGYYK---
        LDT VIVFI+DIL+YSKT+A+HE+HL  VL+TLR NKLYAKFSKCE WLK+V+FLGHVVS  GVSVDP KIEAVT W RPSTVSE+RSFLGLAGYY+   
Subjt:  LDTIVIVFINDILVYSKTKAKHEKHLHQVLETLRANKLYAKFSKCELWLKKVTFLGHVVSSEGVSVDPSKIEAVTSWPRPSTVSEIRSFLGLAGYYK---

Query:  -------SPLTQLTRKGTPFVWSPAYERSFQELKQKLVTAPVLPVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLK
               +PLTQLTRKG PFVWS A E SFQ LKQKLVTAPVL VPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLK
Subjt:  -------SPLTQLTRKGTPFVWSPAYERSFQELKQKLVTAPVLPVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLK

A0A5A7TP96 Reverse transcriptase5.5e-26769.61Show/hide
Query:  MTVERYDAEFDILSRFAPEMIAIKTTRADKFVRGLKLDIQGLVRAFGPATHADALRLAVDLSLQERANSSKVSDRGLASGHKRKAEQQSISVPQRNFRSG
        MTVE+YDAEFD+LSRFAPEMIA +  RADKFVRGL+LDIQGLVRAF PATHADALRLAVDLSLQERANSSK + RGL SG KRKAEQQ + VPQRNFRSG
Subjt:  MTVERYDAEFDILSRFAPEMIAIKTTRADKFVRGLKLDIQGLVRAFGPATHADALRLAVDLSLQERANSSKVSDRGLASGHKRKAEQQSISVPQRNFRSG

Query:  GEFRRFQQKPFEAGEAAEGSRCVPLVGSTIWAVDYFGTRTLFKCRQEGHTADRCPMRLTGNAQNQGAGAPHQGKVIATNKTEVERADTVVIGTLPVLGHY
        GEFRRFQQKPFEAGEAA G       G        FGTRT FKCRQEGHTADRCP+RLTGNAQNQGAGAPHQG+V ATNKTE E+A TVV GTLPVLGHY
Subjt:  GEFRRFQQKPFEAGEAAEGSRCVPLVGSTIWAVDYFGTRTLFKCRQEGHTADRCPMRLTGNAQNQGAGAPHQGKVIATNKTEVERADTVVIGTLPVLGHY

Query:  ALVFFNSVSSHSSISSAFVLHARLEVKPLHHVLSVSTPSGESMLSKEK----------------------------------------------------
        ALV F+S SSHS ISSAFVLHARLEV+PLHHVLSVSTPSGE MLSKEK                                                    
Subjt:  ALVFFNSVSSHSSISSAFVLHARLEVKPLHHVLSVSTPSGESMLSKEK----------------------------------------------------

Query:  ---------------------------------------------------------GVPDVFPDELPKLPPPREIDFAIELEPGTAPILRAPYRMAPAE
                                                                   PDVFP+ELP LPP RE++FAIELEPGT PI RAPYRMAPAE
Subjt:  ---------------------------------------------------------GVPDVFPDELPKLPPPREIDFAIELEPGTAPILRAPYRMAPAE

Query:  LKVLKVQLQELLAKGFIRPNVSPWGAPVLFVKKKGGSMRLCMDYRELNKVTVKNRYPLHRIDDLFDQLQGATVFSKINLRSGYHQLRIRDSDIPKTAFRS
        LK LKVQLQELL KGFIRP+VSPWGAPVLFVKKK GSMRLC+DYRELNKVTVKNRYPL RIDDLFDQLQGATVFSKI+LRSGYHQLRI+D D+PKTAFRS
Subjt:  LKVLKVQLQELLAKGFIRPNVSPWGAPVLFVKKKGGSMRLCMDYRELNKVTVKNRYPLHRIDDLFDQLQGATVFSKINLRSGYHQLRIRDSDIPKTAFRS

Query:  GYGHYEFVGMSFGLTNAPAVFMDLMKRVFKNFLDTIVIVFINDILVYSKTKAKHEKHLHQVLETLRANKLYAKFSKCELWLKKVTFLGHVVSSEGVSVDP
         YGHYEF+ MSFGLTNAPAVFMDLM RVF+ FLDT VIVFI+DIL+YSKT+A+HE+HL  VL+TLR NKLYAKFSKCE WLK+V+FLGHVVS  GVSVDP
Subjt:  GYGHYEFVGMSFGLTNAPAVFMDLMKRVFKNFLDTIVIVFINDILVYSKTKAKHEKHLHQVLETLRANKLYAKFSKCELWLKKVTFLGHVVSSEGVSVDP

Query:  SKIEAVTSWPRPSTVSEIRSFLGLAGYY----------KSPLTQLTRKGTPFVWSPAYERSFQELKQKLVTAPVLPVPDGSGSFVIYSDASKKGLGCVLM
        +KIEAVT W RPSTVSE+RSFLGLAGYY           +PLTQLTRKG PFVWS A E SFQ LKQKLVTAPVL VPDGSGSFVIYSDASKKGLGCVLM
Subjt:  SKIEAVTSWPRPSTVSEIRSFLGLAGYY----------KSPLTQLTRKGTPFVWSPAYERSFQELKQKLVTAPVLPVPDGSGSFVIYSDASKKGLGCVLM

Query:  QQGKVVAYASRQLK
        QQGKVVAYASRQLK
Subjt:  QQGKVVAYASRQLK

A0A5A7TXM6 Reverse transcriptase8.0e-26671.37Show/hide
Query:  MTVERYDAEFDILSRFAPEMIAIKTTRADKFVRGLKLDIQGLVRAFGPATHADALRLAVDLSLQERANSSKVSDRGLASGHKRKAEQQSISVPQRNFRSG
        MTVE+YDAEFD+LSRFAPEMIA +  RADKFVRGL+LDIQGLVRAF PATHADALRLAVDLSLQERANSSK + RG  SG KRKAEQQ + VPQRNFRSG
Subjt:  MTVERYDAEFDILSRFAPEMIAIKTTRADKFVRGLKLDIQGLVRAFGPATHADALRLAVDLSLQERANSSKVSDRGLASGHKRKAEQQSISVPQRNFRSG

Query:  GEFRRFQQKPFEAGEAAEGSRCVPLVGSTIWAVDYFGTRTLFKCRQEGHTADRCPMRLTGNAQNQGAGAPHQGKVIATNKTEVERADTVVIGTLPVLGHY
        GEFRRFQQKPFEA EAA G       G        FGTRT FKCRQEGHTADRCP+RLTGN QNQGAGAPHQG+V ATNKTE E+A TVV GTLPVLGHY
Subjt:  GEFRRFQQKPFEAGEAAEGSRCVPLVGSTIWAVDYFGTRTLFKCRQEGHTADRCPMRLTGNAQNQGAGAPHQGKVIATNKTEVERADTVVIGTLPVLGHY

Query:  ALVFFNSVSSHSSISSAFVLHARLEVKPLHHVLSVSTPSGESMLSKEK----------------------------------------------------
        ALV F+S SSHS ISSAFVLHARLEV+PLHHVLSVSTPSGE MLSKEK                                                    
Subjt:  ALVFFNSVSSHSSISSAFVLHARLEVKPLHHVLSVSTPSGESMLSKEK----------------------------------------------------

Query:  -------------------------------GVPDVFPDELPKLPPPREIDFAIELEPGTAPILRAPYRMAPAELKVLKVQLQELLAKGFIRPNVSPWGA
                                         PDVFP+ELP LPP RE++FAIELEPGT PI RAPYRMAPAELK LKVQLQELL KGFIRP+VSPWGA
Subjt:  -------------------------------GVPDVFPDELPKLPPPREIDFAIELEPGTAPILRAPYRMAPAELKVLKVQLQELLAKGFIRPNVSPWGA

Query:  PVLFVKKKGGSMRLCMDYRELNKVTVKNRYPLHRIDDLFDQLQGATVFSKINLRSGYHQLRIRDSDIPKTAFRSGYGHYEFVGMSFGLTNAPAVFMDLMK
        PVLFVKKK  SMRLC+DYRELNKVTVKNRYPL RIDDLFDQLQGATVFSKI+LRSGYHQLRI+D D+PKTAFRS YGHYEF+ MSFGLTNAPAVFMDLM 
Subjt:  PVLFVKKKGGSMRLCMDYRELNKVTVKNRYPLHRIDDLFDQLQGATVFSKINLRSGYHQLRIRDSDIPKTAFRSGYGHYEFVGMSFGLTNAPAVFMDLMK

Query:  RVFKNFLDTIVIVFINDILVYSKTKAKHEKHLHQVLETLRANKLYAKFSKCELWLKKVTFLGHVVSSEGVSVDPSKIEAVTSWPRPSTVSEIRSFLGLAG
        RVF+ FLDT VIVFI+DIL+YSKT+A+HE+HL  VL+TLR NKLYAKFSKCE WLK+V+FLGH+VS  GVSVDP+KIEAVT W RPSTVSE+RSFLGLAG
Subjt:  RVFKNFLDTIVIVFINDILVYSKTKAKHEKHLHQVLETLRANKLYAKFSKCELWLKKVTFLGHVVSSEGVSVDPSKIEAVTSWPRPSTVSEIRSFLGLAG

Query:  YYK----------SPLTQLTRKGTPFVWSPAYERSFQELKQKLVTAPVLPVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLK
        YY+          +PLTQLTRKG PFVWS A E SFQ LKQKLVTAPVL V DGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLK
Subjt:  YYK----------SPLTQLTRKGTPFVWSPAYERSFQELKQKLVTAPVLPVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLK

A0A5A7UC07 Reverse transcriptase1.0e-26872.71Show/hide
Query:  MTVERYDAEFDILSRFAPEMIAIKTTRADKFVRGLKLDIQGLVRAFGPATHADALRLAVDLSLQERANSSKVSDRGLASGHKRKAEQQSISVPQRNFRSG
        MTVE+YDAEFD+LSRFAPEMIA +  RADKFVRGL+LDIQGLVRAF P THADALRLAVDLSLQERANSSK + RG  SG KRKAEQQ + +PQRNFRSG
Subjt:  MTVERYDAEFDILSRFAPEMIAIKTTRADKFVRGLKLDIQGLVRAFGPATHADALRLAVDLSLQERANSSKVSDRGLASGHKRKAEQQSISVPQRNFRSG

Query:  GEFRRFQQKPFEAGEAAEGSRCVPLVGSTIWAVDYFGTRTLFKCRQEGHTADRCPMRLTGNAQNQGAGAPHQGKVIATNKTEVERADTVVIGTLPVLGHY
        GEFRRFQQKPFEAGEAA G       G        FGT T FKCRQEGHTADRCP+RLTGNAQNQGAGAPHQG+V ATNKTE ERA TVV GTLPVLGHY
Subjt:  GEFRRFQQKPFEAGEAAEGSRCVPLVGSTIWAVDYFGTRTLFKCRQEGHTADRCPMRLTGNAQNQGAGAPHQGKVIATNKTEVERADTVVIGTLPVLGHY

Query:  ALVFFNSVSSHSSISSAFVLHARLEVKPLHHVLSVSTPSGESMLSKEK----------------------------------------------------
        ALV F+S SSHS ISSAFVLHARLEV+PLHHVLSVSTPSGE MLSKEK                                                    
Subjt:  ALVFFNSVSSHSSISSAFVLHARLEVKPLHHVLSVSTPSGESMLSKEK----------------------------------------------------

Query:  ---------------------GVPDVFPDELPKLPPPREIDFAIELEPGTAPILRAPYRMAPAELKVLKVQLQELLAKGFIRPNVSPWGAPVLFVKKKGG
                               PDVFP+ELP LPP RE++FAIELEPGT PI RAPYRMAPAELK LKVQLQELL KGFIRP+VSPWGAPVLFVKKK G
Subjt:  ---------------------GVPDVFPDELPKLPPPREIDFAIELEPGTAPILRAPYRMAPAELKVLKVQLQELLAKGFIRPNVSPWGAPVLFVKKKGG

Query:  SMRLCMDYRELNKVTVKNRYPLHRIDDLFDQLQGATVFSKINLRSGYHQLRIRDSDIPKTAFRSGYGHYEFVGMSFGLTNAPAVFMDLMKRVFKNFLDTI
        SMRLC+DYRELNKVTVKNRYPL RIDDLFDQLQGATVFSKI+LRSGYHQLRI++ D+PKTAFRS YGHYEF+ MSFGLTNAPAVFMDLM RVF+ FLDT 
Subjt:  SMRLCMDYRELNKVTVKNRYPLHRIDDLFDQLQGATVFSKINLRSGYHQLRIRDSDIPKTAFRSGYGHYEFVGMSFGLTNAPAVFMDLMKRVFKNFLDTI

Query:  VIVFINDILVYSKTKAKHEKHLHQVLETLRANKLYAKFSKCELWLKKVTFLGHVVSSEGVSVDPSKIEAVTSWPRPSTVSEIRSFLGLAGYYK-------
        VIVFI+DIL+YSKT+A+HE+HL  VL+TLR NKLYAKFSKCE WLK+V+FLGHVVS  GVSVDP+KIEAVT W RPSTVSE+RSFLGLAGYY+       
Subjt:  VIVFINDILVYSKTKAKHEKHLHQVLETLRANKLYAKFSKCELWLKKVTFLGHVVSSEGVSVDPSKIEAVTSWPRPSTVSEIRSFLGLAGYYK-------

Query:  ---SPLTQLTRKGTPFVWSPAYERSFQELKQKLVTAPVLPVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLK
           +PLTQLTRKG PFVWS A E SFQ LKQKLVTAPVL VPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLK
Subjt:  ---SPLTQLTRKGTPFVWSPAYERSFQELKQKLVTAPVLPVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLK

A0A5A7V1N3 Reverse transcriptase1.0e-26571.51Show/hide
Query:  MTVERYDAEFDILSRFAPEMIAIKTTRADKFVRGLKLDIQGLVRAFGPATHADALRLAVDLSLQERANSSKVSDRGLASGHKRKAEQQSISVPQRNFRSG
        MTVE+YDAEFD+LSRFAPEMIA +  RADKFVRGL+LDIQGLVRAF PATHADALRLAVDLSLQERANSSK   RG  SG KRKAEQQ + VPQRNFRSG
Subjt:  MTVERYDAEFDILSRFAPEMIAIKTTRADKFVRGLKLDIQGLVRAFGPATHADALRLAVDLSLQERANSSKVSDRGLASGHKRKAEQQSISVPQRNFRSG

Query:  GEFRRFQQKPFEAGEAAEGSRCVPLVGSTIWAVDYFGTRTLFKCRQEGHTADRCPMRLTGNAQNQGAGAPHQGKVIATNKTEVERADTVVIGTLPVLGHY
        GEFRRFQQKPFEAGEAA G       G        FGTRT FKCRQEGHTADRCP+RLTGNAQNQGAGAPHQG+V ATNKTE E+A TVV GTLPVLGHY
Subjt:  GEFRRFQQKPFEAGEAAEGSRCVPLVGSTIWAVDYFGTRTLFKCRQEGHTADRCPMRLTGNAQNQGAGAPHQGKVIATNKTEVERADTVVIGTLPVLGHY

Query:  ALVFFNSVSSHSSISSAFVLHARLEVKPLHHVLSVSTPSGESMLSKEK----------------------------------------------------
        ALV F+S SSHS ISS FVLHA LEV+PLHHVLSVSTPSGE MLSKEK                                                    
Subjt:  ALVFFNSVSSHSSISSAFVLHARLEVKPLHHVLSVSTPSGESMLSKEK----------------------------------------------------

Query:  -------------------------------GVPDVFPDELPKLPPPREIDFAIELEPGTAPILRAPYRMAPAELKVLKVQLQELLAKGFIRPNVSPWGA
                                         PDVFP+ELP LPP RE++FAIELEPGT PI RAPYRMAPAELK LKVQLQELL KGFIRP+VSP GA
Subjt:  -------------------------------GVPDVFPDELPKLPPPREIDFAIELEPGTAPILRAPYRMAPAELKVLKVQLQELLAKGFIRPNVSPWGA

Query:  PVLFVKKKGGSMRLCMDYRELNKVTVKNRYPLHRIDDLFDQLQGATVFSKINLRSGYHQLRIRDSDIPKTAFRSGYGHYEFVGMSFGLTNAPAVFMDLMK
        PVLFVKKK GSMRLC+DYRELNKVTVKNRYPL RIDDLFDQLQGATVFSKI+LRSGYHQLRI+D D+PKTAFRS YGHYEF+ MSFGLTNAPAVFMDLM 
Subjt:  PVLFVKKKGGSMRLCMDYRELNKVTVKNRYPLHRIDDLFDQLQGATVFSKINLRSGYHQLRIRDSDIPKTAFRSGYGHYEFVGMSFGLTNAPAVFMDLMK

Query:  RVFKNFLDTIVIVFINDILVYSKTKAKHEKHLHQVLETLRANKLYAKFSKCELWLKKVTFLGHVVSSEGVSVDPSKIEAVTSWPRPSTVSEIRSFLGLAG
        RVF+ FLDT VIVFI+DIL+YSKT+A+HE HL  VL+TLR NKLYAKFSKCE WLK+V+FLGHVVS  GVSVDP+KIEAVT W RPSTVSE+RSFLGLAG
Subjt:  RVFKNFLDTIVIVFINDILVYSKTKAKHEKHLHQVLETLRANKLYAKFSKCELWLKKVTFLGHVVSSEGVSVDPSKIEAVTSWPRPSTVSEIRSFLGLAG

Query:  YYK----------SPLTQLTRKGTPFVWSPAYERSFQELKQKLVTAPVLPVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLK
        YY+          +PLTQLTRKG PFVWS A E SFQ LKQKLVTAPVL VPDGSGSFVIYSD SKKGLGCVLMQQGKVVAYASRQLK
Subjt:  YYK----------SPLTQLTRKGTPFVWSPAYERSFQELKQKLVTAPVLPVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLK

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.62.6e-5635.15Show/hide
Query:  LRAPYRMAPAELKVLKVQLQELLAKGFIRPNVSPWGAPVLFVKKKGGS-----MRLCMDYRELNKVTVKNRYPLHRIDDLFDQLQGATVFSKINLRSGYH
        L + Y    A  + ++ Q+Q++L +G IR + SP+ +P+  V KK  +      R+ +DYR+LN++TV +R+P+  +D++  +L     F+ I+L  G+H
Subjt:  LRAPYRMAPAELKVLKVQLQELLAKGFIRPNVSPWGAPVLFVKKKGGS-----MRLCMDYRELNKVTVKNRYPLHRIDDLFDQLQGATVFSKINLRSGYH

Query:  QLRIRDSDIPKTAFRSGYGHYEFVGMSFGLTNAPAVFMDLMKRVFKNFLDTIVIVFINDILVYSKTKAKHEKHLHQVLETLRANKLYAKFSKCELWLKKV
        Q+ +    + KTAF + +GHYE++ M FGL NAPA F   M  + +  L+   +V+++DI+V+S +  +H + L  V E L    L  +  KCE   ++ 
Subjt:  QLRIRDSDIPKTAFRSGYGHYEFVGMSFGLTNAPAVFMDLMKRVFKNFLDTIVIVFINDILVYSKTKAKHEKHLHQVLETLRANKLYAKFSKCELWLKKV

Query:  TFLGHVVSSEGVSVDPSKIEAVTSWPRPSTVSEIRSFLGLAGYYK----------SPLTQLTRKGTPF-VWSPAYERSFQELKQKLVTAPVLPVPDGSGS
        TFLGHV++ +G+  +P KIEA+  +P P+   EI++FLGL GYY+           P+T+  +K       +P Y+ +F++LK  +   P+L VPD +  
Subjt:  TFLGHVVSSEGVSVDPSKIEAVTSWPRPSTVSEIRSFLGLAGYYK----------SPLTQLTRKGTPF-VWSPAYERSFQELKQKLVTAPVLPVPDGSGS

Query:  FVIYSDASKKGLGCVLMQQGKVVAYASRQL
        F + +DAS   LG VL Q G  ++Y SR L
Subjt:  FVIYSDASKKGLGCVLMQQGKVVAYASRQL

P0CT34 Transposon Tf2-1 polyprotein6.0e-5330.71Show/hide
Query:  SSHSSISSAFVLHARLEVKPLHHVLSVSTPSGESMLSKEKGVPDVFPD--------ELPKLPPP-REIDFAIELEPGTAPILRAPYRMAPAELKVLKVQL
        S  ++IS   +    +E+    H L  S  +  S + KE  +PD++ +           KLP P + ++F +EL      +    Y + P +++ +  ++
Subjt:  SSHSSISSAFVLHARLEVKPLHHVLSVSTPSGESMLSKEKGVPDVFPD--------ELPKLPPP-REIDFAIELEPGTAPILRAPYRMAPAELKVLKVQL

Query:  QELLAKGFIRPNVSPWGAPVLFVKKKGGSMRLCMDYRELNKVTVKNRYPLHRIDDLFDQLQGATVFSKINLRSGYHQLRIRDSDIPKTAFRSGYGHYEFV
         + L  G IR + +    PV+FV KK G++R+ +DY+ LNK    N YPL  I+ L  ++QG+T+F+K++L+S YH +R+R  D  K AFR   G +E++
Subjt:  QELLAKGFIRPNVSPWGAPVLFVKKKGGSMRLCMDYRELNKVTVKNRYPLHRIDDLFDQLQGATVFSKINLRSGYHQLRIRDSDIPKTAFRSGYGHYEFV

Query:  GMSFGLTNAPAVFMDLMKRVFKNFLDTIVIVFINDILVYSKTKAKHEKHLHQVLETLRANKLYAKFSKCELWLKKVTFLGHVVSSEGVSVDPSKIEAVTS
         M +G++ APA F   +  +     ++ V+ +++DIL++SK++++H KH+  VL+ L+   L    +KCE    +V F+G+ +S +G +     I+ V  
Subjt:  GMSFGLTNAPAVFMDLMKRVFKNFLDTIVIVFINDILVYSKTKAKHEKHLHQVLETLRANKLYAKFSKCELWLKKVTFLGHVVSSEGVSVDPSKIEAVTS

Query:  WPRPSTVSEIRSFLGLAGYYKS----------PLTQLTRKGTPFVWSPAYERSFQELKQKLVTAPVLPVPDGSGSFVIYSDASKKGLGCVLMQQ
        W +P    E+R FLG   Y +           PL  L +K   + W+P   ++ + +KQ LV+ PVL   D S   ++ +DAS   +G VL Q+
Subjt:  WPRPSTVSEIRSFLGLAGYYKS----------PLTQLTRKGTPFVWSPAYERSFQELKQKLVTAPVLPVPDGSGSFVIYSDASKKGLGCVLMQQ

P0CT35 Transposon Tf2-2 polyprotein6.0e-5330.71Show/hide
Query:  SSHSSISSAFVLHARLEVKPLHHVLSVSTPSGESMLSKEKGVPDVFPD--------ELPKLPPP-REIDFAIELEPGTAPILRAPYRMAPAELKVLKVQL
        S  ++IS   +    +E+    H L  S  +  S + KE  +PD++ +           KLP P + ++F +EL      +    Y + P +++ +  ++
Subjt:  SSHSSISSAFVLHARLEVKPLHHVLSVSTPSGESMLSKEKGVPDVFPD--------ELPKLPPP-REIDFAIELEPGTAPILRAPYRMAPAELKVLKVQL

Query:  QELLAKGFIRPNVSPWGAPVLFVKKKGGSMRLCMDYRELNKVTVKNRYPLHRIDDLFDQLQGATVFSKINLRSGYHQLRIRDSDIPKTAFRSGYGHYEFV
         + L  G IR + +    PV+FV KK G++R+ +DY+ LNK    N YPL  I+ L  ++QG+T+F+K++L+S YH +R+R  D  K AFR   G +E++
Subjt:  QELLAKGFIRPNVSPWGAPVLFVKKKGGSMRLCMDYRELNKVTVKNRYPLHRIDDLFDQLQGATVFSKINLRSGYHQLRIRDSDIPKTAFRSGYGHYEFV

Query:  GMSFGLTNAPAVFMDLMKRVFKNFLDTIVIVFINDILVYSKTKAKHEKHLHQVLETLRANKLYAKFSKCELWLKKVTFLGHVVSSEGVSVDPSKIEAVTS
         M +G++ APA F   +  +     ++ V+ +++DIL++SK++++H KH+  VL+ L+   L    +KCE    +V F+G+ +S +G +     I+ V  
Subjt:  GMSFGLTNAPAVFMDLMKRVFKNFLDTIVIVFINDILVYSKTKAKHEKHLHQVLETLRANKLYAKFSKCELWLKKVTFLGHVVSSEGVSVDPSKIEAVTS

Query:  WPRPSTVSEIRSFLGLAGYYKS----------PLTQLTRKGTPFVWSPAYERSFQELKQKLVTAPVLPVPDGSGSFVIYSDASKKGLGCVLMQQ
        W +P    E+R FLG   Y +           PL  L +K   + W+P   ++ + +KQ LV+ PVL   D S   ++ +DAS   +G VL Q+
Subjt:  WPRPSTVSEIRSFLGLAGYYKS----------PLTQLTRKGTPFVWSPAYERSFQELKQKLVTAPVLPVPDGSGSFVIYSDASKKGLGCVLMQQ

P0CT41 Transposon Tf2-12 polyprotein6.0e-5330.71Show/hide
Query:  SSHSSISSAFVLHARLEVKPLHHVLSVSTPSGESMLSKEKGVPDVFPD--------ELPKLPPP-REIDFAIELEPGTAPILRAPYRMAPAELKVLKVQL
        S  ++IS   +    +E+    H L  S  +  S + KE  +PD++ +           KLP P + ++F +EL      +    Y + P +++ +  ++
Subjt:  SSHSSISSAFVLHARLEVKPLHHVLSVSTPSGESMLSKEKGVPDVFPD--------ELPKLPPP-REIDFAIELEPGTAPILRAPYRMAPAELKVLKVQL

Query:  QELLAKGFIRPNVSPWGAPVLFVKKKGGSMRLCMDYRELNKVTVKNRYPLHRIDDLFDQLQGATVFSKINLRSGYHQLRIRDSDIPKTAFRSGYGHYEFV
         + L  G IR + +    PV+FV KK G++R+ +DY+ LNK    N YPL  I+ L  ++QG+T+F+K++L+S YH +R+R  D  K AFR   G +E++
Subjt:  QELLAKGFIRPNVSPWGAPVLFVKKKGGSMRLCMDYRELNKVTVKNRYPLHRIDDLFDQLQGATVFSKINLRSGYHQLRIRDSDIPKTAFRSGYGHYEFV

Query:  GMSFGLTNAPAVFMDLMKRVFKNFLDTIVIVFINDILVYSKTKAKHEKHLHQVLETLRANKLYAKFSKCELWLKKVTFLGHVVSSEGVSVDPSKIEAVTS
         M +G++ APA F   +  +     ++ V+ +++DIL++SK++++H KH+  VL+ L+   L    +KCE    +V F+G+ +S +G +     I+ V  
Subjt:  GMSFGLTNAPAVFMDLMKRVFKNFLDTIVIVFINDILVYSKTKAKHEKHLHQVLETLRANKLYAKFSKCELWLKKVTFLGHVVSSEGVSVDPSKIEAVTS

Query:  WPRPSTVSEIRSFLGLAGYYKS----------PLTQLTRKGTPFVWSPAYERSFQELKQKLVTAPVLPVPDGSGSFVIYSDASKKGLGCVLMQQ
        W +P    E+R FLG   Y +           PL  L +K   + W+P   ++ + +KQ LV+ PVL   D S   ++ +DAS   +G VL Q+
Subjt:  WPRPSTVSEIRSFLGLAGYYKS----------PLTQLTRKGTPFVWSPAYERSFQELKQKLVTAPVLPVPDGSGSFVIYSDASKKGLGCVLMQQ

P20825 Retrovirus-related Pol polyprotein from transposon 2975.8e-5634.53Show/hide
Query:  APILRAPYRMAPAELKVLKVQLQELLAKGFIRPNVSPWGAPVLFVKKK-----GGSMRLCMDYRELNKVTVKNRYPLHRIDDLFDQLQGATVFSKINLRS
        +PI    Y +A      ++ Q+QE+L +G IR + SP+ +P   V KK         R+ +DYR+LN++T+ +RYP+  +D++  +L     F+ I+L  
Subjt:  APILRAPYRMAPAELKVLKVQLQELLAKGFIRPNVSPWGAPVLFVKKK-----GGSMRLCMDYRELNKVTVKNRYPLHRIDDLFDQLQGATVFSKINLRS

Query:  GYHQLRIRDSDIPKTAFRSGYGHYEFVGMSFGLTNAPAVFMDLMKRVFKNFLDTIVIVFINDILVYSKTKAKHEKHLHQVLETLRANKLYAKFSKCELWL
        G+HQ+ + +  I KTAF +  GHYE++ M FGL NAPA F   M  + +  L+   +V+++DI+++S +  +H   +  V   L    L  +  KCE   
Subjt:  GYHQLRIRDSDIPKTAFRSGYGHYEFVGMSFGLTNAPAVFMDLMKRVFKNFLDTIVIVFINDILVYSKTKAKHEKHLHQVLETLRANKLYAKFSKCELWL

Query:  KKVTFLGHVVSSEGVSVDPSKIEAVTSWPRPSTVSEIRSFLGLAGYYK----------SPLTQLTRKGTPF-VWSPAYERSFQELKQKLVTAPVLPVPDG
        K+  FLGH+V+ +G+  +P K++A+ S+P P+   EIR+FLGL GYY+           P+T   +K T        Y  +F++LK  ++  P+L +PD 
Subjt:  KKVTFLGHVVSSEGVSVDPSKIEAVTSWPRPSTVSEIRSFLGLAGYYK----------SPLTQLTRKGTPF-VWSPAYERSFQELKQKLVTAPVLPVPDG

Query:  SGSFVIYSDASKKGLGCVLMQQGKVVAYASRQL
           FV+ +DAS   LG VL Q G  +++ SR L
Subjt:  SGSFVIYSDASKKGLGCVLMQQGKVVAYASRQL

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein3.4e-1938.17Show/hide
Query:  HLHQVLETLRANKLYAKFSKCELWLKKVTFLG--HVVSSEGVSVDPSKIEAVTSWPRPSTVSEIRSFLGLAGYYK----------SPLTQLTRKGTPFVW
        HL  VL+    ++ YA   KC     ++ +LG  H++S EGVS DP+K+EA+  WP P   +E+R FLGL GYY+           PLT+L +K +   W
Subjt:  HLHQVLETLRANKLYAKFSKCELWLKKVTFLG--HVVSSEGVSVDPSKIEAVTSWPRPSTVSEIRSFLGLAGYYK----------SPLTQLTRKGTPFVW

Query:  SPAYERSFQELKQKLVTAPVLPVPDGSGSFV
        +     +F+ LK  + T PVL +PD    FV
Subjt:  SPAYERSFQELKQKLVTAPVLPVPDGSGSFV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACAGTGGAGCGGTATGATGCGGAGTTTGACATTTTATCCCGCTTCGCTCCCGAGATGATAGCAATTAAGACAACCAGAGCTGATAAGTTTGTTAGAGGCCTCAAGCT
AGACATCCAGGGTTTGGTTCGAGCCTTCGGACCTGCTACTCATGCCGATGCACTGCGCCTGGCAGTGGATCTCAGTTTACAGGAGAGGGCTAACTCGTCCAAGGTTTCAG
ATAGAGGTTTGGCCTCAGGACATAAAAGGAAGGCTGAGCAGCAGTCTATTTCAGTGCCACAGCGGAACTTCAGATCAGGTGGTGAGTTTCGCCGCTTCCAGCAGAAACCG
TTTGAGGCAGGGGAAGCTGCAGAGGGAAGCCGTTGTGTACCACTGGTGGGAAGCACCATCTGGGCCGTTGATTATTTTGGGACCAGGACTTTATTTAAGTGTAGGCAAGA
GGGGCACACAGCTGACAGATGCCCGATGAGACTTACCGGGAATGCGCAGAATCAGGGAGCAGGTGCTCCACATCAGGGTAAAGTCATTGCTACCAACAAGACTGAGGTTG
AGAGGGCAGACACGGTGGTGATAGGTACGCTTCCAGTATTGGGGCATTACGCCTTAGTTTTTTTCAATTCGGTTTCGTCACATTCCTCTATCTCTTCTGCATTTGTGTTG
CATGCCCGCTTAGAGGTAAAGCCCCTACACCATGTTCTATCCGTATCTACTCCTTCCGGGGAAAGTATGTTGTCGAAAGAAAAGGGAGTACCCGACGTTTTTCCTGATGA
GCTTCCAAAACTTCCACCTCCTAGGGAGATAGACTTCGCCATCGAGTTAGAGCCGGGCACTGCTCCTATCTTGAGGGCCCCTTACAGGATGGCTCCAGCCGAGCTAAAAG
TGTTGAAGGTCCAGTTACAGGAGTTGCTGGCCAAAGGCTTCATCCGGCCCAATGTGTCACCTTGGGGAGCACCAGTATTGTTCGTGAAGAAGAAGGGTGGGTCGATGCGC
CTTTGTATGGACTACCGAGAGCTGAACAAGGTGACGGTCAAAAACCGCTACCCCTTGCACAGGATTGATGACTTGTTCGATCAGTTGCAGGGAGCCACTGTTTTCTCTAA
GATCAATTTACGTTCAGGCTATCACCAACTGAGGATCAGGGACAGTGACATTCCCAAGACGGCCTTTCGATCGGGGTACGGACATTACGAATTCGTTGGGATGTCTTTCG
GCTTGACTAACGCTCCCGCAGTATTCATGGACTTGATGAAAAGAGTGTTTAAGAATTTCTTAGACACTATCGTGATTGTATTTATTAATGATATCTTAGTTTACTCCAAG
ACAAAGGCTAAGCATGAGAAGCATTTACATCAGGTTTTAGAGACTCTTCGAGCCAACAAGTTGTACGCCAAGTTCTCAAAGTGTGAGTTATGGCTGAAGAAGGTGACTTT
CCTCGGCCACGTGGTTTCTAGTGAGGGAGTTTCAGTAGATCCCTCAAAGATTGAAGCGGTGACCAGCTGGCCTCGACCGTCCACAGTTAGTGAAATTCGAAGTTTTTTGG
GCTTGGCAGGTTACTACAAGAGCCCGTTGACACAGTTGACCAGGAAGGGAACACCTTTTGTCTGGAGCCCAGCATACGAGCGTAGCTTTCAGGAGCTCAAACAAAAGCTA
GTGACTGCACCAGTCCTGCCAGTGCCTGATGGGTCGGGAAGCTTCGTGATCTACAGTGATGCCTCAAAAAAAGGACTGGGTTGTGTTTTAATGCAACAGGGAAAGGTCGT
TGCTTATGCCTCCCGCCAGTTGAAGATCCTGAGCAGAACTACCTACCCATAG
mRNA sequenceShow/hide mRNA sequence
ATGACAGTGGAGCGGTATGATGCGGAGTTTGACATTTTATCCCGCTTCGCTCCCGAGATGATAGCAATTAAGACAACCAGAGCTGATAAGTTTGTTAGAGGCCTCAAGCT
AGACATCCAGGGTTTGGTTCGAGCCTTCGGACCTGCTACTCATGCCGATGCACTGCGCCTGGCAGTGGATCTCAGTTTACAGGAGAGGGCTAACTCGTCCAAGGTTTCAG
ATAGAGGTTTGGCCTCAGGACATAAAAGGAAGGCTGAGCAGCAGTCTATTTCAGTGCCACAGCGGAACTTCAGATCAGGTGGTGAGTTTCGCCGCTTCCAGCAGAAACCG
TTTGAGGCAGGGGAAGCTGCAGAGGGAAGCCGTTGTGTACCACTGGTGGGAAGCACCATCTGGGCCGTTGATTATTTTGGGACCAGGACTTTATTTAAGTGTAGGCAAGA
GGGGCACACAGCTGACAGATGCCCGATGAGACTTACCGGGAATGCGCAGAATCAGGGAGCAGGTGCTCCACATCAGGGTAAAGTCATTGCTACCAACAAGACTGAGGTTG
AGAGGGCAGACACGGTGGTGATAGGTACGCTTCCAGTATTGGGGCATTACGCCTTAGTTTTTTTCAATTCGGTTTCGTCACATTCCTCTATCTCTTCTGCATTTGTGTTG
CATGCCCGCTTAGAGGTAAAGCCCCTACACCATGTTCTATCCGTATCTACTCCTTCCGGGGAAAGTATGTTGTCGAAAGAAAAGGGAGTACCCGACGTTTTTCCTGATGA
GCTTCCAAAACTTCCACCTCCTAGGGAGATAGACTTCGCCATCGAGTTAGAGCCGGGCACTGCTCCTATCTTGAGGGCCCCTTACAGGATGGCTCCAGCCGAGCTAAAAG
TGTTGAAGGTCCAGTTACAGGAGTTGCTGGCCAAAGGCTTCATCCGGCCCAATGTGTCACCTTGGGGAGCACCAGTATTGTTCGTGAAGAAGAAGGGTGGGTCGATGCGC
CTTTGTATGGACTACCGAGAGCTGAACAAGGTGACGGTCAAAAACCGCTACCCCTTGCACAGGATTGATGACTTGTTCGATCAGTTGCAGGGAGCCACTGTTTTCTCTAA
GATCAATTTACGTTCAGGCTATCACCAACTGAGGATCAGGGACAGTGACATTCCCAAGACGGCCTTTCGATCGGGGTACGGACATTACGAATTCGTTGGGATGTCTTTCG
GCTTGACTAACGCTCCCGCAGTATTCATGGACTTGATGAAAAGAGTGTTTAAGAATTTCTTAGACACTATCGTGATTGTATTTATTAATGATATCTTAGTTTACTCCAAG
ACAAAGGCTAAGCATGAGAAGCATTTACATCAGGTTTTAGAGACTCTTCGAGCCAACAAGTTGTACGCCAAGTTCTCAAAGTGTGAGTTATGGCTGAAGAAGGTGACTTT
CCTCGGCCACGTGGTTTCTAGTGAGGGAGTTTCAGTAGATCCCTCAAAGATTGAAGCGGTGACCAGCTGGCCTCGACCGTCCACAGTTAGTGAAATTCGAAGTTTTTTGG
GCTTGGCAGGTTACTACAAGAGCCCGTTGACACAGTTGACCAGGAAGGGAACACCTTTTGTCTGGAGCCCAGCATACGAGCGTAGCTTTCAGGAGCTCAAACAAAAGCTA
GTGACTGCACCAGTCCTGCCAGTGCCTGATGGGTCGGGAAGCTTCGTGATCTACAGTGATGCCTCAAAAAAAGGACTGGGTTGTGTTTTAATGCAACAGGGAAAGGTCGT
TGCTTATGCCTCCCGCCAGTTGAAGATCCTGAGCAGAACTACCTACCCATAG
Protein sequenceShow/hide protein sequence
MTVERYDAEFDILSRFAPEMIAIKTTRADKFVRGLKLDIQGLVRAFGPATHADALRLAVDLSLQERANSSKVSDRGLASGHKRKAEQQSISVPQRNFRSGGEFRRFQQKP
FEAGEAAEGSRCVPLVGSTIWAVDYFGTRTLFKCRQEGHTADRCPMRLTGNAQNQGAGAPHQGKVIATNKTEVERADTVVIGTLPVLGHYALVFFNSVSSHSSISSAFVL
HARLEVKPLHHVLSVSTPSGESMLSKEKGVPDVFPDELPKLPPPREIDFAIELEPGTAPILRAPYRMAPAELKVLKVQLQELLAKGFIRPNVSPWGAPVLFVKKKGGSMR
LCMDYRELNKVTVKNRYPLHRIDDLFDQLQGATVFSKINLRSGYHQLRIRDSDIPKTAFRSGYGHYEFVGMSFGLTNAPAVFMDLMKRVFKNFLDTIVIVFINDILVYSK
TKAKHEKHLHQVLETLRANKLYAKFSKCELWLKKVTFLGHVVSSEGVSVDPSKIEAVTSWPRPSTVSEIRSFLGLAGYYKSPLTQLTRKGTPFVWSPAYERSFQELKQKL
VTAPVLPVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKILSRTTYP