; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0016105 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0016105
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
DescriptionReverse transcriptase
Genome locationchr07:8492772..8495208
RNA-Seq ExpressionPay0016105
SyntenyPay0016105
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR001878 - Zinc finger, CCHC-type
IPR036875 - Zinc finger, CCHC-type superfamily
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0035681.1 reverse transcriptase [Cucumis melo var. makuwa]2.1e-29587.33Show/hide
Query:  AISLAETVEEMMTVRLKNSNRRTAWETNPSKKQSYGKKTDEQPSTSMVDKGKAIDIQETNKKKESLVRGKTQNNYTRPSLGKCFRCGEPGHLSNNCSQRK
        AISLAETVEEMMTVRLKNSNRRTAWETNPSKKQSYGKKTDEQPSTSMVDKGKAIDIQETNKKKESLVRGKTQNNYTRPSLGKCFRCGEPGHLSNNCSQRK
Subjt:  AISLAETVEEMMTVRLKNSNRRTAWETNPSKKQSYGKKTDEQPSTSMVDKGKAIDIQETNKKKESLVRGKTQNNYTRPSLGKCFRCGEPGHLSNNCSQRK

Query:  TIALAEDEDTYMSGTDR-EEEEETELIEADDGDRISCIVQRVLITPKEETNPQHHSLFKTRCTINGKNFVARKLVTALNLKTDPHPDPYKIGWVKKGGET
        TIALAEDEDTYMSGTDR EEEEETELIEADDGDRISCIVQRVLITPKEETNPQHHSLFKTRCTINGK +              PHPDPYKIGWVKKGGET
Subjt:  TIALAEDEDTYMSGTDR-EEEEETELIEADDGDRISCIVQRVLITPKEETNPQHHSLFKTRCTINGKNFVARKLVTALNLKTDPHPDPYKIGWVKKGGET

Query:  LINEICTIPLSIGNSYKDQIVCDVIEMDVCHLLLGRPWQHDTQTLHRGRENTYEFQWMGKKVILLPLAKKNTESIRQKNKRQLFITVSGKNLLKEREQDL
        LINEICTIPLSIGNSYKDQIVCDVIEMDVCHLLLGRPWQHDTQTLHRGRENTYEFQWMGKKVILLPLAKKNTESIRQKNKRQLFITVSGKNLLKEREQDL
Subjt:  LINEICTIPLSIGNSYKDQIVCDVIEMDVCHLLLGRPWQHDTQTLHRGRENTYEFQWMGKKVILLPLAKKNTESIRQKNKRQLFITVSGKNLLKEREQDL

Query:  LGLL-----------------------------EPQGLPPLRDIQHQIDLVPRASLPNLPHYRMSPEEYQVLHDHIEDLLKKGHIKPSLSPCAVPALLTP
        LGLL                             EPQGLPPLRDIQHQIDLVPRASLPNLPHYRMSPEEYQVLHDHIEDLLKKGHIKPSLSPCAVPALLTP
Subjt:  LGLL-----------------------------EPQGLPPLRDIQHQIDLVPRASLPNLPHYRMSPEEYQVLHDHIEDLLKKGHIKPSLSPCAVPALLTP

Query:  NKDGSWRMCVDSRAINRVTGKYRFPIPRIGDLLDQLGKATIFSKIDLRNGYHQIQIRPGDEWKTAFKTNEGLFECTFMRLMNQVLHPFLNQFIVVFFDDI
        NKDGSWRMCVDSRAINRVTGKYRFPIPRIGDLLDQLGKA IFSKIDLRNGYHQIQIRPGDEWKTAFKTNEGLFEC                         
Subjt:  NKDGSWRMCVDSRAINRVTGKYRFPIPRIGDLLDQLGKATIFSKIDLRNGYHQIQIRPGDEWKTAFKTNEGLFECTFMRLMNQVLHPFLNQFIVVFFDDI

Query:  LVCSSSREDHLQYLRKLFRVLTEIELYINPKKCTYLTKEIVFLGFLIKEGKIRMEPKKIEAIHSRPTPTSIKEVQAFLGLASFYRRFIRNFSLIVAPLTD
            SSREDHLQYLRKLFRVLTEIELYINPKKCTYLTKEIVFLGFLIKEGKIRMEPKKIEAIHSRPTPTSIKEVQAFLGLASFYRRFIRNFSLIVAPLTD
Subjt:  LVCSSSREDHLQYLRKLFRVLTEIELYINPKKCTYLTKEIVFLGFLIKEGKIRMEPKKIEAIHSRPTPTSIKEVQAFLGLASFYRRFIRNFSLIVAPLTD

KAA0054966.1 transposon Ty3-I Gag-Pol polyprotein isoform X1 [Cucumis melo var. makuwa]8.3e-21550.62Show/hide
Query:  YRGQEARRETYHDYKMKIDLPTYNGKRDIESFLDWIKNTENFFKYMVPPDRKKVHL--------------------------------------------
        Y   E ++    +YKMKIDLP+Y+GKR+IE+FLDW+KNTENFF YM     KKVHL                                            
Subjt:  YRGQEARRETYHDYKMKIDLPTYNGKRDIESFLDWIKNTENFFKYMVPPDRKKVHL--------------------------------------------

Query:  ----------------------------------------------------------------------AISLAETVEEMMTVRLKNSNRRTAWETNPS
                                                                              AI+ AETVEEM+  R K S R+  WE + S
Subjt:  ----------------------------------------------------------------------AISLAETVEEMMTVRLKNSNRRTAWETNPS

Query:  KKQSYGKKTDEQPSTSMVDKGKAIDIQETNKKKESLVRG--KTQNNYTRPSLGKCFRCGEPGHLSNNCSQRKTIALAEDEDTYMSGTDREEEEETELIEA
        KK + G    +  ++      K ++ +E++ KKE +  G  K +N Y RP  G C+RCG+ GH SN C QRKTIA+A+D D   + +  E +EETE+IEA
Subjt:  KKQSYGKKTDEQPSTSMVDKGKAIDIQETNKKKESLVRG--KTQNNYTRPSLGKCFRCGEPGHLSNNCSQRKTIALAEDEDTYMSGTDREEEEETELIEA

Query:  DDGDRISCIVQRVLITPKEETNPQHHSLFKTRCTINGK------------NFVARKLVTALNLKTDPHPDPYKIGWVKKGGETLINEICTIPLSIGNSYK
        D+GD +SCI+QRVLI+PKEE   Q HSLFKTRCTI GK            NFV++KLVTALNLKT PH  PYKIGW+KKGGETLI+EIC +PLSIGNSYK
Subjt:  DDGDRISCIVQRVLITPKEETNPQHHSLFKTRCTINGK------------NFVARKLVTALNLKTDPHPDPYKIGWVKKGGETLINEICTIPLSIGNSYK

Query:  DQIVCDVIEMDVCHLLLGRPWQHDTQTLHRGRENTYEFQWMGKKVILLPLAKKNTESIRQKNKR-QLFITVSGKNLLKEREQDLLGLL------------
        DQ+VCDVIEMDVCH+LLGRPWQ D Q++HRGRENTYEF WM KKVILLPL K+  ++I +  K+  LF+T+SGK  L+ERE ++LG++            
Subjt:  DQIVCDVIEMDVCHLLLGRPWQHDTQTLHRGRENTYEFQWMGKKVILLPLAKKNTESIRQKNKR-QLFITVSGKNLLKEREQDLLGLL------------

Query:  -----------------EPQGLPPLRDIQHQIDLVPRASLPNLPHYRMSPEEYQVLHDHIEDLLKKGHIKPSLSPCAVPALLTPNKDGSWRMCVDSRAIN
                         EP  LPPLRDI H I+L+  AS P+LPHY MSP EY++LHD IE+LLKKGHIKPS S C VPALLTP KDG+WRMCVDSRAIN
Subjt:  -----------------EPQGLPPLRDIQHQIDLVPRASLPNLPHYRMSPEEYQVLHDHIEDLLKKGHIKPSLSPCAVPALLTPNKDGSWRMCVDSRAIN

Query:  RVTGKYRFPIPRIGDLLDQLGKATIFSKIDLRNGYHQIQIRPGDEWKTAFKTNEGLFE------------CTFMRLMNQVLHPFLNQFIVVFFDDILVCS
        ++T KYRFPIPR+ DLLDQLG A IFSKIDLR+ YHQI+IRPGDEWKTAFKTNEGLFE             TFMRLMN+VLHPFLN+FI+V+FDDILV S
Subjt:  RVTGKYRFPIPRIGDLLDQLGKATIFSKIDLRNGYHQIQIRPGDEWKTAFKTNEGLFE------------CTFMRLMNQVLHPFLNQFIVVFFDDILVCS

Query:  SSREDHLQYLRKLFRVLTEIELYINPKKCTYLTKEIVFLGFLIKEGKIRMEPKKIEAIHSRPTPTSIKEVQAFLGLASFYRRFIRNFSLIVAPLTDCLKK
         + + HLQ++ +LF+VL   ELY+N KKC + + EI FLGF+I++  + M+ KK+EAI +  TPT++ +VQAFLGLASFYR+FI+N S I AP+TDCLKK
Subjt:  SSREDHLQYLRKLFRVLTEIELYINPKKCTYLTKEIVFLGFLIKEGKIRMEPKKIEAIHSRPTPTSIKEVQAFLGLASFYRRFIRNFSLIVAPLTDCLKK

Query:  GNFKWEHMRQ
        G F+W   +Q
Subjt:  GNFKWEHMRQ

KAA0062943.1 Retrovirus-related Pol polyprotein from transposon 17.6 [Cucumis melo var. makuwa]1.2e-23271.91Show/hide
Query:  LAISLAETVEEMMTVRLKNSNRRTAWETNPSKKQSYGKKTDEQPSTSMVDKGKAIDIQETNKKKESLVRGKTQNNYTRPSLGKCFRCGEPGHLSNNCSQR
        L ++  ++ EEMM VRLKNSN+R  WETNPSKKQS GKKTDEQPSTS+VDKGKAIDIQETN KKES+VRGKTQNNYTRPSLGKCFRCGEP HLSNNC QR
Subjt:  LAISLAETVEEMMTVRLKNSNRRTAWETNPSKKQSYGKKTDEQPSTSMVDKGKAIDIQETNKKKESLVRGKTQNNYTRPSLGKCFRCGEPGHLSNNCSQR

Query:  KTIALAEDEDTYMSGTDREEEEETELIEADDGDRISCIVQRVLITPKEETNPQHHSLFKTRCTINGK------------NFVARKLVTALNLKTDPHPDP
        KTIALAEDEDTYMS  D+EE+EE ELIEAD+GDRISCIVQRVLIT KEE NPQ HSLFKTRCTI+GK            NFVARKLV +LNLK DPHPDP
Subjt:  KTIALAEDEDTYMSGTDREEEEETELIEADDGDRISCIVQRVLITPKEETNPQHHSLFKTRCTINGK------------NFVARKLVTALNLKTDPHPDP

Query:  YKIGWVKKGGETLINEICTIPLSIGNSYKDQIVCDVIEMDVCHLLLGRPWQHDTQTLHRGRENTYEFQWMGKKVILLPLAKKNTESIRQKNKRQLFITVS
        YKIGWVKK GETLINEICTIPLSI NSYKDQIVCDVIEMDVCHLLL RPW+ D   L                 ++   ++     I +   ++LF    
Subjt:  YKIGWVKKGGETLINEICTIPLSIGNSYKDQIVCDVIEMDVCHLLLGRPWQHDTQTLHRGRENTYEFQWMGKKVILLPLAKKNTESIRQKNKRQLFITVS

Query:  GKNLLKEREQDLLGLLEPQGLPPLRDIQHQIDLVPRASLPNLPHYRMSPEEYQVLHDHIEDLLKKGHIKPSLSPCAVPALLTPNKDGSWRMCVDSRAINR
            LK+         EPQGLPPL DIQHQIDLVP ASLP+LPHYRMSPEEYQVLHD+IE+LLKKGHIKPSLSPC VPALLTP KD SWRMCVDSRAINR
Subjt:  GKNLLKEREQDLLGLLEPQGLPPLRDIQHQIDLVPRASLPNLPHYRMSPEEYQVLHDHIEDLLKKGHIKPSLSPCAVPALLTPNKDGSWRMCVDSRAINR

Query:  VTGKYRFPIPRIGDLLDQLGKATIFSKIDLRNGYHQIQIRPGDEWKTAFKTNEGLFECTFMRLMNQVLHPF-LNQFIVVFFDDILVCSSSREDHLQYLRK
        +T KY FPIP++GDLLDQLGKA +FSKIDLR+ YHQI+IRP DEWKT FK NEGLFE   M        PF L+     F       S SREDHLQ+LRK
Subjt:  VTGKYRFPIPRIGDLLDQLGKATIFSKIDLRNGYHQIQIRPGDEWKTAFKTNEGLFECTFMRLMNQVLHPF-LNQFIVVFFDDILVCSSSREDHLQYLRK

Query:  LFRVLTEIELYINPKKCTYLTKEIVFLGFLIKEGKIRMEPKKIEAIHSRPTPTSIKEVQAFLGLASFYRRFIRNFSLIVAPLTDCLKKGNFKWEHMRQ
        LF+VL EIELYINPKKCT+L KEIVFLGFLIKEGKI MEPKK+EAI S P PTSIKEVQAFLGLASFY+RFIRNFS IV PLTD LKK NFKWEHM+Q
Subjt:  LFRVLTEIELYINPKKCTYLTKEIVFLGFLIKEGKIRMEPKKIEAIHSRPTPTSIKEVQAFLGLASFYRRFIRNFSLIVAPLTDCLKKGNFKWEHMRQ

TYK30863.1 transposon Ty3-I Gag-Pol polyprotein isoform X1 [Cucumis melo var. makuwa]0.0e+0078.12Show/hide
Query:  MAQGYRGQEARRETYHDYKMKIDLPTYNGKRDIESFLDWIKNTENFFKYMVPPDRKKVHL----------------------------------------
        MAQGYRGQEARRETYHDYKMKIDLPTYNGKRDIESFLDWIKNTENFFKYMVPPDRKKVHL                                        
Subjt:  MAQGYRGQEARRETYHDYKMKIDLPTYNGKRDIESFLDWIKNTENFFKYMVPPDRKKVHL----------------------------------------

Query:  ----------------------------------------------AISLAETVEEMMTVRLKNSNRRTAWETNPSKKQSYGKKTDEQPSTSMVDKGKAI
                                                      AISLAETVEEMMTVRLKNSNRRTAWETNPSKKQSYGKKTDEQPSTSMVDKGKAI
Subjt:  ----------------------------------------------AISLAETVEEMMTVRLKNSNRRTAWETNPSKKQSYGKKTDEQPSTSMVDKGKAI

Query:  DIQETNKKKESLVRGKTQNNYTRPSLGKCFRCGEPGHLSNNCSQRKTIALAEDEDTYMSGTDREEEEETELIEADDGDRISCIVQRVLITPKEETNPQHH
        DIQETNKKKESLVRGKTQNNYTRPSLGKCFRCGEPGHLSNNCSQRKTIALAEDEDTYMSGTD EEEEETELIEADDGDRISCIVQRVLITPKEETNPQHH
Subjt:  DIQETNKKKESLVRGKTQNNYTRPSLGKCFRCGEPGHLSNNCSQRKTIALAEDEDTYMSGTDREEEEETELIEADDGDRISCIVQRVLITPKEETNPQHH

Query:  SLFKTRCTINGKNFVARKLVTALNLKTDPHPDPYKIGWVKKGGETLINEICTIPLSIGNSYKDQIVCDVIEMDVCHLLLGRPWQHDTQTLHRGRENTYEF
        SLFKTRCTINGK +              PHPDPYKIGWVKKGGETLINEICTIPLSIGNSYKDQIVCDVIEMDVCHLLLGRPWQHDTQTLHRGRENTYEF
Subjt:  SLFKTRCTINGKNFVARKLVTALNLKTDPHPDPYKIGWVKKGGETLINEICTIPLSIGNSYKDQIVCDVIEMDVCHLLLGRPWQHDTQTLHRGRENTYEF

Query:  QWMGKKVILLPLAKKNTESIRQKNKRQLFITVSGKNLLKEREQDLLGLL-----------------------------EPQGLPPLRDIQHQIDLVPRAS
        QWMGKKVILLPLAKKNTESIRQKNKRQLFITVSGKNLLKEREQDLLGLL                             EPQGLPPLRDIQHQIDLVPRAS
Subjt:  QWMGKKVILLPLAKKNTESIRQKNKRQLFITVSGKNLLKEREQDLLGLL-----------------------------EPQGLPPLRDIQHQIDLVPRAS

Query:  LPNLPHYRMSPEEYQVLHDHIEDLLKKGHIKPSLSPCAVPALLTPNKDGSWRMCVDSRAINRVTGKYRFPIPRIGDLLDQLGKATIFSKIDLRNGYHQIQ
        LPNLPHYRMSPEEYQVLHDHIEDLLKKGHIKPSLSPCAVPALLTPNKDGSWRMCVDSRAINRVTGKYRFPIPRIGDLLDQLGKA IFSKIDLRNGYHQIQ
Subjt:  LPNLPHYRMSPEEYQVLHDHIEDLLKKGHIKPSLSPCAVPALLTPNKDGSWRMCVDSRAINRVTGKYRFPIPRIGDLLDQLGKATIFSKIDLRNGYHQIQ

Query:  IRPGDEWKTAFKTNEGLFECTFMRLMNQVLHPFLNQFIVVFFDDILVCSSSREDHLQYLRKLFRVLTEIELYINPKKCTYLTKEIVFLGFLIKEGKIRME
        IRPGDEWKTAFKTNEGLFEC                             SSREDHLQYLRKLFRVLTEIELYINPKKCTYLTKEIVFLGFLIKEGKIRME
Subjt:  IRPGDEWKTAFKTNEGLFECTFMRLMNQVLHPFLNQFIVVFFDDILVCSSSREDHLQYLRKLFRVLTEIELYINPKKCTYLTKEIVFLGFLIKEGKIRME

Query:  PKKIEAIHSRPTPTSIKEVQAFLGLASFYRRFIRNFSLIVAPLTD
        PKKIEAI SRPTPTSIKEVQAFLGLASFYRRFIRNFSLIVAPLTD
Subjt:  PKKIEAIHSRPTPTSIKEVQAFLGLASFYRRFIRNFSLIVAPLTD

XP_011648447.2 uncharacterized protein LOC105434464 [Cucumis sativus]1.4e-20665.26Show/hide
Query:  EARRETYHDYKMKIDLPTYNGKRDIESFLDWIKNTENFFKYMVPPDRKKVHL-AISL-AETVEEMMTVRLKNSNRRTAWETNPSKKQSYGKKTDEQPSTS
        E RR  YHDYKMKIDL  Y+GK++IE+FLDWIK+TENFF YM  P+ KKVHL A+ L AETVEEM+ VR KN  RR AW+T  ++  +Y  KT++QPSTS
Subjt:  EARRETYHDYKMKIDLPTYNGKRDIESFLDWIKNTENFFKYMVPPDRKKVHL-AISL-AETVEEMMTVRLKNSNRRTAWETNPSKKQSYGKKTDEQPSTS

Query:  MVDKGKAIDIQE--TNKKKESLVRGKTQNNYTRPSLGKCFRCGEPGHLSNNCSQRKTIALAEDEDTYMSGTDREEEEETELIEADDGDRISCIVQRVLIT
           KGK ++ QE    +K E   +  +QNNY+RP LGK FRCG+  HLSNNC QRKTIA+AE E   MS   +  E+E ELIEADDG+R+SC++QRVLIT
Subjt:  MVDKGKAIDIQE--TNKKKESLVRGKTQNNYTRPSLGKCFRCGEPGHLSNNCSQRKTIALAEDEDTYMSGTDREEEEETELIEADDGDRISCIVQRVLIT

Query:  PKEETNPQHHSLFKTRCTING------------KNFVARKLVTALNLKTDPHPDPYKIGWVKKGGETLINEICTIPLSIGNSYKDQIVCDVIEMDVCHLL
        PKEE   Q H LFK RCTING            KNFVA+KLVT LNLK + HP  YKIGWV+K GE  ++EICT+PLSI N+YKDQIVCDVIEMDVCHLL
Subjt:  PKEETNPQHHSLFKTRCTING------------KNFVARKLVTALNLKTDPHPDPYKIGWVKKGGETLINEICTIPLSIGNSYKDQIVCDVIEMDVCHLL

Query:  LGRPWQHDTQTLHRGRENTYEFQWMGKKVILLPLAKKNTESIRQKNKRQLFITVSGKNLLKE--REQDLLGLLEPQGLPPLRDIQHQIDLVPRASLPNLP
        LGRPWQ+DTQ+LH+GRENTYE Q MG+KV+LLP+ +KN E +R ++     I    + LL E  R ++     EP+GLPPLRDIQH IDL+P ASLPNL 
Subjt:  LGRPWQHDTQTLHRGRENTYEFQWMGKKVILLPLAKKNTESIRQKNKRQLFITVSGKNLLKE--REQDLLGLLEPQGLPPLRDIQHQIDLVPRASLPNLP

Query:  HYRMSPEEYQVLHDHIEDLLKKGHIKPSLSPCAVPALLTPNKDGSWRMCVDSRAINRVTGKYRFPIPRIGDLLDQLGKATIFSKIDLRNGYHQIQIRPGD
        HYRMSP+EY+ LHDHIE+LLKKGHIKPSLSPCAVPALLT  KDGSWRMCVDSRAINR+T KYRF IPRI DLLDQLGKA+IFSKIDL++GYHQI+IRPGD
Subjt:  HYRMSPEEYQVLHDHIEDLLKKGHIKPSLSPCAVPALLTPNKDGSWRMCVDSRAINRVTGKYRFPIPRIGDLLDQLGKATIFSKIDLRNGYHQIQIRPGD

Query:  EWKTAFKTNEGLFEC------------TFMRLMNQVLHPFLNQFIVVFFDDILVCSSSREDHLQYLRKLF
        EWKT FKT EGLFE             TFMRLMNQ+LHPFLN+FIVV+FDDILV S++ E+HL +LRK+F
Subjt:  EWKTAFKTNEGLFEC------------TFMRLMNQVLHPFLNQFIVVFFDDILVCSSSREDHLQYLRKLF

TrEMBL top hitse value%identityAlignment
A0A5A7T256 Reverse transcriptase1.0e-29587.33Show/hide
Query:  AISLAETVEEMMTVRLKNSNRRTAWETNPSKKQSYGKKTDEQPSTSMVDKGKAIDIQETNKKKESLVRGKTQNNYTRPSLGKCFRCGEPGHLSNNCSQRK
        AISLAETVEEMMTVRLKNSNRRTAWETNPSKKQSYGKKTDEQPSTSMVDKGKAIDIQETNKKKESLVRGKTQNNYTRPSLGKCFRCGEPGHLSNNCSQRK
Subjt:  AISLAETVEEMMTVRLKNSNRRTAWETNPSKKQSYGKKTDEQPSTSMVDKGKAIDIQETNKKKESLVRGKTQNNYTRPSLGKCFRCGEPGHLSNNCSQRK

Query:  TIALAEDEDTYMSGTDR-EEEEETELIEADDGDRISCIVQRVLITPKEETNPQHHSLFKTRCTINGKNFVARKLVTALNLKTDPHPDPYKIGWVKKGGET
        TIALAEDEDTYMSGTDR EEEEETELIEADDGDRISCIVQRVLITPKEETNPQHHSLFKTRCTINGK +              PHPDPYKIGWVKKGGET
Subjt:  TIALAEDEDTYMSGTDR-EEEEETELIEADDGDRISCIVQRVLITPKEETNPQHHSLFKTRCTINGKNFVARKLVTALNLKTDPHPDPYKIGWVKKGGET

Query:  LINEICTIPLSIGNSYKDQIVCDVIEMDVCHLLLGRPWQHDTQTLHRGRENTYEFQWMGKKVILLPLAKKNTESIRQKNKRQLFITVSGKNLLKEREQDL
        LINEICTIPLSIGNSYKDQIVCDVIEMDVCHLLLGRPWQHDTQTLHRGRENTYEFQWMGKKVILLPLAKKNTESIRQKNKRQLFITVSGKNLLKEREQDL
Subjt:  LINEICTIPLSIGNSYKDQIVCDVIEMDVCHLLLGRPWQHDTQTLHRGRENTYEFQWMGKKVILLPLAKKNTESIRQKNKRQLFITVSGKNLLKEREQDL

Query:  LGLL-----------------------------EPQGLPPLRDIQHQIDLVPRASLPNLPHYRMSPEEYQVLHDHIEDLLKKGHIKPSLSPCAVPALLTP
        LGLL                             EPQGLPPLRDIQHQIDLVPRASLPNLPHYRMSPEEYQVLHDHIEDLLKKGHIKPSLSPCAVPALLTP
Subjt:  LGLL-----------------------------EPQGLPPLRDIQHQIDLVPRASLPNLPHYRMSPEEYQVLHDHIEDLLKKGHIKPSLSPCAVPALLTP

Query:  NKDGSWRMCVDSRAINRVTGKYRFPIPRIGDLLDQLGKATIFSKIDLRNGYHQIQIRPGDEWKTAFKTNEGLFECTFMRLMNQVLHPFLNQFIVVFFDDI
        NKDGSWRMCVDSRAINRVTGKYRFPIPRIGDLLDQLGKA IFSKIDLRNGYHQIQIRPGDEWKTAFKTNEGLFEC                         
Subjt:  NKDGSWRMCVDSRAINRVTGKYRFPIPRIGDLLDQLGKATIFSKIDLRNGYHQIQIRPGDEWKTAFKTNEGLFECTFMRLMNQVLHPFLNQFIVVFFDDI

Query:  LVCSSSREDHLQYLRKLFRVLTEIELYINPKKCTYLTKEIVFLGFLIKEGKIRMEPKKIEAIHSRPTPTSIKEVQAFLGLASFYRRFIRNFSLIVAPLTD
            SSREDHLQYLRKLFRVLTEIELYINPKKCTYLTKEIVFLGFLIKEGKIRMEPKKIEAIHSRPTPTSIKEVQAFLGLASFYRRFIRNFSLIVAPLTD
Subjt:  LVCSSSREDHLQYLRKLFRVLTEIELYINPKKCTYLTKEIVFLGFLIKEGKIRMEPKKIEAIHSRPTPTSIKEVQAFLGLASFYRRFIRNFSLIVAPLTD

A0A5A7V4G7 Retrovirus-related Pol polyprotein from transposon 17.65.6e-23371.91Show/hide
Query:  LAISLAETVEEMMTVRLKNSNRRTAWETNPSKKQSYGKKTDEQPSTSMVDKGKAIDIQETNKKKESLVRGKTQNNYTRPSLGKCFRCGEPGHLSNNCSQR
        L ++  ++ EEMM VRLKNSN+R  WETNPSKKQS GKKTDEQPSTS+VDKGKAIDIQETN KKES+VRGKTQNNYTRPSLGKCFRCGEP HLSNNC QR
Subjt:  LAISLAETVEEMMTVRLKNSNRRTAWETNPSKKQSYGKKTDEQPSTSMVDKGKAIDIQETNKKKESLVRGKTQNNYTRPSLGKCFRCGEPGHLSNNCSQR

Query:  KTIALAEDEDTYMSGTDREEEEETELIEADDGDRISCIVQRVLITPKEETNPQHHSLFKTRCTINGK------------NFVARKLVTALNLKTDPHPDP
        KTIALAEDEDTYMS  D+EE+EE ELIEAD+GDRISCIVQRVLIT KEE NPQ HSLFKTRCTI+GK            NFVARKLV +LNLK DPHPDP
Subjt:  KTIALAEDEDTYMSGTDREEEEETELIEADDGDRISCIVQRVLITPKEETNPQHHSLFKTRCTINGK------------NFVARKLVTALNLKTDPHPDP

Query:  YKIGWVKKGGETLINEICTIPLSIGNSYKDQIVCDVIEMDVCHLLLGRPWQHDTQTLHRGRENTYEFQWMGKKVILLPLAKKNTESIRQKNKRQLFITVS
        YKIGWVKK GETLINEICTIPLSI NSYKDQIVCDVIEMDVCHLLL RPW+ D   L                 ++   ++     I +   ++LF    
Subjt:  YKIGWVKKGGETLINEICTIPLSIGNSYKDQIVCDVIEMDVCHLLLGRPWQHDTQTLHRGRENTYEFQWMGKKVILLPLAKKNTESIRQKNKRQLFITVS

Query:  GKNLLKEREQDLLGLLEPQGLPPLRDIQHQIDLVPRASLPNLPHYRMSPEEYQVLHDHIEDLLKKGHIKPSLSPCAVPALLTPNKDGSWRMCVDSRAINR
            LK+         EPQGLPPL DIQHQIDLVP ASLP+LPHYRMSPEEYQVLHD+IE+LLKKGHIKPSLSPC VPALLTP KD SWRMCVDSRAINR
Subjt:  GKNLLKEREQDLLGLLEPQGLPPLRDIQHQIDLVPRASLPNLPHYRMSPEEYQVLHDHIEDLLKKGHIKPSLSPCAVPALLTPNKDGSWRMCVDSRAINR

Query:  VTGKYRFPIPRIGDLLDQLGKATIFSKIDLRNGYHQIQIRPGDEWKTAFKTNEGLFECTFMRLMNQVLHPF-LNQFIVVFFDDILVCSSSREDHLQYLRK
        +T KY FPIP++GDLLDQLGKA +FSKIDLR+ YHQI+IRP DEWKT FK NEGLFE   M        PF L+     F       S SREDHLQ+LRK
Subjt:  VTGKYRFPIPRIGDLLDQLGKATIFSKIDLRNGYHQIQIRPGDEWKTAFKTNEGLFECTFMRLMNQVLHPF-LNQFIVVFFDDILVCSSSREDHLQYLRK

Query:  LFRVLTEIELYINPKKCTYLTKEIVFLGFLIKEGKIRMEPKKIEAIHSRPTPTSIKEVQAFLGLASFYRRFIRNFSLIVAPLTDCLKKGNFKWEHMRQ
        LF+VL EIELYINPKKCT+L KEIVFLGFLIKEGKI MEPKK+EAI S P PTSIKEVQAFLGLASFY+RFIRNFS IV PLTD LKK NFKWEHM+Q
Subjt:  LFRVLTEIELYINPKKCTYLTKEIVFLGFLIKEGKIRMEPKKIEAIHSRPTPTSIKEVQAFLGLASFYRRFIRNFSLIVAPLTDCLKKGNFKWEHMRQ

A0A5B7BER3 Uncharacterized protein5.9e-17440.93Show/hide
Query:  QGYRGQEARRETYHDYKMKIDLPTYNGKRDIESFLDWIKNTENFFKYMVPPDRKKVHLAI----------------------------------------
        +GY G++ R +   +Y+MKIDLP++NG   IESFLDWI   E FF  M   D K+V L                                          
Subjt:  QGYRGQEARRETYHDYKMKIDLPTYNGKRDIESFLDWIKNTENFFKYMVPPDRKKVHLAI----------------------------------------

Query:  ---------------------SLAETVEEMMTVRLKNS------------------------NRRTAWETNPSKKQSYGKKTDE--QPSTSMVDKGKAID
                             S++E  +E  T+  +N+                        N RT W  N +   +   +  +  QP  S        D
Subjt:  ---------------------SLAETVEEMMTVRLKNS------------------------NRRTAWETNPSKKQSYGKKTDE--QPSTSMVDKGKAID

Query:  IQETNKKKESLVRG----------------------------KTQNNYTRPSLGKCFRCGEPGHLSNNCSQRKTIALAEDEDTYMSGTDREEEEE-----
             + ++  + G                            K+ N Y RP  GKCFRC +PGH SN C  R+ + +    +      + EEE E     
Subjt:  IQETNKKKESLVRG----------------------------KTQNNYTRPSLGKCFRCGEPGHLSNNCSQRKTIALAEDEDTYMSGTDREEEEE-----

Query:  --TELIEADDGDRISCIVQRVLITPKEETNPQHHSLFKTRCTINGK------------NFVARKLVTALNLKTDPHPDPYKIGWVKKGGETLINEICTIP
           E+ E D+G+ +SC+VQR+L+ PK+E +PQ H++F+TRCTIN K            N V++ LV AL LKT+ HP+PYKIGW+KKG ET + EIC +P
Subjt:  --TELIEADDGDRISCIVQRVLITPKEETNPQHHSLFKTRCTINGK------------NFVARKLVTALNLKTDPHPDPYKIGWVKKGGETLINEICTIP

Query:  LSIGNSYKDQIVCDVIEMDVCHLLLGRPWQHDTQTLHRGRENTYEFQWMGKKVILLPLAKKNTESIRQKNKRQLFITVSGKNLLKERE------------
         SIG  YKD++ CD+++MD CH+LLGRPWQ D    H+G++NTY F W  KKV+L+P  K +      K + +  +TV+G   +++ +            
Subjt:  LSIGNSYKDQIVCDVIEMDVCHLLLGRPWQHDTQTLHRGRENTYEFQWMGKKVILLPLAKKNTESIRQKNKRQLFITVSGKNLLKERE------------

Query:  --------------------QDLLGLLEPQGLPPLRDIQHQIDLVPRASLPNLPHYRMSPEEYQVLHDHIEDLLKKGHIKPSLSPCAVPALLTPNKDGSW
                            QD+     P  LPP+RDIQH IDLVP ASLPNLPHYRMSP+E ++L   +EDL+ KG I+ S+SPCAVPALLTP KDGSW
Subjt:  --------------------QDLLGLLEPQGLPPLRDIQHQIDLVPRASLPNLPHYRMSPEEYQVLHDHIEDLLKKGHIKPSLSPCAVPALLTPNKDGSW

Query:  RMCVDSRAINRVTGKYRFPIPRIGDLLDQLGKATIFSKIDLRNGYHQIQIRPGDEWKTAFKTNEGLFE------------CTFMRLMNQVLHPFLNQFIV
        RMCVDSRAIN++T KYRFPIPR+ D+LD L  + IFSKIDLR+GYHQI+IRPGDEWKTAFKT EGL+E             TFMR+MNQVL PF+ +F+V
Subjt:  RMCVDSRAINRVTGKYRFPIPRIGDLLDQLGKATIFSKIDLRNGYHQIQIRPGDEWKTAFKTNEGLFE------------CTFMRLMNQVLHPFLNQFIV

Query:  VFFDDILVCSSSREDHLQYLRKLFRVLTEIELYINPKKCTYLTKEIVFLGFLIKEGKIRMEPKKIEAIHSRPTPTSIKEVQAFLGLASFYRRFIRNFSLI
        V+FDDIL+ S S  +HL+++R++   L E +LYIN KKC +LT  ++FLGF+I    I+++ +K+ AI   PTP ++ ++++F GLA+FYRRFIRNFS I
Subjt:  VFFDDILVCSSSREDHLQYLRKLFRVLTEIELYINPKKCTYLTKEIVFLGFLIKEGKIRMEPKKIEAIHSRPTPTSIKEVQAFLGLASFYRRFIRNFSLI

Query:  VAPLTDCLKKGNFKWE
        VAP+TDC+KKG F+WE
Subjt:  VAPLTDCLKKGNFKWE

A0A5D3DGR0 Reverse transcriptase4.0e-21550.62Show/hide
Query:  YRGQEARRETYHDYKMKIDLPTYNGKRDIESFLDWIKNTENFFKYMVPPDRKKVHL--------------------------------------------
        Y   E ++    +YKMKIDLP+Y+GKR+IE+FLDW+KNTENFF YM     KKVHL                                            
Subjt:  YRGQEARRETYHDYKMKIDLPTYNGKRDIESFLDWIKNTENFFKYMVPPDRKKVHL--------------------------------------------

Query:  ----------------------------------------------------------------------AISLAETVEEMMTVRLKNSNRRTAWETNPS
                                                                              AI+ AETVEEM+  R K S R+  WE + S
Subjt:  ----------------------------------------------------------------------AISLAETVEEMMTVRLKNSNRRTAWETNPS

Query:  KKQSYGKKTDEQPSTSMVDKGKAIDIQETNKKKESLVRG--KTQNNYTRPSLGKCFRCGEPGHLSNNCSQRKTIALAEDEDTYMSGTDREEEEETELIEA
        KK + G    +  ++      K ++ +E++ KKE +  G  K +N Y RP  G C+RCG+ GH SN C QRKTIA+A+D D   + +  E +EETE+IEA
Subjt:  KKQSYGKKTDEQPSTSMVDKGKAIDIQETNKKKESLVRG--KTQNNYTRPSLGKCFRCGEPGHLSNNCSQRKTIALAEDEDTYMSGTDREEEEETELIEA

Query:  DDGDRISCIVQRVLITPKEETNPQHHSLFKTRCTINGK------------NFVARKLVTALNLKTDPHPDPYKIGWVKKGGETLINEICTIPLSIGNSYK
        D+GD +SCI+QRVLI+PKEE   Q HSLFKTRCTI GK            NFV++KLVTALNLKT PH  PYKIGW+KKGGETLI+EIC +PLSIGNSYK
Subjt:  DDGDRISCIVQRVLITPKEETNPQHHSLFKTRCTINGK------------NFVARKLVTALNLKTDPHPDPYKIGWVKKGGETLINEICTIPLSIGNSYK

Query:  DQIVCDVIEMDVCHLLLGRPWQHDTQTLHRGRENTYEFQWMGKKVILLPLAKKNTESIRQKNKR-QLFITVSGKNLLKEREQDLLGLL------------
        DQ+VCDVIEMDVCH+LLGRPWQ D Q++HRGRENTYEF WM KKVILLPL K+  ++I +  K+  LF+T+SGK  L+ERE ++LG++            
Subjt:  DQIVCDVIEMDVCHLLLGRPWQHDTQTLHRGRENTYEFQWMGKKVILLPLAKKNTESIRQKNKR-QLFITVSGKNLLKEREQDLLGLL------------

Query:  -----------------EPQGLPPLRDIQHQIDLVPRASLPNLPHYRMSPEEYQVLHDHIEDLLKKGHIKPSLSPCAVPALLTPNKDGSWRMCVDSRAIN
                         EP  LPPLRDI H I+L+  AS P+LPHY MSP EY++LHD IE+LLKKGHIKPS S C VPALLTP KDG+WRMCVDSRAIN
Subjt:  -----------------EPQGLPPLRDIQHQIDLVPRASLPNLPHYRMSPEEYQVLHDHIEDLLKKGHIKPSLSPCAVPALLTPNKDGSWRMCVDSRAIN

Query:  RVTGKYRFPIPRIGDLLDQLGKATIFSKIDLRNGYHQIQIRPGDEWKTAFKTNEGLFE------------CTFMRLMNQVLHPFLNQFIVVFFDDILVCS
        ++T KYRFPIPR+ DLLDQLG A IFSKIDLR+ YHQI+IRPGDEWKTAFKTNEGLFE             TFMRLMN+VLHPFLN+FI+V+FDDILV S
Subjt:  RVTGKYRFPIPRIGDLLDQLGKATIFSKIDLRNGYHQIQIRPGDEWKTAFKTNEGLFE------------CTFMRLMNQVLHPFLNQFIVVFFDDILVCS

Query:  SSREDHLQYLRKLFRVLTEIELYINPKKCTYLTKEIVFLGFLIKEGKIRMEPKKIEAIHSRPTPTSIKEVQAFLGLASFYRRFIRNFSLIVAPLTDCLKK
         + + HLQ++ +LF+VL   ELY+N KKC + + EI FLGF+I++  + M+ KK+EAI +  TPT++ +VQAFLGLASFYR+FI+N S I AP+TDCLKK
Subjt:  SSREDHLQYLRKLFRVLTEIELYINPKKCTYLTKEIVFLGFLIKEGKIRMEPKKIEAIHSRPTPTSIKEVQAFLGLASFYRRFIRNFSLIVAPLTDCLKK

Query:  GNFKWEHMRQ
        G F+W   +Q
Subjt:  GNFKWEHMRQ

A0A5D3E417 Transposon Ty3-I Gag-Pol polyprotein isoform X10.0e+0078.12Show/hide
Query:  MAQGYRGQEARRETYHDYKMKIDLPTYNGKRDIESFLDWIKNTENFFKYMVPPDRKKVHL----------------------------------------
        MAQGYRGQEARRETYHDYKMKIDLPTYNGKRDIESFLDWIKNTENFFKYMVPPDRKKVHL                                        
Subjt:  MAQGYRGQEARRETYHDYKMKIDLPTYNGKRDIESFLDWIKNTENFFKYMVPPDRKKVHL----------------------------------------

Query:  ----------------------------------------------AISLAETVEEMMTVRLKNSNRRTAWETNPSKKQSYGKKTDEQPSTSMVDKGKAI
                                                      AISLAETVEEMMTVRLKNSNRRTAWETNPSKKQSYGKKTDEQPSTSMVDKGKAI
Subjt:  ----------------------------------------------AISLAETVEEMMTVRLKNSNRRTAWETNPSKKQSYGKKTDEQPSTSMVDKGKAI

Query:  DIQETNKKKESLVRGKTQNNYTRPSLGKCFRCGEPGHLSNNCSQRKTIALAEDEDTYMSGTDREEEEETELIEADDGDRISCIVQRVLITPKEETNPQHH
        DIQETNKKKESLVRGKTQNNYTRPSLGKCFRCGEPGHLSNNCSQRKTIALAEDEDTYMSGTD EEEEETELIEADDGDRISCIVQRVLITPKEETNPQHH
Subjt:  DIQETNKKKESLVRGKTQNNYTRPSLGKCFRCGEPGHLSNNCSQRKTIALAEDEDTYMSGTDREEEEETELIEADDGDRISCIVQRVLITPKEETNPQHH

Query:  SLFKTRCTINGKNFVARKLVTALNLKTDPHPDPYKIGWVKKGGETLINEICTIPLSIGNSYKDQIVCDVIEMDVCHLLLGRPWQHDTQTLHRGRENTYEF
        SLFKTRCTINGK +              PHPDPYKIGWVKKGGETLINEICTIPLSIGNSYKDQIVCDVIEMDVCHLLLGRPWQHDTQTLHRGRENTYEF
Subjt:  SLFKTRCTINGKNFVARKLVTALNLKTDPHPDPYKIGWVKKGGETLINEICTIPLSIGNSYKDQIVCDVIEMDVCHLLLGRPWQHDTQTLHRGRENTYEF

Query:  QWMGKKVILLPLAKKNTESIRQKNKRQLFITVSGKNLLKEREQDLLGLL-----------------------------EPQGLPPLRDIQHQIDLVPRAS
        QWMGKKVILLPLAKKNTESIRQKNKRQLFITVSGKNLLKEREQDLLGLL                             EPQGLPPLRDIQHQIDLVPRAS
Subjt:  QWMGKKVILLPLAKKNTESIRQKNKRQLFITVSGKNLLKEREQDLLGLL-----------------------------EPQGLPPLRDIQHQIDLVPRAS

Query:  LPNLPHYRMSPEEYQVLHDHIEDLLKKGHIKPSLSPCAVPALLTPNKDGSWRMCVDSRAINRVTGKYRFPIPRIGDLLDQLGKATIFSKIDLRNGYHQIQ
        LPNLPHYRMSPEEYQVLHDHIEDLLKKGHIKPSLSPCAVPALLTPNKDGSWRMCVDSRAINRVTGKYRFPIPRIGDLLDQLGKA IFSKIDLRNGYHQIQ
Subjt:  LPNLPHYRMSPEEYQVLHDHIEDLLKKGHIKPSLSPCAVPALLTPNKDGSWRMCVDSRAINRVTGKYRFPIPRIGDLLDQLGKATIFSKIDLRNGYHQIQ

Query:  IRPGDEWKTAFKTNEGLFECTFMRLMNQVLHPFLNQFIVVFFDDILVCSSSREDHLQYLRKLFRVLTEIELYINPKKCTYLTKEIVFLGFLIKEGKIRME
        IRPGDEWKTAFKTNEGLFEC                             SSREDHLQYLRKLFRVLTEIELYINPKKCTYLTKEIVFLGFLIKEGKIRME
Subjt:  IRPGDEWKTAFKTNEGLFECTFMRLMNQVLHPFLNQFIVVFFDDILVCSSSREDHLQYLRKLFRVLTEIELYINPKKCTYLTKEIVFLGFLIKEGKIRME

Query:  PKKIEAIHSRPTPTSIKEVQAFLGLASFYRRFIRNFSLIVAPLTD
        PKKIEAI SRPTPTSIKEVQAFLGLASFYRRFIRNFSLIVAPLTD
Subjt:  PKKIEAIHSRPTPTSIKEVQAFLGLASFYRRFIRNFSLIVAPLTD

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.62.8e-4839.7Show/hide
Query:  NLPHYR--MSPEEY-QVLHDHIEDLLKKGHIKPSLSPCAVPALLTPNKDGS-----WRMCVDSRAINRVTGKYRFPIPRIGDLLDQLGKATIFSKIDLRN
        NLP Y     P+ Y Q +   I+D+L +G I+ S SP   P  + P K  +     +R+ +D R +N +T   R PIP + ++L +LG+   F+ IDL  
Subjt:  NLPHYR--MSPEEY-QVLHDHIEDLLKKGHIKPSLSPCAVPALLTPNKDGS-----WRMCVDSRAINRVTGKYRFPIPRIGDLLDQLGKATIFSKIDLRN

Query:  GYHQIQIRPGDEWKTAFKTNEGLFE------------CTFMRLMNQVLHPFLNQFIVVFFDDILVCSSSREDHLQYLRKLFRVLTEIELYINPKKCTYLT
        G+HQI++ P    KTAF T  G +E             TF R MN +L P LN+  +V+ DDI+V S+S ++HLQ L  +F  L +  L +   KC +L 
Subjt:  GYHQIQIRPGDEWKTAFKTNEGLFE------------CTFMRLMNQVLHPFLNQFIVVFFDDILVCSSSREDHLQYLRKLFRVLTEIELYINPKKCTYLT

Query:  KEIVFLGFLIKEGKIRMEPKKIEAIHSRPTPTSIKEVQAFLGLASFYRRFIRNFSLIVAPLTDCLKK
        +E  FLG ++    I+  P+KIEAI   P PT  KE++AFLGL  +YR+FI NF+ I  P+T CLKK
Subjt:  KEIVFLGFLIKEGKIRMEPKKIEAIHSRPTPTSIKEVQAFLGLASFYRRFIRNFSLIVAPLTDCLKK

P20825 Retrovirus-related Pol polyprotein from transposon 2972.5e-4432.84Show/hide
Query:  LAKKNTESIRQKNKRQLFITVSGKNLLKEREQDLLGLL---------EPQGLPPLRDIQHQIDLVPRASLPNLPHYRMSPEEYQVLHDHIEDLLKKGHIK
        +A  + ESI++ +  Q  +     +L +E    L GLL         E + L     I+H ++    + + +  +      E +V  + ++++L +G I+
Subjt:  LAKKNTESIRQKNKRQLFITVSGKNLLKEREQDLLGLL---------EPQGLPPLRDIQHQIDLVPRASLPNLPHYRMSPEEYQVLHDHIEDLLKKGHIK

Query:  PSLSPCAVPALLTPNKD-----GSWRMCVDSRAINRVTGKYRFPIPRIGDLLDQLGKATIFSKIDLRNGYHQIQIRPGDEWKTAFKTNEGLFE-------
         S SP   P  + P K        +R+ +D R +N +T   R+PIP + ++L +LGK   F+ IDL  G+HQI++      KTAF T  G +E       
Subjt:  PSLSPCAVPALLTPNKD-----GSWRMCVDSRAINRVTGKYRFPIPRIGDLLDQLGKATIFSKIDLRNGYHQIQIRPGDEWKTAFKTNEGLFE-------

Query:  -----CTFMRLMNQVLHPFLNQFIVVFFDDILVCSSSREDHLQYLRKLFRVLTEIELYINPKKCTYLTKEIVFLGFLIKEGKIRMEPKKIEAIHSRPTPT
              TF R MN +L P LN+  +V+ DDI++ S+S  +HL  ++ +F  L +  L +   KC +L KE  FLG ++    I+  P K++AI S P PT
Subjt:  -----CTFMRLMNQVLHPFLNQFIVVFFDDILVCSSSREDHLQYLRKLFRVLTEIELYINPKKCTYLTKEIVFLGFLIKEGKIRMEPKKIEAIHSRPTPT

Query:  SIKEVQAFLGLASFYRRFIRNFSLIVAPLTDCLKK
          KE++AFLGL  +YR+FI N++ I  P+T CLKK
Subjt:  SIKEVQAFLGLASFYRRFIRNFSLIVAPLTDCLKK

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein6.7e-5038.95Show/hide
Query:  IQHQIDLVPRASLPNLPHYRMSPEEYQVLHDHIEDLLKKGHIKPSLSPCAVPALLTPNKDGSWRMCVDSRAINRVTGKYRFPIPRIGDLLDQLGKATIFS
        ++H I++ P A LP L  Y ++ +  Q ++  ++ LL    I PS SPC+ P +L P KDG++R+CVD R +N+ T    FP+PRI +LL ++G A IF+
Subjt:  IQHQIDLVPRASLPNLPHYRMSPEEYQVLHDHIEDLLKKGHIKPSLSPCAVPALLTPNKDGSWRMCVDSRAINRVTGKYRFPIPRIGDLLDQLGKATIFS

Query:  KIDLRNGYHQIQIRPGDEWKTAFKTNEGLFE------------CTFMRLMNQVLHPFLNQFIVVFFDDILVCSSSREDHLQYLRKLFRVLTEIELYINPK
         +DL +GYHQI + P D +KTAF T  G +E             TF R M         +F+ V+ DDIL+ S S E+H ++L  +   L    L +  K
Subjt:  KIDLRNGYHQIQIRPGDEWKTAFKTNEGLFE------------CTFMRLMNQVLHPFLNQFIVVFFDDILVCSSSREDHLQYLRKLFRVLTEIELYINPK

Query:  KCTYLTKEIVFLGFLIKEGKIRMEPKKIEAIHSRPTPTSIKEVQAFLGLASFYRRFIRNFSLIVAPL
        KC + ++E  FLG+ I   KI     K  AI   PTP ++K+ Q FLG+ ++YRRFI N S I  P+
Subjt:  KCTYLTKEIVFLGFLIKEGKIRMEPKKIEAIHSRPTPTSIKEVQAFLGLASFYRRFIRNFSLIVAPL

Q8I7P9 Retrovirus-related Pol polyprotein from transposon opus1.4e-3935.54Show/hide
Query:  IEDLLKKGHIKPSLSP-----CAVPALLTPNKDGSWRMCVDSRAINRVTGKYRFPIPRIGDLLDQLGKATIFSKIDLRNGYHQIQIRPGDEWKTAFKTNE
        I++LL+ G I+PS SP       VP    PN +  +RM VD + +N VT    +PIP I   L  LG A  F+ +DL +G+HQI ++  D  KTAF T  
Subjt:  IEDLLKKGHIKPSLSP-----CAVPALLTPNKDGSWRMCVDSRAINRVTGKYRFPIPRIGDLLDQLGKATIFSKIDLRNGYHQIQIRPGDEWKTAFKTNE

Query:  GLFE------------CTFMRLMNQVLHPFLNQFIVVFFDDILVCSSSREDHLQYLRKLFRVLTEIELYINPKKCTYLTKEIVFLGFLIKEGKIRMEPKK
        G +E              F R+++ +L   + +   V+ DDI+V S   + H + LR +   L++  L +N +K  +L  ++ FLG+++    I+ +PKK
Subjt:  GLFE------------CTFMRLMNQVLHPFLNQFIVVFFDDILVCSSSREDHLQYLRKLFRVLTEIELYINPKKCTYLTKEIVFLGFLIKEGKIRMEPKK

Query:  IEAIHSRPTPTSIKEVQAFLGLASFYRRFIRNFSLIVAPLTD
        + AI   P PTS+KE++ FLG+ S+YR+FI++++ +  PLT+
Subjt:  IEAIHSRPTPTSIKEVQAFLGLASFYRRFIRNFSLIVAPLTD

Q99315 Transposon Ty3-G Gag-Pol polyprotein6.7e-5038.95Show/hide
Query:  IQHQIDLVPRASLPNLPHYRMSPEEYQVLHDHIEDLLKKGHIKPSLSPCAVPALLTPNKDGSWRMCVDSRAINRVTGKYRFPIPRIGDLLDQLGKATIFS
        ++H I++ P A LP L  Y ++ +  Q ++  ++ LL    I PS SPC+ P +L P KDG++R+CVD R +N+ T    FP+PRI +LL ++G A IF+
Subjt:  IQHQIDLVPRASLPNLPHYRMSPEEYQVLHDHIEDLLKKGHIKPSLSPCAVPALLTPNKDGSWRMCVDSRAINRVTGKYRFPIPRIGDLLDQLGKATIFS

Query:  KIDLRNGYHQIQIRPGDEWKTAFKTNEGLFE------------CTFMRLMNQVLHPFLNQFIVVFFDDILVCSSSREDHLQYLRKLFRVLTEIELYINPK
         +DL +GYHQI + P D +KTAF T  G +E             TF R M         +F+ V+ DDIL+ S S E+H ++L  +   L    L +  K
Subjt:  KIDLRNGYHQIQIRPGDEWKTAFKTNEGLFE------------CTFMRLMNQVLHPFLNQFIVVFFDDILVCSSSREDHLQYLRKLFRVLTEIELYINPK

Query:  KCTYLTKEIVFLGFLIKEGKIRMEPKKIEAIHSRPTPTSIKEVQAFLGLASFYRRFIRNFSLIVAPL
        KC + ++E  FLG+ I   KI     K  AI   PTP ++K+ Q FLG+ ++YRRFI N S I  P+
Subjt:  KCTYLTKEIVFLGFLIKEGKIRMEPKKIEAIHSRPTPTSIKEVQAFLGLASFYRRFIRNFSLIVAPL

Arabidopsis top hitse value%identityAlignment
AT4G13320.1 unknown protein2.8e-1130Show/hide
Query:  LFKTRCTIN----------GKNFVARKLVTALNLKTDPHPDPYKIGWVKKGGETLINEICTIPLSIGNSYKDQIVCDVIEM--DVCHLLLGRPWQHDTQT
        +F+T+C IN          G N +++ LV  L LKT       ++    +  + +  E C +P+SIG+ YKD++ C V+ M  +   LL G PW +  Q 
Subjt:  LFKTRCTIN----------GKNFVARKLVTALNLKTDPHPDPYKIGWVKKGGETLINEICTIPLSIGNSYKDQIVCDVIEM--DVCHLLLGRPWQHDTQT

Query:  LHRGRENTYEFQWMGKKVIL
         H GR+++    W    ++L
Subjt:  LHRGRENTYEFQWMGKKVIL

ATMG00860.1 DNA/RNA polymerases superfamily protein7.2e-1535.58Show/hide
Query:  LQYLRKLFRVLTEIELYINPKKCTYLTKEIVFLG--FLIKEGKIRMEPKKIEAIHSRPTPTSIKEVQAFLGLASFYRRFIRNFSLIVAPLTDCLKKGNFK
        + +L  + ++  + + Y N KKC +   +I +LG   +I    +  +P K+EA+   P P +  E++ FLGL  +YRRF++N+  IV PLT+ LKK + K
Subjt:  LQYLRKLFRVLTEIELYINPKKCTYLTKEIVFLG--FLIKEGKIRMEPKKIEAIHSRPTPTSIKEVQAFLGLASFYRRFIRNFSLIVAPLTDCLKKGNFK

Query:  WEHM
        W  M
Subjt:  WEHM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCCAAGGCTATAGAGGACAAGAAGCACGGAGAGAAACTTATCATGATTACAAGATGAAAATCGACCTACCAACTTACAATGGTAAACGCGATATTGAGTCTTTCCT
AGATTGGATTAAAAACACCGAAAACTTCTTTAAATATATGGTTCCTCCGGACAGAAAGAAGGTACACCTAGCCATCTCCCTTGCAGAAACTGTTGAAGAAATGATGACTG
TACGCCTGAAGAACTCCAACAGAAGGACAGCATGGGAGACGAACCCCTCCAAGAAACAATCTTATGGCAAGAAGACAGATGAACAACCTTCAACGTCAATGGTAGATAAG
GGTAAAGCTATTGATATCCAAGAGACAAACAAGAAGAAGGAAAGTTTAGTCAGAGGAAAGACCCAAAACAACTACACACGCCCATCTTTGGGCAAGTGTTTTCGATGTGG
AGAGCCCGGTCACTTATCCAACAACTGTTCGCAAAGGAAGACCATAGCACTAGCTGAAGATGAAGACACCTACATGAGCGGAACAGATAGAGAAGAAGAAGAAGAAACAG
AGCTAATTGAAGCCGATGATGGAGATCGCATCTCCTGCATTGTTCAAAGAGTCCTCATTACTCCTAAAGAAGAAACGAATCCCCAACATCACAGCCTATTCAAAACGAGG
TGCACTATCAATGGAAAAAACTTCGTGGCAAGAAAACTTGTGACTGCTTTGAATCTGAAGACAGATCCACACCCTGATCCATATAAAATTGGATGGGTAAAGAAGGGAGG
AGAAACCTTAATCAACGAAATCTGCACTATACCACTCTCCATCGGTAATAGCTATAAGGATCAGATTGTGTGTGATGTAATTGAGATGGATGTGTGTCACTTACTACTAG
GCAGACCTTGGCAACATGATACTCAAACCCTACATAGGGGGAGAGAAAATACCTACGAGTTCCAGTGGATGGGAAAGAAAGTTATCTTACTCCCATTAGCAAAGAAAAAC
ACAGAAAGCATAAGGCAAAAGAATAAAAGGCAGCTTTTCATCACAGTAAGTGGAAAAAACCTATTAAAAGAAAGGGAGCAAGATCTTTTGGGATTATTAGAGCCACAAGG
ACTGCCACCACTTCGTGACATTCAGCATCAAATTGATCTCGTTCCAAGAGCATCACTACCTAATCTACCCCATTACAGAATGAGCCCCGAAGAATACCAAGTCTTACACG
ATCATATTGAAGACTTACTAAAAAAGGGCCACATCAAGCCAAGTCTAAGTCCATGCGCTGTACCAGCACTACTCACACCAAATAAAGATGGAAGTTGGAGGATGTGTGTA
GACAGCAGAGCTATTAACCGAGTTACTGGGAAGTATCGGTTTCCTATCCCTCGGATTGGAGATCTTTTGGACCAACTAGGCAAGGCTACGATTTTCTCAAAAATTGATTT
AAGGAACGGCTATCATCAAATACAAATCAGACCAGGAGATGAGTGGAAGACAGCCTTCAAGACAAATGAAGGATTGTTTGAATGCACTTTTATGAGGTTGATGAATCAAG
TACTACATCCATTCTTAAACCAGTTTATAGTGGTTTTCTTTGATGATATCCTTGTGTGTAGCAGCAGCCGTGAAGACCATCTACAGTACCTAAGGAAACTGTTTCGAGTT
TTGACTGAAATAGAATTATATATAAATCCAAAGAAGTGCACATATCTCACCAAGGAAATCGTCTTTCTTGGATTCTTGATCAAAGAAGGAAAGATAAGGATGGAACCTAA
GAAAATAGAAGCTATACATTCTCGGCCAACACCAACATCCATCAAAGAAGTGCAAGCTTTTCTTGGCTTGGCATCTTTCTACAGAAGATTCATCAGGAATTTTAGCTTAA
TAGTGGCCCCCCTAACCGACTGTTTAAAGAAAGGAAATTTCAAATGGGAGCATATGCGGCAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCCCAAGGCTATAGAGGACAAGAAGCACGGAGAGAAACTTATCATGATTACAAGATGAAAATCGACCTACCAACTTACAATGGTAAACGCGATATTGAGTCTTTCCT
AGATTGGATTAAAAACACCGAAAACTTCTTTAAATATATGGTTCCTCCGGACAGAAAGAAGGTACACCTAGCCATCTCCCTTGCAGAAACTGTTGAAGAAATGATGACTG
TACGCCTGAAGAACTCCAACAGAAGGACAGCATGGGAGACGAACCCCTCCAAGAAACAATCTTATGGCAAGAAGACAGATGAACAACCTTCAACGTCAATGGTAGATAAG
GGTAAAGCTATTGATATCCAAGAGACAAACAAGAAGAAGGAAAGTTTAGTCAGAGGAAAGACCCAAAACAACTACACACGCCCATCTTTGGGCAAGTGTTTTCGATGTGG
AGAGCCCGGTCACTTATCCAACAACTGTTCGCAAAGGAAGACCATAGCACTAGCTGAAGATGAAGACACCTACATGAGCGGAACAGATAGAGAAGAAGAAGAAGAAACAG
AGCTAATTGAAGCCGATGATGGAGATCGCATCTCCTGCATTGTTCAAAGAGTCCTCATTACTCCTAAAGAAGAAACGAATCCCCAACATCACAGCCTATTCAAAACGAGG
TGCACTATCAATGGAAAAAACTTCGTGGCAAGAAAACTTGTGACTGCTTTGAATCTGAAGACAGATCCACACCCTGATCCATATAAAATTGGATGGGTAAAGAAGGGAGG
AGAAACCTTAATCAACGAAATCTGCACTATACCACTCTCCATCGGTAATAGCTATAAGGATCAGATTGTGTGTGATGTAATTGAGATGGATGTGTGTCACTTACTACTAG
GCAGACCTTGGCAACATGATACTCAAACCCTACATAGGGGGAGAGAAAATACCTACGAGTTCCAGTGGATGGGAAAGAAAGTTATCTTACTCCCATTAGCAAAGAAAAAC
ACAGAAAGCATAAGGCAAAAGAATAAAAGGCAGCTTTTCATCACAGTAAGTGGAAAAAACCTATTAAAAGAAAGGGAGCAAGATCTTTTGGGATTATTAGAGCCACAAGG
ACTGCCACCACTTCGTGACATTCAGCATCAAATTGATCTCGTTCCAAGAGCATCACTACCTAATCTACCCCATTACAGAATGAGCCCCGAAGAATACCAAGTCTTACACG
ATCATATTGAAGACTTACTAAAAAAGGGCCACATCAAGCCAAGTCTAAGTCCATGCGCTGTACCAGCACTACTCACACCAAATAAAGATGGAAGTTGGAGGATGTGTGTA
GACAGCAGAGCTATTAACCGAGTTACTGGGAAGTATCGGTTTCCTATCCCTCGGATTGGAGATCTTTTGGACCAACTAGGCAAGGCTACGATTTTCTCAAAAATTGATTT
AAGGAACGGCTATCATCAAATACAAATCAGACCAGGAGATGAGTGGAAGACAGCCTTCAAGACAAATGAAGGATTGTTTGAATGCACTTTTATGAGGTTGATGAATCAAG
TACTACATCCATTCTTAAACCAGTTTATAGTGGTTTTCTTTGATGATATCCTTGTGTGTAGCAGCAGCCGTGAAGACCATCTACAGTACCTAAGGAAACTGTTTCGAGTT
TTGACTGAAATAGAATTATATATAAATCCAAAGAAGTGCACATATCTCACCAAGGAAATCGTCTTTCTTGGATTCTTGATCAAAGAAGGAAAGATAAGGATGGAACCTAA
GAAAATAGAAGCTATACATTCTCGGCCAACACCAACATCCATCAAAGAAGTGCAAGCTTTTCTTGGCTTGGCATCTTTCTACAGAAGATTCATCAGGAATTTTAGCTTAA
TAGTGGCCCCCCTAACCGACTGTTTAAAGAAAGGAAATTTCAAATGGGAGCATATGCGGCAGTAG
Protein sequenceShow/hide protein sequence
MAQGYRGQEARRETYHDYKMKIDLPTYNGKRDIESFLDWIKNTENFFKYMVPPDRKKVHLAISLAETVEEMMTVRLKNSNRRTAWETNPSKKQSYGKKTDEQPSTSMVDK
GKAIDIQETNKKKESLVRGKTQNNYTRPSLGKCFRCGEPGHLSNNCSQRKTIALAEDEDTYMSGTDREEEEETELIEADDGDRISCIVQRVLITPKEETNPQHHSLFKTR
CTINGKNFVARKLVTALNLKTDPHPDPYKIGWVKKGGETLINEICTIPLSIGNSYKDQIVCDVIEMDVCHLLLGRPWQHDTQTLHRGRENTYEFQWMGKKVILLPLAKKN
TESIRQKNKRQLFITVSGKNLLKEREQDLLGLLEPQGLPPLRDIQHQIDLVPRASLPNLPHYRMSPEEYQVLHDHIEDLLKKGHIKPSLSPCAVPALLTPNKDGSWRMCV
DSRAINRVTGKYRFPIPRIGDLLDQLGKATIFSKIDLRNGYHQIQIRPGDEWKTAFKTNEGLFECTFMRLMNQVLHPFLNQFIVVFFDDILVCSSSREDHLQYLRKLFRV
LTEIELYINPKKCTYLTKEIVFLGFLIKEGKIRMEPKKIEAIHSRPTPTSIKEVQAFLGLASFYRRFIRNFSLIVAPLTDCLKKGNFKWEHMRQ