; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI01G19140 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI01G19140
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionRNA-directed DNA polymerase
Genome locationChr1:14518722..14522696
RNA-Seq ExpressionCSPI01G19140
SyntenyCSPI01G19140
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0016787 - hydrolase activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR001584 - Integrase, catalytic core
IPR001878 - Zinc finger, CCHC-type
IPR005162 - Retrotransposon gag domain
IPR012337 - Ribonuclease H-like superfamily
IPR021109 - Aspartic peptidase domain superfamily
IPR036397 - Ribonuclease H superfamily
IPR036875 - Zinc finger, CCHC-type superfamily
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR041588 - Integrase zinc-binding domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
PKU63120.1 RNA-directed DNA polymerase [Dendrobium catenatum]0.0e+0047.32Show/hide
Query:  MDTPEQKKVRLVALKLKGGASAWWEQLEANRQRYNKRPVRSWEKMKKLLKGRFLPLNYEQTLYNQYQNCRQGTRTVTEYIEEFHRLSARTNLSENEQHQI
        M+ P +K+V+ VA +LKGGASAWW QL+ NRQR  K PVR+W +MK++++G FLP +YEQ LY QYQ+C QG R+V+ Y EEF+RLSAR NL E++   +
Subjt:  MDTPEQKKVRLVALKLKGGASAWWEQLEANRQRYNKRPVRSWEKMKKLLKGRFLPLNYEQTLYNQYQNCRQGTRTVTEYIEEFHRLSARTNLSENEQHQI

Query:  ARFVGGLRFDIKEKVKLQPLRFLSEAISLAETVEEMIALKAKTMNRRTTWEPTPTKKTSYTSKTNDQPMAPIHGKGKEA-DSQTATNEKKAEIINKSKNQ
        AR+ GGL+  +++K++L  L  LS+A++ A   E  ++ ++K  + R       T+   Y+      P++  + +   A DSQTA   +           
Subjt:  ARFVGGLRFDIKEKVKLQPLRFLSEAISLAETVEEMIALKAKTMNRRTTWEPTPTKKTSYTSKTNDQPMAPIHGKGKEA-DSQTATNEKKAEIINKSKNQ

Query:  NNYTRPSLGKCFRCGQPGHLSNSCPQRKTIALAEEEGNLPGEDESEPREET-EEIEVDEGDRISCVIHKVLIAPKEEKSPQRHSLFKTRCTINGKVCDVI
        N Y +P+  KCFRC QPGH SN CP R  + + E +     E   +  ++  EE+  DEG+ + C++ ++L+AP++  + QR+++F+TRCTI G+VC+++
Subjt:  NNYTRPSLGKCFRCGQPGHLSNSCPQRKTIALAEEEGNLPGEDESEPREET-EEIEVDEGDRISCVIHKVLIAPKEEKSPQRHSLFKTRCTINGKVCDVI

Query:  IDGGSNENFIAKKIVSNLNLKVEPHPNPYKIGWVKKGNESTVNEICTVPLSIGSSYKDQIICDVIDMDVCHVLLGRPWQHDTQTLHKGRENTYEFHWMGK
        ID G  EN +++ +V  L LK   +P PYKI WVKKG E ++ ++C V  SIG SY  +++CDVIDMDVCH++LGRPWQ+D   ++  R NTY F W G+
Subjt:  IDGGSNENFIAKKIVSNLNLKVEPHPNPYKIGWVKKGNESTVNEICTVPLSIGSSYKDQIICDVIDMDVCHVLLGRPWQHDTQTLHKGRENTYEFHWMGK

Query:  KVALLPLTKKNEENSKTRGQLFTTVSGKTLLKERKQDILALVVTGSTNGEQAGELEPQLQQLFEEFPHL--KKEPDGLPPLRDIQHHIDLIPGASLPNLA
        K+ LLP    +  N      +   VSG  LL +    I ALV       +      PQ+ +L  EF  +   + P  LPP+  IQH IDL+PGA+LPNL 
Subjt:  KVALLPLTKKNEENSKTRGQLFTTVSGKTLLKERKQDILALVVTGSTNGEQAGELEPQLQQLFEEFPHL--KKEPDGLPPLRDIQHHIDLIPGASLPNLA

Query:  HYRMTPQEYAALHEHIEDLLKKGHIKPSLSPCAVPALLTPKKDGSWRMCVDSRAINRITVKYRFPIPRVGDLLDQLGKATIFSKIDPRSGYHQIRIRPGD
        HYRM+P+E+  L E ++DLL++  I+PSLSPCAVPALL PKKD  WRMC+DSRAIN+IT KYRFP+PR+ D+LD+L  + +FSK+D RSGYHQIRIRPGD
Subjt:  HYRMTPQEYAALHEHIEDLLKKGHIKPSLSPCAVPALLTPKKDGSWRMCVDSRAINRITVKYRFPIPRVGDLLDQLGKATIFSKIDPRSGYHQIRIRPGD

Query:  EWKTAFKTNEGLFEWMVMPFGLSNAPSTFMRVMNQVLHPFLNKFIVVYFDDILVYSSGNDEHLLHLKKLFQVLTEKELYINQKRCEFLKAEITFLGFIIK
        EWKTAFKT +GLFEW VMPFGL NAP+TFM +M +VL    N+F VVYFDDIL+YSS  ++H+ HL K+ Q L E  LY+N  +CEF  A++ FLGFI+ 
Subjt:  EWKTAFKTNEGLFEWMVMPFGLSNAPSTFMRVMNQVLHPFLNKFIVVYFDDILVYSSGNDEHLLHLKKLFQVLTEKELYINQKRCEFLKAEITFLGFIIK

Query:  KGKISMEPRKVEAIQNWLVPTTVKEVHAFLGLASFYRKFIRNFSSICAPLTDCLKKGNFKWTPSQQESFEEIKKRLASSPVLQLPDFSSPFEAAVDACGT
           ++ +PRK+ AI++W +P T+ +V +F GLA+FYR+FIR FS I AP+ DCLK     W  S Q S++ IK+ L+S+PVL LP+F  PF+   DA   
Subjt:  KGKISMEPRKVEAIQNWLVPTTVKEVHAFLGLASFYRKFIRNFSSICAPLTDCLKKGNFKWTPSQQESFEEIKKRLASSPVLQLPDFSSPFEAAVDACGT

Query:  GIRAVLSQRGHPIEYLSEKLSPTRQTWSTYEQELYALVRALK-----------------------------------------QFNFVIKHQAGKENKVA
        GI AVLSQ   PIE+ SEKLS  RQ W+ YEQELYA+VRALK                                         +F FV+KH++G +N+VA
Subjt:  GIRAVLSQRGHPIEYLSEKLSPTRQTWSTYEQELYALVRALK-----------------------------------------QFNFVIKHQAGKENKVA

Query:  DALSRKGSLLTLLSSEIIAFKHLPELYERDTDFADIWHKCSNYLRAEGYHILEGFLFKGDQLCIPHTSLREALIKEAHSNGLAGHFGQDKIFETVSIRYY
        DALSR+ +LLT L +E+   + L ELY  D DFA  W +C+       Y +  G+LFKG+ LCIP +S R+ LIKEAH+ GLA H G++K  + +  R++
Subjt:  DALSRKGSLLTLLSSEIIAFKHLPELYERDTDFADIWHKCSNYLRAEGYHILEGFLFKGDQLCIPHTSLREALIKEAHSNGLAGHFGQDKIFETVSIRYY

Query:  WPQLRKDSNNFVKRCSICQRAKGSRTNAGLYTPLPIPQSIWEDLSIDFVLGLPKTQRNHDSVMVVVDRFSKMAHFIACKKTNDAIYIANLFFKEVIRLHG
        WP+L +D + FV+RC++CQ  KG++ NAGLY PL +P SIWED+SIDFVLGLP+TQR  DS+MVVVDRFSKMAHF+AC+K+ DA+ +A LFF E++RLHG
Subjt:  WPQLRKDSNNFVKRCSICQRAKGSRTNAGLYTPLPIPQSIWEDLSIDFVLGLPKTQRNHDSVMVVVDRFSKMAHFIACKKTNDAIYIANLFFKEVIRLHG

Query:  IPKTIVSDRDVKFLSHFWKTLWKN----------------------------------GSKPKQWDLSLAQAEFAFNNMKNRSTDKCPFQVVYTKRPRLT
        +P++I SDRDVKF+SHFW+ LWK                                     +PKQW+ +L+QAEFA+N+M NRST +CPF +VYTK P   
Subjt:  IPKTIVSDRDVKFLSHFWKTLWKN----------------------------------GSKPKQWDLSLAQAEFAFNNMKNRSTDKCPFQVVYTKRPRLT

Query:  FDLASLPITVESHKEAETMAENIEKLHKEVHDHLIQSTDSYKKAADKKRRQAVFSKGDLVMVHLRKNRFPAGTYNKLKDKQIGPFRIIEKYGDNAFKVEL
        FD+A LP    ++K A  + +  + +  EV   LI S  +YKKAAD  RR   F  GDLV++ LRK+RFPAG  +KL  ++ GP  I+++  DNA+ V+L
Subjt:  FDLASLPITVESHKEAETMAENIEKLHKEVHDHLIQSTDSYKKAADKKRRQAVFSKGDLVMVHLRKNRFPAGTYNKLKDKQIGPFRIIEKYGDNAFKVEL

Query:  PPDMHIHSVFNIADLKPYYAPDD
        P  MH    FN+ DL PY+ PDD
Subjt:  PPDMHIHSVFNIADLKPYYAPDD

PKU71894.1 RNA-directed DNA polymerase [Dendrobium catenatum]0.0e+0048.02Show/hide
Query:  MDTPEQKKVRLVALKLKGGASAWWEQLEANRQRYNKRPVRSWEKMKKLLKGRFLPLNYEQTLYNQYQNCRQGTRTVTEYIEEFHRLSARTNLSENEQHQI
        M+   +K+V+ VA +LKGGASAWW QL  +R+R  +  VRSW +MK+LL+G FLP +YEQ LY +YQ+C QG+R+V +Y EEF+RLSAR NL E+E   +
Subjt:  MDTPEQKKVRLVALKLKGGASAWWEQLEANRQRYNKRPVRSWEKMKKLLKGRFLPLNYEQTLYNQYQNCRQGTRTVTEYIEEFHRLSARTNLSENEQHQI

Query:  ARFVGGLRFDIKEKVKLQPLRFLSEAISLAETVEEMIALKAKTMNRRTTWEPTPTKKTSYTSKTNDQPMAPIHGKGKEADSQTATNEKKAEIINKSKNQN
        AR+VGGL+  I++K++L  +  LS+A++ A   E  +   +++ ++R +    P +  S  SK + QP          +   +A N   A  +     +N
Subjt:  ARFVGGLRFDIKEKVKLQPLRFLSEAISLAETVEEMIALKAKTMNRRTTWEPTPTKKTSYTSKTNDQPMAPIHGKGKEADSQTATNEKKAEIINKSKNQN

Query:  NYTRPSLGKCFRCGQPGHLSNSCPQRKTIALAE----EEGNLPGEDESEPREETEEIEVDEGDRISCVIHKVLIAPKEEKSPQRHSLFKTRCTINGKVCD
         Y+RP+  KCFRC QPGH SN CP R+ I + +    E+G +  +  ++  +  E+++ DEG+ I C++ K+L+AP++ +  QR+++F+T+CTI GKVC+
Subjt:  NYTRPSLGKCFRCGQPGHLSNSCPQRKTIALAE----EEGNLPGEDESEPREETEEIEVDEGDRISCVIHKVLIAPKEEKSPQRHSLFKTRCTINGKVCD

Query:  VIIDGGSNENFIAKKIVSNLNLKVEPHPNPYKIGWVKKGNESTVNEICTVPLSIGSSYKDQIICDVIDMDVCHVLLGRPWQHDTQTLHKGRENTYEFHWM
        ++ID G  EN I++ +V  L LK    P PYKI WVK+G E TV E C V  S+G  Y  +++CDV++MDVCH++LGRPWQ DTQ +H  R N Y F W 
Subjt:  VIIDGGSNENFIAKKIVSNLNLKVEPHPNPYKIGWVKKGNESTVNEICTVPLSIGSSYKDQIICDVIDMDVCHVLLGRPWQHDTQTLHKGRENTYEFHWM

Query:  GKKVALLPLT-----KKNEENSKTRGQLFTT---VSGKTLLKE--RKQDILALVVTGSTNGEQAGELEPQLQQLFEEF----PHLKKEPDGLPPLRDIQH
        GKK+ LLP T     K N  N       F     VSG  LL+E   K  +LALV    +       L   +QQL  EF    PH  + P  LPPLR+IQH
Subjt:  GKKVALLPLT-----KKNEENSKTRGQLFTT---VSGKTLLKE--RKQDILALVVTGSTNGEQAGELEPQLQQLFEEF----PHLKKEPDGLPPLRDIQH

Query:  HIDLIPGASLPNLAHYRMTPQEYAALHEHIEDLLKKGHIKPSLSPCAVPALLTPKKDGSWRMCVDSRAINRITVKYRFPIPRVGDLLDQLGKATIFSKID
         IDLIPGA+LPNL +YRM+P+E+  L E ++DLLK+  I+ SLSPCAVPALL PKKDG WRMC+DSRAIN+IT K+RFP+PRV DLLD+L  A IFSK+D
Subjt:  HIDLIPGASLPNLAHYRMTPQEYAALHEHIEDLLKKGHIKPSLSPCAVPALLTPKKDGSWRMCVDSRAINRITVKYRFPIPRVGDLLDQLGKATIFSKID

Query:  PRSGYHQIRIRPGDEWKTAFKTNEGLFEWMVMPFGLSNAPSTFMRVMNQVLHPFLNKFIVVYFDDILVYSSGNDEHLLHLKKLFQVLTEKELYINQKRCE
         RSGYHQ+RIRPGDEWK+AFKT EGLFEW VMPFGL NAPSTFMR+M++VL PF  KF V YFDDILVYS+  ++H+LHL +LFQ L   +LY+N  +CE
Subjt:  PRSGYHQIRIRPGDEWKTAFKTNEGLFEWMVMPFGLSNAPSTFMRVMNQVLHPFLNKFIVVYFDDILVYSSGNDEHLLHLKKLFQVLTEKELYINQKRCE

Query:  FLKAEITFLGFIIKKGKISMEPRKVEAIQNWLVPTTVKEVHAFLGLASFYRKFIRNFSSICAPLTDCLKKGNFKWTPSQQESFEEIKKRLASSPVLQLPD
        F   ++ FLGF++ +  I ++PRKV A++ W VP ++ ++ +F GLA+FYR+FIR FS I AP+TD LK  +F W+ +QQ+SFE IKK L+S+P+L LP+
Subjt:  FLKAEITFLGFIIKKGKISMEPRKVEAIQNWLVPTTVKEVHAFLGLASFYRKFIRNFSSICAPLTDCLKKGNFKWTPSQQESFEEIKKRLASSPVLQLPD

Query:  FSSPFEAAVDACGTGIRAVLSQRGHPIEYLSEKLSPTRQTWSTYEQELYALVRALKQ-----------------------------------------FN
        F  PF+   DA G GI AVL Q   P+EY SEKLS +RQ W+ YEQELYA+VRALKQ                                         F+
Subjt:  FSSPFEAAVDACGTGIRAVLSQRGHPIEYLSEKLSPTRQTWSTYEQELYALVRALKQ-----------------------------------------FN

Query:  FVIKHQAGKENKVADALSRKGSLLTLLSSEIIAFKHLPELYERDTDFADIWHKCSNYLRAEGYHILEGFLFKGDQLCIPHTSLREALIKEAHSNGLAGHF
        FVI+H+ GK N+VADALSR+ +LL  L +E+   + +  LY+ D DFA  W  C+       + +  GFLFKG+ LC+P +S R  LI+E H NGLA H 
Subjt:  FVIKHQAGKENKVADALSRKGSLLTLLSSEIIAFKHLPELYERDTDFADIWHKCSNYLRAEGYHILEGFLFKGDQLCIPHTSLREALIKEAHSNGLAGHF

Query:  GQDKIFETVSIRYYWPQLRKDSNNFVKRCSICQRAKGSRTNAGLYTPLPIPQSIWEDLSIDFVLGLPKTQRNHDSVMVVVDRFSKMAHFIACKKTNDAIY
        G+DK  + + +R++WP L++D    ++RC+ CQ  KG+  N GLY PLP+P SIWEDLS+DFVLGLP+T+R  DS+MVVVDRFSKMAHFI CKKT DA+ 
Subjt:  GQDKIFETVSIRYYWPQLRKDSNNFVKRCSICQRAKGSRTNAGLYTPLPIPQSIWEDLSIDFVLGLPKTQRNHDSVMVVVDRFSKMAHFIACKKTNDAIY

Query:  IANLFFKEVIRLHGIPKTIVSDRDVKFLSHFWKTLWKN----------------------------------GSKPKQWDLSLAQAEFAFNNMKNRSTDK
        IA LFFKE++RLHGIP+++ SDRDVKF+SHFW+ LWK                                      PK W+  L QAEFAFN+M NRST +
Subjt:  IANLFFKEVIRLHGIPKTIVSDRDVKFLSHFWKTLWKN----------------------------------GSKPKQWDLSLAQAEFAFNNMKNRSTDK

Query:  CPFQVVYTKRPRLTFDLASLPITVESHKEAETMAENIEKLHKEVHDHLIQSTDSYKKAADKKRRQAVFSKGDLVMVHLRKNRFPAGTYNKLKDKQIGPFR
        CPF VVYTK P    DLA LP   +S   A+T A    ++ KEV + + ++   YK   D+ RR   F+ G+LVM+  R++RFP+G   KL  K+ GPF 
Subjt:  CPFQVVYTKRPRLTFDLASLPITVESHKEAETMAENIEKLHKEVHDHLIQSTDSYKKAADKKRRQAVFSKGDLVMVHLRKNRFPAGTYNKLKDKQIGPFR

Query:  IIEKYGDNAFKVELPPDMHIHSVFNIADLKPYYAPDDFQ
        ++ K  DNA+ ++LP D+   S FN+AD+ PY+ PD+ Q
Subjt:  IIEKYGDNAFKVELPPDMHIHSVFNIADLKPYYAPDDFQ

PKU85169.1 RNA-directed DNA polymerase [Dendrobium catenatum]0.0e+0048.68Show/hide
Query:  MDTPEQKKVRLVALKLKGGASAWWEQLEANRQRYNKRPVRSWEKMKKLLKGRFLPLNYEQTLYNQYQNCRQGTRTVTEYIEEFHRLSARTNLSENEQHQI
        MD P +K+V+ VA +LKGGASAWW Q++  R R  K  VRSW +MK++L+  FLP +YEQ LY +YQ C QG++TV+EY EEF+RLSAR NL E+E   +
Subjt:  MDTPEQKKVRLVALKLKGGASAWWEQLEANRQRYNKRPVRSWEKMKKLLKGRFLPLNYEQTLYNQYQNCRQGTRTVTEYIEEFHRLSARTNLSENEQHQI

Query:  ARFVGGLRFDIKEKVKLQPLRFLSEAISLAETVEEMIALKAKTMNRRTTWEP---TPTKKTSYTSKTNDQPMAPIHGKGKEADSQTATNEKKAEIINKSK
        AR+  GLR  I++K++L  +  LS+AI+ A   E  ++ + ++ N R T       P  +   +      P   I+         +A  + KA +  K  
Subjt:  ARFVGGLRFDIKEKVKLQPLRFLSEAISLAETVEEMIALKAKTMNRRTTWEP---TPTKKTSYTSKTNDQPMAPIHGKGKEADSQTATNEKKAEIINKSK

Query:  NQNN-YTRPSLGKCFRCGQPGHLSNSCPQRKTIALAEEEGNLPGE-DESEPREETEEIEVDEGDRISCVIHKVLIAPKEEKSPQRHSLFKTRCTINGKVC
         ++N Y+RPS  KCFRC Q GH SN CP R  I L E E    GE    E   E E++  DEGD++ CV+ ++L+AP++    QR+++F+TRCTINGKVC
Subjt:  NQNN-YTRPSLGKCFRCGQPGHLSNSCPQRKTIALAEEEGNLPGE-DESEPREETEEIEVDEGDRISCVIHKVLIAPKEEKSPQRHSLFKTRCTINGKVC

Query:  DVIIDGGSNENFIAKKIVSNLNLKVEPHPNPYKIGWVKKGNESTVNEICTVPLSIGSSYKDQIICDVIDMDVCHVLLGRPWQHDTQTLHKGRENTYEFHW
        D++ID G  EN ++K +V  L LK   +P+PYKI WVKKG E  V E+C +  S+G SY  +++CDV++MDVCHV+LGRPWQ DT  ++ GR NTY F W
Subjt:  DVIIDGGSNENFIAKKIVSNLNLKVEPHPNPYKIGWVKKGNESTVNEICTVPLSIGSSYKDQIICDVIDMDVCHVLLGRPWQHDTQTLHKGRENTYEFHW

Query:  MGKKVALLPLTKKNEENSKTRGQLFTTVSGKTLLKERK-QDILALVVTGSTNGEQAGELEPQLQQLFEEFPHLKKE--PDGLPPLRDIQHHIDLIPGASL
         G+K+ LLP T  +          F  V+G  L+ +RK +++ A+VVT +            + +L  +F  +  +  P GLP  R IQH IDLIPGA+L
Subjt:  MGKKVALLPLTKKNEENSKTRGQLFTTVSGKTLLKERK-QDILALVVTGSTNGEQAGELEPQLQQLFEEFPHLKKE--PDGLPPLRDIQHHIDLIPGASL

Query:  PNLAHYRMTPQEYAALHEHIEDLLKKGHIKPSLSPCAVPALLTPKKDGSWRMCVDSRAINRITVKYRFPIPRVGDLLDQLGKATIFSKIDPRSGYHQIRI
        PNL HY+M+P+E+  L E +E+LL+K  I+PSLSPCAVPALL PKKD  WRMC+DSRAIN+IT K+RFP+PR+ DLLD+L  A+ FSK+D RSGYHQIRI
Subjt:  PNLAHYRMTPQEYAALHEHIEDLLKKGHIKPSLSPCAVPALLTPKKDGSWRMCVDSRAINRITVKYRFPIPRVGDLLDQLGKATIFSKIDPRSGYHQIRI

Query:  RPGDEWKTAFKTNEGLFEWMVMPFGLSNAPSTFMRVMNQVLHPFLNKFIVVYFDDILVYSSGNDEHLLHLKKLFQVLTEKELYINQKRCEFLKAEITFLG
        RPGDEWKTAFKT+ GL+EW VMPFGL NAP+TFMR+MN+VL  F+N F VVYFDDILVYS+  ++H  HL  +FQ L   +L++N  +CEF  + + FLG
Subjt:  RPGDEWKTAFKTNEGLFEWMVMPFGLSNAPSTFMRVMNQVLHPFLNKFIVVYFDDILVYSSGNDEHLLHLKKLFQVLTEKELYINQKRCEFLKAEITFLG

Query:  FIIKKGKISMEPRKVEAIQNWLVPTTVKEVHAFLGLASFYRKFIRNFSSICAPLTDCLKKGNFKWTPSQQESFEEIKKRLASSPVLQLPDFSSPFEAAVD
        FII    I+ +PRKV AI +W  P ++ +V +F GLA+FYR+FIR FS + APLTDCLK   F W   +Q S+E IK+ L+S+PVL LP+F  PF+   D
Subjt:  FIIKKGKISMEPRKVEAIQNWLVPTTVKEVHAFLGLASFYRKFIRNFSSICAPLTDCLKKGNFKWTPSQQESFEEIKKRLASSPVLQLPDFSSPFEAAVD

Query:  ACGTGIRAVLSQRGHPIEYLSEKLSPTRQTWSTYEQELYALVRALK-----------------------------------------QFNFVIKHQAGKE
        A   GI AVLSQ   PIE+ SEKL+P RQ W+ YEQELYA++RALK                                         +F FV+KH++G +
Subjt:  ACGTGIRAVLSQRGHPIEYLSEKLSPTRQTWSTYEQELYALVRALK-----------------------------------------QFNFVIKHQAGKE

Query:  NKVADALSRKGSLLTLLSSEIIAFKHLPELYERDTDFADIWHKCSNYLRAEGYHILEGFLFKGDQLCIPHTSLREALIKEAHSNGLAGHFGQDKIFETVS
        N+VADALSR+ +LLT L +EI     L +LY  D DF  IW  CS       Y +  G+LFKG+ LCIP +S R  LI EAHS GLA H G+DK F+ + 
Subjt:  NKVADALSRKGSLLTLLSSEIIAFKHLPELYERDTDFADIWHKCSNYLRAEGYHILEGFLFKGDQLCIPHTSLREALIKEAHSNGLAGHFGQDKIFETVS

Query:  IRYYWPQLRKDSNNFVKRCSICQRAKGSRTNAGLYTPLPIPQSIWEDLSIDFVLGLPKTQRNHDSVMVVVDRFSKMAHFIACKKTNDAIYIANLFFKEVI
         +++WP+L +D    V+RCS+CQ  KG+  N GLYTPLP+P++IWED+SIDFVLGLP+T+R  DS+MVVVDR SKMAHF+ACKKT DA+ +A LFF E++
Subjt:  IRYYWPQLRKDSNNFVKRCSICQRAKGSRTNAGLYTPLPIPQSIWEDLSIDFVLGLPKTQRNHDSVMVVVDRFSKMAHFIACKKTNDAIYIANLFFKEVI

Query:  RLHGIPKTIVSDRDVKFLSHFWKTLWKN----------------------------------GSKPKQWDLSLAQAEFAFNNMKNRSTDKCPFQVVYTKR
        RLHG+P++I SDRDVKF+SHFW+ LWK                                    S PKQW+  L QAEFA+N+M NRST K PF +VYTK 
Subjt:  RLHGIPKTIVSDRDVKFLSHFWKTLWKN----------------------------------GSKPKQWDLSLAQAEFAFNNMKNRSTDKCPFQVVYTKR

Query:  PRLTFDLASLPITVESHKEAETMAENIEKLHKEVHDHLIQSTDSYKKAADKKRRQAVFSKGDLVMVHLRKNRFPAGTYNKLKDKQIGPFRIIEKYGDNAF
        P   FD+A LP    S K A  + E    + ++V + LI S  +YK+AAD  RR  +F+ GDLVMV +RK RFPAGTY+KL  +++GP  I ++  DNA+
Subjt:  PRLTFDLASLPITVESHKEAETMAENIEKLHKEVHDHLIQSTDSYKKAADKKRRQAVFSKGDLVMVHLRKNRFPAGTYNKLKDKQIGPFRIIEKYGDNAF

Query:  KVELPPDMHIHSVFNIADLKPYYAPDD
         VELP +++  S FN+AD+  Y+ PDD
Subjt:  KVELPPDMHIHSVFNIADLKPYYAPDD

PWA81295.1 transposon Ty3-I Gag-Pol polyprotein [Artemisia annua]0.0e+0049.89Show/hide
Query:  MDTPEQKKVRLVALKLKGGASAWWEQLEANRQRYNKRPVRSWEKMKKLLKGRFLPLNYEQTLYNQYQNCRQGTRTVTEYIEEFHRLSARTNLSENEQHQI
        MD P+ ++V++VA KL+GGA AWW++ + NR+ + +RPV +W  MK+++KGRFLP + EQ LY QY NC QG RTV EY  EF RL AR NL E ++   
Subjt:  MDTPEQKKVRLVALKLKGGASAWWEQLEANRQRYNKRPVRSWEKMKKLLKGRFLPLNYEQTLYNQYQNCRQGTRTVTEYIEEFHRLSARTNLSENEQHQI

Query:  ARFVGGLRFDIKEKVKLQPLRFLSEAISLAETVEEMIALKAKTMNRRTTWEPTPTKKTSYTSKTNDQPMAPIHGKGKEADSQTATNEK-KAEIINKSK--
        AR+V GL   I+EK+ L  +  + +A +LA   E M   K     RR T E T    +SY +K N    A          + T+TN K     ++KSK  
Subjt:  ARFVGGLRFDIKEKVKLQPLRFLSEAISLAETVEEMIALKAKTMNRRTTWEPTPTKKTSYTSKTNDQPMAPIHGKGKEADSQTATNEK-KAEIINKSK--

Query:  ---NQNNYTRPSLGKCFRCGQPGHLSNSCPQRKTIALAEEEGN--LPGEDESEPREETEEIEVDEG--DRISCVIHKVLIAPKEEKSPQRHSLFKTRCTI
             N Y +P   KCFRCG+PGH SN CP+R T+    E GN  + G++  +  ++ E  E  +G  ++I+CVI + L +PK   S QR+ +F+T+C +
Subjt:  ---NQNNYTRPSLGKCFRCGQPGHLSNSCPQRKTIALAEEEGN--LPGEDESEPREETEEIEVDEG--DRISCVIHKVLIAPKEEKSPQRHSLFKTRCTI

Query:  NGKVCDVIIDGGSNENFIAKKIVSNLNLKVEPHPNPYKIGWVKKGNESTVNEICTVPLSIGSSYKDQIICDVIDMDVCHVLLGRPWQHDTQTLHKGRENT
          K+C +IIDGGS EN ++K +V    L  EPHPNPY+IGW+KKG    V EIC VPL+IG  Y + + CDV+DM+ CHVLLGRPWQHD    H+G+ N 
Subjt:  NGKVCDVIIDGGSNENFIAKKIVSNLNLKVEPHPNPYKIGWVKKGNESTVNEICTVPLSIGSSYKDQIICDVIDMDVCHVLLGRPWQHDTQTLHKGRENT

Query:  YEFHWMGKKVALLPLTKKNEENSKTRGQLFTTVSG-KTLLKERKQD--ILALVVTGSTNGEQAGELEPQLQQLFEEFPHLKKE--PDGLPPLRDIQHHID
        Y F W GK +A+LPL   +         L T VS  K    ERK+     ALVV G  +G     +   ++ + EEF  +  +  PD LPPLR+IQH ID
Subjt:  YEFHWMGKKVALLPLTKKNEENSKTRGQLFTTVSG-KTLLKERKQD--ILALVVTGSTNGEQAGELEPQLQQLFEEFPHLKKE--PDGLPPLRDIQHHID

Query:  LIPGASLPNLAHYRMTPQEYAALHEHIEDLLKKGHIKPSLSPCAVPALLTPKKDGSWRMCVDSRAINRITVKYRFPIPRVGDLLDQLGKATIFSKIDPRS
        L+PGASLPNL HYRM+P+E   L E +E+LL+KGHI+ S+SPCAVPALLTPKKDGSWRMCVDSRAIN+ITV+YRFPIPR+ DLLDQL  A +FSKID RS
Subjt:  LIPGASLPNLAHYRMTPQEYAALHEHIEDLLKKGHIKPSLSPCAVPALLTPKKDGSWRMCVDSRAINRITVKYRFPIPRVGDLLDQLGKATIFSKIDPRS

Query:  GYHQIRIRPGDEWKTAFKTNEGLFEWMVMPFGLSNAPSTFMRVMNQVLHPFLNKFIVVYFDDILVYSSGNDEHLLHLKKLFQVLTEKELYINQKRCEFLK
        GYHQIRI+PGDEWKTAFKT +GL+EW+VMPFGLSNAPSTFMR+M QVL PF+ KF+VVYFDDILVYS    EHL HL+K+ + LTE EL++N K+C FL 
Subjt:  GYHQIRIRPGDEWKTAFKTNEGLFEWMVMPFGLSNAPSTFMRVMNQVLHPFLNKFIVVYFDDILVYSSGNDEHLLHLKKLFQVLTEKELYINQKRCEFLK

Query:  AEITFLGFIIKKGKISMEPRKVEAIQNWLVPTTVKEVHAFLGLASFYRKFIRNFSSICAPLTDCLKKGNFKWTPSQQESFEEIKKRLASSPVLQLPDFSS
         ++ FLG+I+    I ++  KV+A+++W  P T+ EV +F GLA+FYR+F+RNFSSI AP+T+C+KKG FKWT   +ESF+ IK+RL ++PVL LP+F +
Subjt:  AEITFLGFIIKKGKISMEPRKVEAIQNWLVPTTVKEVHAFLGLASFYRKFIRNFSSICAPLTDCLKKGNFKWTPSQQESFEEIKKRLASSPVLQLPDFSS

Query:  PFEAAVDACGTGIRAVLSQRGHPIEYLSEKLSPTRQTWSTYEQELYALVRALK-----------------------------------------QFNFVI
         FE   DACGTGI AVLSQ G P+ + SEKL+  RQ WSTYEQELYA+V+A+K                                         +FN+VI
Subjt:  PFEAAVDACGTGIRAVLSQRGHPIEYLSEKLSPTRQTWSTYEQELYALVRALK-----------------------------------------QFNFVI

Query:  KHQAGKENKVADALSRKGSLLTLLSSEIIAFKHLPELYERDTDFADIWHKCSNYLRAEGYHILEGFLFKGDQLCIPHTSLREALIKEAHSNGLAGHFGQD
        KH++G  NKVADALSRK +LL  +S++++ F+ +  LYE D DF   W +         + +L+G+LFKG++LCIP TSLR  LIKE H+ GL+ H G+D
Subjt:  KHQAGKENKVADALSRKGSLLTLLSSEIIAFKHLPELYERDTDFADIWHKCSNYLRAEGYHILEGFLFKGDQLCIPHTSLREALIKEAHSNGLAGHFGQD

Query:  KIFETVSIRYYWPQLRKDSNNFVKRCSICQRAKGSRTNAGLYTPLPIPQSIWEDLSIDFVLGLPKTQRNHDSVMVVVDRFSKMAHFIACKKTNDAIYIAN
        K   ++  R+YWPQL++D  +FV+RC +CQ  KG   N GLY PLP+P+S W D+S+DFVLGLP+TQR  DSV VVVDRFSKMAHFI CKKT+DA +IA 
Subjt:  KIFETVSIRYYWPQLRKDSNNFVKRCSICQRAKGSRTNAGLYTPLPIPQSIWEDLSIDFVLGLPKTQRNHDSVMVVVDRFSKMAHFIACKKTNDAIYIAN

Query:  LFFKEVIRLHGIPKTIVSDRDVKFLSHFWKTLWKN----------------------------------GSKPKQWDLSLAQAEFAFNNMKNRSTDKCPF
        LFF+EV+RLHG+PK+I SDRD KFL+HFW TLW+                                   G KPK WD+SLAQAEFA+N+  + ST   PF
Subjt:  LFFKEVIRLHGIPKTIVSDRDVKFLSHFWKTLWKN----------------------------------GSKPKQWDLSLAQAEFAFNNMKNRSTDKCPF

Query:  QVVYTKRPRLTFDLASLPITVESHKEAETMAENIEKLHKEVHDHLIQSTDSYKKAADKKRRQAVFSKGDLVMVHLRKNRFPAGTYNKLKDKQIGPFRIIE
         VVY   PR   DL  LP   + + +A  M E ++  H+ V   + +S   YK AADK RR  +F  GD VMV LRK RFP GTY+KL+ K+ GP++I+ 
Subjt:  QVVYTKRPRLTFDLASLPITVESHKEAETMAENIEKLHKEVHDHLIQSTDSYKKAADKKRRQAVFSKGDLVMVHLRKNRFPAGTYNKLKDKQIGPFRIIE

Query:  KYGDNAFKVELPPDMHIHSVFNIADLKPYYAPD
        K  DNA+ V+LP  M I   FN++D+  ++  D
Subjt:  KYGDNAFKVELPPDMHIHSVFNIADLKPYYAPD

TYK30863.1 transposon Ty3-I Gag-Pol polyprotein isoform X1 [Cucumis melo var. makuwa]0.0e+0061.87Show/hide
Query:  MDTPEQKKVRLVALKLKGGASAWWEQLEANRQRYNKRPVRSWEKMKKLLKGRFLPLNYEQTL---YNQYQNCRQGTRTVTEYIEEFHRLSARTNLSENEQ
        M  P++KKV LVALKLKGGASAW                               P++Y Q +   Y+QYQNCRQG++ V EYIEEFHRL AR NLSENEQ
Subjt:  MDTPEQKKVRLVALKLKGGASAWWEQLEANRQRYNKRPVRSWEKMKKLLKGRFLPLNYEQTL---YNQYQNCRQGTRTVTEYIEEFHRLSARTNLSENEQ

Query:  HQIARFVGGLRFDIKEKVKLQPLRFLSEAISLAETVEEMIALKAKTMNRRTTWEPTPTKKTSYTSKTNDQPMAPIHGKGKEADSQTATNEKKAEIINKSK
        HQIARF+GGLRFDIKEKVKL   R LSEAISLAETVEEM+ ++ K  NRRT WE  P+KK SY  KT++QP   +  KGK  D Q  TN+KK  ++ + K
Subjt:  HQIARFVGGLRFDIKEKVKLQPLRFLSEAISLAETVEEMIALKAKTMNRRTTWEPTPTKKTSYTSKTNDQPMAPIHGKGKEADSQTATNEKKAEIINKSK

Query:  NQNNYTRPSLGKCFRCGQPGHLSNSCPQRKTIALAEEEGNLPGEDESEPREETEEIEVDEGDRISCVIHKVLIAPKEEKSPQRHSLFKTRCTINGKVCDV
         QNNYTRPSLGKCFRCG+PGHLSN+C QRKTIALAE+E       + E  EETE IE D+GDRISC++ +VLI PKEE +PQ HSLFKTRCTING     
Subjt:  NQNNYTRPSLGKCFRCGQPGHLSNSCPQRKTIALAEEEGNLPGEDESEPREETEEIEVDEGDRISCVIHKVLIAPKEEKSPQRHSLFKTRCTINGKVCDV

Query:  IIDGGSNENFIAKKIVSNLNLKVEPHPNPYKIGWVKKGNESTVNEICTVPLSIGSSYKDQIICDVIDMDVCHVLLGRPWQHDTQTLHKGRENTYEFHWMG
                             KV PHP+PYKIGWVKKG E+ +NEICT+PLSIG+SYKDQI+CDVI+MDVCH+LLGRPWQHDTQTLH+GRENTYEF WMG
Subjt:  IIDGGSNENFIAKKIVSNLNLKVEPHPNPYKIGWVKKGNESTVNEICTVPLSIGSSYKDQIICDVIDMDVCHVLLGRPWQHDTQTLHKGRENTYEFHWMG

Query:  KKVALLPLTKKNEEN--SKTRGQLFTTVSGKTLLKERKQDILALVVTGSTNGEQAGELEPQLQQLFEEFPHLKKEPDGLPPLRDIQHHIDLIPGASLPNL
        KKV LLPL KKN E+   K + QLF TVSGK LLKER+QD+L L+VT  + G  +  +EP+L++LF EFPHLKKEP GLPPLRDIQH IDL+P ASLPNL
Subjt:  KKVALLPLTKKNEEN--SKTRGQLFTTVSGKTLLKERKQDILALVVTGSTNGEQAGELEPQLQQLFEEFPHLKKEPDGLPPLRDIQHHIDLIPGASLPNL

Query:  AHYRMTPQEYAALHEHIEDLLKKGHIKPSLSPCAVPALLTPKKDGSWRMCVDSRAINRITVKYRFPIPRVGDLLDQLGKATIFSKIDPRSGYHQIRIRPG
         HYRM+P+EY  LH+HIEDLLKKGHIKPSLSPCAVPALLTP KDGSWRMCVDSRAINR+T KYRFPIPR+GDLLDQLGKA IFSKID R+GYHQI+IRPG
Subjt:  AHYRMTPQEYAALHEHIEDLLKKGHIKPSLSPCAVPALLTPKKDGSWRMCVDSRAINRITVKYRFPIPRVGDLLDQLGKATIFSKIDPRSGYHQIRIRPG

Query:  DEWKTAFKTNEGLFEWMVMPFGLSNAPSTFMRVMNQVLHPFLNKFIVVYFDDILVYSSGNDEHLLHLKKLFQVLTEKELYINQKRCEFLKAEITFLGFII
        DEWKTAFKTNEGLFE                                          S  ++HL +L+KLF+VLTE ELYIN K+C +L  EI FLGF+I
Subjt:  DEWKTAFKTNEGLFEWMVMPFGLSNAPSTFMRVMNQVLHPFLNKFIVVYFDDILVYSSGNDEHLLHLKKLFQVLTEKELYINQKRCEFLKAEITFLGFII

Query:  KKGKISMEPRKVEAIQNWLVPTTVKEVHAFLGLASFYRKFIRNFSSICAPLTDCLKKGNFKWTPSQQESFEEIKKRLASSPVLQLPDFSSPFEAAVDACG
        K+GKI MEP+K+EAIQ+   PT++KEV AFLGLASFYR+FIRNFS I APLTD                                  F+SPFE AV+ACG
Subjt:  KKGKISMEPRKVEAIQNWLVPTTVKEVHAFLGLASFYRKFIRNFSSICAPLTDCLKKGNFKWTPSQQESFEEIKKRLASSPVLQLPDFSSPFEAAVDACG

Query:  TGIRAVLSQRGHPIEYLSEKLSPTRQTWSTYEQELYALVRALKQFNFVIKHQAGKENKVADALSRKGSLLTLLSSEIIAFKHLPELYERDTDFADIWHKC
        TGI AVLSQ+GHPIEY SEKLS +RQ+WSTYEQELYALVRALKQ+                                                       
Subjt:  TGIRAVLSQRGHPIEYLSEKLSPTRQTWSTYEQELYALVRALKQFNFVIKHQAGKENKVADALSRKGSLLTLLSSEIIAFKHLPELYERDTDFADIWHKC

Query:  SNYLRAEGYHILEGFLFKGDQLCIPHTSLREALIKEAHSNGLAGHFGQDKIFETVSIRYYWPQLRKDSNNFVKRCSICQRAKGSRTNAGLYTPLPIPQSI
         +YL +  +HI+EG LFKG+QLCIPHTSLREAL+KEAHS GLAGHF QDK FE +S RYYWPQLR+D NNFVKRC  CQRAKG+ TNAGLY+PLP   SI
Subjt:  SNYLRAEGYHILEGFLFKGDQLCIPHTSLREALIKEAHSNGLAGHFGQDKIFETVSIRYYWPQLRKDSNNFVKRCSICQRAKGSRTNAGLYTPLPIPQSI

Query:  WEDLSIDFVLGLPKTQRNHDSVMVVVDRFSKMAHFIACKKTNDAIYIANLFFKEVIRLHGIPKTIVSDRDVKFLSHFWKTLWK
        WEDLSIDFVLGLPKTQR HDSVMV+VDRFSKM HFI CKKTNDAIYIANLFF+E++RLHG+PKTIVSDRDVKFLSHFWKTLW+
Subjt:  WEDLSIDFVLGLPKTQRNHDSVMVVVDRFSKMAHFIACKKTNDAIYIANLFFKEVIRLHGIPKTIVSDRDVKFLSHFWKTLWK

TrEMBL top hitse value%identityAlignment
A0A2I0VI82 RNA-directed DNA polymerase0.0e+0047.32Show/hide
Query:  MDTPEQKKVRLVALKLKGGASAWWEQLEANRQRYNKRPVRSWEKMKKLLKGRFLPLNYEQTLYNQYQNCRQGTRTVTEYIEEFHRLSARTNLSENEQHQI
        M+ P +K+V+ VA +LKGGASAWW QL+ NRQR  K PVR+W +MK++++G FLP +YEQ LY QYQ+C QG R+V+ Y EEF+RLSAR NL E++   +
Subjt:  MDTPEQKKVRLVALKLKGGASAWWEQLEANRQRYNKRPVRSWEKMKKLLKGRFLPLNYEQTLYNQYQNCRQGTRTVTEYIEEFHRLSARTNLSENEQHQI

Query:  ARFVGGLRFDIKEKVKLQPLRFLSEAISLAETVEEMIALKAKTMNRRTTWEPTPTKKTSYTSKTNDQPMAPIHGKGKEA-DSQTATNEKKAEIINKSKNQ
        AR+ GGL+  +++K++L  L  LS+A++ A   E  ++ ++K  + R       T+   Y+      P++  + +   A DSQTA   +           
Subjt:  ARFVGGLRFDIKEKVKLQPLRFLSEAISLAETVEEMIALKAKTMNRRTTWEPTPTKKTSYTSKTNDQPMAPIHGKGKEA-DSQTATNEKKAEIINKSKNQ

Query:  NNYTRPSLGKCFRCGQPGHLSNSCPQRKTIALAEEEGNLPGEDESEPREET-EEIEVDEGDRISCVIHKVLIAPKEEKSPQRHSLFKTRCTINGKVCDVI
        N Y +P+  KCFRC QPGH SN CP R  + + E +     E   +  ++  EE+  DEG+ + C++ ++L+AP++  + QR+++F+TRCTI G+VC+++
Subjt:  NNYTRPSLGKCFRCGQPGHLSNSCPQRKTIALAEEEGNLPGEDESEPREET-EEIEVDEGDRISCVIHKVLIAPKEEKSPQRHSLFKTRCTINGKVCDVI

Query:  IDGGSNENFIAKKIVSNLNLKVEPHPNPYKIGWVKKGNESTVNEICTVPLSIGSSYKDQIICDVIDMDVCHVLLGRPWQHDTQTLHKGRENTYEFHWMGK
        ID G  EN +++ +V  L LK   +P PYKI WVKKG E ++ ++C V  SIG SY  +++CDVIDMDVCH++LGRPWQ+D   ++  R NTY F W G+
Subjt:  IDGGSNENFIAKKIVSNLNLKVEPHPNPYKIGWVKKGNESTVNEICTVPLSIGSSYKDQIICDVIDMDVCHVLLGRPWQHDTQTLHKGRENTYEFHWMGK

Query:  KVALLPLTKKNEENSKTRGQLFTTVSGKTLLKERKQDILALVVTGSTNGEQAGELEPQLQQLFEEFPHL--KKEPDGLPPLRDIQHHIDLIPGASLPNLA
        K+ LLP    +  N      +   VSG  LL +    I ALV       +      PQ+ +L  EF  +   + P  LPP+  IQH IDL+PGA+LPNL 
Subjt:  KVALLPLTKKNEENSKTRGQLFTTVSGKTLLKERKQDILALVVTGSTNGEQAGELEPQLQQLFEEFPHL--KKEPDGLPPLRDIQHHIDLIPGASLPNLA

Query:  HYRMTPQEYAALHEHIEDLLKKGHIKPSLSPCAVPALLTPKKDGSWRMCVDSRAINRITVKYRFPIPRVGDLLDQLGKATIFSKIDPRSGYHQIRIRPGD
        HYRM+P+E+  L E ++DLL++  I+PSLSPCAVPALL PKKD  WRMC+DSRAIN+IT KYRFP+PR+ D+LD+L  + +FSK+D RSGYHQIRIRPGD
Subjt:  HYRMTPQEYAALHEHIEDLLKKGHIKPSLSPCAVPALLTPKKDGSWRMCVDSRAINRITVKYRFPIPRVGDLLDQLGKATIFSKIDPRSGYHQIRIRPGD

Query:  EWKTAFKTNEGLFEWMVMPFGLSNAPSTFMRVMNQVLHPFLNKFIVVYFDDILVYSSGNDEHLLHLKKLFQVLTEKELYINQKRCEFLKAEITFLGFIIK
        EWKTAFKT +GLFEW VMPFGL NAP+TFM +M +VL    N+F VVYFDDIL+YSS  ++H+ HL K+ Q L E  LY+N  +CEF  A++ FLGFI+ 
Subjt:  EWKTAFKTNEGLFEWMVMPFGLSNAPSTFMRVMNQVLHPFLNKFIVVYFDDILVYSSGNDEHLLHLKKLFQVLTEKELYINQKRCEFLKAEITFLGFIIK

Query:  KGKISMEPRKVEAIQNWLVPTTVKEVHAFLGLASFYRKFIRNFSSICAPLTDCLKKGNFKWTPSQQESFEEIKKRLASSPVLQLPDFSSPFEAAVDACGT
           ++ +PRK+ AI++W +P T+ +V +F GLA+FYR+FIR FS I AP+ DCLK     W  S Q S++ IK+ L+S+PVL LP+F  PF+   DA   
Subjt:  KGKISMEPRKVEAIQNWLVPTTVKEVHAFLGLASFYRKFIRNFSSICAPLTDCLKKGNFKWTPSQQESFEEIKKRLASSPVLQLPDFSSPFEAAVDACGT

Query:  GIRAVLSQRGHPIEYLSEKLSPTRQTWSTYEQELYALVRALK-----------------------------------------QFNFVIKHQAGKENKVA
        GI AVLSQ   PIE+ SEKLS  RQ W+ YEQELYA+VRALK                                         +F FV+KH++G +N+VA
Subjt:  GIRAVLSQRGHPIEYLSEKLSPTRQTWSTYEQELYALVRALK-----------------------------------------QFNFVIKHQAGKENKVA

Query:  DALSRKGSLLTLLSSEIIAFKHLPELYERDTDFADIWHKCSNYLRAEGYHILEGFLFKGDQLCIPHTSLREALIKEAHSNGLAGHFGQDKIFETVSIRYY
        DALSR+ +LLT L +E+   + L ELY  D DFA  W +C+       Y +  G+LFKG+ LCIP +S R+ LIKEAH+ GLA H G++K  + +  R++
Subjt:  DALSRKGSLLTLLSSEIIAFKHLPELYERDTDFADIWHKCSNYLRAEGYHILEGFLFKGDQLCIPHTSLREALIKEAHSNGLAGHFGQDKIFETVSIRYY

Query:  WPQLRKDSNNFVKRCSICQRAKGSRTNAGLYTPLPIPQSIWEDLSIDFVLGLPKTQRNHDSVMVVVDRFSKMAHFIACKKTNDAIYIANLFFKEVIRLHG
        WP+L +D + FV+RC++CQ  KG++ NAGLY PL +P SIWED+SIDFVLGLP+TQR  DS+MVVVDRFSKMAHF+AC+K+ DA+ +A LFF E++RLHG
Subjt:  WPQLRKDSNNFVKRCSICQRAKGSRTNAGLYTPLPIPQSIWEDLSIDFVLGLPKTQRNHDSVMVVVDRFSKMAHFIACKKTNDAIYIANLFFKEVIRLHG

Query:  IPKTIVSDRDVKFLSHFWKTLWKN----------------------------------GSKPKQWDLSLAQAEFAFNNMKNRSTDKCPFQVVYTKRPRLT
        +P++I SDRDVKF+SHFW+ LWK                                     +PKQW+ +L+QAEFA+N+M NRST +CPF +VYTK P   
Subjt:  IPKTIVSDRDVKFLSHFWKTLWKN----------------------------------GSKPKQWDLSLAQAEFAFNNMKNRSTDKCPFQVVYTKRPRLT

Query:  FDLASLPITVESHKEAETMAENIEKLHKEVHDHLIQSTDSYKKAADKKRRQAVFSKGDLVMVHLRKNRFPAGTYNKLKDKQIGPFRIIEKYGDNAFKVEL
        FD+A LP    ++K A  + +  + +  EV   LI S  +YKKAAD  RR   F  GDLV++ LRK+RFPAG  +KL  ++ GP  I+++  DNA+ V+L
Subjt:  FDLASLPITVESHKEAETMAENIEKLHKEVHDHLIQSTDSYKKAADKKRRQAVFSKGDLVMVHLRKNRFPAGTYNKLKDKQIGPFRIIEKYGDNAFKVEL

Query:  PPDMHIHSVFNIADLKPYYAPDD
        P  MH    FN+ DL PY+ PDD
Subjt:  PPDMHIHSVFNIADLKPYYAPDD

A0A2U1P6A2 Transposon Ty3-I Gag-Pol polyprotein0.0e+0049.89Show/hide
Query:  MDTPEQKKVRLVALKLKGGASAWWEQLEANRQRYNKRPVRSWEKMKKLLKGRFLPLNYEQTLYNQYQNCRQGTRTVTEYIEEFHRLSARTNLSENEQHQI
        MD P+ ++V++VA KL+GGA AWW++ + NR+ + +RPV +W  MK+++KGRFLP + EQ LY QY NC QG RTV EY  EF RL AR NL E ++   
Subjt:  MDTPEQKKVRLVALKLKGGASAWWEQLEANRQRYNKRPVRSWEKMKKLLKGRFLPLNYEQTLYNQYQNCRQGTRTVTEYIEEFHRLSARTNLSENEQHQI

Query:  ARFVGGLRFDIKEKVKLQPLRFLSEAISLAETVEEMIALKAKTMNRRTTWEPTPTKKTSYTSKTNDQPMAPIHGKGKEADSQTATNEK-KAEIINKSK--
        AR+V GL   I+EK+ L  +  + +A +LA   E M   K     RR T E T    +SY +K N    A          + T+TN K     ++KSK  
Subjt:  ARFVGGLRFDIKEKVKLQPLRFLSEAISLAETVEEMIALKAKTMNRRTTWEPTPTKKTSYTSKTNDQPMAPIHGKGKEADSQTATNEK-KAEIINKSK--

Query:  ---NQNNYTRPSLGKCFRCGQPGHLSNSCPQRKTIALAEEEGN--LPGEDESEPREETEEIEVDEG--DRISCVIHKVLIAPKEEKSPQRHSLFKTRCTI
             N Y +P   KCFRCG+PGH SN CP+R T+    E GN  + G++  +  ++ E  E  +G  ++I+CVI + L +PK   S QR+ +F+T+C +
Subjt:  ---NQNNYTRPSLGKCFRCGQPGHLSNSCPQRKTIALAEEEGN--LPGEDESEPREETEEIEVDEG--DRISCVIHKVLIAPKEEKSPQRHSLFKTRCTI

Query:  NGKVCDVIIDGGSNENFIAKKIVSNLNLKVEPHPNPYKIGWVKKGNESTVNEICTVPLSIGSSYKDQIICDVIDMDVCHVLLGRPWQHDTQTLHKGRENT
          K+C +IIDGGS EN ++K +V    L  EPHPNPY+IGW+KKG    V EIC VPL+IG  Y + + CDV+DM+ CHVLLGRPWQHD    H+G+ N 
Subjt:  NGKVCDVIIDGGSNENFIAKKIVSNLNLKVEPHPNPYKIGWVKKGNESTVNEICTVPLSIGSSYKDQIICDVIDMDVCHVLLGRPWQHDTQTLHKGRENT

Query:  YEFHWMGKKVALLPLTKKNEENSKTRGQLFTTVSG-KTLLKERKQD--ILALVVTGSTNGEQAGELEPQLQQLFEEFPHLKKE--PDGLPPLRDIQHHID
        Y F W GK +A+LPL   +         L T VS  K    ERK+     ALVV G  +G     +   ++ + EEF  +  +  PD LPPLR+IQH ID
Subjt:  YEFHWMGKKVALLPLTKKNEENSKTRGQLFTTVSG-KTLLKERKQD--ILALVVTGSTNGEQAGELEPQLQQLFEEFPHLKKE--PDGLPPLRDIQHHID

Query:  LIPGASLPNLAHYRMTPQEYAALHEHIEDLLKKGHIKPSLSPCAVPALLTPKKDGSWRMCVDSRAINRITVKYRFPIPRVGDLLDQLGKATIFSKIDPRS
        L+PGASLPNL HYRM+P+E   L E +E+LL+KGHI+ S+SPCAVPALLTPKKDGSWRMCVDSRAIN+ITV+YRFPIPR+ DLLDQL  A +FSKID RS
Subjt:  LIPGASLPNLAHYRMTPQEYAALHEHIEDLLKKGHIKPSLSPCAVPALLTPKKDGSWRMCVDSRAINRITVKYRFPIPRVGDLLDQLGKATIFSKIDPRS

Query:  GYHQIRIRPGDEWKTAFKTNEGLFEWMVMPFGLSNAPSTFMRVMNQVLHPFLNKFIVVYFDDILVYSSGNDEHLLHLKKLFQVLTEKELYINQKRCEFLK
        GYHQIRI+PGDEWKTAFKT +GL+EW+VMPFGLSNAPSTFMR+M QVL PF+ KF+VVYFDDILVYS    EHL HL+K+ + LTE EL++N K+C FL 
Subjt:  GYHQIRIRPGDEWKTAFKTNEGLFEWMVMPFGLSNAPSTFMRVMNQVLHPFLNKFIVVYFDDILVYSSGNDEHLLHLKKLFQVLTEKELYINQKRCEFLK

Query:  AEITFLGFIIKKGKISMEPRKVEAIQNWLVPTTVKEVHAFLGLASFYRKFIRNFSSICAPLTDCLKKGNFKWTPSQQESFEEIKKRLASSPVLQLPDFSS
         ++ FLG+I+    I ++  KV+A+++W  P T+ EV +F GLA+FYR+F+RNFSSI AP+T+C+KKG FKWT   +ESF+ IK+RL ++PVL LP+F +
Subjt:  AEITFLGFIIKKGKISMEPRKVEAIQNWLVPTTVKEVHAFLGLASFYRKFIRNFSSICAPLTDCLKKGNFKWTPSQQESFEEIKKRLASSPVLQLPDFSS

Query:  PFEAAVDACGTGIRAVLSQRGHPIEYLSEKLSPTRQTWSTYEQELYALVRALK-----------------------------------------QFNFVI
         FE   DACGTGI AVLSQ G P+ + SEKL+  RQ WSTYEQELYA+V+A+K                                         +FN+VI
Subjt:  PFEAAVDACGTGIRAVLSQRGHPIEYLSEKLSPTRQTWSTYEQELYALVRALK-----------------------------------------QFNFVI

Query:  KHQAGKENKVADALSRKGSLLTLLSSEIIAFKHLPELYERDTDFADIWHKCSNYLRAEGYHILEGFLFKGDQLCIPHTSLREALIKEAHSNGLAGHFGQD
        KH++G  NKVADALSRK +LL  +S++++ F+ +  LYE D DF   W +         + +L+G+LFKG++LCIP TSLR  LIKE H+ GL+ H G+D
Subjt:  KHQAGKENKVADALSRKGSLLTLLSSEIIAFKHLPELYERDTDFADIWHKCSNYLRAEGYHILEGFLFKGDQLCIPHTSLREALIKEAHSNGLAGHFGQD

Query:  KIFETVSIRYYWPQLRKDSNNFVKRCSICQRAKGSRTNAGLYTPLPIPQSIWEDLSIDFVLGLPKTQRNHDSVMVVVDRFSKMAHFIACKKTNDAIYIAN
        K   ++  R+YWPQL++D  +FV+RC +CQ  KG   N GLY PLP+P+S W D+S+DFVLGLP+TQR  DSV VVVDRFSKMAHFI CKKT+DA +IA 
Subjt:  KIFETVSIRYYWPQLRKDSNNFVKRCSICQRAKGSRTNAGLYTPLPIPQSIWEDLSIDFVLGLPKTQRNHDSVMVVVDRFSKMAHFIACKKTNDAIYIAN

Query:  LFFKEVIRLHGIPKTIVSDRDVKFLSHFWKTLWKN----------------------------------GSKPKQWDLSLAQAEFAFNNMKNRSTDKCPF
        LFF+EV+RLHG+PK+I SDRD KFL+HFW TLW+                                   G KPK WD+SLAQAEFA+N+  + ST   PF
Subjt:  LFFKEVIRLHGIPKTIVSDRDVKFLSHFWKTLWKN----------------------------------GSKPKQWDLSLAQAEFAFNNMKNRSTDKCPF

Query:  QVVYTKRPRLTFDLASLPITVESHKEAETMAENIEKLHKEVHDHLIQSTDSYKKAADKKRRQAVFSKGDLVMVHLRKNRFPAGTYNKLKDKQIGPFRIIE
         VVY   PR   DL  LP   + + +A  M E ++  H+ V   + +S   YK AADK RR  +F  GD VMV LRK RFP GTY+KL+ K+ GP++I+ 
Subjt:  QVVYTKRPRLTFDLASLPITVESHKEAETMAENIEKLHKEVHDHLIQSTDSYKKAADKKRRQAVFSKGDLVMVHLRKNRFPAGTYNKLKDKQIGPFRIIE

Query:  KYGDNAFKVELPPDMHIHSVFNIADLKPYYAPD
        K  DNA+ V+LP  M I   FN++D+  ++  D
Subjt:  KYGDNAFKVELPPDMHIHSVFNIADLKPYYAPD

A0A5B7BER3 Uncharacterized protein0.0e+0053Show/hide
Query:  MDTPEQKKVRLVALKLKGGASAWWEQLEANRQRYNKRPVRSWEKMKKLLKGRFLPLNYEQTLYNQYQNCRQGTRTVTEYIEEFHRLSARTNLSENEQHQI
        M+  + K+V+LVA KLKGGASAWW+Q++ NR+R  K+PVR+W+KM++LL+ RFLP++YEQ LY QYQNCRQG R+V+EY +EF+ LS+R NL+E E  Q+
Subjt:  MDTPEQKKVRLVALKLKGGASAWWEQLEANRQRYNKRPVRSWEKMKKLLKGRFLPLNYEQTLYNQYQNCRQGTRTVTEYIEEFHRLSARTNLSENEQHQI

Query:  ARFVGGLRFDIKEKVKLQPLRFLSEAISLAETVEEMIALKAKTMNRRTTWEPTPTKKTSYTSKTNDQPMAPIHGKGKEADSQTATNEKKAEIINKSKNQN
        AR+VGGLR  I++++ L+ +  L+EA SLA  VE   + +           P  ++      K   + + P   K    D  +++  +   I    K+ N
Subjt:  ARFVGGLRFDIKEKVKLQPLRFLSEAISLAETVEEMIALKAKTMNRRTTWEPTPTKKTSYTSKTNDQPMAPIHGKGKEADSQTATNEKKAEIINKSKNQN

Query:  NYTRPSLGKCFRCGQPGHLSNSCPQRKTI-ALAEEEGNLP---GEDESEPREE---TEEIEVDEGDRISCVIHKVLIAPKEEKSPQRHSLFKTRCTINGK
         Y RP  GKCFRC QPGH SN CP R+ +  +   E N P    E+E+E ++E    E  E DEG+ +SCV+ ++L+ PK+E  PQRH++F+TRCTIN K
Subjt:  NYTRPSLGKCFRCGQPGHLSNSCPQRKTI-ALAEEEGNLP---GEDESEPREE---TEEIEVDEGDRISCVIHKVLIAPKEEKSPQRHSLFKTRCTINGK

Query:  VCDVIIDGGSNENFIAKKIVSNLNLKVEPHPNPYKIGWVKKGNESTVNEICTVPLSIGSSYKDQIICDVIDMDVCHVLLGRPWQHDTQTLHKGRENTYEF
        VCDVIID GS+EN ++K +V  L LK E HPNPYKIGW+KKG E+ V EIC VP SIG  YKD++ CD++DMD CHVLLGRPWQ D    HKG++NTY F
Subjt:  VCDVIIDGGSNENFIAKKIVSNLNLKVEPHPNPYKIGWVKKGNESTVNEICTVPLSIGSSYKDQIICDVIDMDVCHVLLGRPWQHDTQTLHKGRENTYEF

Query:  HWMGKKVALLPLTKKNE--ENSKTRGQLFTTVSGKTLLKERKQ--DILALVVTGSTNGEQAGELEPQLQQLFEEFPHL--KKEPDGLPPLRDIQHHIDLI
         W  KKV L+P  K +   + SK  G+   TV+G   +++ K+   I+ ++V G T G +  ++   LQ L  EF  +   + PD LPP+RDIQHHIDL+
Subjt:  HWMGKKVALLPLTKKNE--ENSKTRGQLFTTVSGKTLLKERKQ--DILALVVTGSTNGEQAGELEPQLQQLFEEFPHL--KKEPDGLPPLRDIQHHIDLI

Query:  PGASLPNLAHYRMTPQEYAALHEHIEDLLKKGHIKPSLSPCAVPALLTPKKDGSWRMCVDSRAINRITVKYRFPIPRVGDLLDQLGKATIFSKIDPRSGY
        PGASLPNL HYRM+P+E   L + +EDL+ KG I+ S+SPCAVPALLTPKKDGSWRMCVDSRAIN+ITVKYRFPIPR+ D+LD L  + IFSKID RSGY
Subjt:  PGASLPNLAHYRMTPQEYAALHEHIEDLLKKGHIKPSLSPCAVPALLTPKKDGSWRMCVDSRAINRITVKYRFPIPRVGDLLDQLGKATIFSKIDPRSGY

Query:  HQIRIRPGDEWKTAFKTNEGLFEWMVMPFGLSNAPSTFMRVMNQVLHPFLNKFIVVYFDDILVYSSGNDEHLLHLKKLFQVLTEKELYINQKRCEFLKAE
        HQIRIRPGDEWKTAFKT EGL+EW+VMPFGLSNAPSTFMR+MNQVL PF+ KF+VVYFDDIL+YS    EHL H++++   L E +LYIN K+C FL   
Subjt:  HQIRIRPGDEWKTAFKTNEGLFEWMVMPFGLSNAPSTFMRVMNQVLHPFLNKFIVVYFDDILVYSSGNDEHLLHLKKLFQVLTEKELYINQKRCEFLKAE

Query:  ITFLGFIIKKGKISMEPRKVEAIQNWLVPTTVKEVHAFLGLASFYRKFIRNFSSICAPLTDCLKKGNFKWTPSQQESFEEIKKRLASSPVLQLPDFSSPF
        + FLGFII    I ++  KV AI++W  P TV ++ +F GLA+FYR+FIRNFSSI AP+TDC+KKG F+W   Q+ SF  IK++L+++PVL LP F   F
Subjt:  ITFLGFIIKKGKISMEPRKVEAIQNWLVPTTVKEVHAFLGLASFYRKFIRNFSSICAPLTDCLKKGNFKWTPSQQESFEEIKKRLASSPVLQLPDFSSPF

Query:  EAAVDACGTGIRAVLSQRGHPIEYLSEKLSPTRQTWSTYEQELYALVRALK-----------------------------------------QFNFVIKH
        +   DA  TGI AVLSQ G P+E+ SEKL+  RQ W+TYE EL+A+VRALK                                         +F FV+KH
Subjt:  EAAVDACGTGIRAVLSQRGHPIEYLSEKLSPTRQTWSTYEQELYALVRALK-----------------------------------------QFNFVIKH

Query:  QAGKENKVADALSRKGSLLTLLSSEIIAFKHLPELYERDTDFADIWHKCSNYLRAEGYHILEGFLFKGDQLCIPHTSLREALIKEAHSNGLAGHFGQDKI
        +AG++NKVADALSR+ +LL ++SSEI +F+ L ELY+ D DF   W KC     +  +HI +G+LFKG+QLCIP TSLRE ++++ HS GL GH G+DK 
Subjt:  QAGKENKVADALSRKGSLLTLLSSEIIAFKHLPELYERDTDFADIWHKCSNYLRAEGYHILEGFLFKGDQLCIPHTSLREALIKEAHSNGLAGHFGQDKI

Query:  FETVSIRYYWPQLRKDSNNFVKRCSICQRAKGSRTNAGLYTPLPIPQSIWEDLSIDFVLGLPKTQRNHDSVMVVVDRFSKMAHFIACKKTNDAIYIANLF
           V  RYYWPQL++D   FV++C ICQ AKG   N GLYTPLP+P+ IWEDL++DF+LGLP+TQR  DSV VVVDRFSKMAHFI CKKT+DA ++ANLF
Subjt:  FETVSIRYYWPQLRKDSNNFVKRCSICQRAKGSRTNAGLYTPLPIPQSIWEDLSIDFVLGLPKTQRNHDSVMVVVDRFSKMAHFIACKKTNDAIYIANLF

Query:  FKEVIRLHGIPKTIVSDRDVKFLSHFWKTLWK----------------------------------NGSKPKQWDLSLAQAEFAFNNMKNRSTDKCPFQV
        F+E++RLHG+PK+I SDRDVKFLSHFW+TLW+                                  +G +PKQWD+ L Q EFA+N M NRST K PF++
Subjt:  FKEVIRLHGIPKTIVSDRDVKFLSHFWKTLWK----------------------------------NGSKPKQWDLSLAQAEFAFNNMKNRSTDKCPFQV

Query:  VYTKRPRLTFDLASLPITVESHKEAETMAENIEKLHKEVHDHLIQSTDSYKKAADKKRRQAVFSKGDLVMVHLRKNRFPAGTYNKLKDKQIGPFRIIEKY
        VYTK P+   DLA LP    S   AE  A+    + +EV  +L ++ + YK AADK RR  VF++GDLVMV LRKNRFP GTYNKLK+++ GPFR+  K 
Subjt:  VYTKRPRLTFDLASLPITVESHKEAETMAENIEKLHKEVHDHLIQSTDSYKKAADKKRRQAVFSKGDLVMVHLRKNRFPAGTYNKLKDKQIGPFRIIEKY

Query:  GDNAFKVELPPDMHIHSVFNIADLKPYYAPDD
         DNA+ VELP DM I S FN+ADL  Y+ PD+
Subjt:  GDNAFKVELPPDMHIHSVFNIADLKPYYAPDD

A0A5D3E417 Transposon Ty3-I Gag-Pol polyprotein isoform X10.0e+0061.87Show/hide
Query:  MDTPEQKKVRLVALKLKGGASAWWEQLEANRQRYNKRPVRSWEKMKKLLKGRFLPLNYEQTL---YNQYQNCRQGTRTVTEYIEEFHRLSARTNLSENEQ
        M  P++KKV LVALKLKGGASAW                               P++Y Q +   Y+QYQNCRQG++ V EYIEEFHRL AR NLSENEQ
Subjt:  MDTPEQKKVRLVALKLKGGASAWWEQLEANRQRYNKRPVRSWEKMKKLLKGRFLPLNYEQTL---YNQYQNCRQGTRTVTEYIEEFHRLSARTNLSENEQ

Query:  HQIARFVGGLRFDIKEKVKLQPLRFLSEAISLAETVEEMIALKAKTMNRRTTWEPTPTKKTSYTSKTNDQPMAPIHGKGKEADSQTATNEKKAEIINKSK
        HQIARF+GGLRFDIKEKVKL   R LSEAISLAETVEEM+ ++ K  NRRT WE  P+KK SY  KT++QP   +  KGK  D Q  TN+KK  ++ + K
Subjt:  HQIARFVGGLRFDIKEKVKLQPLRFLSEAISLAETVEEMIALKAKTMNRRTTWEPTPTKKTSYTSKTNDQPMAPIHGKGKEADSQTATNEKKAEIINKSK

Query:  NQNNYTRPSLGKCFRCGQPGHLSNSCPQRKTIALAEEEGNLPGEDESEPREETEEIEVDEGDRISCVIHKVLIAPKEEKSPQRHSLFKTRCTINGKVCDV
         QNNYTRPSLGKCFRCG+PGHLSN+C QRKTIALAE+E       + E  EETE IE D+GDRISC++ +VLI PKEE +PQ HSLFKTRCTING     
Subjt:  NQNNYTRPSLGKCFRCGQPGHLSNSCPQRKTIALAEEEGNLPGEDESEPREETEEIEVDEGDRISCVIHKVLIAPKEEKSPQRHSLFKTRCTINGKVCDV

Query:  IIDGGSNENFIAKKIVSNLNLKVEPHPNPYKIGWVKKGNESTVNEICTVPLSIGSSYKDQIICDVIDMDVCHVLLGRPWQHDTQTLHKGRENTYEFHWMG
                             KV PHP+PYKIGWVKKG E+ +NEICT+PLSIG+SYKDQI+CDVI+MDVCH+LLGRPWQHDTQTLH+GRENTYEF WMG
Subjt:  IIDGGSNENFIAKKIVSNLNLKVEPHPNPYKIGWVKKGNESTVNEICTVPLSIGSSYKDQIICDVIDMDVCHVLLGRPWQHDTQTLHKGRENTYEFHWMG

Query:  KKVALLPLTKKNEEN--SKTRGQLFTTVSGKTLLKERKQDILALVVTGSTNGEQAGELEPQLQQLFEEFPHLKKEPDGLPPLRDIQHHIDLIPGASLPNL
        KKV LLPL KKN E+   K + QLF TVSGK LLKER+QD+L L+VT  + G  +  +EP+L++LF EFPHLKKEP GLPPLRDIQH IDL+P ASLPNL
Subjt:  KKVALLPLTKKNEEN--SKTRGQLFTTVSGKTLLKERKQDILALVVTGSTNGEQAGELEPQLQQLFEEFPHLKKEPDGLPPLRDIQHHIDLIPGASLPNL

Query:  AHYRMTPQEYAALHEHIEDLLKKGHIKPSLSPCAVPALLTPKKDGSWRMCVDSRAINRITVKYRFPIPRVGDLLDQLGKATIFSKIDPRSGYHQIRIRPG
         HYRM+P+EY  LH+HIEDLLKKGHIKPSLSPCAVPALLTP KDGSWRMCVDSRAINR+T KYRFPIPR+GDLLDQLGKA IFSKID R+GYHQI+IRPG
Subjt:  AHYRMTPQEYAALHEHIEDLLKKGHIKPSLSPCAVPALLTPKKDGSWRMCVDSRAINRITVKYRFPIPRVGDLLDQLGKATIFSKIDPRSGYHQIRIRPG

Query:  DEWKTAFKTNEGLFEWMVMPFGLSNAPSTFMRVMNQVLHPFLNKFIVVYFDDILVYSSGNDEHLLHLKKLFQVLTEKELYINQKRCEFLKAEITFLGFII
        DEWKTAFKTNEGLFE                                          S  ++HL +L+KLF+VLTE ELYIN K+C +L  EI FLGF+I
Subjt:  DEWKTAFKTNEGLFEWMVMPFGLSNAPSTFMRVMNQVLHPFLNKFIVVYFDDILVYSSGNDEHLLHLKKLFQVLTEKELYINQKRCEFLKAEITFLGFII

Query:  KKGKISMEPRKVEAIQNWLVPTTVKEVHAFLGLASFYRKFIRNFSSICAPLTDCLKKGNFKWTPSQQESFEEIKKRLASSPVLQLPDFSSPFEAAVDACG
        K+GKI MEP+K+EAIQ+   PT++KEV AFLGLASFYR+FIRNFS I APLTD                                  F+SPFE AV+ACG
Subjt:  KKGKISMEPRKVEAIQNWLVPTTVKEVHAFLGLASFYRKFIRNFSSICAPLTDCLKKGNFKWTPSQQESFEEIKKRLASSPVLQLPDFSSPFEAAVDACG

Query:  TGIRAVLSQRGHPIEYLSEKLSPTRQTWSTYEQELYALVRALKQFNFVIKHQAGKENKVADALSRKGSLLTLLSSEIIAFKHLPELYERDTDFADIWHKC
        TGI AVLSQ+GHPIEY SEKLS +RQ+WSTYEQELYALVRALKQ+                                                       
Subjt:  TGIRAVLSQRGHPIEYLSEKLSPTRQTWSTYEQELYALVRALKQFNFVIKHQAGKENKVADALSRKGSLLTLLSSEIIAFKHLPELYERDTDFADIWHKC

Query:  SNYLRAEGYHILEGFLFKGDQLCIPHTSLREALIKEAHSNGLAGHFGQDKIFETVSIRYYWPQLRKDSNNFVKRCSICQRAKGSRTNAGLYTPLPIPQSI
         +YL +  +HI+EG LFKG+QLCIPHTSLREAL+KEAHS GLAGHF QDK FE +S RYYWPQLR+D NNFVKRC  CQRAKG+ TNAGLY+PLP   SI
Subjt:  SNYLRAEGYHILEGFLFKGDQLCIPHTSLREALIKEAHSNGLAGHFGQDKIFETVSIRYYWPQLRKDSNNFVKRCSICQRAKGSRTNAGLYTPLPIPQSI

Query:  WEDLSIDFVLGLPKTQRNHDSVMVVVDRFSKMAHFIACKKTNDAIYIANLFFKEVIRLHGIPKTIVSDRDVKFLSHFWKTLWK
        WEDLSIDFVLGLPKTQR HDSVMV+VDRFSKM HFI CKKTNDAIYIANLFF+E++RLHG+PKTIVSDRDVKFLSHFWKTLW+
Subjt:  WEDLSIDFVLGLPKTQRNHDSVMVVVDRFSKMAHFIACKKTNDAIYIANLFFKEVIRLHGIPKTIVSDRDVKFLSHFWKTLWK

A0A6N2LVR1 Uncharacterized protein0.0e+0050.67Show/hide
Query:  MDTPEQKKVRLVALKLKGGASAWWEQLEANRQRYNKRPVRSWEKMKKLLKGRFLPLNYEQTLYNQYQNCRQGTRTVTEYIEEFHRLSARTNLSENEQHQI
        M+ PE KKV+LVA +L GGASAWWEQL+  R R  K  V+SW KM++LL+ R+LP +YEQ L+ QYQNC+QG R V  Y+EEFHRLS+R NL E +  Q+
Subjt:  MDTPEQKKVRLVALKLKGGASAWWEQLEANRQRYNKRPVRSWEKMKKLLKGRFLPLNYEQTLYNQYQNCRQGTRTVTEYIEEFHRLSARTNLSENEQHQI

Query:  ARFVGGLRFDIKEKVKLQPLRFLSEAISLAETVEEMIALKAKTMNRRTTWEPTPT-KKTSYTSKTNDQPMAPIHG------KGKEADSQTATNEKKAEII
        ARFVGGLR++I+++V +  +  L+EAI+L        A KA+T   RTT  P P    T  ++     P+ P +       KG  +     +       +
Subjt:  ARFVGGLRFDIKEKVKLQPLRFLSEAISLAETVEEMIALKAKTMNRRTTWEPTPT-KKTSYTSKTNDQPMAPIHG------KGKEADSQTATNEKKAEII

Query:  NKSKNQNNYTRPSLGKCFRCGQPGHLSNSCPQRKTIALAE--EEGNLPGED-ESEPREETEEIEV---DEGDRIS--CVIHKVLIAPKEEKSPQRHSLFK
             +N Y+RP+  KC+RCGQ GH SN+CP+R  + L E  EE ++ GE  E+E      E EV   DEG+ +S   V+ ++++APK E   QR+++F+
Subjt:  NKSKNQNNYTRPSLGKCFRCGQPGHLSNSCPQRKTIALAE--EEGNLPGED-ESEPREETEEIEV---DEGDRIS--CVIHKVLIAPKEEKSPQRHSLFK

Query:  TRCTINGKVCDVIIDGGSNENFIAKKIVSNLNLKVEPHPNPYKIGWVKKGNESTVNEICTVPLSIGSSYKDQIICDVIDMDVCHVLLGRPWQHDTQTLHK
        TRCT+N KVCDVIID GS+EN I+K +V+ L LK E H  PYKIGW+KKG E+ V E C    SIG +Y D+I+CDV++MD CHV+LGRPWQ+D    +K
Subjt:  TRCTINGKVCDVIIDGGSNENFIAKKIVSNLNLKVEPHPNPYKIGWVKKGNESTVNEICTVPLSIGSSYKDQIICDVIDMDVCHVLLGRPWQHDTQTLHK

Query:  GRENTYEFHWMGKKVALLPLTKK-NEENSKTRGQLFTTVSGKTLLKERKQDI-LALVVTGSTNGEQAGELEPQLQQLFEEFPHLKKE--PDGLPPLRDIQ
        G++N Y F   G+KV L PL +       + + +    V G+T L +  +D  +  V+ G      +  +   LQ L  EF  +  E  P+GLPP+RDIQ
Subjt:  GRENTYEFHWMGKKVALLPLTKK-NEENSKTRGQLFTTVSGKTLLKERKQDI-LALVVTGSTNGEQAGELEPQLQQLFEEFPHLKKE--PDGLPPLRDIQ

Query:  HHIDLIPGASLPNLAHYRMTPQEYAALHEHIEDLLKKGHIKPSLSPCAVPALLTPKKDGSWRMCVDSRAINRITVKYRFPIPRVGDLLDQLGKATIFSKI
        HHIDLIPGASLPN  HYRM+P+E A L   +E+L+KKG ++ S+SPCAVPALL PKKDGSWRMC+DSRAIN+IT+KYRFPIPR+ D+LD L  + IFSKI
Subjt:  HHIDLIPGASLPNLAHYRMTPQEYAALHEHIEDLLKKGHIKPSLSPCAVPALLTPKKDGSWRMCVDSRAINRITVKYRFPIPRVGDLLDQLGKATIFSKI

Query:  DPRSGYHQIRIRPGDEWKTAFKTNEGLFEWMVMPFGLSNAPSTFMRVMNQVLHPFLNKFIVVYFDDILVYSSGNDEHLLHLKKLFQVLTEKELYINQKRC
        D RSGYHQIRIRPGDEWKTAFKT EGL+EW+VMPFGLSNAPSTFMR+MNQVL PF   F+VVYFDDIL+YS    +H+ HL+++F VL   +L++N  +C
Subjt:  DPRSGYHQIRIRPGDEWKTAFKTNEGLFEWMVMPFGLSNAPSTFMRVMNQVLHPFLNKFIVVYFDDILVYSSGNDEHLLHLKKLFQVLTEKELYINQKRC

Query:  EFLKAEITFLGFIIKKGKISMEPRKVEAIQNWLVPTTVKEVHAFLGLASFYRKFIRNFSSICAPLTDCLKKGNFKWTPSQQESFEEIKKRLASSPVLQLP
         F+ + + FLGF++    I ++  KV AI++W  P  + EV +F GLA+FYR+F+R+FS I AP+T+C+KKG F W    + SF  IK++LAS+PVL LP
Subjt:  EFLKAEITFLGFIIKKGKISMEPRKVEAIQNWLVPTTVKEVHAFLGLASFYRKFIRNFSSICAPLTDCLKKGNFKWTPSQQESFEEIKKRLASSPVLQLP

Query:  DFSSPFEAAVDACGTGIRAVLSQRGHPIEYLSEKLSPTRQTWSTYEQELYALVRALK-----------------------------------------QF
        DF   FE   DA   GI AVLSQ   P+ + SEKLS  R+ WSTYE ELYA+ RA+K                                         +F
Subjt:  DFSSPFEAAVDACGTGIRAVLSQRGHPIEYLSEKLSPTRQTWSTYEQELYALVRALK-----------------------------------------QF

Query:  NFVIKHQAGKENKVADALSRKGSLLTLLSSEIIAFKHLPELYERDTDFADIWHKCSNYLRAEGYHILEGFLFKGDQLCIPHTSLREALIKEAHSNGLAGH
        NF +KH++G+ NKVADALSRK SLLT L +E+I F+ + +LY  D DF + W KC   L  EG H  +G+LF+G+QLCIP +SLRE +I E H  GL GH
Subjt:  NFVIKHQAGKENKVADALSRKGSLLTLLSSEIIAFKHLPELYERDTDFADIWHKCSNYLRAEGYHILEGFLFKGDQLCIPHTSLREALIKEAHSNGLAGH

Query:  FGQDKIFETVSIRYYWPQLRKDSNNFVKRCSICQRAKGSRTNAGLYTPLPIPQSIWEDLSIDFVLGLPKTQRNHDSVMVVVDRFSKMAHFIACKKTNDAI
         G+DK       RYYWPQL++D  N VKRC  CQ +KG   N GLY PLPIP   WEDLS+DF+LGLP+TQR  DSV VVVDRFSKMAHFIACKKT+DA+
Subjt:  FGQDKIFETVSIRYYWPQLRKDSNNFVKRCSICQRAKGSRTNAGLYTPLPIPQSIWEDLSIDFVLGLPKTQRNHDSVMVVVDRFSKMAHFIACKKTNDAI

Query:  YIANLFFKEVIRLHGIPKTIVSDRDVKFLSHFWKTLWK----------------------------------NGSKPKQWDLSLAQAEFAFNNMKNRSTD
        ++ANLFFKEV+RLHG+PK+I SDRD KFLSHFW+TLW+                                  +G +PKQWDL+LAQAEFA+N+M NRST 
Subjt:  YIANLFFKEVIRLHGIPKTIVSDRDVKFLSHFWKTLWK----------------------------------NGSKPKQWDLSLAQAEFAFNNMKNRSTD

Query:  KCPFQVVYTKRPRLTFDLASLPITVESHKEAETMAENIEKLHKEVHDHLIQSTDSYKKAADKKRRQAVFSKGDLVMVHLRKNRFPAGTYNKLKDKQIGPF
        K PFQVVY + P+   DL  LP     +  AE MA+ +  + +EV  +L  S + YK AADKKRR  +F +GDLVMV+LRK R P GT +KL DK+ GP+
Subjt:  KCPFQVVYTKRPRLTFDLASLPITVESHKEAETMAENIEKLHKEVHDHLIQSTDSYKKAADKKRRQAVFSKGDLVMVHLRKNRFPAGTYNKLKDKQIGPF

Query:  RIIEKYGDNAFKVELPPDMHIHSVFNIADLKPYYAPDD
        +I++K  DNA++V+LP DM I   FN+ADL  Y+ PD+
Subjt:  RIIEKYGDNAFKVELPPDMHIHSVFNIADLKPYYAPDD

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein1.4e-10729.39Show/hide
Query:  EPQLQQLFEEFPHLKKE--PDGLP-PLRDIQHHIDLIPGASLPNLAHYRMTPQEYAALHEHIEDLLKKGHIKPSLSPCAVPALLTPKKDGSWRMCVDSRA
        EP+L  +++EF  +  E   + LP P++ ++  ++L        + +Y + P +  A+++ I   LK G I+ S +  A P +  PKK+G+ RM VD + 
Subjt:  EPQLQQLFEEFPHLKKE--PDGLP-PLRDIQHHIDLIPGASLPNLAHYRMTPQEYAALHEHIEDLLKKGHIKPSLSPCAVPALLTPKKDGSWRMCVDSRA

Query:  INRITVKYRFPIPRVGDLLDQLGKATIFSKIDPRSGYHQIRIRPGDEWKTAFKTNEGLFEWMVMPFGLSNAPSTFMRVMNQVLHPFLNKFIVVYFDDILV
        +N+      +P+P +  LL ++  +TIF+K+D +S YH IR+R GDE K AF+   G+FE++VMP+G+S AP+ F   +N +L       +V Y DDIL+
Subjt:  INRITVKYRFPIPRVGDLLDQLGKATIFSKIDPRSGYHQIRIRPGDEWKTAFKTNEGLFEWMVMPFGLSNAPSTFMRVMNQVLHPFLNKFIVVYFDDILV

Query:  YSSGNDEHLLHLKKLFQVLTEKELYINQKRCEFLKAEITFLGFIIKKGKISMEPRKVEAIQNWLVPTTVKEVHAFLGLASFYRKFIRNFSSICAPLTDCL
        +S    EH+ H+K + Q L    L INQ +CEF ++++ F+G+ I +   +     ++ +  W  P   KE+  FLG  ++ RKFI   S +  PL + L
Subjt:  YSSGNDEHLLHLKKLFQVLTEKELYINQKRCEFLKAEITFLGFIIKKGKISMEPRKVEAIQNWLVPTTVKEVHAFLGLASFYRKFIRNFSSICAPLTDCL

Query:  KKG-NFKWTPSQQESFEEIKKRLASSPVLQLPDFSSPFEAAVDACGTGIRAVLSQRG-----HPIEYLSEKLSPTRQTWSTYEQELYALVRALK------
        KK   +KWTP+Q ++ E IK+ L S PVL+  DFS       DA    + AVLSQ+      +P+ Y S K+S  +  +S  ++E+ A++++LK      
Subjt:  KKG-NFKWTPSQQESFEEIKKRLASSPVLQLPDFSSPFEAAVDACGTGIRAVLSQRG-----HPIEYLSEKLSPTRQTWSTYEQELYALVRALK------

Query:  ---------------------------------------QFNFVIKHQAGKENKVADALSR------------KGSLLTLLSSEIIA--FKH-LPELYER
                                                FNF I ++ G  N +ADALSR            + + +  ++   I   FK+ +   Y  
Subjt:  ---------------------------------------QFNFVIKHQAGKENKVADALSR------------KGSLLTLLSSEIIA--FKH-LPELYER

Query:  DTDFADIWHKCSNYLRAEGYHILEGFLFKG-DQLCIPH-TSLREALIKEAHSNGLAGHFGQDKIFETVSIRYYWPQLRKDSNNFVKRCSICQRAKG-SRT
        DT   ++ +     +  E   + +G L    DQ+ +P+ T L   +IK+ H  G   H G + +   +  R+ W  +RK    +V+ C  CQ  K  +  
Subjt:  DTDFADIWHKCSNYLRAEGYHILEGFLFKG-DQLCIPH-TSLREALIKEAHSNGLAGHFGQDKIFETVSIRYYWPQLRKDSNNFVKRCSICQRAKG-SRT

Query:  NAGLYTPLPIPQSIWEDLSIDFVLGLPKTQRNHDSVMVVVDRFSKMAHFIACKKTNDAIYIANLFFKEVIRLHGIPKTIVSDRDVKFLSHFWKTLWKN--
          G   P+P  +  WE LS+DF+  LP++   ++++ VVVDRFSKMA  + C K+  A   A +F + VI   G PK I++D D  F S  WK       
Subjt:  NAGLYTPLPIPQSIWEDLSIDFVLGLPKTQRNHDSVMVVVDRFSKMAHFIACKKTNDAIYIANLFFKEVIRLHGIPKTIVSDRDVKFLSHFWKTLWKN--

Query:  --------------------------------GSKPKQWDLSLAQAEFAFNNMKNRSTDKCPFQVVYTKRPRLTFDLASLPITVESHKEAETMAENIEKL
                                         + P  W   ++  + ++NN  + +T   PF++V+   P     L+ L +   S K  E   E I+ +
Subjt:  --------------------------------GSKPKQWDLSLAQAEFAFNNMKNRSTDKCPFQVVYTKRPRLTFDLASLPITVESHKEAETMAENIEKL

Query:  HKEVHDHLIQSTDSYKKAADKKRRQ-AVFSKGDLVMVHLRKNRFPAGTYNKLKDKQIGPFRIIEKYGDNAFKVELPPDMH--IHSVFNIADLKPY
         + V +HL  +    KK  D K ++   F  GDLVMV   K  F     NKL     GPF +++K G N ++++LP  +     S F+++ L+ Y
Subjt:  HKEVHDHLIQSTDSYKKAADKKRRQ-AVFSKGDLVMVHLRKNRFPAGTYNKLKDKQIGPFRIIEKYGDNAFKVELPPDMH--IHSVFNIADLKPY

P0CT35 Transposon Tf2-2 polyprotein1.4e-10729.39Show/hide
Query:  EPQLQQLFEEFPHLKKE--PDGLP-PLRDIQHHIDLIPGASLPNLAHYRMTPQEYAALHEHIEDLLKKGHIKPSLSPCAVPALLTPKKDGSWRMCVDSRA
        EP+L  +++EF  +  E   + LP P++ ++  ++L        + +Y + P +  A+++ I   LK G I+ S +  A P +  PKK+G+ RM VD + 
Subjt:  EPQLQQLFEEFPHLKKE--PDGLP-PLRDIQHHIDLIPGASLPNLAHYRMTPQEYAALHEHIEDLLKKGHIKPSLSPCAVPALLTPKKDGSWRMCVDSRA

Query:  INRITVKYRFPIPRVGDLLDQLGKATIFSKIDPRSGYHQIRIRPGDEWKTAFKTNEGLFEWMVMPFGLSNAPSTFMRVMNQVLHPFLNKFIVVYFDDILV
        +N+      +P+P +  LL ++  +TIF+K+D +S YH IR+R GDE K AF+   G+FE++VMP+G+S AP+ F   +N +L       +V Y DDIL+
Subjt:  INRITVKYRFPIPRVGDLLDQLGKATIFSKIDPRSGYHQIRIRPGDEWKTAFKTNEGLFEWMVMPFGLSNAPSTFMRVMNQVLHPFLNKFIVVYFDDILV

Query:  YSSGNDEHLLHLKKLFQVLTEKELYINQKRCEFLKAEITFLGFIIKKGKISMEPRKVEAIQNWLVPTTVKEVHAFLGLASFYRKFIRNFSSICAPLTDCL
        +S    EH+ H+K + Q L    L INQ +CEF ++++ F+G+ I +   +     ++ +  W  P   KE+  FLG  ++ RKFI   S +  PL + L
Subjt:  YSSGNDEHLLHLKKLFQVLTEKELYINQKRCEFLKAEITFLGFIIKKGKISMEPRKVEAIQNWLVPTTVKEVHAFLGLASFYRKFIRNFSSICAPLTDCL

Query:  KKG-NFKWTPSQQESFEEIKKRLASSPVLQLPDFSSPFEAAVDACGTGIRAVLSQRG-----HPIEYLSEKLSPTRQTWSTYEQELYALVRALK------
        KK   +KWTP+Q ++ E IK+ L S PVL+  DFS       DA    + AVLSQ+      +P+ Y S K+S  +  +S  ++E+ A++++LK      
Subjt:  KKG-NFKWTPSQQESFEEIKKRLASSPVLQLPDFSSPFEAAVDACGTGIRAVLSQRG-----HPIEYLSEKLSPTRQTWSTYEQELYALVRALK------

Query:  ---------------------------------------QFNFVIKHQAGKENKVADALSR------------KGSLLTLLSSEIIA--FKH-LPELYER
                                                FNF I ++ G  N +ADALSR            + + +  ++   I   FK+ +   Y  
Subjt:  ---------------------------------------QFNFVIKHQAGKENKVADALSR------------KGSLLTLLSSEIIA--FKH-LPELYER

Query:  DTDFADIWHKCSNYLRAEGYHILEGFLFKG-DQLCIPH-TSLREALIKEAHSNGLAGHFGQDKIFETVSIRYYWPQLRKDSNNFVKRCSICQRAKG-SRT
        DT   ++ +     +  E   + +G L    DQ+ +P+ T L   +IK+ H  G   H G + +   +  R+ W  +RK    +V+ C  CQ  K  +  
Subjt:  DTDFADIWHKCSNYLRAEGYHILEGFLFKG-DQLCIPH-TSLREALIKEAHSNGLAGHFGQDKIFETVSIRYYWPQLRKDSNNFVKRCSICQRAKG-SRT

Query:  NAGLYTPLPIPQSIWEDLSIDFVLGLPKTQRNHDSVMVVVDRFSKMAHFIACKKTNDAIYIANLFFKEVIRLHGIPKTIVSDRDVKFLSHFWKTLWKN--
          G   P+P  +  WE LS+DF+  LP++   ++++ VVVDRFSKMA  + C K+  A   A +F + VI   G PK I++D D  F S  WK       
Subjt:  NAGLYTPLPIPQSIWEDLSIDFVLGLPKTQRNHDSVMVVVDRFSKMAHFIACKKTNDAIYIANLFFKEVIRLHGIPKTIVSDRDVKFLSHFWKTLWKN--

Query:  --------------------------------GSKPKQWDLSLAQAEFAFNNMKNRSTDKCPFQVVYTKRPRLTFDLASLPITVESHKEAETMAENIEKL
                                         + P  W   ++  + ++NN  + +T   PF++V+   P     L+ L +   S K  E   E I+ +
Subjt:  --------------------------------GSKPKQWDLSLAQAEFAFNNMKNRSTDKCPFQVVYTKRPRLTFDLASLPITVESHKEAETMAENIEKL

Query:  HKEVHDHLIQSTDSYKKAADKKRRQ-AVFSKGDLVMVHLRKNRFPAGTYNKLKDKQIGPFRIIEKYGDNAFKVELPPDMH--IHSVFNIADLKPY
         + V +HL  +    KK  D K ++   F  GDLVMV   K  F     NKL     GPF +++K G N ++++LP  +     S F+++ L+ Y
Subjt:  HKEVHDHLIQSTDSYKKAADKKRRQ-AVFSKGDLVMVHLRKNRFPAGTYNKLKDKQIGPFRIIEKYGDNAFKVELPPDMH--IHSVFNIADLKPY

P0CT41 Transposon Tf2-12 polyprotein1.4e-10729.39Show/hide
Query:  EPQLQQLFEEFPHLKKE--PDGLP-PLRDIQHHIDLIPGASLPNLAHYRMTPQEYAALHEHIEDLLKKGHIKPSLSPCAVPALLTPKKDGSWRMCVDSRA
        EP+L  +++EF  +  E   + LP P++ ++  ++L        + +Y + P +  A+++ I   LK G I+ S +  A P +  PKK+G+ RM VD + 
Subjt:  EPQLQQLFEEFPHLKKE--PDGLP-PLRDIQHHIDLIPGASLPNLAHYRMTPQEYAALHEHIEDLLKKGHIKPSLSPCAVPALLTPKKDGSWRMCVDSRA

Query:  INRITVKYRFPIPRVGDLLDQLGKATIFSKIDPRSGYHQIRIRPGDEWKTAFKTNEGLFEWMVMPFGLSNAPSTFMRVMNQVLHPFLNKFIVVYFDDILV
        +N+      +P+P +  LL ++  +TIF+K+D +S YH IR+R GDE K AF+   G+FE++VMP+G+S AP+ F   +N +L       +V Y DDIL+
Subjt:  INRITVKYRFPIPRVGDLLDQLGKATIFSKIDPRSGYHQIRIRPGDEWKTAFKTNEGLFEWMVMPFGLSNAPSTFMRVMNQVLHPFLNKFIVVYFDDILV

Query:  YSSGNDEHLLHLKKLFQVLTEKELYINQKRCEFLKAEITFLGFIIKKGKISMEPRKVEAIQNWLVPTTVKEVHAFLGLASFYRKFIRNFSSICAPLTDCL
        +S    EH+ H+K + Q L    L INQ +CEF ++++ F+G+ I +   +     ++ +  W  P   KE+  FLG  ++ RKFI   S +  PL + L
Subjt:  YSSGNDEHLLHLKKLFQVLTEKELYINQKRCEFLKAEITFLGFIIKKGKISMEPRKVEAIQNWLVPTTVKEVHAFLGLASFYRKFIRNFSSICAPLTDCL

Query:  KKG-NFKWTPSQQESFEEIKKRLASSPVLQLPDFSSPFEAAVDACGTGIRAVLSQRG-----HPIEYLSEKLSPTRQTWSTYEQELYALVRALK------
        KK   +KWTP+Q ++ E IK+ L S PVL+  DFS       DA    + AVLSQ+      +P+ Y S K+S  +  +S  ++E+ A++++LK      
Subjt:  KKG-NFKWTPSQQESFEEIKKRLASSPVLQLPDFSSPFEAAVDACGTGIRAVLSQRG-----HPIEYLSEKLSPTRQTWSTYEQELYALVRALK------

Query:  ---------------------------------------QFNFVIKHQAGKENKVADALSR------------KGSLLTLLSSEIIA--FKH-LPELYER
                                                FNF I ++ G  N +ADALSR            + + +  ++   I   FK+ +   Y  
Subjt:  ---------------------------------------QFNFVIKHQAGKENKVADALSR------------KGSLLTLLSSEIIA--FKH-LPELYER

Query:  DTDFADIWHKCSNYLRAEGYHILEGFLFKG-DQLCIPH-TSLREALIKEAHSNGLAGHFGQDKIFETVSIRYYWPQLRKDSNNFVKRCSICQRAKG-SRT
        DT   ++ +     +  E   + +G L    DQ+ +P+ T L   +IK+ H  G   H G + +   +  R+ W  +RK    +V+ C  CQ  K  +  
Subjt:  DTDFADIWHKCSNYLRAEGYHILEGFLFKG-DQLCIPH-TSLREALIKEAHSNGLAGHFGQDKIFETVSIRYYWPQLRKDSNNFVKRCSICQRAKG-SRT

Query:  NAGLYTPLPIPQSIWEDLSIDFVLGLPKTQRNHDSVMVVVDRFSKMAHFIACKKTNDAIYIANLFFKEVIRLHGIPKTIVSDRDVKFLSHFWKTLWKN--
          G   P+P  +  WE LS+DF+  LP++   ++++ VVVDRFSKMA  + C K+  A   A +F + VI   G PK I++D D  F S  WK       
Subjt:  NAGLYTPLPIPQSIWEDLSIDFVLGLPKTQRNHDSVMVVVDRFSKMAHFIACKKTNDAIYIANLFFKEVIRLHGIPKTIVSDRDVKFLSHFWKTLWKN--

Query:  --------------------------------GSKPKQWDLSLAQAEFAFNNMKNRSTDKCPFQVVYTKRPRLTFDLASLPITVESHKEAETMAENIEKL
                                         + P  W   ++  + ++NN  + +T   PF++V+   P     L+ L +   S K  E   E I+ +
Subjt:  --------------------------------GSKPKQWDLSLAQAEFAFNNMKNRSTDKCPFQVVYTKRPRLTFDLASLPITVESHKEAETMAENIEKL

Query:  HKEVHDHLIQSTDSYKKAADKKRRQ-AVFSKGDLVMVHLRKNRFPAGTYNKLKDKQIGPFRIIEKYGDNAFKVELPPDMH--IHSVFNIADLKPY
         + V +HL  +    KK  D K ++   F  GDLVMV   K  F     NKL     GPF +++K G N ++++LP  +     S F+++ L+ Y
Subjt:  HKEVHDHLIQSTDSYKKAADKKRRQ-AVFSKGDLVMVHLRKNRFPAGTYNKLKDKQIGPFRIIEKYGDNAFKVELPPDMH--IHSVFNIADLKPY

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein2.6e-12231.34Show/hide
Query:  TGSTNGEQAGELEPQLQQLFEE-----FPHLKKEPDGLPPLRDIQHHIDLIPGASLPNLAHYRMTPQEYAALHEHIEDLLKKGHIKPSLSPCAVPALLTP
        T  +N +    L   LQQ + E      P    + + +P    ++H I++ PGA LP L  Y +T +    +++ ++ LL    I PS SPC+ P +L P
Subjt:  TGSTNGEQAGELEPQLQQLFEE-----FPHLKKEPDGLPPLRDIQHHIDLIPGASLPNLAHYRMTPQEYAALHEHIEDLLKKGHIKPSLSPCAVPALLTP

Query:  KKDGSWRMCVDSRAINRITVKYRFPIPRVGDLLDQLGKATIFSKIDPRSGYHQIRIRPGDEWKTAFKTNEGLFEWMVMPFGLSNAPSTFMRVMNQVLHPF
        KKDG++R+CVD R +N+ T+   FP+PR+ +LL ++G A IF+ +D  SGYHQI + P D +KTAF T  G +E+ VMPFGL NAPSTF R M       
Subjt:  KKDGSWRMCVDSRAINRITVKYRFPIPRVGDLLDQLGKATIFSKIDPRSGYHQIRIRPGDEWKTAFKTNEGLFEWMVMPFGLSNAPSTFMRVMNQVLHPF

Query:  LNKFIVVYFDDILVYSSGNDEHLLHLKKLFQVLTEKELYINQKRCEFLKAEITFLGFIIKKGKISMEPRKVEAIQNWLVPTTVKEVHAFLGLASFYRKFI
          +F+ VY DDIL++S   +EH  HL  + + L  + L + +K+C+F   E  FLG+ I   KI+    K  AI+++  P TVK+   FLG+ ++YR+FI
Subjt:  LNKFIVVYFDDILVYSSGNDEHLLHLKKLFQVLTEKELYINQKRCEFLKAEITFLGFIIKKGKISMEPRKVEAIQNWLVPTTVKEVHAFLGLASFYRKFI

Query:  RNFSSICAP--LTDCLKKGNFKWTPSQQESFEEIKKRLASSPVLQLPDFSSPFEAAVDACGTGIRAVLSQRGHP------IEYLSEKLSPTRQTWSTYEQ
         N S I  P  L  C K    +WT  Q ++ E++K  L +SPVL   +  + +    DA   GI AVL +  +       + Y S+ L   ++ +   E 
Subjt:  RNFSSICAP--LTDCLKKGNFKWTPSQQESFEEIKKRLASSPVLQLPDFSSPFEAAVDACGTGIRAVLSQRGHP------IEYLSEKLSPTRQTWSTYEQ

Query:  ELYALVRALKQF-----------------------------------------NFVIKHQAGKENKVADALSRKGSLLTLLSSEII--------------
        EL  +++AL  F                                         +F +++ AG +N VADA+SR    +T  +S  I              
Subjt:  ELYALVRALKQF-----------------------------------------NFVIKHQAGKENKVADALSRKGSLLTLLSSEII--------------

Query:  ---AFKHLPELYERDTDFADIWHKCSNYLRAE-------GYHILEGFLFKGDQLCIPHTSLREALIKEAHSNGL-AGHFGQDKIFETVSIRYYWPQLRKD
              H+ EL + +    D+    S   + E        Y + +  ++  D+L +P    + A+++  H + L  GHFG       +S  YYWP+L+  
Subjt:  ---AFKHLPELYERDTDFADIWHKCSNYLRAE-------GYHILEGFLFKGDQLCIPHTSLREALIKEAHSNGL-AGHFGQDKIFETVSIRYYWPQLRKD

Query:  SNNFVKRCSICQRAKGSRTNA-GLYTPLPIPQSIWEDLSIDFVLGLPKTQRNHDSVMVVVDRFSKMAHFIACKKTNDAIYIANLFFKEVIRLHGIPKTIV
           +++ C  CQ  K  R    GL  PLPI +  W D+S+DFV GLP T  N + ++VVVDRFSK AHFIA +KT DA  + +L F+ +   HG P+TI 
Subjt:  SNNFVKRCSICQRAKGSRTNA-GLYTPLPIPQSIWEDLSIDFVLGLPKTQRNHDSVMVVVDRFSKMAHFIACKKTNDAIYIANLFFKEVIRLHGIPKTIV

Query:  SDRDVKFLSHFWKTLWKN----------------------------------GSKPKQWDLSLAQAEFAFNNMKNRSTDKCPFQVVYTKRPRLTFDLASL
        SDRDV+  +  ++ L K                                    +  + W + L Q EF +N+   R+  K PF++          DL  L
Subjt:  SDRDVKFLSHFWKTLWKN----------------------------------GSKPKQWDLSLAQAEFAFNNMKNRSTDKCPFQVVYTKRPRLTFDLASL

Query:  PIT--VESHKEAETMAENIEKLHKEVHDHLIQSTDSYKKAA-------DKKRRQAVFSKGDLVMVHLRKNRFPAGTYNKLKDKQIGPFRIIEKYGDNAFK
        P T  ++S  E    +    +L K +    IQ+ +  + A        +++R+  + + GD V+VH R   F  G Y K++   +GPFR+++K  DNA++
Subjt:  PIT--VESHKEAETMAENIEKLHKEVHDHLIQSTDSYKKAA-------DKKRRQAVFSKGDLVMVHLRKNRFPAGTYNKLKDKQIGPFRIIEKYGDNAFK

Query:  VELPPDMHIHSVFNIADLKPYY
        ++L      H V N+  LK  Y
Subjt:  VELPPDMHIHSVFNIADLKPYY

Q99315 Transposon Ty3-G Gag-Pol polyprotein1.1e-12231.25Show/hide
Query:  TGSTNGEQAGELEPQLQQLFEE-----FPHLKKEPDGLPPLRDIQHHIDLIPGASLPNLAHYRMTPQEYAALHEHIEDLLKKGHIKPSLSPCAVPALLTP
        T  +N +    L   LQQ + E      P    + + +P    ++H I++ PGA LP L  Y +T +    +++ ++ LL    I PS SPC+ P +L P
Subjt:  TGSTNGEQAGELEPQLQQLFEE-----FPHLKKEPDGLPPLRDIQHHIDLIPGASLPNLAHYRMTPQEYAALHEHIEDLLKKGHIKPSLSPCAVPALLTP

Query:  KKDGSWRMCVDSRAINRITVKYRFPIPRVGDLLDQLGKATIFSKIDPRSGYHQIRIRPGDEWKTAFKTNEGLFEWMVMPFGLSNAPSTFMRVMNQVLHPF
        KKDG++R+CVD R +N+ T+   FP+PR+ +LL ++G A IF+ +D  SGYHQI + P D +KTAF T  G +E+ VMPFGL NAPSTF R M       
Subjt:  KKDGSWRMCVDSRAINRITVKYRFPIPRVGDLLDQLGKATIFSKIDPRSGYHQIRIRPGDEWKTAFKTNEGLFEWMVMPFGLSNAPSTFMRVMNQVLHPF

Query:  LNKFIVVYFDDILVYSSGNDEHLLHLKKLFQVLTEKELYINQKRCEFLKAEITFLGFIIKKGKISMEPRKVEAIQNWLVPTTVKEVHAFLGLASFYRKFI
          +F+ VY DDIL++S   +EH  HL  + + L  + L + +K+C+F   E  FLG+ I   KI+    K  AI+++  P TVK+   FLG+ ++YR+FI
Subjt:  LNKFIVVYFDDILVYSSGNDEHLLHLKKLFQVLTEKELYINQKRCEFLKAEITFLGFIIKKGKISMEPRKVEAIQNWLVPTTVKEVHAFLGLASFYRKFI

Query:  RNFSSICAP--LTDCLKKGNFKWTPSQQESFEEIKKRLASSPVLQLPDFSSPFEAAVDACGTGIRAVLSQRGHP------IEYLSEKLSPTRQTWSTYEQ
         N S I  P  L  C K    +WT  Q ++ +++K  L +SPVL   +  + +    DA   GI AVL +  +       + Y S+ L   ++ +   E 
Subjt:  RNFSSICAP--LTDCLKKGNFKWTPSQQESFEEIKKRLASSPVLQLPDFSSPFEAAVDACGTGIRAVLSQRGHP------IEYLSEKLSPTRQTWSTYEQ

Query:  ELYALVRALKQF-----------------------------------------NFVIKHQAGKENKVADALSRKGSLLTLLSSEII--------------
        EL  +++AL  F                                         +F +++ AG +N VADA+SR    +T  +S  I              
Subjt:  ELYALVRALKQF-----------------------------------------NFVIKHQAGKENKVADALSRKGSLLTLLSSEII--------------

Query:  ---AFKHLPELYERDTDFADIWHKCSNYLRAE-------GYHILEGFLFKGDQLCIPHTSLREALIKEAHSNGL-AGHFGQDKIFETVSIRYYWPQLRKD
              H+ EL + +    D+    S   + E        Y + +  ++  D+L +P    + A+++  H + L  GHFG       +S  YYWP+L+  
Subjt:  ---AFKHLPELYERDTDFADIWHKCSNYLRAE-------GYHILEGFLFKGDQLCIPHTSLREALIKEAHSNGL-AGHFGQDKIFETVSIRYYWPQLRKD

Query:  SNNFVKRCSICQRAKGSRTNA-GLYTPLPIPQSIWEDLSIDFVLGLPKTQRNHDSVMVVVDRFSKMAHFIACKKTNDAIYIANLFFKEVIRLHGIPKTIV
           +++ C  CQ  K  R    GL  PLPI +  W D+S+DFV GLP T  N + ++VVVDRFSK AHFIA +KT DA  + +L F+ +   HG P+TI 
Subjt:  SNNFVKRCSICQRAKGSRTNA-GLYTPLPIPQSIWEDLSIDFVLGLPKTQRNHDSVMVVVDRFSKMAHFIACKKTNDAIYIANLFFKEVIRLHGIPKTIV

Query:  SDRDVKFLSHFWKTLWKN----------------------------------GSKPKQWDLSLAQAEFAFNNMKNRSTDKCPFQVVYTKRPRLTFDLASL
        SDRDV+  +  ++ L K                                    +  + W + L Q EF +N+   R+  K PF++          DL  L
Subjt:  SDRDVKFLSHFWKTLWKN----------------------------------GSKPKQWDLSLAQAEFAFNNMKNRSTDKCPFQVVYTKRPRLTFDLASL

Query:  PIT--VESHKEAETMAENIEKLHKEVHDHLIQSTDSYKKAA-------DKKRRQAVFSKGDLVMVHLRKNRFPAGTYNKLKDKQIGPFRIIEKYGDNAFK
        P T  ++S  E    +    +L K +    IQ+ +  + A        +++R+  + + GD V+VH R   F  G Y K++   +GPFR+++K  DNA++
Subjt:  PIT--VESHKEAETMAENIEKLHKEVHDHLIQSTDSYKKAA-------DKKRRQAVFSKGDLVMVHLRKNRFPAGTYNKLKDKQIGPFRIIEKYGDNAFK

Query:  VELPPDMHIHSVFNIADLKPY-YAPDDF
        ++L      H V N+  LK + Y PD +
Subjt:  VELPPDMHIHSVFNIADLKPY-YAPDDF

Arabidopsis top hitse value%identityAlignment
AT4G13320.1 unknown protein4.5e-1331.71Show/hide
Query:  LFKTRCTINGKVCDVIIDGGSNENFIAKKIVSNLNLK-VEPHPNPYKIGWVKKGNESTVNEICTVPLSIGSSYKDQIICDVIDM--DVCHVLLGRPWQHD
        +F+T+C IN + C +++ GG+  N I+K +V  L LK ++ +P+   +    +  +    E C VP+SIG  YKD++ C V++M  +   +L G PW + 
Subjt:  LFKTRCTINGKVCDVIIDGGSNENFIAKKIVSNLNLK-VEPHPNPYKIGWVKKGNESTVNEICTVPLSIGSSYKDQIICDVIDM--DVCHVLLGRPWQHD

Query:  TQTLHKGRENTYEFHWMGKKVAL
         Q  H GR+++    W    + L
Subjt:  TQTLHKGRENTYEFHWMGKKVAL

ATMG00860.1 DNA/RNA polymerases superfamily protein5.3e-2237.59Show/hide
Query:  HLKKLFQVLTEKELYINQKRCEFLKAEITFLG--FIIKKGKISMEPRKVEAIQNWLVPTTVKEVHAFLGLASFYRKFIRNFSSICAPLTDCLKKGNFKWT
        HL  + Q+  + + Y N+K+C F + +I +LG   II    +S +P K+EA+  W  P    E+  FLGL  +YR+F++N+  I  PLT+ LKK + KWT
Subjt:  HLKKLFQVLTEKELYINQKRCEFLKAEITFLG--FIIKKGKISMEPRKVEAIQNWLVPTTVKEVHAFLGLASFYRKFIRNFSSICAPLTDCLKKGNFKWT

Query:  PSQQESFEEIKKRLASSPVLQLPDFSSPFEAAV
             +F+ +K  + + PVL LPD   PF   V
Subjt:  PSQQESFEEIKKRLASSPVLQLPDFSSPFEAAV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACACCCCTGAACAGAAAAAGGTACGTCTCGTGGCCTTGAAACTCAAAGGGGGCGCATCAGCATGGTGGGAGCAACTGGAAGCCAACAGGCAAAGATACAACAAACG
ACCCGTACGCTCATGGGAAAAGATGAAGAAACTATTAAAAGGACGATTCCTGCCTCTGAATTATGAGCAGACCTTATACAACCAGTATCAGAATTGTCGTCAAGGCACCA
GGACAGTCACCGAATACATAGAAGAGTTCCACCGATTAAGTGCCAGAACGAACCTAAGTGAGAATGAGCAGCACCAAATAGCCAGATTTGTGGGGGGTCTGCGCTTTGAT
ATTAAAGAAAAAGTAAAGTTACAGCCTCTCCGTTTCTTATCTGAAGCAATTTCATTGGCAGAAACAGTAGAAGAAATGATAGCTCTCAAGGCCAAAACCATGAATCGAAG
AACAACATGGGAGCCAACACCAACCAAGAAGACAAGCTATACGAGCAAGACAAATGACCAGCCGATGGCACCAATTCATGGAAAAGGAAAAGAGGCTGACTCCCAGACTG
CAACGAACGAAAAGAAGGCAGAGATAATCAATAAAAGCAAGAACCAAAATAATTACACTCGCCCATCATTGGGTAAATGCTTTCGGTGTGGACAACCAGGCCACCTATCT
AATTCTTGCCCTCAAAGAAAAACAATTGCCTTAGCTGAGGAAGAAGGCAACTTGCCTGGCGAAGATGAGTCAGAACCAAGAGAGGAAACGGAAGAAATTGAGGTAGATGA
GGGAGACAGAATCTCCTGTGTTATCCATAAAGTACTCATTGCTCCCAAGGAAGAAAAGAGCCCACAACGCCACAGTCTTTTCAAAACAAGGTGCACTATTAATGGGAAGG
TATGCGACGTCATTATTGATGGAGGTAGCAATGAAAACTTCATAGCAAAGAAAATAGTCTCCAACCTGAACTTAAAGGTTGAGCCACATCCAAACCCCTACAAGATAGGA
TGGGTAAAGAAAGGGAATGAGTCCACAGTTAATGAGATATGTACGGTTCCCCTTTCTATCGGAAGTAGCTACAAGGACCAGATCATATGCGATGTAATCGACATGGATGT
ATGCCATGTCCTCCTAGGCAGACCGTGGCAACACGACACTCAAACCTTGCATAAGGGGAGAGAAAACACATACGAATTTCACTGGATGGGTAAAAAGGTAGCCCTGCTGC
CTTTGACCAAAAAAAATGAGGAGAACAGTAAGACAAGGGGTCAACTATTCACAACAGTCAGTGGCAAAACCCTACTGAAAGAAAGAAAGCAGGATATTTTAGCCCTTGTG
GTGACAGGCAGCACTAATGGAGAACAGGCTGGGGAGTTGGAACCACAATTACAACAACTTTTTGAGGAATTCCCACACCTCAAGAAGGAACCTGACGGACTGCCACCGCT
TCGTGACATCCAGCACCATATAGATCTAATTCCCGGGGCATCATTGCCAAACCTAGCTCATTACAGGATGACCCCTCAAGAGTATGCAGCACTCCATGAACATATTGAAG
ATCTACTTAAGAAAGGGCATATTAAGCCAAGCCTCAGCCCTTGTGCTGTCCCAGCTCTCCTCACCCCAAAAAAGGATGGAAGTTGGAGGATGTGTGTAGATAGCAGAGCC
ATTAACCGAATCACAGTGAAGTACAGATTCCCTATCCCAAGAGTTGGAGACCTCTTGGATCAACTCGGCAAGGCCACCATCTTTTCGAAGATTGACCCAAGAAGCGGATA
TCATCAAATACGTATCCGACCAGGTGATGAATGGAAGACTGCCTTCAAGACAAATGAAGGGCTGTTCGAATGGATGGTCATGCCCTTCGGGCTATCCAATGCTCCCAGTA
CCTTCATGAGGGTAATGAATCAGGTACTGCACCCTTTCCTCAACAAGTTCATTGTGGTTTATTTTGATGATATATTGGTGTACAGTAGTGGGAACGACGAACACTTGCTC
CACCTTAAAAAGTTGTTTCAAGTATTGACAGAAAAGGAGCTCTACATCAATCAAAAGAGGTGTGAATTTTTGAAGGCTGAAATTACATTTCTTGGTTTTATAATCAAGAA
AGGAAAGATAAGCATGGAGCCAAGAAAGGTTGAAGCAATACAGAATTGGTTGGTTCCAACCACTGTCAAAGAAGTACATGCCTTCCTAGGGCTGGCTTCTTTCTACAGAA
AATTTATAAGAAACTTCAGCTCCATCTGTGCACCACTGACCGACTGCTTGAAGAAGGGAAACTTTAAGTGGACCCCATCCCAACAGGAGAGCTTCGAAGAAATAAAAAAA
AGGTTAGCTTCTAGCCCTGTTCTACAATTACCAGATTTCTCTTCCCCTTTTGAAGCAGCAGTCGACGCCTGTGGCACGGGGATTAGGGCAGTCCTATCCCAACGAGGTCA
CCCAATTGAATACCTCAGTGAGAAGTTGAGCCCGACACGACAAACATGGAGCACGTACGAGCAAGAACTATATGCCTTAGTTCGAGCTCTCAAACAGTTCAATTTTGTTA
TCAAACACCAAGCTGGAAAAGAGAATAAGGTGGCTGATGCACTGAGCAGAAAAGGCTCCCTGCTTACACTCCTCTCCTCAGAAATAATTGCTTTCAAACACCTGCCAGAA
CTATACGAAAGGGATACTGACTTCGCAGACATCTGGCATAAATGCTCCAATTACCTAAGAGCTGAAGGTTATCACATCCTAGAGGGGTTTCTCTTCAAGGGAGACCAGTT
ATGCATACCACACACTTCCCTACGGGAAGCCTTAATAAAAGAAGCTCACTCTAACGGGTTAGCTGGACATTTTGGGCAAGATAAGATCTTTGAAACAGTCTCTATACGGT
ACTACTGGCCACAGTTAAGGAAAGACTCCAATAACTTTGTGAAGAGGTGTTCCATTTGCCAACGGGCCAAGGGCTCTCGAACTAATGCAGGGTTATACACCCCACTACCG
ATTCCACAGTCAATCTGGGAAGATCTCTCAATTGACTTTGTACTCGGGCTTCCTAAGACTCAAAGAAACCATGATTCAGTCATGGTGGTTGTTGACCGATTTAGCAAGAT
GGCTCACTTCATTGCTTGCAAGAAAACGAATGATGCTATATATATAGCTAACCTGTTCTTCAAGGAAGTCATCCGATTACATGGAATACCTAAAACCATAGTCTCTGATA
GGGATGTCAAATTCCTAAGCCATTTTTGGAAGACCCTGTGGAAAAATGGCTCTAAACCTAAACAATGGGATTTGTCCCTCGCACAAGCAGAATTTGCCTTCAATAACATG
AAGAACCGGTCGACTGACAAATGTCCCTTTCAAGTCGTATACACTAAACGACCTAGGTTAACATTTGACCTCGCATCACTCCCTATTACTGTAGAAAGTCATAAAGAAGC
AGAAACCATGGCAGAGAATATTGAAAAACTACATAAGGAAGTTCATGACCACCTCATCCAATCCACTGATTCTTATAAGAAAGCAGCAGACAAAAAGAGGAGACAAGCTG
TTTTCTCCAAAGGGGATTTAGTAATGGTACACCTAAGGAAGAACAGATTCCCCGCTGGAACGTATAACAAGTTGAAGGATAAACAAATCGGCCCATTTCGCATTATAGAA
AAATATGGAGATAATGCTTTTAAGGTCGAACTTCCCCCGGATATGCATATCCATTCAGTATTCAACATCGCAGACTTGAAGCCCTATTATGCTCCAGACGACTTCCAGCT
TGCCGACTAG
mRNA sequenceShow/hide mRNA sequence
ATGGACACCCCTGAACAGAAAAAGGTACGTCTCGTGGCCTTGAAACTCAAAGGGGGCGCATCAGCATGGTGGGAGCAACTGGAAGCCAACAGGCAAAGATACAACAAACG
ACCCGTACGCTCATGGGAAAAGATGAAGAAACTATTAAAAGGACGATTCCTGCCTCTGAATTATGAGCAGACCTTATACAACCAGTATCAGAATTGTCGTCAAGGCACCA
GGACAGTCACCGAATACATAGAAGAGTTCCACCGATTAAGTGCCAGAACGAACCTAAGTGAGAATGAGCAGCACCAAATAGCCAGATTTGTGGGGGGTCTGCGCTTTGAT
ATTAAAGAAAAAGTAAAGTTACAGCCTCTCCGTTTCTTATCTGAAGCAATTTCATTGGCAGAAACAGTAGAAGAAATGATAGCTCTCAAGGCCAAAACCATGAATCGAAG
AACAACATGGGAGCCAACACCAACCAAGAAGACAAGCTATACGAGCAAGACAAATGACCAGCCGATGGCACCAATTCATGGAAAAGGAAAAGAGGCTGACTCCCAGACTG
CAACGAACGAAAAGAAGGCAGAGATAATCAATAAAAGCAAGAACCAAAATAATTACACTCGCCCATCATTGGGTAAATGCTTTCGGTGTGGACAACCAGGCCACCTATCT
AATTCTTGCCCTCAAAGAAAAACAATTGCCTTAGCTGAGGAAGAAGGCAACTTGCCTGGCGAAGATGAGTCAGAACCAAGAGAGGAAACGGAAGAAATTGAGGTAGATGA
GGGAGACAGAATCTCCTGTGTTATCCATAAAGTACTCATTGCTCCCAAGGAAGAAAAGAGCCCACAACGCCACAGTCTTTTCAAAACAAGGTGCACTATTAATGGGAAGG
TATGCGACGTCATTATTGATGGAGGTAGCAATGAAAACTTCATAGCAAAGAAAATAGTCTCCAACCTGAACTTAAAGGTTGAGCCACATCCAAACCCCTACAAGATAGGA
TGGGTAAAGAAAGGGAATGAGTCCACAGTTAATGAGATATGTACGGTTCCCCTTTCTATCGGAAGTAGCTACAAGGACCAGATCATATGCGATGTAATCGACATGGATGT
ATGCCATGTCCTCCTAGGCAGACCGTGGCAACACGACACTCAAACCTTGCATAAGGGGAGAGAAAACACATACGAATTTCACTGGATGGGTAAAAAGGTAGCCCTGCTGC
CTTTGACCAAAAAAAATGAGGAGAACAGTAAGACAAGGGGTCAACTATTCACAACAGTCAGTGGCAAAACCCTACTGAAAGAAAGAAAGCAGGATATTTTAGCCCTTGTG
GTGACAGGCAGCACTAATGGAGAACAGGCTGGGGAGTTGGAACCACAATTACAACAACTTTTTGAGGAATTCCCACACCTCAAGAAGGAACCTGACGGACTGCCACCGCT
TCGTGACATCCAGCACCATATAGATCTAATTCCCGGGGCATCATTGCCAAACCTAGCTCATTACAGGATGACCCCTCAAGAGTATGCAGCACTCCATGAACATATTGAAG
ATCTACTTAAGAAAGGGCATATTAAGCCAAGCCTCAGCCCTTGTGCTGTCCCAGCTCTCCTCACCCCAAAAAAGGATGGAAGTTGGAGGATGTGTGTAGATAGCAGAGCC
ATTAACCGAATCACAGTGAAGTACAGATTCCCTATCCCAAGAGTTGGAGACCTCTTGGATCAACTCGGCAAGGCCACCATCTTTTCGAAGATTGACCCAAGAAGCGGATA
TCATCAAATACGTATCCGACCAGGTGATGAATGGAAGACTGCCTTCAAGACAAATGAAGGGCTGTTCGAATGGATGGTCATGCCCTTCGGGCTATCCAATGCTCCCAGTA
CCTTCATGAGGGTAATGAATCAGGTACTGCACCCTTTCCTCAACAAGTTCATTGTGGTTTATTTTGATGATATATTGGTGTACAGTAGTGGGAACGACGAACACTTGCTC
CACCTTAAAAAGTTGTTTCAAGTATTGACAGAAAAGGAGCTCTACATCAATCAAAAGAGGTGTGAATTTTTGAAGGCTGAAATTACATTTCTTGGTTTTATAATCAAGAA
AGGAAAGATAAGCATGGAGCCAAGAAAGGTTGAAGCAATACAGAATTGGTTGGTTCCAACCACTGTCAAAGAAGTACATGCCTTCCTAGGGCTGGCTTCTTTCTACAGAA
AATTTATAAGAAACTTCAGCTCCATCTGTGCACCACTGACCGACTGCTTGAAGAAGGGAAACTTTAAGTGGACCCCATCCCAACAGGAGAGCTTCGAAGAAATAAAAAAA
AGGTTAGCTTCTAGCCCTGTTCTACAATTACCAGATTTCTCTTCCCCTTTTGAAGCAGCAGTCGACGCCTGTGGCACGGGGATTAGGGCAGTCCTATCCCAACGAGGTCA
CCCAATTGAATACCTCAGTGAGAAGTTGAGCCCGACACGACAAACATGGAGCACGTACGAGCAAGAACTATATGCCTTAGTTCGAGCTCTCAAACAGTTCAATTTTGTTA
TCAAACACCAAGCTGGAAAAGAGAATAAGGTGGCTGATGCACTGAGCAGAAAAGGCTCCCTGCTTACACTCCTCTCCTCAGAAATAATTGCTTTCAAACACCTGCCAGAA
CTATACGAAAGGGATACTGACTTCGCAGACATCTGGCATAAATGCTCCAATTACCTAAGAGCTGAAGGTTATCACATCCTAGAGGGGTTTCTCTTCAAGGGAGACCAGTT
ATGCATACCACACACTTCCCTACGGGAAGCCTTAATAAAAGAAGCTCACTCTAACGGGTTAGCTGGACATTTTGGGCAAGATAAGATCTTTGAAACAGTCTCTATACGGT
ACTACTGGCCACAGTTAAGGAAAGACTCCAATAACTTTGTGAAGAGGTGTTCCATTTGCCAACGGGCCAAGGGCTCTCGAACTAATGCAGGGTTATACACCCCACTACCG
ATTCCACAGTCAATCTGGGAAGATCTCTCAATTGACTTTGTACTCGGGCTTCCTAAGACTCAAAGAAACCATGATTCAGTCATGGTGGTTGTTGACCGATTTAGCAAGAT
GGCTCACTTCATTGCTTGCAAGAAAACGAATGATGCTATATATATAGCTAACCTGTTCTTCAAGGAAGTCATCCGATTACATGGAATACCTAAAACCATAGTCTCTGATA
GGGATGTCAAATTCCTAAGCCATTTTTGGAAGACCCTGTGGAAAAATGGCTCTAAACCTAAACAATGGGATTTGTCCCTCGCACAAGCAGAATTTGCCTTCAATAACATG
AAGAACCGGTCGACTGACAAATGTCCCTTTCAAGTCGTATACACTAAACGACCTAGGTTAACATTTGACCTCGCATCACTCCCTATTACTGTAGAAAGTCATAAAGAAGC
AGAAACCATGGCAGAGAATATTGAAAAACTACATAAGGAAGTTCATGACCACCTCATCCAATCCACTGATTCTTATAAGAAAGCAGCAGACAAAAAGAGGAGACAAGCTG
TTTTCTCCAAAGGGGATTTAGTAATGGTACACCTAAGGAAGAACAGATTCCCCGCTGGAACGTATAACAAGTTGAAGGATAAACAAATCGGCCCATTTCGCATTATAGAA
AAATATGGAGATAATGCTTTTAAGGTCGAACTTCCCCCGGATATGCATATCCATTCAGTATTCAACATCGCAGACTTGAAGCCCTATTATGCTCCAGACGACTTCCAGCT
TGCCGACTAG
Protein sequenceShow/hide protein sequence
MDTPEQKKVRLVALKLKGGASAWWEQLEANRQRYNKRPVRSWEKMKKLLKGRFLPLNYEQTLYNQYQNCRQGTRTVTEYIEEFHRLSARTNLSENEQHQIARFVGGLRFD
IKEKVKLQPLRFLSEAISLAETVEEMIALKAKTMNRRTTWEPTPTKKTSYTSKTNDQPMAPIHGKGKEADSQTATNEKKAEIINKSKNQNNYTRPSLGKCFRCGQPGHLS
NSCPQRKTIALAEEEGNLPGEDESEPREETEEIEVDEGDRISCVIHKVLIAPKEEKSPQRHSLFKTRCTINGKVCDVIIDGGSNENFIAKKIVSNLNLKVEPHPNPYKIG
WVKKGNESTVNEICTVPLSIGSSYKDQIICDVIDMDVCHVLLGRPWQHDTQTLHKGRENTYEFHWMGKKVALLPLTKKNEENSKTRGQLFTTVSGKTLLKERKQDILALV
VTGSTNGEQAGELEPQLQQLFEEFPHLKKEPDGLPPLRDIQHHIDLIPGASLPNLAHYRMTPQEYAALHEHIEDLLKKGHIKPSLSPCAVPALLTPKKDGSWRMCVDSRA
INRITVKYRFPIPRVGDLLDQLGKATIFSKIDPRSGYHQIRIRPGDEWKTAFKTNEGLFEWMVMPFGLSNAPSTFMRVMNQVLHPFLNKFIVVYFDDILVYSSGNDEHLL
HLKKLFQVLTEKELYINQKRCEFLKAEITFLGFIIKKGKISMEPRKVEAIQNWLVPTTVKEVHAFLGLASFYRKFIRNFSSICAPLTDCLKKGNFKWTPSQQESFEEIKK
RLASSPVLQLPDFSSPFEAAVDACGTGIRAVLSQRGHPIEYLSEKLSPTRQTWSTYEQELYALVRALKQFNFVIKHQAGKENKVADALSRKGSLLTLLSSEIIAFKHLPE
LYERDTDFADIWHKCSNYLRAEGYHILEGFLFKGDQLCIPHTSLREALIKEAHSNGLAGHFGQDKIFETVSIRYYWPQLRKDSNNFVKRCSICQRAKGSRTNAGLYTPLP
IPQSIWEDLSIDFVLGLPKTQRNHDSVMVVVDRFSKMAHFIACKKTNDAIYIANLFFKEVIRLHGIPKTIVSDRDVKFLSHFWKTLWKNGSKPKQWDLSLAQAEFAFNNM
KNRSTDKCPFQVVYTKRPRLTFDLASLPITVESHKEAETMAENIEKLHKEVHDHLIQSTDSYKKAADKKRRQAVFSKGDLVMVHLRKNRFPAGTYNKLKDKQIGPFRIIE
KYGDNAFKVELPPDMHIHSVFNIADLKPYYAPDDFQLAD