; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cucsat.G5089 (gene) of Cucumber (B10) v3 genome

Gene IDCucsat.G5089
OrganismCucumis sativus L. var. sativus cv. B10 (Cucumber (B10) v3)
DescriptionRNA-directed DNA polymerase
Genome locationctg1227:3443413..3481482
RNA-Seq ExpressionCucsat.G5089
SyntenyCucsat.G5089
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0016310 - phosphorylation (biological process)
GO:0036092 - phosphatidylinositol-3-phosphate biosynthetic process (biological process)
GO:0046856 - phosphatidylinositol dephosphorylation (biological process)
GO:0090304 - nucleic acid metabolic process (biological process)
GO:0005774 - vacuolar membrane (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008233 - peptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0016301 - kinase activity (molecular function)
GO:0016740 - transferase activity (molecular function)
GO:0016779 - nucleotidyltransferase activity (molecular function)
GO:0016787 - hydrolase activity (molecular function)
GO:0043813 - phosphatidylinositol-3,5-bisphosphate 5-phosphatase activity (molecular function)
GO:0140640 - catalytic activity, acting on a nucleic acid (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7588770.1 Integrase catalytic core [Arabidopsis suecica]0.052.73Show/hide
Query:  EKIAEGKILTPNYEQTLYNQYQNCRQGARTVAEYIEEFHRLSARTNLSENEQHQVARFVGGLRFDIKEKVRLQPFRFLSEAISFAETVEEMIAIRSKNLN
        +++   + L  +YEQ LY  Y +C QG+RTV +Y  EF R S R++L+E E  +VAR++ GL+  ++ K+ LQ    + EA S A   E M        N
Subjt:  EKIAEGKILTPNYEQTLYNQYQNCRQGARTVAEYIEEFHRLSARTNLSENEQHQVARFVGGLRFDIKEKVRLQPFRFLSEAISFAETVEEMIAIRSKNLN

Query:  -RRSAWETNSTKSKTNDQPSTSTKAKGKEIDNQEVAVERKKEQTFKPSGQNSYSRPSLGKCFRCGQTAHLSNNCPQRKTIAIAEEGGQTSEDSIEAEEE-
         RR + ++N       D+   +  +              +K++   P     Y+RP+L KCFRC  T H SN CP RKT+A+ EE     +D +E EE  
Subjt:  -RRSAWETNSTKSKTNDQPSTSTKAKGKEIDNQEVAVERKKEQTFKPSGQNSYSRPSLGKCFRCGQTAHLSNNCPQRKTIAIAEEGGQTSEDSIEAEEE-

Query:  -TELIEADDGERVSCVIQRLLITPKEEKNLQRHCLFKTRCTINGRVCDVIIDSGSSENFVAKKLVTVLNLKAEAHPNPYKIGWVRKGGEATVSEICTVPL
          E  E +  +++S V+QRLL++ KEE   QR  LF+T C+I  +VC++I+D+GSSEN V++KLV  L +    H  PY +GWV+KG +  VSE C +PL
Subjt:  -TELIEADDGERVSCVIQRLLITPKEEKNLQRHCLFKTRCTINGRVCDVIIDSGSSENFVAKKLVTVLNLKAEAHPNPYKIGWVRKGGEATVSEICTVPL

Query:  SIGNAYKDQIVCDVIEMDVCHLLLGRPWQYDTQSLHKGRENTYEFQWMGRKVVLLPITKKINEGLRGEKQLFITVS--GKNMLKE-REQDILGLVVI---
        SIG  YK++I+CDV++MDVCH+LLGRPWQYD    ++GR+N   F W   K+ +  + K  +   + +K  F+ +S   K + K  +E     LVV+   
Subjt:  SIGNAYKDQIVCDVIEMDVCHLLLGRPWQYDTQSLHKGRENTYEFQWMGRKVVLLPITKKINEGLRGEKQLFITVS--GKNMLKE-REQDILGLVVI---

Query:  ------EKTKEKHVEDIEPKLQQLLHEFPHIKEEPKGLPPLRDIQHHIDLIPGASLPNLAHYRMSPQEYKILHDHIEELLKKGHIKPSLSPCAVPALLTP
              E T  + V +I    ++L+ E     E P  LPP+RDIQHHIDLIPG+SLPNL HYRMSP+E +I+   IE+LLKKG I+ S+SPCAVP LL P
Subjt:  ------EKTKEKHVEDIEPKLQQLLHEFPHIKEEPKGLPPLRDIQHHIDLIPGASLPNLAHYRMSPQEYKILHDHIEELLKKGHIKPSLSPCAVPALLTP

Query:  KKDGSWRMCVDSRAINRITVKYRFPIPRISDLLDQLGKVSVFSKIDLKSGYHQIRVRPGDEWKTAFKTNEGLFEWMVMPFGLSNAPSTFMRLMNQVLHPF
        KK   WRMCVDSRAIN+IT+KYRFPIPR+ D+LD+L    VFSKIDL+SGYHQIR+R GDEWKTAFK+ +GL+EW+VMPFGLSNAPSTFMRLMNQVL PF
Subjt:  KKDGSWRMCVDSRAINRITVKYRFPIPRISDLLDQLGKVSVFSKIDLKSGYHQIRVRPGDEWKTAFKTNEGLFEWMVMPFGLSNAPSTFMRLMNQVLHPF

Query:  LNKFIVVYFDDILVYSTNNDEHLLHLRKLFQVLTEAELYINTKKSMFMKKEIAFLGFIIKQGSISMEPKKIEAIHTWPTPASIKEIQAFLGLASFYRKFI
        +  F+VVYFDDIL+YS   D+HL H+R++ QVL E +LY+N KK  F   ++ FLGF++ +  I ++  K+ AI  WP P +  E+++F GLA+FYR+F+
Subjt:  LNKFIVVYFDDILVYSTNNDEHLLHLRKLFQVLTEAELYINTKKSMFMKKEIAFLGFIIKQGSISMEPKKIEAIHTWPTPASIKEIQAFLGLASFYRKFI

Query:  RNFSSLAAPLTDCLKKGNFKWTPLQQESFEDIKKKLTSSPILKLPDFSSPFEVAVDACCTGIGAVLVQQGHPIEYFSEKLSTSRQTWSTYEQELYALVRA
        R+FS++ AP+T+CLKKG F W P Q ESF  IK+KL ++P+L LPDF   F+V  DA   GIGAVL Q+  PI +FSEKLS +RQ WSTY+QE YA+ RA
Subjt:  RNFSSLAAPLTDCLKKGNFKWTPLQQESFEDIKKKLTSSPILKLPDFSSPFEVAVDACCTGIGAVLVQQGHPIEYFSEKLSTSRQTWSTYEQELYALVRA

Query:  LKQWEHYLISKEFVLLTDHFSLKYLQAQKNISRMHARWISFLQRFDFVIKHQSGKENKVADALSRKGSLLTILSSEIIAFKHLPDLYEGDTDFKDIWYKC
        L+QWEHYL+ +EF+L TDH +LK+L +QK I++MHARW+SFLQ+F F+I+H+SG  NKVADALSR+ SLLT L+ EI+ F+ L +LYE D +FK++W KC
Subjt:  LKQWEHYLISKEFVLLTDHFSLKYLQAQKNISRMHARWISFLQRFDFVIKHQSGKENKVADALSRKGSLLTILSSEIIAFKHLPDLYEGDTDFKDIWYKC

Query:  SNFLDADDYHIVEGYLFKGEQLCIPHTSLREALIKEAHSGGLAGHFGQNKTLEITSKRYYWPQIRKDSNNFVKRCPICQRTKGSSTNAGLYSPLPIPTSI
        +    + D+HI EGYLFKG++LCIP +SLRE LI+E H GGL+GH G++KT+    +RYYWP +RKD+   V+RC +CQ +KG S N GLY PL +P  I
Subjt:  SNFLDADDYHIVEGYLFKGEQLCIPHTSLREALIKEAHSGGLAGHFGQNKTLEITSKRYYWPQIRKDSNNFVKRCPICQRTKGSSTNAGLYSPLPIPTSI

Query:  WEDLSIDFVIGLPKTQRQFDSIMVIVDRFSKMTHFVACKKTNDAIYIANLFFKEVVRLHGVPKSIVSDRDVKFLSHFWRTLWKKFDTTLKFSTTAHPQTD
        W+DLS+DFV+GLP+TQR  DS+ V+VD+FSKMTHF+AC+KT DA  IA LFF+EVVRLHGVPKSIVSDRD KFLSHFW TLW+ F T+LK S+TAHPQ+D
Subjt:  WEDLSIDFVIGLPKTQRQFDSIMVIVDRFSKMTHFVACKKTNDAIYIANLFFKEVVRLHGVPKSIVSDRDVKFLSHFWRTLWKKFDTTLKFSTTAHPQTD

Query:  GQTEVTNRTLGNLLRCLSGSKPKQWDLALAQAEFAFNNMKNRSTGKSPFEVVYTKLPRLTFDLTTLPTTVDLNIEAECMAENIKKLHKEVHDHLIQTTDS
        GQTEVTNRTLGN++R + G KPKQWDLAL Q EFA+N+  + +TGKSPF +VYT +P+   DL  LP    ++  A+ MA++I    + V   L  T   
Subjt:  GQTEVTNRTLGNLLRCLSGSKPKQWDLALAQAEFAFNNMKNRSTGKSPFEVVYTKLPRLTFDLTTLPTTVDLNIEAECMAENIKKLHKEVHDHLIQTTDS

Query:  YKKAADKKRRQAHFNKGDLVMVHLKKSRFPTGTYNKLKDRQIGPFPILEKYGDNAFKIDLPQDIHIHPVFNVADLKPYHA
         K+AADKK+R   F +GD VMV LKK RFP GTY KL+ R+ GPF IL+K  DNA+ +DLP D+ I   FNVAD+  YHA
Subjt:  YKKAADKKRRQAHFNKGDLVMVHLKKSRFPTGTYNKLKDRQIGPFPILEKYGDNAFKIDLPQDIHIHPVFNVADLKPYHA

PKU71894.1 RNA-directed DNA polymerase [Dendrobium catenatum]0.051.12Show/hide
Query:  EKIAEGKILTPNYEQTLYNQYQNCRQGARTVAEYIEEFHRLSARTNLSENEQHQVARFVGGLRFDIKEKVRLQPFRFLSEAISFAETVEEMIAIRSKNLN
        +++  G  L  +YEQ LY +YQ+C QG+R+V +Y EEF+RLSAR NL E+E   VAR+VGGL+  I++K+ L     LS+A++FA   E  +   S++ +
Subjt:  EKIAEGKILTPNYEQTLYNQYQNCRQGARTVAEYIEEFHRLSARTNLSENEQHQVARFVGGLRFDIKEKVRLQPFRFLSEAISFAETVEEMIAIRSKNLN

Query:  RRSAW--ETNSTKSKTNDQP-----STSTKAKGKEIDNQEVAVERKKEQTFKPSGQNSYSRPSLGKCFRCGQTAHLSNNCPQRKTIAIAE----EGGQTS
        +R +   E NS  SK + QP     + S  +     +N+  A + K      P  +N YSRP+  KCFRC Q  H SN CP R+ I + +    E G  +
Subjt:  RRSAW--ETNSTKSKTNDQP-----STSTKAKGKEIDNQEVAVERKKEQTFKPSGQNSYSRPSLGKCFRCGQTAHLSNNCPQRKTIAIAE----EGGQTS

Query:  EDSIEAEEETELIEADDGERVSCVIQRLLITPKEEKNLQRHCLFKTRCTINGRVCDVIIDSGSSENFVAKKLVTVLNLKAEAHPNPYKIGWVRKGGEATV
        +   + E+  E ++ D+GE + C++++LL+ P++ +  QR+ +F+T+CTI G+VC+++IDSG +EN +++ +V  L LK    P PYKI WV++G E TV
Subjt:  EDSIEAEEETELIEADDGERVSCVIQRLLITPKEEKNLQRHCLFKTRCTINGRVCDVIIDSGSSENFVAKKLVTVLNLKAEAHPNPYKIGWVRKGGEATV

Query:  SEICTVPLSIGNAYKDQIVCDVIEMDVCHLLLGRPWQYDTQSLHKGRENTYEFQWMGRKVVLLPITKKINEGLRGEKQLFIT--------VSGKNMLKER
        +E C V  S+G  Y  +++CDV+EMDVCHL+LGRPWQ+DTQ++H  R N Y F W G+K+ LLP T    +   G  +  ++        VSG  +L+E 
Subjt:  SEICTVPLSIGNAYKDQIVCDVIEMDVCHLLLGRPWQYDTQSLHKGRENTYEFQWMGRKVVLLPITKKINEGLRGEKQLFIT--------VSGKNMLKER

Query:  EQDI--LGLVVIEKTKEKHVEDIEPKLQQLLHEFPHI--KEEPKGLPPLRDIQHHIDLIPGASLPNLAHYRMSPQEYKILHDHIEELLKKGHIKPSLSPC
        +  +  L LV ++++   + + +   +QQLL EF  I   E P  LPPLR+IQH IDLIPGA+LPNL +YRMSP+E+ IL + +++LLK+  I+ SLSPC
Subjt:  EQDI--LGLVVIEKTKEKHVEDIEPKLQQLLHEFPHI--KEEPKGLPPLRDIQHHIDLIPGASLPNLAHYRMSPQEYKILHDHIEELLKKGHIKPSLSPC

Query:  AVPALLTPKKDGSWRMCVDSRAINRITVKYRFPIPRISDLLDQLGKVSVFSKIDLKSGYHQIRVRPGDEWKTAFKTNEGLFEWMVMPFGLSNAPSTFMRL
        AVPALL PKKDG WRMC+DSRAIN+IT K+RFP+PR+SDLLD+L   ++FSK+DL+SGYHQ+R+RPGDEWK+AFKT EGLFEW VMPFGL NAPSTFMRL
Subjt:  AVPALLTPKKDGSWRMCVDSRAINRITVKYRFPIPRISDLLDQLGKVSVFSKIDLKSGYHQIRVRPGDEWKTAFKTNEGLFEWMVMPFGLSNAPSTFMRL

Query:  MNQVLHPFLNKFIVVYFDDILVYSTNNDEHLLHLRKLFQVLTEAELYINTKKSMFMKKEIAFLGFIIKQGSISMEPKKIEAIHTWPTPASIKEIQAFLGL
        M++VL PF  KF V YFDDILVYS + ++H+LHL +LFQ L  ++LY+N  K  F   ++ FLGF++ +  I ++P+K+ A+  WP P S+ +I++F GL
Subjt:  MNQVLHPFLNKFIVVYFDDILVYSTNNDEHLLHLRKLFQVLTEAELYINTKKSMFMKKEIAFLGFIIKQGSISMEPKKIEAIHTWPTPASIKEIQAFLGL

Query:  ASFYRKFIRNFSSLAAPLTDCLKKGNFKWTPLQQESFEDIKKKLTSSPILKLPDFSSPFEVAVDACCTGIGAVLVQQGHPIEYFSEKLSTSRQTWSTYEQ
        A+FYR+FIR FS + AP+TD LK  +F W+  QQ+SFE+IKK L+S+PIL LP+F  PF+V  DA   GIGAVL Q+  P+EYFSEKLSTSRQ W+ YEQ
Subjt:  ASFYRKFIRNFSSLAAPLTDCLKKGNFKWTPLQQESFEDIKKKLTSSPILKLPDFSSPFEVAVDACCTGIGAVLVQQGHPIEYFSEKLSTSRQTWSTYEQ

Query:  ELYALVRALKQWEHYLISKEFVLLTDHFSLKYLQAQKNISRMHARWISFLQRFDFVIKHQSGKENKVADALSRKGSLLTILSSEIIAFKHLPDLYEGDTD
        ELYA+VRALKQWEHYL+ ++FVL +DH SL+YL +QK I+RMHARW+ FLQRF FVI+H++GK N+VADALSR+ +LL  L +E+   + +  LY+ D D
Subjt:  ELYALVRALKQWEHYLISKEFVLLTDHFSLKYLQAQKNISRMHARWISFLQRFDFVIKHQSGKENKVADALSRKGSLLTILSSEIIAFKHLPDLYEGDTD

Query:  FKDIWYKCSNFLDADDYHIVEGYLFKGEQLCIPHTSLREALIKEAHSGGLAGHFGQNKTLEITSKRYYWPQIRKDSNNFVKRCPICQRTKGSSTNAGLYS
        F   W  C+      D+ +  G+LFKG  LC+P +S R  LI+E H  GLA H G++KT++    R++WP +++D    ++RC  CQ  KG++ N GLY 
Subjt:  FKDIWYKCSNFLDADDYHIVEGYLFKGEQLCIPHTSLREALIKEAHSGGLAGHFGQNKTLEITSKRYYWPQIRKDSNNFVKRCPICQRTKGSSTNAGLYS

Query:  PLPIPTSIWEDLSIDFVIGLPKTQRQFDSIMVIVDRFSKMTHFVACKKTNDAIYIANLFFKEVVRLHGVPKSIVSDRDVKFLSHFWRTLWKKFDTTLKFS
        PLP+P SIWEDLS+DFV+GLP+T+R  DSIMV+VDRFSKM HF+ CKKT DA+ IA LFFKE+VRLHG+P+S+ SDRDVKF+SHFWR LWKKF T LK S
Subjt:  PLPIPTSIWEDLSIDFVIGLPKTQRQFDSIMVIVDRFSKMTHFVACKKTNDAIYIANLFFKEVVRLHGVPKSIVSDRDVKFLSHFWRTLWKKFDTTLKFS

Query:  TTAHPQTDGQTEVTNRTLGNLLRCLSGSKPKQWDLALAQAEFAFNNMKNRSTGKSPFEVVYTKLPRLTFDLTTLPTTVDLNIEAECMAENIKKLHKEVHD
        +T HPQTDGQTEV NRTL  +LRCL    PK W+  L QAEFAFN+M NRSTG+ PF VVYTK+P    DL  LP     +  A   A    ++ KEV +
Subjt:  TTAHPQTDGQTEVTNRTLGNLLRCLSGSKPKQWDLALAQAEFAFNNMKNRSTGKSPFEVVYTKLPRLTFDLTTLPTTVDLNIEAECMAENIKKLHKEVHD

Query:  HLIQTTDSYKKAADKKRRQAHFNKGDLVMVHLKKSRFPTGTYNKLKDRQIGPFPILEKYGDNAFKIDLPQDIHIHPVFNVADLKPYHAPDRFR
         + +T   YK   D+ RR   F  G+LVM+  ++ RFP+G   KL  ++ GPFP+L K  DNA+ IDLP+D+     FNVAD+ PYH PD  +
Subjt:  HLIQTTDSYKKAADKKRRQAHFNKGDLVMVHLKKSRFPTGTYNKLKDRQIGPFPILEKYGDNAFKIDLPQDIHIHPVFNVADLKPYHAPDRFR

PKU85169.1 RNA-directed DNA polymerase [Dendrobium catenatum]0.052.47Show/hide
Query:  EKIAEGKILTPNYEQTLYNQYQNCRQGARTVAEYIEEFHRLSARTNLSENEQHQVARFVGGLRFDIKEKVRLQPFRFLSEAISFAETVEEMIAIRSKNLN
        +++     L  +YEQ LY +YQ C QG++TV+EY EEF+RLSAR NL E+E   VAR+  GLR  I++K+ L     LS+AI+FA   E  ++ + ++ N
Subjt:  EKIAEGKILTPNYEQTLYNQYQNCRQGARTVAEYIEEFHRLSARTNLSENEQHQVARFVGGLRFDIKEKVRLQPFRFLSEAISFAETVEEMIAIRSKNLN

Query:  RRSAWETNSTKSKTNDQPSTSTKAKGKEIDNQEVAVERKKEQTFKP---SGQNSYSRPSLGKCFRCGQTAHLSNNCPQRKTIAIAE-EGGQTSE-DSIEA
         R          K   Q S S +   +   N   +  +  +   KP   +  N YSRPS  KCFRC Q  H SN CP R  I + E E  +T E  ++E 
Subjt:  RRSAWETNSTKSKTNDQPSTSTKAKGKEIDNQEVAVERKKEQTFKP---SGQNSYSRPSLGKCFRCGQTAHLSNNCPQRKTIAIAE-EGGQTSE-DSIEA

Query:  EEETELIEADDGERVSCVIQRLLITPKEEKNLQRHCLFKTRCTINGRVCDVIIDSGSSENFVAKKLVTVLNLKAEAHPNPYKIGWVRKGGEATVSEICTV
        E E E + AD+G++V CV++RLL+ P++    QR+ +F+TRCTING+VCD++ID+G +EN V+K LV VL LK   +P+PYKI WV+KG E  V+E+C +
Subjt:  EEETELIEADDGERVSCVIQRLLITPKEEKNLQRHCLFKTRCTINGRVCDVIIDSGSSENFVAKKLVTVLNLKAEAHPNPYKIGWVRKGGEATVSEICTV

Query:  PLSIGNAYKDQIVCDVIEMDVCHLLLGRPWQYDTQSLHKGRENTYEFQWMGRKVVLLPITKKINEGLRGEKQLFITVSGKNMLKERE-QDILGLVVIEKT
          S+G +Y  +++CDV+EMDVCH++LGRPWQ+DT  ++ GR NTY F W GRK+ LLP T   N     +K  F  V+G  ++ +R+ +++  +VV +  
Subjt:  PLSIGNAYKDQIVCDVIEMDVCHLLLGRPWQYDTQSLHKGRENTYEFQWMGRKVVLLPITKKINEGLRGEKQLFITVSGKNMLKERE-QDILGLVVIEKT

Query:  KEKHVEDIEPKLQQLLHEFPHIK--EEPKGLPPLRDIQHHIDLIPGASLPNLAHYRMSPQEYKILHDHIEELLKKGHIKPSLSPCAVPALLTPKKDGSWR
           +V      + +LL +F  I   E P GLP  R IQH IDLIPGA+LPNL HY+MSP+E+ IL + +EELL+K  I+PSLSPCAVPALL PKKD  WR
Subjt:  KEKHVEDIEPKLQQLLHEFPHIK--EEPKGLPPLRDIQHHIDLIPGASLPNLAHYRMSPQEYKILHDHIEELLKKGHIKPSLSPCAVPALLTPKKDGSWR

Query:  MCVDSRAINRITVKYRFPIPRISDLLDQLGKVSVFSKIDLKSGYHQIRVRPGDEWKTAFKTNEGLFEWMVMPFGLSNAPSTFMRLMNQVLHPFLNKFIVV
        MC+DSRAIN+IT K+RFP+PR+ DLLD+L   S FSK+DL+SGYHQIR+RPGDEWKTAFKT+ GL+EW VMPFGL NAP+TFMRLMN+VL  F+N F VV
Subjt:  MCVDSRAINRITVKYRFPIPRISDLLDQLGKVSVFSKIDLKSGYHQIRVRPGDEWKTAFKTNEGLFEWMVMPFGLSNAPSTFMRLMNQVLHPFLNKFIVV

Query:  YFDDILVYSTNNDEHLLHLRKLFQVLTEAELYINTKKSMFMKKEIAFLGFIIKQGSISMEPKKIEAIHTWPTPASIKEIQAFLGLASFYRKFIRNFSSLA
        YFDDILVYST+ ++H  HL  +FQ L   +L++N  K  F    + FLGFII    I+ +P+K+ AI  WP P S+ ++++F GLA+FYR+FIR FS L 
Subjt:  YFDDILVYSTNNDEHLLHLRKLFQVLTEAELYINTKKSMFMKKEIAFLGFIIKQGSISMEPKKIEAIHTWPTPASIKEIQAFLGLASFYRKFIRNFSSLA

Query:  APLTDCLKKGNFKWTPLQQESFEDIKKKLTSSPILKLPDFSSPFEVAVDACCTGIGAVLVQQGHPIEYFSEKLSTSRQTWSTYEQELYALVRALKQWEHY
        APLTDCLK   F W   +Q S+E IK+ L+S+P+L LP+F  PF+V  DA   GIGAVL Q   PIE+FSEKL+  RQ W+ YEQELYA++RALK WEHY
Subjt:  APLTDCLKKGNFKWTPLQQESFEDIKKKLTSSPILKLPDFSSPFEVAVDACCTGIGAVLVQQGHPIEYFSEKLSTSRQTWSTYEQELYALVRALKQWEHY

Query:  LISKEFVLLTDHFSLKYLQAQKNISRMHARWISFLQRFDFVIKHQSGKENKVADALSRKGSLLTILSSEIIAFKHLPDLYEGDTDFKDIWYKCSNFLDAD
        L+ K+FVL +DH +L+++  QK ++RMHARW+ FLQ+F FV+KH+SG +N+VADALSR+ +LLT L +EI     L DLY  D DF+ IW  CS      
Subjt:  LISKEFVLLTDHFSLKYLQAQKNISRMHARWISFLQRFDFVIKHQSGKENKVADALSRKGSLLTILSSEIIAFKHLPDLYEGDTDFKDIWYKCSNFLDAD

Query:  DYHIVEGYLFKGEQLCIPHTSLREALIKEAHSGGLAGHFGQNKTLEITSKRYYWPQIRKDSNNFVKRCPICQRTKGSSTNAGLYSPLPIPTSIWEDLSID
        +Y +  GYLFKG  LCIP +S R  LI EAHSGGLA H G++KT +   ++++WP++ +D    V+RC +CQ  KG+  N GLY+PLP+P +IWED+SID
Subjt:  DYHIVEGYLFKGEQLCIPHTSLREALIKEAHSGGLAGHFGQNKTLEITSKRYYWPQIRKDSNNFVKRCPICQRTKGSSTNAGLYSPLPIPTSIWEDLSID

Query:  FVIGLPKTQRQFDSIMVIVDRFSKMTHFVACKKTNDAIYIANLFFKEVVRLHGVPKSIVSDRDVKFLSHFWRTLWKKFDTTLKFSTTAHPQTDGQTEVTN
        FV+GLP+T+R  DSIMV+VDR SKM HFVACKKT DA+ +A LFF E+VRLHG+P+SI SDRDVKF+SHFWR LWK+  T +  S+  HPQ+DGQTEV N
Subjt:  FVIGLPKTQRQFDSIMVIVDRFSKMTHFVACKKTNDAIYIANLFFKEVVRLHGVPKSIVSDRDVKFLSHFWRTLWKKFDTTLKFSTTAHPQTDGQTEVTN

Query:  RTLGNLLRCLSGSKPKQWDLALAQAEFAFNNMKNRSTGKSPFEVVYTKLPRLTFDLTTLPTTVDLNIEAECMAENIKKLHKEVHDHLIQTTDSYKKAADK
        RTLGN+LRCL  S PKQW+  L QAEFA+N+M NRSTGKSPF +VYTK P   FD+  LP   D    A  + E    + ++V + LI +  +YK+AAD 
Subjt:  RTLGNLLRCLSGSKPKQWDLALAQAEFAFNNMKNRSTGKSPFEVVYTKLPRLTFDLTTLPTTVDLNIEAECMAENIKKLHKEVHDHLIQTTDSYKKAADK

Query:  KRRQAHFNKGDLVMVHLKKSRFPTGTYNKLKDRQIGPFPILEKYGDNAFKIDLPQDIHIHPVFNVADLKPYHAPD
         RR   FN GDLVMV ++K RFP GTY+KL  R++GP PI ++  DNA+ ++LP +I+    FNVAD+  YH PD
Subjt:  KRRQAHFNKGDLVMVHLKKSRFPTGTYNKLKDRQIGPFPILEKYGDNAFKIDLPQDIHIHPVFNVADLKPYHAPD

PKU87230.1 RNA-directed DNA polymerase [Dendrobium catenatum]0.050.51Show/hide
Query:  EKIAEGKILTPNYEQTLYNQYQNCRQGARTVAEYIEEFHRLSARTNLSENEQHQVARFVGGLRFDIKEKVRLQPFRFLSEAISFAETVEEMIAIRSKNLN
        +++     L  +YEQ LY +YQ+C QG R+V++Y EEF+RLSAR NL+E +   VAR++GGL+  I++K+ L     LS+A++FA   E  +   SK+  
Subjt:  EKIAEGKILTPNYEQTLYNQYQNCRQGARTVAEYIEEFHRLSARTNLSENEQHQVARFVGGLRFDIKEKVRLQPFRFLSEAISFAETVEEMIAIRSKNLN

Query:  RRSAWETNSTKSKTNDQPSTSTKAKGKEIDNQEVAV---ERKKEQTFKPSGQNSYSRPSLGKCFRCGQTAHLSNNCPQRKTIAIAEEGGQTSEDSIEAEE
         R  +   S++S+    P  S+      + +   +     R  +    P  +N Y++P+  KCFRC Q  H SN CP R  I +AE  G+  +D    ++
Subjt:  RRSAWETNSTKSKTNDQPSTSTKAKGKEIDNQEVAV---ERKKEQTFKPSGQNSYSRPSLGKCFRCGQTAHLSNNCPQRKTIAIAEEGGQTSEDSIEAEE

Query:  ETELI----EADDGERVSCVIQRLLITPKEEKNLQRHCLFKTRCTINGRVCDVIIDSGSSENFVAKKLVTVLNLKAEAHPNPYKIGWVRKGGEATVSEIC
         +E I     AD+GE V C+++RLL+ P++    QR+ +F+TRCTING+VC+++ID+G +EN V++ LV  L LK   +P PYKI WV++G +  V+E+C
Subjt:  ETELI----EADDGERVSCVIQRLLITPKEEKNLQRHCLFKTRCTINGRVCDVIIDSGSSENFVAKKLVTVLNLKAEAHPNPYKIGWVRKGGEATVSEIC

Query:  TVPLSIGNAYKDQIVCDVIEMDVCHLLLGRPWQYDTQSLHKGRENTYEFQWMGRKVVLLPITKKINEGLRGEKQLFITVSGKNMLKER-EQDILGLVVIE
         +  +IG  Y  +++CDVI+MDVCHL+LGRPWQYD+ +++  R+NTY F+W G+K+ LLP     ++    +K +   +    +  E+  + +  +V++E
Subjt:  TVPLSIGNAYKDQIVCDVIEMDVCHLLLGRPWQYDTQSLHKGRENTYEFQWMGRKVVLLPITKKINEGLRGEKQLFITVSGKNMLKER-EQDILGLVVIE

Query:  KTKEKHVEDIEPKLQQLLHEFPHIK--EEPKGLPPLRDIQHHIDLIPGASLPNLAHYRMSPQEYKILHDHIEELLKKGHIKPSLSPCAVPALLTPKKDGS
        +  +  ++D  P++  L+ +F  +   E P  LPPLR IQH IDL+PG++LPNL HYRMSP+E++IL + ++EL++K  I+PSLSPCAVPALL PKKD  
Subjt:  KTKEKHVEDIEPKLQQLLHEFPHIK--EEPKGLPPLRDIQHHIDLIPGASLPNLAHYRMSPQEYKILHDHIEELLKKGHIKPSLSPCAVPALLTPKKDGS

Query:  WRMCVDSRAINRITVKYRFPIPRISDLLDQLGKVSVFSKIDLKSGYHQIRVRPGDEWKTAFKTNEGLFEWMVMPFGLSNAPSTFMRLMNQVLHPFLNKFI
        WRMC+DSRAIN+IT K+RFP+PRI DLLD+L     FSK+DL+SGYHQIR+RPGDEWKTAFKT +GLFEW VMPFGL NAP+TFMRLMN+VL  ++N+F 
Subjt:  WRMCVDSRAINRITVKYRFPIPRISDLLDQLGKVSVFSKIDLKSGYHQIRVRPGDEWKTAFKTNEGLFEWMVMPFGLSNAPSTFMRLMNQVLHPFLNKFI

Query:  VVYFDDILVYSTNNDEHLLHLRKLFQVLTEAELYINTKKSMFMKKEIAFLGFIIKQGSISMEPKKIEAIHTWPTPASIKEIQAFLGLASFYRKFIRNFSS
        VVYFDDIL+YS    EHL HL ++ QVL E +LY+N  K      ++ FLGFI+    +  +P K+ A+  WPTP S+ ++++F GLA+FYR+FIR FS 
Subjt:  VVYFDDILVYSTNNDEHLLHLRKLFQVLTEAELYINTKKSMFMKKEIAFLGFIIKQGSISMEPKKIEAIHTWPTPASIKEIQAFLGLASFYRKFIRNFSS

Query:  LAAPLTDCLKKGNFKWTPLQQESFEDIKKKLTSSPILKLPDFSSPFEVAVDACCTGIGAVLVQQGHPIEYFSEKLSTSRQTWSTYEQELYALVRALKQWE
        + AP+TDCLK   F+WT   Q S++ IK+ L+S+P+L LP+F  PF+V  DA   GIGAVL Q   PIE+FSEKLS +RQ W+ YEQELYA++RALK WE
Subjt:  LAAPLTDCLKKGNFKWTPLQQESFEDIKKKLTSSPILKLPDFSSPFEVAVDACCTGIGAVLVQQGHPIEYFSEKLSTSRQTWSTYEQELYALVRALKQWE

Query:  HYLISKEFVLLTDHFSLKYLQAQKNISRMHARWISFLQRFDFVIKHQSGKENKVADALSRKGSLLTILSSEIIAFKHLPDLYEGDTDFKDIWYKCSNFLD
        HYL+ ++FVL +DH +L+++ +QK+ISRMHARW+ FLQ+F FV+KH+SG +N+VADALSR+ +L+T L +E+     L +LY  D DF  IW +CS    
Subjt:  HYLISKEFVLLTDHFSLKYLQAQKNISRMHARWISFLQRFDFVIKHQSGKENKVADALSRKGSLLTILSSEIIAFKHLPDLYEGDTDFKDIWYKCSNFLD

Query:  ADDYHIVEGYLFKGEQLCIPHTSLREALIKEAHSGGLAGHFGQNKTLEITSKRYYWPQIRKDSNNFVKRCPICQRTKGSSTNAGLYSPLPIPTSIWEDLS
        ADDY +  GYLFK   LCIP +S R+ +IK+AH+GGLA H G+ KT+     R++WP++ +D + FV+RC +CQ  KG+  NAGLY+PLP+P SIWED+S
Subjt:  ADDYHIVEGYLFKGEQLCIPHTSLREALIKEAHSGGLAGHFGQNKTLEITSKRYYWPQIRKDSNNFVKRCPICQRTKGSSTNAGLYSPLPIPTSIWEDLS

Query:  IDFVIGLPKTQRQFDSIMVIVDRFSKMTHFVACKKTNDAIYIANLFFKEVVRLHGVPKSIVSDRDVKFLSHFWRTLWKKFDTTLKFSTTAHPQTDGQTEV
        IDF++GLP+TQ+  DSIMV+VDRFSKM HFVACKKT DA++IA LFF E+VRLHG+P+SI SDRDVKF+SHFWR LWK+  T +  S+  HPQ+DGQTEV
Subjt:  IDFVIGLPKTQRQFDSIMVIVDRFSKMTHFVACKKTNDAIYIANLFFKEVVRLHGVPKSIVSDRDVKFLSHFWRTLWKKFDTTLKFSTTAHPQTDGQTEV

Query:  TNRTLGNLLRCLSGSKPKQWDLALAQAEFAFNNMKNRSTGKSPFEVVYTKLPRLTFDLTTLPTTVDLNIEAECMAENIKKLHKEVHDHLIQTTDSYKKAA
         NRTLGN+LRCL   +PKQW+ AL QAEFA+N+M NRS GKSPF +VYTK P   FD+  LP     N  A  + +N + +  EV   LI +   YK+AA
Subjt:  TNRTLGNLLRCLSGSKPKQWDLALAQAEFAFNNMKNRSTGKSPFEVVYTKLPRLTFDLTTLPTTVDLNIEAECMAENIKKLHKEVHDHLIQTTDSYKKAA

Query:  DKKRRQAHFNKGDLVMVHLKKSRFPTGTYNKLKDRQIGPFPILEKYGDNAFKIDLPQDIHIHPVFNVADLKPYHAPD
        D +RR   F  G+LVM+ L+K RFP G  +KL  R+ GP PIL++  DNA+ +DLP        FNV D+ PYH PD
Subjt:  DKKRRQAHFNKGDLVMVHLKKSRFPTGTYNKLKDRQIGPFPILEKYGDNAFKIDLPQDIHIHPVFNVADLKPYHAPD

PWA81295.1 transposon Ty3-I Gag-Pol polyprotein [Artemisia annua]0.051.87Show/hide
Query:  EKIAEGKILTPNYEQTLYNQYQNCRQGARTVAEYIEEFHRLSARTNLSENEQHQVARFVGGLRFDIKEKVRLQPFRFLSEAISFAETVEEMIAIRSKNLN
        +++ +G+ L P+ EQ LY QY NC QG RTVAEY  EF RL AR NL E ++   AR+V GL   I+EK+ L     + +A + A   E M         
Subjt:  EKIAEGKILTPNYEQTLYNQYQNCRQGARTVAEYIEEFHRLSARTNLSENEQHQVARFVGGLRFDIKEKVRLQPFRFLSEAISFAETVEEMIAIRSKNLN

Query:  RRSAWETNSTKSKTNDQ----PSTST------KAKGKEIDNQEVAVERKKEQTFKPSGQNSYSRPSLGKCFRCGQTAHLSNNCPQRKTIAIAEEG--GQT
        R +   T+S  +K N      PST+T      KA G  +D         K +  +P   N Y++P   KCFRCG+  H SN CP+R T+  +E G  G  
Subjt:  RRSAWETNSTKSKTNDQ----PSTST------KAKGKEIDNQEVAVERKKEQTFKPSGQNSYSRPSLGKCFRCGQTAHLSNNCPQRKTIAIAEEG--GQT

Query:  SEDSIEAEEETELIEADDGE--RVSCVIQRLLITPKEEKNLQRHCLFKTRCTINGRVCDVIIDSGSSENFVAKKLVTVLNLKAEAHPNPYKIGWVRKGGE
         ++S + E++ E  E  DGE  +++CVIQR L +PK   + QR+ +F+T+C +  ++C +IID GS EN V+K LV    L  E HPNPY+IGW++KG  
Subjt:  SEDSIEAEEETELIEADDGE--RVSCVIQRLLITPKEEKNLQRHCLFKTRCTINGRVCDVIIDSGSSENFVAKKLVTVLNLKAEAHPNPYKIGWVRKGGE

Query:  ATVSEICTVPLSIGNAYKDQIVCDVIEMDVCHLLLGRPWQYDTQSLHKGRENTYEFQWMGRKVVLLPITKKINEGLRGEKQLFITV--SGKNMLKEREQD
          V+EIC VPL+IG  Y + + CDV++M+ CH+LLGRPWQ+D  + H+G+ N Y F+W G+ + +LP+    + G + + +  +T+  S K    ER++ 
Subjt:  ATVSEICTVPLSIGNAYKDQIVCDVIEMDVCHLLLGRPWQYDTQSLHKGRENTYEFQWMGRKVVLLPITKKINEGLRGEKQLFITV--SGKNMLKEREQD

Query:  ILGLVVIEKTKEKHVEDIEPK-LQQLLHEFPHI--KEEPKGLPPLRDIQHHIDLIPGASLPNLAHYRMSPQEYKILHDHIEELLKKGHIKPSLSPCAVPA
         +   ++ K  E  ++++ P+ ++ +L EF  +   + P  LPPLR+IQH IDL+PGASLPNL HYRMSP+E  IL + +EELL+KGHI+ S+SPCAVPA
Subjt:  ILGLVVIEKTKEKHVEDIEPK-LQQLLHEFPHI--KEEPKGLPPLRDIQHHIDLIPGASLPNLAHYRMSPQEYKILHDHIEELLKKGHIKPSLSPCAVPA

Query:  LLTPKKDGSWRMCVDSRAINRITVKYRFPIPRISDLLDQLGKVSVFSKIDLKSGYHQIRVRPGDEWKTAFKTNEGLFEWMVMPFGLSNAPSTFMRLMNQV
        LLTPKKDGSWRMCVDSRAIN+ITV+YRFPIPR+ DLLDQL    +FSKIDL+SGYHQIR++PGDEWKTAFKT +GL+EW+VMPFGLSNAPSTFMRLM QV
Subjt:  LLTPKKDGSWRMCVDSRAINRITVKYRFPIPRISDLLDQLGKVSVFSKIDLKSGYHQIRVRPGDEWKTAFKTNEGLFEWMVMPFGLSNAPSTFMRLMNQV

Query:  LHPFLNKFIVVYFDDILVYSTNNDEHLLHLRKLFQVLTEAELYINTKKSMFMKKEIAFLGFIIKQGSISMEPKKIEAIHTWPTPASIKEIQAFLGLASFY
        L PF+ KF+VVYFDDILVYS    EHL HLRK+ + LTE EL++N KK  F+  ++ FLG+I+    I ++  K++A+  WP+P ++ E+++F GLA+FY
Subjt:  LHPFLNKFIVVYFDDILVYSTNNDEHLLHLRKLFQVLTEAELYINTKKSMFMKKEIAFLGFIIKQGSISMEPKKIEAIHTWPTPASIKEIQAFLGLASFY

Query:  RKFIRNFSSLAAPLTDCLKKGNFKWTPLQQESFEDIKKKLTSSPILKLPDFSSPFEVAVDACCTGIGAVLVQQGHPIEYFSEKLSTSRQTWSTYEQELYA
        R+F+RNFSS+ AP+T+C+KKG FKWT   +ESF+ IK++LT++P+L LP+F + FE+  DAC TGIGAVL Q+G P+ + SEKL+ +RQ WSTYEQELYA
Subjt:  RKFIRNFSSLAAPLTDCLKKGNFKWTPLQQESFEDIKKKLTSSPILKLPDFSSPFEVAVDACCTGIGAVLVQQGHPIEYFSEKLSTSRQTWSTYEQELYA

Query:  LVRALKQWEHYLISKEFVLLTDHFSLKYLQAQKNISRMHARWISFLQRFDFVIKHQSGKENKVADALSRKGSLLTILSSEIIAFKHLPDLYEGDTDFKDI
        +V+A+K+WEHYLI +EFV+ +DH +LKY Q Q++++++HARW SFL++F++VIKH+SG  NKVADALSRK +LL  +S++++ F+ +  LYE D DF+  
Subjt:  LVRALKQWEHYLISKEFVLLTDHFSLKYLQAQKNISRMHARWISFLQRFDFVIKHQSGKENKVADALSRKGSLLTILSSEIIAFKHLPDLYEGDTDFKDI

Query:  WYKCSNFLDADDYHIVEGYLFKGEQLCIPHTSLREALIKEAHSGGLAGHFGQNKTLEITSKRYYWPQIRKDSNNFVKRCPICQRTKGSSTNAGLYSPLPI
        W +        ++ +++GYLFKG +LCIP TSLR  LIKE H+GGL+ H G++KT+     R+YWPQ+++D  +FV+RC +CQ  KG + N GLY PLP+
Subjt:  WYKCSNFLDADDYHIVEGYLFKGEQLCIPHTSLREALIKEAHSGGLAGHFGQNKTLEITSKRYYWPQIRKDSNNFVKRCPICQRTKGSSTNAGLYSPLPI

Query:  PTSIWEDLSIDFVIGLPKTQRQFDSIMVIVDRFSKMTHFVACKKTNDAIYIANLFFKEVVRLHGVPKSIVSDRDVKFLSHFWRTLWKKFDTTLKFSTTAH
        P S W D+S+DFV+GLP+TQR  DS+ V+VDRFSKM HF+ CKKT+DA +IA LFF+EVVRLHGVPKSI SDRD KFL+HFW TLW++  T+L FS+TAH
Subjt:  PTSIWEDLSIDFVIGLPKTQRQFDSIMVIVDRFSKMTHFVACKKTNDAIYIANLFFKEVVRLHGVPKSIVSDRDVKFLSHFWRTLWKKFDTTLKFSTTAH

Query:  PQTDGQTEVTNRTLGNLLRCLSGSKPKQWDLALAQAEFAFNNMKNRSTGKSPFEVVYTKLPRLTFDLTTLPTTVDLNIEAECMAENIKKLHKEVHDHLIQ
        PQTDGQTEV NRTLGN++RCL G KPK WD++LAQAEFA+N+  + STG SPF+VVY   PR   DL  LP     N++A  M E ++  H+ V   + +
Subjt:  PQTDGQTEVTNRTLGNLLRCLSGSKPKQWDLALAQAEFAFNNMKNRSTGKSPFEVVYTKLPRLTFDLTTLPTTVDLNIEAECMAENIKKLHKEVHDHLIQ

Query:  TTDSYKKAADKKRRQAHFNKGDLVMVHLKKSRFPTGTYNKLKDRQIGPFPILEKYGDNAFKIDLPQDIHIHPVFNVADLKPYHAPD
        +   YK AADK RR   F  GD VMV L+K RFP GTY+KL+ ++ GP+ IL K  DNA+ +DLP  + I   FNV+D+  +H  D
Subjt:  TTDSYKKAADKKRRQAHFNKGDLVMVHLKKSRFPTGTYNKLKDRQIGPFPILEKYGDNAFKIDLPQDIHIHPVFNVADLKPYHAPD

TrEMBL top hitse value%identityAlignment
A0A2I0W8A8 RNA-directed DNA polymerase0.051.12Show/hide
Query:  EKIAEGKILTPNYEQTLYNQYQNCRQGARTVAEYIEEFHRLSARTNLSENEQHQVARFVGGLRFDIKEKVRLQPFRFLSEAISFAETVEEMIAIRSKNLN
        +++  G  L  +YEQ LY +YQ+C QG+R+V +Y EEF+RLSAR NL E+E   VAR+VGGL+  I++K+ L     LS+A++FA   E  +   S++ +
Subjt:  EKIAEGKILTPNYEQTLYNQYQNCRQGARTVAEYIEEFHRLSARTNLSENEQHQVARFVGGLRFDIKEKVRLQPFRFLSEAISFAETVEEMIAIRSKNLN

Query:  RRSAW--ETNSTKSKTNDQP-----STSTKAKGKEIDNQEVAVERKKEQTFKPSGQNSYSRPSLGKCFRCGQTAHLSNNCPQRKTIAIAE----EGGQTS
        +R +   E NS  SK + QP     + S  +     +N+  A + K      P  +N YSRP+  KCFRC Q  H SN CP R+ I + +    E G  +
Subjt:  RRSAW--ETNSTKSKTNDQP-----STSTKAKGKEIDNQEVAVERKKEQTFKPSGQNSYSRPSLGKCFRCGQTAHLSNNCPQRKTIAIAE----EGGQTS

Query:  EDSIEAEEETELIEADDGERVSCVIQRLLITPKEEKNLQRHCLFKTRCTINGRVCDVIIDSGSSENFVAKKLVTVLNLKAEAHPNPYKIGWVRKGGEATV
        +   + E+  E ++ D+GE + C++++LL+ P++ +  QR+ +F+T+CTI G+VC+++IDSG +EN +++ +V  L LK    P PYKI WV++G E TV
Subjt:  EDSIEAEEETELIEADDGERVSCVIQRLLITPKEEKNLQRHCLFKTRCTINGRVCDVIIDSGSSENFVAKKLVTVLNLKAEAHPNPYKIGWVRKGGEATV

Query:  SEICTVPLSIGNAYKDQIVCDVIEMDVCHLLLGRPWQYDTQSLHKGRENTYEFQWMGRKVVLLPITKKINEGLRGEKQLFIT--------VSGKNMLKER
        +E C V  S+G  Y  +++CDV+EMDVCHL+LGRPWQ+DTQ++H  R N Y F W G+K+ LLP T    +   G  +  ++        VSG  +L+E 
Subjt:  SEICTVPLSIGNAYKDQIVCDVIEMDVCHLLLGRPWQYDTQSLHKGRENTYEFQWMGRKVVLLPITKKINEGLRGEKQLFIT--------VSGKNMLKER

Query:  EQDI--LGLVVIEKTKEKHVEDIEPKLQQLLHEFPHI--KEEPKGLPPLRDIQHHIDLIPGASLPNLAHYRMSPQEYKILHDHIEELLKKGHIKPSLSPC
        +  +  L LV ++++   + + +   +QQLL EF  I   E P  LPPLR+IQH IDLIPGA+LPNL +YRMSP+E+ IL + +++LLK+  I+ SLSPC
Subjt:  EQDI--LGLVVIEKTKEKHVEDIEPKLQQLLHEFPHI--KEEPKGLPPLRDIQHHIDLIPGASLPNLAHYRMSPQEYKILHDHIEELLKKGHIKPSLSPC

Query:  AVPALLTPKKDGSWRMCVDSRAINRITVKYRFPIPRISDLLDQLGKVSVFSKIDLKSGYHQIRVRPGDEWKTAFKTNEGLFEWMVMPFGLSNAPSTFMRL
        AVPALL PKKDG WRMC+DSRAIN+IT K+RFP+PR+SDLLD+L   ++FSK+DL+SGYHQ+R+RPGDEWK+AFKT EGLFEW VMPFGL NAPSTFMRL
Subjt:  AVPALLTPKKDGSWRMCVDSRAINRITVKYRFPIPRISDLLDQLGKVSVFSKIDLKSGYHQIRVRPGDEWKTAFKTNEGLFEWMVMPFGLSNAPSTFMRL

Query:  MNQVLHPFLNKFIVVYFDDILVYSTNNDEHLLHLRKLFQVLTEAELYINTKKSMFMKKEIAFLGFIIKQGSISMEPKKIEAIHTWPTPASIKEIQAFLGL
        M++VL PF  KF V YFDDILVYS + ++H+LHL +LFQ L  ++LY+N  K  F   ++ FLGF++ +  I ++P+K+ A+  WP P S+ +I++F GL
Subjt:  MNQVLHPFLNKFIVVYFDDILVYSTNNDEHLLHLRKLFQVLTEAELYINTKKSMFMKKEIAFLGFIIKQGSISMEPKKIEAIHTWPTPASIKEIQAFLGL

Query:  ASFYRKFIRNFSSLAAPLTDCLKKGNFKWTPLQQESFEDIKKKLTSSPILKLPDFSSPFEVAVDACCTGIGAVLVQQGHPIEYFSEKLSTSRQTWSTYEQ
        A+FYR+FIR FS + AP+TD LK  +F W+  QQ+SFE+IKK L+S+PIL LP+F  PF+V  DA   GIGAVL Q+  P+EYFSEKLSTSRQ W+ YEQ
Subjt:  ASFYRKFIRNFSSLAAPLTDCLKKGNFKWTPLQQESFEDIKKKLTSSPILKLPDFSSPFEVAVDACCTGIGAVLVQQGHPIEYFSEKLSTSRQTWSTYEQ

Query:  ELYALVRALKQWEHYLISKEFVLLTDHFSLKYLQAQKNISRMHARWISFLQRFDFVIKHQSGKENKVADALSRKGSLLTILSSEIIAFKHLPDLYEGDTD
        ELYA+VRALKQWEHYL+ ++FVL +DH SL+YL +QK I+RMHARW+ FLQRF FVI+H++GK N+VADALSR+ +LL  L +E+   + +  LY+ D D
Subjt:  ELYALVRALKQWEHYLISKEFVLLTDHFSLKYLQAQKNISRMHARWISFLQRFDFVIKHQSGKENKVADALSRKGSLLTILSSEIIAFKHLPDLYEGDTD

Query:  FKDIWYKCSNFLDADDYHIVEGYLFKGEQLCIPHTSLREALIKEAHSGGLAGHFGQNKTLEITSKRYYWPQIRKDSNNFVKRCPICQRTKGSSTNAGLYS
        F   W  C+      D+ +  G+LFKG  LC+P +S R  LI+E H  GLA H G++KT++    R++WP +++D    ++RC  CQ  KG++ N GLY 
Subjt:  FKDIWYKCSNFLDADDYHIVEGYLFKGEQLCIPHTSLREALIKEAHSGGLAGHFGQNKTLEITSKRYYWPQIRKDSNNFVKRCPICQRTKGSSTNAGLYS

Query:  PLPIPTSIWEDLSIDFVIGLPKTQRQFDSIMVIVDRFSKMTHFVACKKTNDAIYIANLFFKEVVRLHGVPKSIVSDRDVKFLSHFWRTLWKKFDTTLKFS
        PLP+P SIWEDLS+DFV+GLP+T+R  DSIMV+VDRFSKM HF+ CKKT DA+ IA LFFKE+VRLHG+P+S+ SDRDVKF+SHFWR LWKKF T LK S
Subjt:  PLPIPTSIWEDLSIDFVIGLPKTQRQFDSIMVIVDRFSKMTHFVACKKTNDAIYIANLFFKEVVRLHGVPKSIVSDRDVKFLSHFWRTLWKKFDTTLKFS

Query:  TTAHPQTDGQTEVTNRTLGNLLRCLSGSKPKQWDLALAQAEFAFNNMKNRSTGKSPFEVVYTKLPRLTFDLTTLPTTVDLNIEAECMAENIKKLHKEVHD
        +T HPQTDGQTEV NRTL  +LRCL    PK W+  L QAEFAFN+M NRSTG+ PF VVYTK+P    DL  LP     +  A   A    ++ KEV +
Subjt:  TTAHPQTDGQTEVTNRTLGNLLRCLSGSKPKQWDLALAQAEFAFNNMKNRSTGKSPFEVVYTKLPRLTFDLTTLPTTVDLNIEAECMAENIKKLHKEVHD

Query:  HLIQTTDSYKKAADKKRRQAHFNKGDLVMVHLKKSRFPTGTYNKLKDRQIGPFPILEKYGDNAFKIDLPQDIHIHPVFNVADLKPYHAPDRFR
         + +T   YK   D+ RR   F  G+LVM+  ++ RFP+G   KL  ++ GPFP+L K  DNA+ IDLP+D+     FNVAD+ PYH PD  +
Subjt:  HLIQTTDSYKKAADKKRRQAHFNKGDLVMVHLKKSRFPTGTYNKLKDRQIGPFPILEKYGDNAFKIDLPQDIHIHPVFNVADLKPYHAPDRFR

A0A2U1P6A2 Transposon Ty3-I Gag-Pol polyprotein0.051.87Show/hide
Query:  EKIAEGKILTPNYEQTLYNQYQNCRQGARTVAEYIEEFHRLSARTNLSENEQHQVARFVGGLRFDIKEKVRLQPFRFLSEAISFAETVEEMIAIRSKNLN
        +++ +G+ L P+ EQ LY QY NC QG RTVAEY  EF RL AR NL E ++   AR+V GL   I+EK+ L     + +A + A   E M         
Subjt:  EKIAEGKILTPNYEQTLYNQYQNCRQGARTVAEYIEEFHRLSARTNLSENEQHQVARFVGGLRFDIKEKVRLQPFRFLSEAISFAETVEEMIAIRSKNLN

Query:  RRSAWETNSTKSKTNDQ----PSTST------KAKGKEIDNQEVAVERKKEQTFKPSGQNSYSRPSLGKCFRCGQTAHLSNNCPQRKTIAIAEEG--GQT
        R +   T+S  +K N      PST+T      KA G  +D         K +  +P   N Y++P   KCFRCG+  H SN CP+R T+  +E G  G  
Subjt:  RRSAWETNSTKSKTNDQ----PSTST------KAKGKEIDNQEVAVERKKEQTFKPSGQNSYSRPSLGKCFRCGQTAHLSNNCPQRKTIAIAEEG--GQT

Query:  SEDSIEAEEETELIEADDGE--RVSCVIQRLLITPKEEKNLQRHCLFKTRCTINGRVCDVIIDSGSSENFVAKKLVTVLNLKAEAHPNPYKIGWVRKGGE
         ++S + E++ E  E  DGE  +++CVIQR L +PK   + QR+ +F+T+C +  ++C +IID GS EN V+K LV    L  E HPNPY+IGW++KG  
Subjt:  SEDSIEAEEETELIEADDGE--RVSCVIQRLLITPKEEKNLQRHCLFKTRCTINGRVCDVIIDSGSSENFVAKKLVTVLNLKAEAHPNPYKIGWVRKGGE

Query:  ATVSEICTVPLSIGNAYKDQIVCDVIEMDVCHLLLGRPWQYDTQSLHKGRENTYEFQWMGRKVVLLPITKKINEGLRGEKQLFITV--SGKNMLKEREQD
          V+EIC VPL+IG  Y + + CDV++M+ CH+LLGRPWQ+D  + H+G+ N Y F+W G+ + +LP+    + G + + +  +T+  S K    ER++ 
Subjt:  ATVSEICTVPLSIGNAYKDQIVCDVIEMDVCHLLLGRPWQYDTQSLHKGRENTYEFQWMGRKVVLLPITKKINEGLRGEKQLFITV--SGKNMLKEREQD

Query:  ILGLVVIEKTKEKHVEDIEPK-LQQLLHEFPHI--KEEPKGLPPLRDIQHHIDLIPGASLPNLAHYRMSPQEYKILHDHIEELLKKGHIKPSLSPCAVPA
         +   ++ K  E  ++++ P+ ++ +L EF  +   + P  LPPLR+IQH IDL+PGASLPNL HYRMSP+E  IL + +EELL+KGHI+ S+SPCAVPA
Subjt:  ILGLVVIEKTKEKHVEDIEPK-LQQLLHEFPHI--KEEPKGLPPLRDIQHHIDLIPGASLPNLAHYRMSPQEYKILHDHIEELLKKGHIKPSLSPCAVPA

Query:  LLTPKKDGSWRMCVDSRAINRITVKYRFPIPRISDLLDQLGKVSVFSKIDLKSGYHQIRVRPGDEWKTAFKTNEGLFEWMVMPFGLSNAPSTFMRLMNQV
        LLTPKKDGSWRMCVDSRAIN+ITV+YRFPIPR+ DLLDQL    +FSKIDL+SGYHQIR++PGDEWKTAFKT +GL+EW+VMPFGLSNAPSTFMRLM QV
Subjt:  LLTPKKDGSWRMCVDSRAINRITVKYRFPIPRISDLLDQLGKVSVFSKIDLKSGYHQIRVRPGDEWKTAFKTNEGLFEWMVMPFGLSNAPSTFMRLMNQV

Query:  LHPFLNKFIVVYFDDILVYSTNNDEHLLHLRKLFQVLTEAELYINTKKSMFMKKEIAFLGFIIKQGSISMEPKKIEAIHTWPTPASIKEIQAFLGLASFY
        L PF+ KF+VVYFDDILVYS    EHL HLRK+ + LTE EL++N KK  F+  ++ FLG+I+    I ++  K++A+  WP+P ++ E+++F GLA+FY
Subjt:  LHPFLNKFIVVYFDDILVYSTNNDEHLLHLRKLFQVLTEAELYINTKKSMFMKKEIAFLGFIIKQGSISMEPKKIEAIHTWPTPASIKEIQAFLGLASFY

Query:  RKFIRNFSSLAAPLTDCLKKGNFKWTPLQQESFEDIKKKLTSSPILKLPDFSSPFEVAVDACCTGIGAVLVQQGHPIEYFSEKLSTSRQTWSTYEQELYA
        R+F+RNFSS+ AP+T+C+KKG FKWT   +ESF+ IK++LT++P+L LP+F + FE+  DAC TGIGAVL Q+G P+ + SEKL+ +RQ WSTYEQELYA
Subjt:  RKFIRNFSSLAAPLTDCLKKGNFKWTPLQQESFEDIKKKLTSSPILKLPDFSSPFEVAVDACCTGIGAVLVQQGHPIEYFSEKLSTSRQTWSTYEQELYA

Query:  LVRALKQWEHYLISKEFVLLTDHFSLKYLQAQKNISRMHARWISFLQRFDFVIKHQSGKENKVADALSRKGSLLTILSSEIIAFKHLPDLYEGDTDFKDI
        +V+A+K+WEHYLI +EFV+ +DH +LKY Q Q++++++HARW SFL++F++VIKH+SG  NKVADALSRK +LL  +S++++ F+ +  LYE D DF+  
Subjt:  LVRALKQWEHYLISKEFVLLTDHFSLKYLQAQKNISRMHARWISFLQRFDFVIKHQSGKENKVADALSRKGSLLTILSSEIIAFKHLPDLYEGDTDFKDI

Query:  WYKCSNFLDADDYHIVEGYLFKGEQLCIPHTSLREALIKEAHSGGLAGHFGQNKTLEITSKRYYWPQIRKDSNNFVKRCPICQRTKGSSTNAGLYSPLPI
        W +        ++ +++GYLFKG +LCIP TSLR  LIKE H+GGL+ H G++KT+     R+YWPQ+++D  +FV+RC +CQ  KG + N GLY PLP+
Subjt:  WYKCSNFLDADDYHIVEGYLFKGEQLCIPHTSLREALIKEAHSGGLAGHFGQNKTLEITSKRYYWPQIRKDSNNFVKRCPICQRTKGSSTNAGLYSPLPI

Query:  PTSIWEDLSIDFVIGLPKTQRQFDSIMVIVDRFSKMTHFVACKKTNDAIYIANLFFKEVVRLHGVPKSIVSDRDVKFLSHFWRTLWKKFDTTLKFSTTAH
        P S W D+S+DFV+GLP+TQR  DS+ V+VDRFSKM HF+ CKKT+DA +IA LFF+EVVRLHGVPKSI SDRD KFL+HFW TLW++  T+L FS+TAH
Subjt:  PTSIWEDLSIDFVIGLPKTQRQFDSIMVIVDRFSKMTHFVACKKTNDAIYIANLFFKEVVRLHGVPKSIVSDRDVKFLSHFWRTLWKKFDTTLKFSTTAH

Query:  PQTDGQTEVTNRTLGNLLRCLSGSKPKQWDLALAQAEFAFNNMKNRSTGKSPFEVVYTKLPRLTFDLTTLPTTVDLNIEAECMAENIKKLHKEVHDHLIQ
        PQTDGQTEV NRTLGN++RCL G KPK WD++LAQAEFA+N+  + STG SPF+VVY   PR   DL  LP     N++A  M E ++  H+ V   + +
Subjt:  PQTDGQTEVTNRTLGNLLRCLSGSKPKQWDLALAQAEFAFNNMKNRSTGKSPFEVVYTKLPRLTFDLTTLPTTVDLNIEAECMAENIKKLHKEVHDHLIQ

Query:  TTDSYKKAADKKRRQAHFNKGDLVMVHLKKSRFPTGTYNKLKDRQIGPFPILEKYGDNAFKIDLPQDIHIHPVFNVADLKPYHAPD
        +   YK AADK RR   F  GD VMV L+K RFP GTY+KL+ ++ GP+ IL K  DNA+ +DLP  + I   FNV+D+  +H  D
Subjt:  TTDSYKKAADKKRRQAHFNKGDLVMVHLKKSRFPTGTYNKLKDRQIGPFPILEKYGDNAFKIDLPQDIHIHPVFNVADLKPYHAPD

A0A5B7BER3 Uncharacterized protein0.056.21Show/hide
Query:  KDEKIAEGKILTPNYEQTLYNQYQNCRQGARTVAEYIEEFHRLSARTNLSENEQHQVARFVGGLRFDIKEKVRLQPFRFLSEAISFAETVEEMIA---IR
        K  ++   + L  +YEQ LY QYQNCRQG R+V+EY +EF+ LS+R NL+E E  QVAR+VGGLR  I++++ L+    L+EA S A  VE   +   +R
Subjt:  KDEKIAEGKILTPNYEQTLYNQYQNCRQGARTVAEYIEEFHRLSARTNLSENEQHQVARFVGGLRFDIKEKVRLQPFRFLSEAISFAETVEEMIA---IR

Query:  SKNLNRRSAWETNSTKSKTNDQPSTSTKAKGKEIDNQEVAVERKKEQT-FKPSGQ--NSYSRPSLGKCFRCGQTAHLSNNCPQRKTI---AIAEEGGQTS
        S+N  R  ++  +S   +  D+       + ++I  ++ A   K + T   PS +  N Y+RP  GKCFRC Q  H SN CP R+ +    + E+     
Subjt:  SKNLNRRSAWETNSTKSKTNDQPSTSTKAKGKEIDNQEVAVERKKEQT-FKPSGQ--NSYSRPSLGKCFRCGQTAHLSNNCPQRKTI---AIAEEGGQTS

Query:  EDSIEAEEE-----TELIEADDGERVSCVIQRLLITPKEEKNLQRHCLFKTRCTINGRVCDVIIDSGSSENFVAKKLVTVLNLKAEAHPNPYKIGWVRKG
        E+  EAE +      E+ E D+GE VSCV+QRLL+ PK+E + QRH +F+TRCTIN +VCDVIIDSGSSEN V+K LV  L LK E HPNPYKIGW++KG
Subjt:  EDSIEAEEE-----TELIEADDGERVSCVIQRLLITPKEEKNLQRHCLFKTRCTINGRVCDVIIDSGSSENFVAKKLVTVLNLKAEAHPNPYKIGWVRKG

Query:  GEATVSEICTVPLSIGNAYKDQIVCDVIEMDVCHLLLGRPWQYDTQSLHKGRENTYEFQWMGRKVVLLPITKKIN--EGLRGEKQLFITVSGKNMLKERE
         E  V+EIC VP SIG  YKD++ CD+++MD CH+LLGRPWQ+D  + HKG++NTY F W  +KVVL+P  K  N  +  + E +  +TV+G   +++ +
Subjt:  GEATVSEICTVPLSIGNAYKDQIVCDVIEMDVCHLLLGRPWQYDTQSLHKGRENTYEFQWMGRKVVLLPITKKIN--EGLRGEKQLFITVSGKNMLKERE

Query:  QDILGLVVIEKTKE-KHVEDIEPKLQQLLHEFPHI--KEEPKGLPPLRDIQHHIDLIPGASLPNLAHYRMSPQEYKILHDHIEELLKKGHIKPSLSPCAV
        +    +V+I K K      D+   LQ LL EF  I   E P  LPP+RDIQHHIDL+PGASLPNL HYRMSP+E +IL   +E+L+ KG I+ S+SPCAV
Subjt:  QDILGLVVIEKTKE-KHVEDIEPKLQQLLHEFPHI--KEEPKGLPPLRDIQHHIDLIPGASLPNLAHYRMSPQEYKILHDHIEELLKKGHIKPSLSPCAV

Query:  PALLTPKKDGSWRMCVDSRAINRITVKYRFPIPRISDLLDQLGKVSVFSKIDLKSGYHQIRVRPGDEWKTAFKTNEGLFEWMVMPFGLSNAPSTFMRLMN
        PALLTPKKDGSWRMCVDSRAIN+ITVKYRFPIPR++D+LD L    +FSKIDL+SGYHQIR+RPGDEWKTAFKT EGL+EW+VMPFGLSNAPSTFMR+MN
Subjt:  PALLTPKKDGSWRMCVDSRAINRITVKYRFPIPRISDLLDQLGKVSVFSKIDLKSGYHQIRVRPGDEWKTAFKTNEGLFEWMVMPFGLSNAPSTFMRLMN

Query:  QVLHPFLNKFIVVYFDDILVYSTNNDEHLLHLRKLFQVLTEAELYINTKKSMFMKKEIAFLGFIIKQGSISMEPKKIEAIHTWPTPASIKEIQAFLGLAS
        QVL PF+ KF+VVYFDDIL+YS +  EHL H+R++   L E++LYIN KK  F+   + FLGFII    I ++ +K+ AI  WPTP ++ +I++F GLA+
Subjt:  QVLHPFLNKFIVVYFDDILVYSTNNDEHLLHLRKLFQVLTEAELYINTKKSMFMKKEIAFLGFIIKQGSISMEPKKIEAIHTWPTPASIKEIQAFLGLAS

Query:  FYRKFIRNFSSLAAPLTDCLKKGNFKWTPLQQESFEDIKKKLTSSPILKLPDFSSPFEVAVDACCTGIGAVLVQQGHPIEYFSEKLSTSRQTWSTYEQEL
        FYR+FIRNFSS+ AP+TDC+KKG F+W   Q+ SF  IK+KL+++P+L LP F   F+V  DA  TGIGAVL Q+G P+E+FSEKL+ +RQ W+TYE EL
Subjt:  FYRKFIRNFSSLAAPLTDCLKKGNFKWTPLQQESFEDIKKKLTSSPILKLPDFSSPFEVAVDACCTGIGAVLVQQGHPIEYFSEKLSTSRQTWSTYEQEL

Query:  YALVRALKQWEHYLISKEFVLLTDHFSLKYLQAQKNISRMHARWISFLQRFDFVIKHQSGKENKVADALSRKGSLLTILSSEIIAFKHLPDLYEGDTDFK
        +A+VRALK WEHYLI +EFV+ +DH +LK++  Q ++SRMH RWI+FLQRF FV+KH++G++NKVADALSR+ +LL ++SSEI +F+ L +LY+ D DF+
Subjt:  YALVRALKQWEHYLISKEFVLLTDHFSLKYLQAQKNISRMHARWISFLQRFDFVIKHQSGKENKVADALSRKGSLLTILSSEIIAFKHLPDLYEGDTDFK

Query:  DIWYKCSNFLDADDYHIVEGYLFKGEQLCIPHTSLREALIKEAHSGGLAGHFGQNKTLEITSKRYYWPQIRKDSNNFVKRCPICQRTKGSSTNAGLYSPL
          W KC     + ++HI +GYLFKG QLCIP TSLRE ++++ HSGGL GH G++KT+ +  +RYYWPQ+++D   FV++CPICQ  KG + N GLY+PL
Subjt:  DIWYKCSNFLDADDYHIVEGYLFKGEQLCIPHTSLREALIKEAHSGGLAGHFGQNKTLEITSKRYYWPQIRKDSNNFVKRCPICQRTKGSSTNAGLYSPL

Query:  PIPTSIWEDLSIDFVIGLPKTQRQFDSIMVIVDRFSKMTHFVACKKTNDAIYIANLFFKEVVRLHGVPKSIVSDRDVKFLSHFWRTLWKKFDTTLKFSTT
        P+P  IWEDL++DF++GLP+TQR  DS+ V+VDRFSKM HF+ CKKT+DA ++ANLFF+E+VRLHGVPKSI SDRDVKFLSHFWRTLW+KFDT+L++S+T
Subjt:  PIPTSIWEDLSIDFVIGLPKTQRQFDSIMVIVDRFSKMTHFVACKKTNDAIYIANLFFKEVVRLHGVPKSIVSDRDVKFLSHFWRTLWKKFDTTLKFSTT

Query:  AHPQTDGQTEVTNRTLGNLLRCLSGSKPKQWDLALAQAEFAFNNMKNRSTGKSPFEVVYTKLPRLTFDLTTLPTTVDLNIEAECMAENIKKLHKEVHDHL
        AHPQTDGQTEVTNRTLGNL+RC SG +PKQWD+ L Q EFA+N M NRST K+PFE+VYTK P+   DL  LP     +I AE  A+    + +EV  +L
Subjt:  AHPQTDGQTEVTNRTLGNLLRCLSGSKPKQWDLALAQAEFAFNNMKNRSTGKSPFEVVYTKLPRLTFDLTTLPTTVDLNIEAECMAENIKKLHKEVHDHL

Query:  IQTTDSYKKAADKKRRQAHFNKGDLVMVHLKKSRFPTGTYNKLKDRQIGPFPILEKYGDNAFKIDLPQDIHIHPVFNVADLKPYHAPD
         +  + YK AADK RR   F +GDLVMV L+K+RFP GTYNKLK+R+ GPF +  K  DNA+ ++LP D+ I   FNVADL  YH PD
Subjt:  IQTTDSYKKAADKKRRQAHFNKGDLVMVHLKKSRFPTGTYNKLKDRQIGPFPILEKYGDNAFKIDLPQDIHIHPVFNVADLKPYHAPD

A0A6N2LVR1 Uncharacterized protein0.054.71Show/hide
Query:  RKTTGSLMGKDEKIAEGKILTPNYEQTLYNQYQNCRQGARTVAEYIEEFHRLSARTNLSENEQHQVARFVGGLRFDIKEKVRLQPFRFLSEAISFAETVE
        RK       K  ++   + L P+YEQ L+ QYQNC+QG R V  Y+EEFHRLS+R NL E +  QVARFVGGLR++I+++V +     L+EAI+ A   E
Subjt:  RKTTGSLMGKDEKIAEGKILTPNYEQTLYNQYQNCRQGARTVAEYIEEFHRLSARTNLSENEQHQVARFVGGLRFDIKEKVRLQPFRFLSEAISFAETVE

Query:  EMIAIRSKNLNRRSAWETNST--KSKTNDQP---STSTKAKGKEIDNQEVAVERKKEQTFKPSGQNSYSRPSLGKCFRCGQTAHLSNNCPQRKTIAIAEE
          +  R+ N  R     T ++  K K    P   S S   KG      +V+           + +N YSRP+  KC+RCGQ  H SN CP+R  + + E+
Subjt:  EMIAIRSKNLNRRSAWETNST--KSKTNDQP---STSTKAKGKEIDNQEVAVERKKEQTFKPSGQNSYSRPSLGKCFRCGQTAHLSNNCPQRKTIAIAEE

Query:  GGQTSEDS--IEAEE-----ETELIEADDGERVS--CVIQRLLITPKEEKNLQRHCLFKTRCTINGRVCDVIIDSGSSENFVAKKLVTVLNLKAEAHPNP
        G ++  +    EAE+     E E+   D+GE +S   V++R+++ PK E   QR+ +F+TRCT+N +VCDVIIDSGSSEN ++K +V  L LK E H  P
Subjt:  GGQTSEDS--IEAEE-----ETELIEADDGERVS--CVIQRLLITPKEEKNLQRHCLFKTRCTINGRVCDVIIDSGSSENFVAKKLVTVLNLKAEAHPNP

Query:  YKIGWVRKGGEATVSEICTVPLSIGNAYKDQIVCDVIEMDVCHLLLGRPWQYDTQSLHKGRENTYEFQWMGRKVVLLPITKKINE-GLRGEKQLFITVSG
        YKIGW++KG E  V+E C    SIG  Y D+IVCDV+EMD CH++LGRPWQYD    +KG++N Y F   G+KV+L P+ +     G R +++  + V G
Subjt:  YKIGWVRKGGEATVSEICTVPLSIGNAYKDQIVCDVIEMDVCHLLLGRPWQYDTQSLHKGRENTYEFQWMGRKVVLLPITKKINE-GLRGEKQLFITVSG

Query:  KNMLKEREQDILGLVVIEKTK-EKHVEDIEPKLQQLLHEFPHI--KEEPKGLPPLRDIQHHIDLIPGASLPNLAHYRMSPQEYKILHDHIEELLKKGHIK
        +  L + ++D     VI   + E +  +I   LQ LL EF  I  +E P+GLPP+RDIQHHIDLIPGASLPN  HYRMSP+E  IL   +EEL+KKG ++
Subjt:  KNMLKEREQDILGLVVIEKTK-EKHVEDIEPKLQQLLHEFPHI--KEEPKGLPPLRDIQHHIDLIPGASLPNLAHYRMSPQEYKILHDHIEELLKKGHIK

Query:  PSLSPCAVPALLTPKKDGSWRMCVDSRAINRITVKYRFPIPRISDLLDQLGKVSVFSKIDLKSGYHQIRVRPGDEWKTAFKTNEGLFEWMVMPFGLSNAP
         S+SPCAVPALL PKKDGSWRMC+DSRAIN+IT+KYRFPIPR+ D+LD L    +FSKIDL+SGYHQIR+RPGDEWKTAFKT EGL+EW+VMPFGLSNAP
Subjt:  PSLSPCAVPALLTPKKDGSWRMCVDSRAINRITVKYRFPIPRISDLLDQLGKVSVFSKIDLKSGYHQIRVRPGDEWKTAFKTNEGLFEWMVMPFGLSNAP

Query:  STFMRLMNQVLHPFLNKFIVVYFDDILVYSTNNDEHLLHLRKLFQVLTEAELYINTKKSMFMKKEIAFLGFIIKQGSISMEPKKIEAIHTWPTPASIKEI
        STFMRLMNQVL PF   F+VVYFDDIL+YS +  +H+ HLR++F VL   +L++N  K  FM   + FLGF++    I ++ +K+ AI  WPTP +I E+
Subjt:  STFMRLMNQVLHPFLNKFIVVYFDDILVYSTNNDEHLLHLRKLFQVLTEAELYINTKKSMFMKKEIAFLGFIIKQGSISMEPKKIEAIHTWPTPASIKEI

Query:  QAFLGLASFYRKFIRNFSSLAAPLTDCLKKGNFKWTPLQQESFEDIKKKLTSSPILKLPDFSSPFEVAVDACCTGIGAVLVQQGHPIEYFSEKLSTSRQT
        ++F GLA+FYR+F+R+FS + AP+T+C+KKG F W    + SF  IK+KL S+P+L LPDF   FEV  DA   GIGAVL Q+  P+ ++SEKLS +R+ 
Subjt:  QAFLGLASFYRKFIRNFSSLAAPLTDCLKKGNFKWTPLQQESFEDIKKKLTSSPILKLPDFSSPFEVAVDACCTGIGAVLVQQGHPIEYFSEKLSTSRQT

Query:  WSTYEQELYALVRALKQWEHYLISKEFVLLTDHFSLKYLQAQKNISRMHARWISFLQRFDFVIKHQSGKENKVADALSRKGSLLTILSSEIIAFKHLPDL
        WSTYE ELYA+ RA+K WEHYL+ +EF+L +DH +LK++  Q N++RMHARW++F+QRF+F +KH+SG+ NKVADALSRK SLLT L +E+I F+ + DL
Subjt:  WSTYEQELYALVRALKQWEHYLISKEFVLLTDHFSLKYLQAQKNISRMHARWISFLQRFDFVIKHQSGKENKVADALSRKGSLLTILSSEIIAFKHLPDL

Query:  YEGDTDFKDIWYKCSNFLDADDYHIVEGYLFKGEQLCIPHTSLREALIKEAHSGGLAGHFGQNKTLEITSKRYYWPQIRKDSNNFVKRCPICQRTKGSST
        Y GD DF + W KC   L  +  H  +GYLF+G QLCIP +SLRE +I E H GGL GH G++KT+ +  +RYYWPQ+++D  N VKRCP CQ +KG + 
Subjt:  YEGDTDFKDIWYKCSNFLDADDYHIVEGYLFKGEQLCIPHTSLREALIKEAHSGGLAGHFGQNKTLEITSKRYYWPQIRKDSNNFVKRCPICQRTKGSST

Query:  NAGLYSPLPIPTSIWEDLSIDFVIGLPKTQRQFDSIMVIVDRFSKMTHFVACKKTNDAIYIANLFFKEVVRLHGVPKSIVSDRDVKFLSHFWRTLWKKFD
        N GLY PLPIP   WEDLS+DF++GLP+TQR  DS+ V+VDRFSKM HF+ACKKT+DA+++ANLFFKEVVRLHGVPKSI SDRD KFLSHFWRTLW++FD
Subjt:  NAGLYSPLPIPTSIWEDLSIDFVIGLPKTQRQFDSIMVIVDRFSKMTHFVACKKTNDAIYIANLFFKEVVRLHGVPKSIVSDRDVKFLSHFWRTLWKKFD

Query:  TTLKFSTTAHPQTDGQTEVTNRTLGNLLRCLSGSKPKQWDLALAQAEFAFNNMKNRSTGKSPFEVVYTKLPRLTFDLTTLPTTVDLNIEAECMAENIKKL
        TTL FS+T+HPQTDGQTEV NRTLGNL+RCLSG +PKQWDL LAQAEFA+N+M NRSTGK+PF+VVY + P+   DL  LP    +NI AE MA+ ++ +
Subjt:  TTLKFSTTAHPQTDGQTEVTNRTLGNLLRCLSGSKPKQWDLALAQAEFAFNNMKNRSTGKSPFEVVYTKLPRLTFDLTTLPTTVDLNIEAECMAENIKKL

Query:  HKEVHDHLIQTTDSYKKAADKKRRQAHFNKGDLVMVHLKKSRFPTGTYNKLKDRQIGPFPILEKYGDNAFKIDLPQDIHIHPVFNVADLKPYHAPD
         +EV  +L  + + YK AADKKRR   F +GDLVMV+L+K R P GT +KL D++ GP+ IL+K  DNA+++DLP D+ I P FNVADL  YH PD
Subjt:  HKEVHDHLIQTTDSYKKAADKKRRQAHFNKGDLVMVHLKKSRFPTGTYNKLKDRQIGPFPILEKYGDNAFKIDLPQDIHIHPVFNVADLKPYHAPD

M5W531 Reverse transcriptase0.051.9Show/hide
Query:  KDEKIAEGKILTPNYEQTLYNQYQNCRQGARTVAEYIEEFHRLSARTNLSENEQHQVARFVGGLRFDIKEKVRLQPFRFLSEAISFAETVEEMIAIRSKN
        K + +   + L  +YEQ LY  Y  C QG R+V+EY EEF RL+ R +L+E +  +VAR+  GL+  I+EK+ +Q    L EAI+ A   E +   + + 
Subjt:  KDEKIAEGKILTPNYEQTLYNQYQNCRQGARTVAEYIEEFHRLSARTNLSENEQHQVARFVGGLRFDIKEKVRLQPFRFLSEAISFAETVEEMIAIRSKN

Query:  LNRRSAWETNS----------TKSKTNDQPS---TSTKAKGKEIDNQEVAVERKKEQTFKPSGQNSYSRPSLGKCFRCGQTAHLSNNCPQRKTIAIAEEG
          RR+  E +            K K   Q S   T     G+  +  E +         +   QN Y++P    C+RC +  H SN CP+RK     EE 
Subjt:  LNRRSAWETNS----------TKSKTNDQPS---TSTKAKGKEIDNQEVAVERKKEQTFKPSGQNSYSRPSLGKCFRCGQTAHLSNNCPQRKTIAIAEEG

Query:  GQTSEDSIEAEEE---TELIEADDGERVSCVIQRLLITPKEEKNLQRHCLFKTRCTINGRVCDVIIDSGSSENFVAKKLVTVLNLKAEAHPNPYKIGWVR
         +  E     E +    E    +  E+++ V+QR+L+ PKEE   QRH +F++ C+I  +VCDVI+D+GS ENFV+KKLV  L L  E H +PY +GWV+
Subjt:  GQTSEDSIEAEEE---TELIEADDGERVSCVIQRLLITPKEEKNLQRHCLFKTRCTINGRVCDVIIDSGSSENFVAKKLVTVLNLKAEAHPNPYKIGWVR

Query:  KGGEATVSEICTVPLSIGNAYKDQIVCDVIEMDVCHLLLGRPWQYDTQSLHKGRENTYEFQWMGRKVVLLPITKKINEGLRGEKQLFITVSGKNMLKERE
        KG    V+E C VPLSIG  Y+D ++CDVI+MD CH+LLGRPWQ+D  +  KGR+N   F W  RK+ +        + LR    L + +S +  L E  
Subjt:  KGGEATVSEICTVPLSIGNAYKDQIVCDVIEMDVCHLLLGRPWQYDTQSLHKGRENTYEFQWMGRKVVLLPITKKINEGLRGEKQLFITVSGKNMLKERE

Query:  QDILGLVVIEKTKEKHVEDIEPKLQQLLHEFPHIKEE--PKGLPPLRDIQHHIDLIPGASLPNLAHYRMSPQEYKILHDHIEELLKKGHIKPSLSPCAVP
        ++  G             DI   +QQ+L +F  +  E  P  LPP+RDIQH IDL+ GASLPNL HYRMSP+E  IL + IEELL+KG I+ SLSPCAVP
Subjt:  QDILGLVVIEKTKEKHVEDIEPKLQQLLHEFPHIKEE--PKGLPPLRDIQHHIDLIPGASLPNLAHYRMSPQEYKILHDHIEELLKKGHIKPSLSPCAVP

Query:  ALLTPKKDGSWRMCVDSRAINRITVKYRFPIPRISDLLDQLGKVSVFSKIDLKSGYHQIRVRPGDEWKTAFKTNEGLFEWMVMPFGLSNAPSTFMRLMNQ
         LL PKKD +WRMCVDSRA+N+I VKYRF IPR+ D+LD L    VFSKIDL+SGYHQIR+RPGDEWKTAFK+ +GLFEW+VMPFGLSNAPSTFMRLMNQ
Subjt:  ALLTPKKDGSWRMCVDSRAINRITVKYRFPIPRISDLLDQLGKVSVFSKIDLKSGYHQIRVRPGDEWKTAFKTNEGLFEWMVMPFGLSNAPSTFMRLMNQ

Query:  VLHPFLNKFIVVYFDDILVYSTNNDEHLLHLRKLFQVLTEAELYINTKKSMFMKKEIAFLGFIIKQGSISMEPKKIEAIHTWPTPASIKEIQAFLGLASF
        VL PF+  F+VVYFDDIL+YST  +EHL+HLR++  VL E +LY+N KK  F   ++ FLGF++ +  I ++ +KI+AI  WP P ++ E+++F GLA+F
Subjt:  VLHPFLNKFIVVYFDDILVYSTNNDEHLLHLRKLFQVLTEAELYINTKKSMFMKKEIAFLGFIIKQGSISMEPKKIEAIHTWPTPASIKEIQAFLGLASF

Query:  YRKFIRNFSSLAAPLTDCLKKGNFKWTPLQQESFEDIKKKLTSSPILKLPDFSSPFEVAVDACCTGIGAVLVQQGHPIEYFSEKLSTSRQTWSTYEQELY
        Y +F+R+FSS+AAP+T+CLKKG F W   Q+ SF DIK+KL ++P+L LP+F   FEV  DA   G+GAVL+Q   P+ +FSEKLS +RQ WSTY+QE Y
Subjt:  YRKFIRNFSSLAAPLTDCLKKGNFKWTPLQQESFEDIKKKLTSSPILKLPDFSSPFEVAVDACCTGIGAVLVQQGHPIEYFSEKLSTSRQTWSTYEQELY

Query:  ALVRALKQWEHYLISKEFVLLTDHFSLKYLQAQKNISRMHARWISFLQRFDFVIKHQSGKENKVADALSRKGSLLTILSSEIIAFKHLPDLYEGDTDFKD
        A+VRALKQWEHYLI KEFVL TDH +LKY+ +QKNI +MHARW++FLQ+F FVIKH SGK N+VADALSR+ SLL  L+ E++ F+ L +LYEGD DF++
Subjt:  ALVRALKQWEHYLISKEFVLLTDHFSLKYLQAQKNISRMHARWISFLQRFDFVIKHQSGKENKVADALSRKGSLLTILSSEIIAFKHLPDLYEGDTDFKD

Query:  IWYKCSNFLDADDYHIVEGYLFKGEQLCIPHTSLREALIKEAHSGGLAGHFGQNKTLEITSKRYYWPQIRKDSNNFVKRCPICQRTKGSSTNAGLYSPLP
        IW KC+N     DY + EGYLFKG QLCIP +SLRE LI++ H GGL+GH G++KT+    +R+YWPQ+++D    V++C  CQ +KG   N GLY PLP
Subjt:  IWYKCSNFLDADDYHIVEGYLFKGEQLCIPHTSLREALIKEAHSGGLAGHFGQNKTLEITSKRYYWPQIRKDSNNFVKRCPICQRTKGSSTNAGLYSPLP

Query:  IPTSIWEDLSIDFVIGLPKTQRQFDSIMVIVDRFSKMTHFVACKKTNDAIYIANLFFKEVVRLHGVPKSIVSDRDVKFLSHFWRTLWKKFDTTLKFSTTA
        +P  IW+DL++DFV+G P+TQR+ DS+ V+ DRFSKM HF+ACKKT DA  IA LFF+EVVRLHGVP SI SDRD KFLSHFW TLW+ F TTL  S+TA
Subjt:  IPTSIWEDLSIDFVIGLPKTQRQFDSIMVIVDRFSKMTHFVACKKTNDAIYIANLFFKEVVRLHGVPKSIVSDRDVKFLSHFWRTLWKKFDTTLKFSTTA

Query:  HPQTDGQTEVTNRTLGNLLRCLSGSKPKQWDLALAQAEFAFNNMKNRSTGKSPFEVVYTKLPRLTFDLTTLPTTVDLNIEAECMAENIKKLHKEVHDHLI
        HPQTDGQTEVTNRTLGN++R + G KPKQWD AL Q EFA+N+  + +TGKSPF +VYT  P    DL  LP     ++ A+ +AE +  +  EV   L 
Subjt:  HPQTDGQTEVTNRTLGNLLRCLSGSKPKQWDLALAQAEFAFNNMKNRSTGKSPFEVVYTKLPRLTFDLTTLPTTVDLNIEAECMAENIKKLHKEVHDHLI

Query:  QTTDSYKKAADKKRRQAHFNKGDLVMVHLKKSRFPTGTYNKLKDRQIGPFPILEKYGDNAFKIDLPQDIHIHPVFNVADLKPYHAPD
        QT   YK AAD+ RR   F +GD VMV L+K RFP GTY+KLK ++ GP+ +L++  DNA+ I+LP  + I  +FNVADL  +   +
Subjt:  QTTDSYKKAADKKRRQAHFNKGDLVMVHLKKSRFPTGTYNKLKDRQIGPFPILEKYGDNAFKIDLPQDIHIHPVFNVADLKPYHAPD

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein1.0e-13434.07Show/hide
Query:  EPKLQQLLHEFPHIKEE--PKGLP-PLRDIQHHIDLIPGASLPNLAHYRMSPQEYKILHDHIEELLKKGHIKPSLSPCAVPALLTPKKDGSWRMCVDSRA
        EP+L  +  EF  I  E   + LP P++ ++  ++L        + +Y + P + + ++D I + LK G I+ S +  A P +  PKK+G+ RM VD + 
Subjt:  EPKLQQLLHEFPHIKEE--PKGLP-PLRDIQHHIDLIPGASLPNLAHYRMSPQEYKILHDHIEELLKKGHIKPSLSPCAVPALLTPKKDGSWRMCVDSRA

Query:  INRITVKYRFPIPRISDLLDQLGKVSVFSKIDLKSGYHQIRVRPGDEWKTAFKTNEGLFEWMVMPFGLSNAPSTFMRLMNQVLHPFLNKFIVVYFDDILV
        +N+      +P+P I  LL ++   ++F+K+DLKS YH IRVR GDE K AF+   G+FE++VMP+G+S AP+ F   +N +L       +V Y DDIL+
Subjt:  INRITVKYRFPIPRISDLLDQLGKVSVFSKIDLKSGYHQIRVRPGDEWKTAFKTNEGLFEWMVMPFGLSNAPSTFMRLMNQVLHPFLNKFIVVYFDDILV

Query:  YSTNNDEHLLHLRKLFQVLTEAELYINTKKSMFMKKEIAFLGFIIKQGSISMEPKKIEAIHTWPTPASIKEIQAFLGLASFYRKFIRNFSSLAAPLTDCL
        +S +  EH+ H++ + Q L  A L IN  K  F + ++ F+G+ I +   +   + I+ +  W  P + KE++ FLG  ++ RKFI   S L  PL + L
Subjt:  YSTNNDEHLLHLRKLFQVLTEAELYINTKKSMFMKKEIAFLGFIIKQGSISMEPKKIEAIHTWPTPASIKEIQAFLGLASFYRKFIRNFSSLAAPLTDCL

Query:  KKG-NFKWTPLQQESFEDIKKKLTSSPILKLPDFSSPFEVAVDACCTGIGAVLVQQG-----HPIEYFSEKLSTSRQTWSTYEQELYALVRALKQWEHYL
        KK   +KWTP Q ++ E+IK+ L S P+L+  DFS    +  DA    +GAVL Q+      +P+ Y+S K+S ++  +S  ++E+ A++++LK W HYL
Subjt:  KKG-NFKWTPLQQESFEDIKKKLTSSPILKLPDFSSPFEVAVDACCTGIGAVLVQQG-----HPIEYFSEKLSTSRQTWSTYEQELYALVRALKQWEHYL

Query:  IS--KEFVLLTDHFSL--KYLQAQKNISRMHARWISFLQRFDFVIKHQSGKENKVADALSR------------KGSLLTILSSEIIA--FKH-LPDLYEG
         S  + F +LTDH +L  +     +  ++  ARW  FLQ F+F I ++ G  N +ADALSR            + + +  ++   I   FK+ +   Y  
Subjt:  IS--KEFVLLTDHFSL--KYLQAQKNISRMHARWISFLQRFDFVIKHQSGKENKVADALSR------------KGSLLTILSSEIIA--FKH-LPDLYEG

Query:  DTDFKDIWYKCSNFLDADDYHIVEGYLFKG-------EQLCIPH-TSLREALIKEAHSGGLAGHFGQNKTLEITSKRYYWPQIRKDSNNFVKRCPICQRT
        DT       K  N L+ +D  + E    K        +Q+ +P+ T L   +IK+ H  G   H G      I  +R+ W  IRK    +V+ C  CQ  
Subjt:  DTDFKDIWYKCSNFLDADDYHIVEGYLFKG-------EQLCIPH-TSLREALIKEAHSGGLAGHFGQNKTLEITSKRYYWPQIRKDSNNFVKRCPICQRT

Query:  KGSSTNAGLYSPL-PIPTS--IWEDLSIDFVIGLPKTQRQFDSIMVIVDRFSKMTHFVACKKTNDAIYIANLFFKEVVRLHGVPKSIVSDRDVKFLSHFW
        K  S N   Y PL PIP S   WE LS+DF+  LP++   ++++ V+VDRFSKM   V C K+  A   A +F + V+   G PK I++D D  F S  W
Subjt:  KGSSTNAGLYSPL-PIPTS--IWEDLSIDFVIGLPKTQRQFDSIMVIVDRFSKMTHFVACKKTNDAIYIANLFFKEVVRLHGVPKSIVSDRDVKFLSHFW

Query:  RTLWKKFDTTLKFSTTAHPQTDGQTEVTNRTLGNLLRCLSGSKPKQWDLALAQAEFAFNNMKNRSTGKSPFEVVYTKLPRLT-FDLTTLPTTVDLNIEAE
        +    K++  +KFS    PQTDGQTE TN+T+  LLRC+  + P  W   ++  + ++NN  + +T  +PFE+V+   P L+  +L +     D N    
Subjt:  RTLWKKFDTTLKFSTTAHPQTDGQTEVTNRTLGNLLRCLSGSKPKQWDLALAQAEFAFNNMKNRSTGKSPFEVVYTKLPRLT-FDLTTLPTTVDLNIEAE

Query:  CMAENIKKLHKEVHDHLIQTTDSYKKAADKKRRQ-AHFNKGDLVMVHLKKSRFPTGTYNKLKDRQIGPFPILEKYGDNAFKIDLPQDIH--IHPVFNVAD
          ++   ++ + V +HL       KK  D K ++   F  GDLVMV   K+ F     NKL     GPF +L+K G N +++DLP  I       F+V+ 
Subjt:  CMAENIKKLHKEVHDHLIQTTDSYKKAADKKRRQ-AHFNKGDLVMVHLKKSRFPTGTYNKLKDRQIGPFPILEKYGDNAFKIDLPQDIH--IHPVFNVAD

Query:  LKPY
        L+ Y
Subjt:  LKPY

P0CT35 Transposon Tf2-2 polyprotein1.0e-13434.07Show/hide
Query:  EPKLQQLLHEFPHIKEE--PKGLP-PLRDIQHHIDLIPGASLPNLAHYRMSPQEYKILHDHIEELLKKGHIKPSLSPCAVPALLTPKKDGSWRMCVDSRA
        EP+L  +  EF  I  E   + LP P++ ++  ++L        + +Y + P + + ++D I + LK G I+ S +  A P +  PKK+G+ RM VD + 
Subjt:  EPKLQQLLHEFPHIKEE--PKGLP-PLRDIQHHIDLIPGASLPNLAHYRMSPQEYKILHDHIEELLKKGHIKPSLSPCAVPALLTPKKDGSWRMCVDSRA

Query:  INRITVKYRFPIPRISDLLDQLGKVSVFSKIDLKSGYHQIRVRPGDEWKTAFKTNEGLFEWMVMPFGLSNAPSTFMRLMNQVLHPFLNKFIVVYFDDILV
        +N+      +P+P I  LL ++   ++F+K+DLKS YH IRVR GDE K AF+   G+FE++VMP+G+S AP+ F   +N +L       +V Y DDIL+
Subjt:  INRITVKYRFPIPRISDLLDQLGKVSVFSKIDLKSGYHQIRVRPGDEWKTAFKTNEGLFEWMVMPFGLSNAPSTFMRLMNQVLHPFLNKFIVVYFDDILV

Query:  YSTNNDEHLLHLRKLFQVLTEAELYINTKKSMFMKKEIAFLGFIIKQGSISMEPKKIEAIHTWPTPASIKEIQAFLGLASFYRKFIRNFSSLAAPLTDCL
        +S +  EH+ H++ + Q L  A L IN  K  F + ++ F+G+ I +   +   + I+ +  W  P + KE++ FLG  ++ RKFI   S L  PL + L
Subjt:  YSTNNDEHLLHLRKLFQVLTEAELYINTKKSMFMKKEIAFLGFIIKQGSISMEPKKIEAIHTWPTPASIKEIQAFLGLASFYRKFIRNFSSLAAPLTDCL

Query:  KKG-NFKWTPLQQESFEDIKKKLTSSPILKLPDFSSPFEVAVDACCTGIGAVLVQQG-----HPIEYFSEKLSTSRQTWSTYEQELYALVRALKQWEHYL
        KK   +KWTP Q ++ E+IK+ L S P+L+  DFS    +  DA    +GAVL Q+      +P+ Y+S K+S ++  +S  ++E+ A++++LK W HYL
Subjt:  KKG-NFKWTPLQQESFEDIKKKLTSSPILKLPDFSSPFEVAVDACCTGIGAVLVQQG-----HPIEYFSEKLSTSRQTWSTYEQELYALVRALKQWEHYL

Query:  IS--KEFVLLTDHFSL--KYLQAQKNISRMHARWISFLQRFDFVIKHQSGKENKVADALSR------------KGSLLTILSSEIIA--FKH-LPDLYEG
         S  + F +LTDH +L  +     +  ++  ARW  FLQ F+F I ++ G  N +ADALSR            + + +  ++   I   FK+ +   Y  
Subjt:  IS--KEFVLLTDHFSL--KYLQAQKNISRMHARWISFLQRFDFVIKHQSGKENKVADALSR------------KGSLLTILSSEIIA--FKH-LPDLYEG

Query:  DTDFKDIWYKCSNFLDADDYHIVEGYLFKG-------EQLCIPH-TSLREALIKEAHSGGLAGHFGQNKTLEITSKRYYWPQIRKDSNNFVKRCPICQRT
        DT       K  N L+ +D  + E    K        +Q+ +P+ T L   +IK+ H  G   H G      I  +R+ W  IRK    +V+ C  CQ  
Subjt:  DTDFKDIWYKCSNFLDADDYHIVEGYLFKG-------EQLCIPH-TSLREALIKEAHSGGLAGHFGQNKTLEITSKRYYWPQIRKDSNNFVKRCPICQRT

Query:  KGSSTNAGLYSPL-PIPTS--IWEDLSIDFVIGLPKTQRQFDSIMVIVDRFSKMTHFVACKKTNDAIYIANLFFKEVVRLHGVPKSIVSDRDVKFLSHFW
        K  S N   Y PL PIP S   WE LS+DF+  LP++   ++++ V+VDRFSKM   V C K+  A   A +F + V+   G PK I++D D  F S  W
Subjt:  KGSSTNAGLYSPL-PIPTS--IWEDLSIDFVIGLPKTQRQFDSIMVIVDRFSKMTHFVACKKTNDAIYIANLFFKEVVRLHGVPKSIVSDRDVKFLSHFW

Query:  RTLWKKFDTTLKFSTTAHPQTDGQTEVTNRTLGNLLRCLSGSKPKQWDLALAQAEFAFNNMKNRSTGKSPFEVVYTKLPRLT-FDLTTLPTTVDLNIEAE
        +    K++  +KFS    PQTDGQTE TN+T+  LLRC+  + P  W   ++  + ++NN  + +T  +PFE+V+   P L+  +L +     D N    
Subjt:  RTLWKKFDTTLKFSTTAHPQTDGQTEVTNRTLGNLLRCLSGSKPKQWDLALAQAEFAFNNMKNRSTGKSPFEVVYTKLPRLT-FDLTTLPTTVDLNIEAE

Query:  CMAENIKKLHKEVHDHLIQTTDSYKKAADKKRRQ-AHFNKGDLVMVHLKKSRFPTGTYNKLKDRQIGPFPILEKYGDNAFKIDLPQDIH--IHPVFNVAD
          ++   ++ + V +HL       KK  D K ++   F  GDLVMV   K+ F     NKL     GPF +L+K G N +++DLP  I       F+V+ 
Subjt:  CMAENIKKLHKEVHDHLIQTTDSYKKAADKKRRQ-AHFNKGDLVMVHLKKSRFPTGTYNKLKDRQIGPFPILEKYGDNAFKIDLPQDIH--IHPVFNVAD

Query:  LKPY
        L+ Y
Subjt:  LKPY

P0CT41 Transposon Tf2-12 polyprotein1.0e-13434.07Show/hide
Query:  EPKLQQLLHEFPHIKEE--PKGLP-PLRDIQHHIDLIPGASLPNLAHYRMSPQEYKILHDHIEELLKKGHIKPSLSPCAVPALLTPKKDGSWRMCVDSRA
        EP+L  +  EF  I  E   + LP P++ ++  ++L        + +Y + P + + ++D I + LK G I+ S +  A P +  PKK+G+ RM VD + 
Subjt:  EPKLQQLLHEFPHIKEE--PKGLP-PLRDIQHHIDLIPGASLPNLAHYRMSPQEYKILHDHIEELLKKGHIKPSLSPCAVPALLTPKKDGSWRMCVDSRA

Query:  INRITVKYRFPIPRISDLLDQLGKVSVFSKIDLKSGYHQIRVRPGDEWKTAFKTNEGLFEWMVMPFGLSNAPSTFMRLMNQVLHPFLNKFIVVYFDDILV
        +N+      +P+P I  LL ++   ++F+K+DLKS YH IRVR GDE K AF+   G+FE++VMP+G+S AP+ F   +N +L       +V Y DDIL+
Subjt:  INRITVKYRFPIPRISDLLDQLGKVSVFSKIDLKSGYHQIRVRPGDEWKTAFKTNEGLFEWMVMPFGLSNAPSTFMRLMNQVLHPFLNKFIVVYFDDILV

Query:  YSTNNDEHLLHLRKLFQVLTEAELYINTKKSMFMKKEIAFLGFIIKQGSISMEPKKIEAIHTWPTPASIKEIQAFLGLASFYRKFIRNFSSLAAPLTDCL
        +S +  EH+ H++ + Q L  A L IN  K  F + ++ F+G+ I +   +   + I+ +  W  P + KE++ FLG  ++ RKFI   S L  PL + L
Subjt:  YSTNNDEHLLHLRKLFQVLTEAELYINTKKSMFMKKEIAFLGFIIKQGSISMEPKKIEAIHTWPTPASIKEIQAFLGLASFYRKFIRNFSSLAAPLTDCL

Query:  KKG-NFKWTPLQQESFEDIKKKLTSSPILKLPDFSSPFEVAVDACCTGIGAVLVQQG-----HPIEYFSEKLSTSRQTWSTYEQELYALVRALKQWEHYL
        KK   +KWTP Q ++ E+IK+ L S P+L+  DFS    +  DA    +GAVL Q+      +P+ Y+S K+S ++  +S  ++E+ A++++LK W HYL
Subjt:  KKG-NFKWTPLQQESFEDIKKKLTSSPILKLPDFSSPFEVAVDACCTGIGAVLVQQG-----HPIEYFSEKLSTSRQTWSTYEQELYALVRALKQWEHYL

Query:  IS--KEFVLLTDHFSL--KYLQAQKNISRMHARWISFLQRFDFVIKHQSGKENKVADALSR------------KGSLLTILSSEIIA--FKH-LPDLYEG
         S  + F +LTDH +L  +     +  ++  ARW  FLQ F+F I ++ G  N +ADALSR            + + +  ++   I   FK+ +   Y  
Subjt:  IS--KEFVLLTDHFSL--KYLQAQKNISRMHARWISFLQRFDFVIKHQSGKENKVADALSR------------KGSLLTILSSEIIA--FKH-LPDLYEG

Query:  DTDFKDIWYKCSNFLDADDYHIVEGYLFKG-------EQLCIPH-TSLREALIKEAHSGGLAGHFGQNKTLEITSKRYYWPQIRKDSNNFVKRCPICQRT
        DT       K  N L+ +D  + E    K        +Q+ +P+ T L   +IK+ H  G   H G      I  +R+ W  IRK    +V+ C  CQ  
Subjt:  DTDFKDIWYKCSNFLDADDYHIVEGYLFKG-------EQLCIPH-TSLREALIKEAHSGGLAGHFGQNKTLEITSKRYYWPQIRKDSNNFVKRCPICQRT

Query:  KGSSTNAGLYSPL-PIPTS--IWEDLSIDFVIGLPKTQRQFDSIMVIVDRFSKMTHFVACKKTNDAIYIANLFFKEVVRLHGVPKSIVSDRDVKFLSHFW
        K  S N   Y PL PIP S   WE LS+DF+  LP++   ++++ V+VDRFSKM   V C K+  A   A +F + V+   G PK I++D D  F S  W
Subjt:  KGSSTNAGLYSPL-PIPTS--IWEDLSIDFVIGLPKTQRQFDSIMVIVDRFSKMTHFVACKKTNDAIYIANLFFKEVVRLHGVPKSIVSDRDVKFLSHFW

Query:  RTLWKKFDTTLKFSTTAHPQTDGQTEVTNRTLGNLLRCLSGSKPKQWDLALAQAEFAFNNMKNRSTGKSPFEVVYTKLPRLT-FDLTTLPTTVDLNIEAE
        +    K++  +KFS    PQTDGQTE TN+T+  LLRC+  + P  W   ++  + ++NN  + +T  +PFE+V+   P L+  +L +     D N    
Subjt:  RTLWKKFDTTLKFSTTAHPQTDGQTEVTNRTLGNLLRCLSGSKPKQWDLALAQAEFAFNNMKNRSTGKSPFEVVYTKLPRLT-FDLTTLPTTVDLNIEAE

Query:  CMAENIKKLHKEVHDHLIQTTDSYKKAADKKRRQ-AHFNKGDLVMVHLKKSRFPTGTYNKLKDRQIGPFPILEKYGDNAFKIDLPQDIH--IHPVFNVAD
          ++   ++ + V +HL       KK  D K ++   F  GDLVMV   K+ F     NKL     GPF +L+K G N +++DLP  I       F+V+ 
Subjt:  CMAENIKKLHKEVHDHLIQTTDSYKKAADKKRRQ-AHFNKGDLVMVHLKKSRFPTGTYNKLKDRQIGPFPILEKYGDNAFKIDLPQDIH--IHPVFNVAD

Query:  LKPY
        L+ Y
Subjt:  LKPY

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein2.4e-14434.93Show/hide
Query:  IQHHIDLIPGASLPNLAHYRMSPQEYKILHDHIEELLKKGHIKPSLSPCAVPALLTPKKDGSWRMCVDSRAINRITVKYRFPIPRISDLLDQLGKVSVFS
        ++H I++ PGA LP L  Y ++ +  + ++  +++LL    I PS SPC+ P +L PKKDG++R+CVD R +N+ T+   FP+PRI +LL ++G   +F+
Subjt:  IQHHIDLIPGASLPNLAHYRMSPQEYKILHDHIEELLKKGHIKPSLSPCAVPALLTPKKDGSWRMCVDSRAINRITVKYRFPIPRISDLLDQLGKVSVFS

Query:  KIDLKSGYHQIRVRPGDEWKTAFKTNEGLFEWMVMPFGLSNAPSTFMRLMNQVLHPFLNKFIVVYFDDILVYSTNNDEHLLHLRKLFQVLTEAELYINTK
         +DL SGYHQI + P D +KTAF T  G +E+ VMPFGL NAPSTF R M         +F+ VY DDIL++S + +EH  HL  + + L    L +  K
Subjt:  KIDLKSGYHQIRVRPGDEWKTAFKTNEGLFEWMVMPFGLSNAPSTFMRLMNQVLHPFLNKFIVVYFDDILVYSTNNDEHLLHLRKLFQVLTEAELYINTK

Query:  KSMFMKKEIAFLGFIIKQGSISMEPKKIEAIHTWPTPASIKEIQAFLGLASFYRKFIRNFSSLAAP--LTDCLKKGNFKWTPLQQESFEDIKKKLTSSPI
        K  F  +E  FLG+ I    I+    K  AI  +PTP ++K+ Q FLG+ ++YR+FI N S +A P  L  C K    +WT  Q ++ E +K  L +SP+
Subjt:  KSMFMKKEIAFLGFIIKQGSISMEPKKIEAIHTWPTPASIKEIQAFLGLASFYRKFIRNFSSLAAP--LTDCLKKGNFKWTPLQQESFEDIKKKLTSSPI

Query:  LKLPDFSSPFEVAVDACCTGIGAVLVQQGHP------IEYFSEKLSTSRQTWSTYEQELYALVRALKQWEHYLISKEFVLLTDHFSLKYLQAQKNISRMH
        L   +  + + +  DA   GIGAVL +  +       + YFS+ L ++++ +   E EL  +++AL  + + L  K F L TDH SL  LQ +   +R  
Subjt:  LKLPDFSSPFEVAVDACCTGIGAVLVQQGHP------IEYFSEKLSTSRQTWSTYEQELYALVRALKQWEHYLISKEFVLLTDHFSLKYLQAQKNISRMH

Query:  ARWISFLQRFDFVIKHQSGKENKVADALSRKGSLLTILSSEIIAFKHLPDLYEGD-----------------------TDFKDIWYKCS-NFLDADDYHI
         RW+  L  +DF +++ +G +N VADA+SR    +T  +S  I  +     Y+ D                       + F+    K   +     +Y +
Subjt:  ARWISFLQRFDFVIKHQSGKENKVADALSRKGSLLTILSSEIIAFKHLPDLYEGD-----------------------TDFKDIWYKCS-NFLDADDYHI

Query:  VEGYLFKGEQLCIPHTSLREALIKEAHSGGL-AGHFGQNKTLEITSKRYYWPQIRKDSNNFVKRCPICQRTKGSSTNA-GLYSPLPIPTSIWEDLSIDFV
         +  ++  ++L +P    + A+++  H   L  GHFG   TL   S  YYWP+++     +++ C  CQ  K       GL  PLPI    W D+S+DFV
Subjt:  VEGYLFKGEQLCIPHTSLREALIKEAHSGGL-AGHFGQNKTLEITSKRYYWPQIRKDSNNFVKRCPICQRTKGSSTNA-GLYSPLPIPTSIWEDLSIDFV

Query:  IGLPKTQRQFDSIMVIVDRFSKMTHFVACKKTNDAIYIANLFFKEVVRLHGVPKSIVSDRDVKFLSHFWRTLWKKFDTTLKFSTTAHPQTDGQTEVTNRT
         GLP T    + I+V+VDRFSK  HF+A +KT DA  + +L F+ +   HG P++I SDRDV+  +  ++ L K+       S+  HPQTDGQ+E T +T
Subjt:  IGLPKTQRQFDSIMVIVDRFSKMTHFVACKKTNDAIYIANLFFKEVVRLHGVPKSIVSDRDVKFLSHFWRTLWKKFDTTLKFSTTAHPQTDGQTEVTNRT

Query:  LGNLLRCLSGSKPKQWDLALAQAEFAFNNMKNRSTGKSPFEVVYTKLPRLTFDLTTLPTTVDLNIEAECMAENIK--KLHKEVHDHLIQTTDSYKKAA--
        L  LLR    +  + W + L Q EF +N+   R+ GKSPFE+          DL  LP T  +  + E  A +    +L K +    IQT +  + A   
Subjt:  LGNLLRCLSGSKPKQWDLALAQAEFAFNNMKNRSTGKSPFEVVYTKLPRLTFDLTTLPTTVDLNIEAECMAENIK--KLHKEVHDHLIQTTDSYKKAA--

Query:  -----DKKRRQAHFNKGDLVMVHLKKSRFPTGTYNKLKDRQIGPFPILEKYGDNAFKIDLPQDIHIHPVFNVADLK
             +++R+    N GD V+VH + + F  G Y K++   +GPF +++K  DNA+++DL      H V NV  LK
Subjt:  -----DKKRRQAHFNKGDLVMVHLKKSRFPTGTYNKLKDRQIGPFPILEKYGDNAFKIDLPQDIHIHPVFNVADLK

Q99315 Transposon Ty3-G Gag-Pol polyprotein3.8e-14534.69Show/hide
Query:  IQHHIDLIPGASLPNLAHYRMSPQEYKILHDHIEELLKKGHIKPSLSPCAVPALLTPKKDGSWRMCVDSRAINRITVKYRFPIPRISDLLDQLGKVSVFS
        ++H I++ PGA LP L  Y ++ +  + ++  +++LL    I PS SPC+ P +L PKKDG++R+CVD R +N+ T+   FP+PRI +LL ++G   +F+
Subjt:  IQHHIDLIPGASLPNLAHYRMSPQEYKILHDHIEELLKKGHIKPSLSPCAVPALLTPKKDGSWRMCVDSRAINRITVKYRFPIPRISDLLDQLGKVSVFS

Query:  KIDLKSGYHQIRVRPGDEWKTAFKTNEGLFEWMVMPFGLSNAPSTFMRLMNQVLHPFLNKFIVVYFDDILVYSTNNDEHLLHLRKLFQVLTEAELYINTK
         +DL SGYHQI + P D +KTAF T  G +E+ VMPFGL NAPSTF R M         +F+ VY DDIL++S + +EH  HL  + + L    L +  K
Subjt:  KIDLKSGYHQIRVRPGDEWKTAFKTNEGLFEWMVMPFGLSNAPSTFMRLMNQVLHPFLNKFIVVYFDDILVYSTNNDEHLLHLRKLFQVLTEAELYINTK

Query:  KSMFMKKEIAFLGFIIKQGSISMEPKKIEAIHTWPTPASIKEIQAFLGLASFYRKFIRNFSSLAAP--LTDCLKKGNFKWTPLQQESFEDIKKKLTSSPI
        K  F  +E  FLG+ I    I+    K  AI  +PTP ++K+ Q FLG+ ++YR+FI N S +A P  L  C K    +WT  Q ++ + +K  L +SP+
Subjt:  KSMFMKKEIAFLGFIIKQGSISMEPKKIEAIHTWPTPASIKEIQAFLGLASFYRKFIRNFSSLAAP--LTDCLKKGNFKWTPLQQESFEDIKKKLTSSPI

Query:  LKLPDFSSPFEVAVDACCTGIGAVLVQQGHP------IEYFSEKLSTSRQTWSTYEQELYALVRALKQWEHYLISKEFVLLTDHFSLKYLQAQKNISRMH
        L   +  + + +  DA   GIGAVL +  +       + YFS+ L ++++ +   E EL  +++AL  + + L  K F L TDH SL  LQ +   +R  
Subjt:  LKLPDFSSPFEVAVDACCTGIGAVLVQQGHP------IEYFSEKLSTSRQTWSTYEQELYALVRALKQWEHYLISKEFVLLTDHFSLKYLQAQKNISRMH

Query:  ARWISFLQRFDFVIKHQSGKENKVADALSRKGSLLTILSSEIIAFKHLPDLYEGD-----------------------TDFKDIWYKCS-NFLDADDYHI
         RW+  L  +DF +++ +G +N VADA+SR    +T  +S  I  +     Y+ D                       + F+    K   +     +Y +
Subjt:  ARWISFLQRFDFVIKHQSGKENKVADALSRKGSLLTILSSEIIAFKHLPDLYEGD-----------------------TDFKDIWYKCS-NFLDADDYHI

Query:  VEGYLFKGEQLCIPHTSLREALIKEAHSGGL-AGHFGQNKTLEITSKRYYWPQIRKDSNNFVKRCPICQRTKGSSTNA-GLYSPLPIPTSIWEDLSIDFV
         +  ++  ++L +P    + A+++  H   L  GHFG   TL   S  YYWP+++     +++ C  CQ  K       GL  PLPI    W D+S+DFV
Subjt:  VEGYLFKGEQLCIPHTSLREALIKEAHSGGL-AGHFGQNKTLEITSKRYYWPQIRKDSNNFVKRCPICQRTKGSSTNA-GLYSPLPIPTSIWEDLSIDFV

Query:  IGLPKTQRQFDSIMVIVDRFSKMTHFVACKKTNDAIYIANLFFKEVVRLHGVPKSIVSDRDVKFLSHFWRTLWKKFDTTLKFSTTAHPQTDGQTEVTNRT
         GLP T    + I+V+VDRFSK  HF+A +KT DA  + +L F+ +   HG P++I SDRDV+  +  ++ L K+       S+  HPQTDGQ+E T +T
Subjt:  IGLPKTQRQFDSIMVIVDRFSKMTHFVACKKTNDAIYIANLFFKEVVRLHGVPKSIVSDRDVKFLSHFWRTLWKKFDTTLKFSTTAHPQTDGQTEVTNRT

Query:  LGNLLRCLSGSKPKQWDLALAQAEFAFNNMKNRSTGKSPFEVVYTKLPRLTFDLTTLPTTVDLNIEAECMAENIK--KLHKEVHDHLIQTTDSYKKAA--
        L  LLR  + +  + W + L Q EF +N+   R+ GKSPFE+          DL  LP T  +  + E  A +    +L K +    IQT +  + A   
Subjt:  LGNLLRCLSGSKPKQWDLALAQAEFAFNNMKNRSTGKSPFEVVYTKLPRLTFDLTTLPTTVDLNIEAECMAENIK--KLHKEVHDHLIQTTDSYKKAA--

Query:  -----DKKRRQAHFNKGDLVMVHLKKSRFPTGTYNKLKDRQIGPFPILEKYGDNAFKIDLPQDIHIHPVFNVADLKPY-HAPDRF
             +++R+    N GD V+VH + + F  G Y K++   +GPF +++K  DNA+++DL      H V NV  LK + + PD +
Subjt:  -----DKKRRQAHFNKGDLVMVHLKKSRFPTGTYNKLKDRQIGPFPILEKYGDNAFKIDLPQDIHIHPVFNVADLKPY-HAPDRF

Arabidopsis top hitse value%identityAlignment
AT4G13320.1 unknown protein9.3e-1433.33Show/hide
Query:  LFKTRCTINGRVCDVIIDSGSSENFVAKKLVTVLNLKA-EAHPNPYKIGWVRKGGEATVSEICTVPLSIGNAYKDQIVCDVIEM--DVCHLLLGRPWQYD
        +F+T+C IN   C +++  G+  N ++K LV  L LK  + +P+   +   R+  +    E C VP+SIG+ YKD++ C V+ M  +   LL G PW Y 
Subjt:  LFKTRCTINGRVCDVIIDSGSSENFVAKKLVTVLNLKA-EAHPNPYKIGWVRKGGEATVSEICTVPLSIGNAYKDQIVCDVIEM--DVCHLLLGRPWQYD

Query:  TQSLHKGRENTYEFQWMGRKVVL
         Q+ H GR+++    W    ++L
Subjt:  TQSLHKGRENTYEFQWMGRKVVL

ATMG00860.1 DNA/RNA polymerases superfamily protein2.9e-2338.35Show/hide
Query:  HLRKLFQVLTEAELYINTKKSMFMKKEIAFLG--FIIKQGSISMEPKKIEAIHTWPTPASIKEIQAFLGLASFYRKFIRNFSSLAAPLTDCLKKGNFKWT
        HL  + Q+  + + Y N KK  F + +IA+LG   II    +S +P K+EA+  WP P +  E++ FLGL  +YR+F++N+  +  PLT+ LKK + KWT
Subjt:  HLRKLFQVLTEAELYINTKKSMFMKKEIAFLG--FIIKQGSISMEPKKIEAIHTWPTPASIKEIQAFLGLASFYRKFIRNFSSLAAPLTDCLKKGNFKWT

Query:  PLQQESFEDIKKKLTSSPILKLPDFSSPFEVAV
         +   +F+ +K  +T+ P+L LPD   PF   V
Subjt:  PLQQESFEDIKKKLTSSPILKLPDFSSPFEVAV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCGGAAAACAACCGGTTCGCTCATGGGAAAAGATGAAAAAATTGCTGAAGGCAAGATTCTTACACCAAACTACGAACAAACTTTATACAATCAGTACCAAAACTGCCG
CCAAGGTGCCCGGACAGTAGCTGAGTACATTGAAGAATTCCACCGCCTGAGTGCAAGAACAAACCTAAGCGAAAATGAACAGCACCAGGTTGCAAGATTTGTGGGAGGCC
TTCGATTCGACATCAAGGAAAAGGTCAGACTACAACCATTCCGTTTTCTGTCTGAAGCAATATCATTTGCAGAAACGGTAGAGGAGATGATTGCAATTCGATCCAAGAAT
CTGAATAGAAGATCAGCATGGGAAACAAATTCGACAAAAAGCAAAACAAACGACCAACCTTCAACCTCAACAAAAGCAAAAGGGAAAGAGATTGATAATCAAGAAGTAGC
CGTTGAAAGAAAGAAAGAACAGACGTTCAAGCCTAGTGGTCAGAACAGCTACTCCCGCCCGTCATTAGGAAAATGCTTCCGATGTGGCCAAACTGCCCACCTGTCCAACA
ACTGTCCGCAAAGAAAAACCATTGCAATAGCCGAAGAAGGAGGACAGACAAGTGAAGACAGTATAGAAGCAGAGGAAGAAACTGAACTGATTGAAGCAGATGATGGGGAA
AGGGTCTCATGTGTTATCCAACGGTTACTAATTACGCCCAAGGAAGAAAAGAACCTGCAACGCCACTGTCTTTTCAAGACAAGATGCACCATAAACGGAAGAGTATGTGA
TGTAATCATAGACAGCGGCAGCAGTGAAAACTTCGTAGCAAAGAAATTAGTAACAGTCTTGAATCTAAAGGCTGAAGCACATCCAAACCCTTACAAGATAGGTTGGGTGA
GAAAAGGAGGAGAAGCCACGGTCAGCGAAATCTGTACAGTCCCTCTCTCCATTGGAAACGCCTACAAAGATCAAATTGTTTGTGATGTCATTGAGATGGATGTATGCCAT
CTCCTACTAGGAAGACCTTGGCAGTATGATACTCAATCATTACACAAAGGGAGGGAAAATACATATGAGTTTCAATGGATGGGAAGAAAGGTAGTCTTACTCCCTATAAC
AAAAAAGATTAACGAAGGGTTAAGAGGTGAGAAACAGTTATTCATCACTGTTAGTGGGAAAAACATGCTTAAAGAAAGGGAACAAGACATCCTAGGGCTGGTTGTTATTG
AAAAAACCAAAGAAAAACATGTTGAAGACATAGAACCCAAATTACAGCAGCTCCTTCATGAGTTCCCTCATATAAAGGAAGAGCCAAAGGGACTCCCACCCCTTCGAGAT
ATACAGCACCACATAGACTTGATTCCGGGAGCATCACTACCAAATTTGGCTCACTATAGGATGAGCCCCCAGGAGTACAAAATACTTCATGATCACATTGAGGAATTACT
AAAGAAAGGGCACATCAAACCTAGCCTCAGCCCGTGTGCAGTACCAGCACTTCTCACGCCAAAGAAAGATGGAAGCTGGAGAATGTGCGTTGACAGCAGAGCCATCAACC
GCATCACAGTAAAGTATAGATTTCCCATTCCAAGGATTAGTGACCTGCTAGATCAACTCGGTAAAGTCAGTGTCTTTTCGAAAATTGACTTGAAAAGTGGCTATCATCAA
ATACGGGTAAGACCTGGCGACGAATGGAAGACAGCCTTCAAGACAAACGAAGGATTATTTGAATGGATGGTCATGCCATTCGGCCTTTCTAATGCACCCAGCACCTTCAT
GAGATTGATGAACCAGGTACTTCACCCATTTCTCAACAAATTCATAGTAGTTTACTTCGATGACATACTTGTTTACAGCACAAACAATGATGAGCATTTATTACACCTAA
GGAAGCTGTTCCAAGTCTTAACAGAAGCGGAACTCTACATAAATACTAAGAAAAGCATGTTTATGAAAAAAGAAATTGCATTCCTCGGCTTTATAATCAAACAAGGAAGC
ATAAGCATGGAACCAAAGAAAATCGAAGCCATCCATACATGGCCGACTCCTGCCTCCATTAAAGAAATACAAGCCTTCCTCGGCCTGGCTTCGTTTTACAGGAAATTCAT
CAGAAATTTCAGCTCTTTAGCCGCACCACTAACTGACTGTCTAAAGAAAGGAAACTTCAAATGGACCCCATTACAACAAGAAAGCTTTGAAGATATCAAAAAGAAGCTCA
CATCCAGCCCTATCCTTAAATTACCAGATTTCTCTTCACCTTTTGAAGTAGCAGTTGATGCATGCTGTACAGGGATTGGAGCTGTCCTAGTACAACAAGGACATCCTATC
GAATACTTCAGTGAAAAACTCAGCACATCAAGACAGACCTGGAGCACATACGAACAAGAGCTGTATGCCCTCGTCCGAGCACTAAAACAATGGGAACACTACCTGATCTC
TAAAGAATTTGTACTCCTAACTGACCATTTCTCCCTAAAATACCTTCAAGCCCAAAAGAATATCAGCAGGATGCACGCACGCTGGATATCCTTCCTCCAAAGGTTTGACT
TCGTGATCAAACACCAATCAGGCAAAGAAAACAAGGTGGCCGATGCGCTAAGCCGAAAAGGCTCCCTACTCACAATACTGTCCTCGGAAATCATAGCATTCAAACATTTA
CCTGACTTATACGAAGGTGATACTGACTTCAAGGATATCTGGTACAAATGCTCCAACTTCTTAGACGCTGATGACTACCACATTGTTGAAGGATATCTATTTAAAGGAGA
ACAATTATGCATCCCGCACACCTCACTACGTGAAGCCTTAATAAAGGAAGCACATTCTGGAGGGCTAGCTGGACATTTCGGACAGAATAAGACATTGGAGATCACTTCCA
AACGATACTACTGGCCGCAAATAAGAAAAGACTCCAATAATTTCGTAAAAAGATGCCCCATCTGCCAAAGAACCAAAGGCTCAAGCACGAATGCAGGATTATACTCGCCA
CTACCCATCCCAACCTCAATATGGGAAGATTTATCAATTGACTTCGTGATTGGATTACCAAAAACACAAAGACAATTTGACTCAATAATGGTTATAGTGGACAGATTCAG
CAAAATGACACATTTCGTAGCATGCAAAAAGACAAATGATGCAATCTACATAGCCAACCTCTTCTTTAAAGAAGTAGTACGACTACATGGAGTACCTAAAAGCATAGTAT
CAGACAGAGATGTCAAGTTCCTGAGTCACTTTTGGCGAACACTGTGGAAGAAGTTTGACACAACACTGAAATTCAGCACCACAGCCCACCCACAGACAGATGGACAAACT
GAAGTAACAAACAGGACTCTCGGTAATCTGCTACGCTGCCTTAGCGGGTCAAAACCAAAACAATGGGATCTAGCATTGGCTCAAGCTGAATTCGCCTTCAATAATATGAA
GAACAGATCAACAGGAAAGTCCCCCTTCGAAGTAGTTTATACTAAACTACCACGATTAACCTTTGATCTCACTACACTCCCCACAACCGTGGATCTCAACATCGAAGCAG
AATGCATGGCAGAAAATATCAAAAAACTACACAAGGAAGTCCATGATCATCTTATACAGACAACAGACTCCTACAAAAAGGCAGCAGATAAAAAAAGAAGACAAGCCCAC
TTCAATAAAGGAGACCTAGTAATGGTACACCTGAAAAAGAGCAGATTTCCTACCGGCACATACAACAAGCTGAAAGACAGACAAATTGGGCCATTCCCTATATTAGAGAA
ATACGGAGATAATGCCTTCAAGATCGATCTACCACAAGACATACACATACACCCAGTCTTCAATGTTGCTGACCTAAAGCCATACCATGCACCAGATCGTTTCAGGCTTG
CTGACTGA
mRNA sequenceShow/hide mRNA sequence
ATGCGGAAAACAACCGGTTCGCTCATGGGAAAAGATGAAAAAATTGCTGAAGGCAAGATTCTTACACCAAACTACGAACAAACTTTATACAATCAGTACCAAAACTGCCG
CCAAGGTGCCCGGACAGTAGCTGAGTACATTGAAGAATTCCACCGCCTGAGTGCAAGAACAAACCTAAGCGAAAATGAACAGCACCAGGTTGCAAGATTTGTGGGAGGCC
TTCGATTCGACATCAAGGAAAAGGTCAGACTACAACCATTCCGTTTTCTGTCTGAAGCAATATCATTTGCAGAAACGGTAGAGGAGATGATTGCAATTCGATCCAAGAAT
CTGAATAGAAGATCAGCATGGGAAACAAATTCGACAAAAAGCAAAACAAACGACCAACCTTCAACCTCAACAAAAGCAAAAGGGAAAGAGATTGATAATCAAGAAGTAGC
CGTTGAAAGAAAGAAAGAACAGACGTTCAAGCCTAGTGGTCAGAACAGCTACTCCCGCCCGTCATTAGGAAAATGCTTCCGATGTGGCCAAACTGCCCACCTGTCCAACA
ACTGTCCGCAAAGAAAAACCATTGCAATAGCCGAAGAAGGAGGACAGACAAGTGAAGACAGTATAGAAGCAGAGGAAGAAACTGAACTGATTGAAGCAGATGATGGGGAA
AGGGTCTCATGTGTTATCCAACGGTTACTAATTACGCCCAAGGAAGAAAAGAACCTGCAACGCCACTGTCTTTTCAAGACAAGATGCACCATAAACGGAAGAGTATGTGA
TGTAATCATAGACAGCGGCAGCAGTGAAAACTTCGTAGCAAAGAAATTAGTAACAGTCTTGAATCTAAAGGCTGAAGCACATCCAAACCCTTACAAGATAGGTTGGGTGA
GAAAAGGAGGAGAAGCCACGGTCAGCGAAATCTGTACAGTCCCTCTCTCCATTGGAAACGCCTACAAAGATCAAATTGTTTGTGATGTCATTGAGATGGATGTATGCCAT
CTCCTACTAGGAAGACCTTGGCAGTATGATACTCAATCATTACACAAAGGGAGGGAAAATACATATGAGTTTCAATGGATGGGAAGAAAGGTAGTCTTACTCCCTATAAC
AAAAAAGATTAACGAAGGGTTAAGAGGTGAGAAACAGTTATTCATCACTGTTAGTGGGAAAAACATGCTTAAAGAAAGGGAACAAGACATCCTAGGGCTGGTTGTTATTG
AAAAAACCAAAGAAAAACATGTTGAAGACATAGAACCCAAATTACAGCAGCTCCTTCATGAGTTCCCTCATATAAAGGAAGAGCCAAAGGGACTCCCACCCCTTCGAGAT
ATACAGCACCACATAGACTTGATTCCGGGAGCATCACTACCAAATTTGGCTCACTATAGGATGAGCCCCCAGGAGTACAAAATACTTCATGATCACATTGAGGAATTACT
AAAGAAAGGGCACATCAAACCTAGCCTCAGCCCGTGTGCAGTACCAGCACTTCTCACGCCAAAGAAAGATGGAAGCTGGAGAATGTGCGTTGACAGCAGAGCCATCAACC
GCATCACAGTAAAGTATAGATTTCCCATTCCAAGGATTAGTGACCTGCTAGATCAACTCGGTAAAGTCAGTGTCTTTTCGAAAATTGACTTGAAAAGTGGCTATCATCAA
ATACGGGTAAGACCTGGCGACGAATGGAAGACAGCCTTCAAGACAAACGAAGGATTATTTGAATGGATGGTCATGCCATTCGGCCTTTCTAATGCACCCAGCACCTTCAT
GAGATTGATGAACCAGGTACTTCACCCATTTCTCAACAAATTCATAGTAGTTTACTTCGATGACATACTTGTTTACAGCACAAACAATGATGAGCATTTATTACACCTAA
GGAAGCTGTTCCAAGTCTTAACAGAAGCGGAACTCTACATAAATACTAAGAAAAGCATGTTTATGAAAAAAGAAATTGCATTCCTCGGCTTTATAATCAAACAAGGAAGC
ATAAGCATGGAACCAAAGAAAATCGAAGCCATCCATACATGGCCGACTCCTGCCTCCATTAAAGAAATACAAGCCTTCCTCGGCCTGGCTTCGTTTTACAGGAAATTCAT
CAGAAATTTCAGCTCTTTAGCCGCACCACTAACTGACTGTCTAAAGAAAGGAAACTTCAAATGGACCCCATTACAACAAGAAAGCTTTGAAGATATCAAAAAGAAGCTCA
CATCCAGCCCTATCCTTAAATTACCAGATTTCTCTTCACCTTTTGAAGTAGCAGTTGATGCATGCTGTACAGGGATTGGAGCTGTCCTAGTACAACAAGGACATCCTATC
GAATACTTCAGTGAAAAACTCAGCACATCAAGACAGACCTGGAGCACATACGAACAAGAGCTGTATGCCCTCGTCCGAGCACTAAAACAATGGGAACACTACCTGATCTC
TAAAGAATTTGTACTCCTAACTGACCATTTCTCCCTAAAATACCTTCAAGCCCAAAAGAATATCAGCAGGATGCACGCACGCTGGATATCCTTCCTCCAAAGGTTTGACT
TCGTGATCAAACACCAATCAGGCAAAGAAAACAAGGTGGCCGATGCGCTAAGCCGAAAAGGCTCCCTACTCACAATACTGTCCTCGGAAATCATAGCATTCAAACATTTA
CCTGACTTATACGAAGGTGATACTGACTTCAAGGATATCTGGTACAAATGCTCCAACTTCTTAGACGCTGATGACTACCACATTGTTGAAGGATATCTATTTAAAGGAGA
ACAATTATGCATCCCGCACACCTCACTACGTGAAGCCTTAATAAAGGAAGCACATTCTGGAGGGCTAGCTGGACATTTCGGACAGAATAAGACATTGGAGATCACTTCCA
AACGATACTACTGGCCGCAAATAAGAAAAGACTCCAATAATTTCGTAAAAAGATGCCCCATCTGCCAAAGAACCAAAGGCTCAAGCACGAATGCAGGATTATACTCGCCA
CTACCCATCCCAACCTCAATATGGGAAGATTTATCAATTGACTTCGTGATTGGATTACCAAAAACACAAAGACAATTTGACTCAATAATGGTTATAGTGGACAGATTCAG
CAAAATGACACATTTCGTAGCATGCAAAAAGACAAATGATGCAATCTACATAGCCAACCTCTTCTTTAAAGAAGTAGTACGACTACATGGAGTACCTAAAAGCATAGTAT
CAGACAGAGATGTCAAGTTCCTGAGTCACTTTTGGCGAACACTGTGGAAGAAGTTTGACACAACACTGAAATTCAGCACCACAGCCCACCCACAGACAGATGGACAAACT
GAAGTAACAAACAGGACTCTCGGTAATCTGCTACGCTGCCTTAGCGGGTCAAAACCAAAACAATGGGATCTAGCATTGGCTCAAGCTGAATTCGCCTTCAATAATATGAA
GAACAGATCAACAGGAAAGTCCCCCTTCGAAGTAGTTTATACTAAACTACCACGATTAACCTTTGATCTCACTACACTCCCCACAACCGTGGATCTCAACATCGAAGCAG
AATGCATGGCAGAAAATATCAAAAAACTACACAAGGAAGTCCATGATCATCTTATACAGACAACAGACTCCTACAAAAAGGCAGCAGATAAAAAAAGAAGACAAGCCCAC
TTCAATAAAGGAGACCTAGTAATGGTACACCTGAAAAAGAGCAGATTTCCTACCGGCACATACAACAAGCTGAAAGACAGACAAATTGGGCCATTCCCTATATTAGAGAA
ATACGGAGATAATGCCTTCAAGATCGATCTACCACAAGACATACACATACACCCAGTCTTCAATGTTGCTGACCTAAAGCCATACCATGCACCAGATCGTTTCAGGCTTG
CTGACTGA
Protein sequenceShow/hide protein sequence
MRKTTGSLMGKDEKIAEGKILTPNYEQTLYNQYQNCRQGARTVAEYIEEFHRLSARTNLSENEQHQVARFVGGLRFDIKEKVRLQPFRFLSEAISFAETVEEMIAIRSKN
LNRRSAWETNSTKSKTNDQPSTSTKAKGKEIDNQEVAVERKKEQTFKPSGQNSYSRPSLGKCFRCGQTAHLSNNCPQRKTIAIAEEGGQTSEDSIEAEEETELIEADDGE
RVSCVIQRLLITPKEEKNLQRHCLFKTRCTINGRVCDVIIDSGSSENFVAKKLVTVLNLKAEAHPNPYKIGWVRKGGEATVSEICTVPLSIGNAYKDQIVCDVIEMDVCH
LLLGRPWQYDTQSLHKGRENTYEFQWMGRKVVLLPITKKINEGLRGEKQLFITVSGKNMLKEREQDILGLVVIEKTKEKHVEDIEPKLQQLLHEFPHIKEEPKGLPPLRD
IQHHIDLIPGASLPNLAHYRMSPQEYKILHDHIEELLKKGHIKPSLSPCAVPALLTPKKDGSWRMCVDSRAINRITVKYRFPIPRISDLLDQLGKVSVFSKIDLKSGYHQ
IRVRPGDEWKTAFKTNEGLFEWMVMPFGLSNAPSTFMRLMNQVLHPFLNKFIVVYFDDILVYSTNNDEHLLHLRKLFQVLTEAELYINTKKSMFMKKEIAFLGFIIKQGS
ISMEPKKIEAIHTWPTPASIKEIQAFLGLASFYRKFIRNFSSLAAPLTDCLKKGNFKWTPLQQESFEDIKKKLTSSPILKLPDFSSPFEVAVDACCTGIGAVLVQQGHPI
EYFSEKLSTSRQTWSTYEQELYALVRALKQWEHYLISKEFVLLTDHFSLKYLQAQKNISRMHARWISFLQRFDFVIKHQSGKENKVADALSRKGSLLTILSSEIIAFKHL
PDLYEGDTDFKDIWYKCSNFLDADDYHIVEGYLFKGEQLCIPHTSLREALIKEAHSGGLAGHFGQNKTLEITSKRYYWPQIRKDSNNFVKRCPICQRTKGSSTNAGLYSP
LPIPTSIWEDLSIDFVIGLPKTQRQFDSIMVIVDRFSKMTHFVACKKTNDAIYIANLFFKEVVRLHGVPKSIVSDRDVKFLSHFWRTLWKKFDTTLKFSTTAHPQTDGQT
EVTNRTLGNLLRCLSGSKPKQWDLALAQAEFAFNNMKNRSTGKSPFEVVYTKLPRLTFDLTTLPTTVDLNIEAECMAENIKKLHKEVHDHLIQTTDSYKKAADKKRRQAH
FNKGDLVMVHLKKSRFPTGTYNKLKDRQIGPFPILEKYGDNAFKIDLPQDIHIHPVFNVADLKPYHAPDRFRLAD