; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc02G09885 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc02G09885
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionRNA-directed DNA polymerase
Genome locationClcChr02:14805414..14809487
RNA-Seq ExpressionClc02G09885
SyntenyClc02G09885
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR001878 - Zinc finger, CCHC-type
IPR001969 - Aspartic peptidase, active site
IPR012337 - Ribonuclease H-like superfamily
IPR021109 - Aspartic peptidase domain superfamily
IPR036397 - Ribonuclease H superfamily
IPR036875 - Zinc finger, CCHC-type superfamily
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR041588 - Integrase zinc-binding domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAA7014963.1 unnamed protein product [Microthlaspi erraticum]1.1e-26444.76Show/hide
Query:  CFRCGQQGHLSNECPQRRTLA-VQEEEDGADGCNSEDN-EELAYLQPDDGDQLSCVLQQVLLTPKSESHPQRHSLFRTRCTINGRVCQIIIDSGSSKNIV
        C+RC  QGH SN CP RRT+A + EEED     N +D  E + + + +  ++++ VLQ++LL+ K E   QR +LFRT C+I  +VC +I+D+GS++N+V
Subjt:  CFRCGQQGHLSNECPQRRTLA-VQEEEDGADGCNSEDN-EELAYLQPDDGDQLSCVLQQVLLTPKSESHPQRHSLFRTRCTINGRVCQIIIDSGSSKNIV

Query:  SKKLVTALNLKAKAHPIRIGLAGSKKGE-------------------------------------KQWLKTKLETL--TNTKLHAIWMGKKVVLLPL---
        S+KLV  L L  K H     L   KKG                                      + W   K  T    +  +   W   K+ + P+   
Subjt:  SKKLVTALNLKAKAHPIRIGLAGSKKGE-------------------------------------KQWLKTKLETL--TNTKLHAIWMGKKVVLLPL---

Query:  ----GK-------------------NDSKLLSP----------NSNILDP-HISNLLNEFPTLI--EEPNSLPPLWNIQHHIDQILGSSLPHLPHYRTSP
            GK                    ++K L P          N     P  +  +L +F  L   E PN LPP+ +IQHHID I GSSLP+LPHYR SP
Subjt:  ----GK-------------------NDSKLLSP----------NSNILDP-HISNLLNEFPTLI--EEPNSLPPLWNIQHHIDQILGSSLPHLPHYRTSP

Query:  KEYEALHNEIHTLLDKGHLQPSLSPCAVPALLAPKKDGSWRLCVDSRAINKITIKYRFPIPRLSDLLDQLGGASSFSKLTLKV-----------------
        KE E L  +I  LL KG ++ S+SPCAVP LL PKK   WR+CVDSRAINKITIKYRFPIPRL D+LD+L G+  FSK+ L+                  
Subjt:  KEYEALHNEIHTLLDKGHLQPSLSPCAVPALLAPKKDGSWRLCVDSRAINKITIKYRFPIPRLSDLLDQLGGASSFSKLTLKV-----------------

Query:  ---------VITKLG-SNRGMSGERLLKQM------------------------------------------------------KAFLNGF---PKGISV
                 ++   G SN   +  RL+ Q+                                                      K    GF    +GI V
Subjt:  ---------VITKLG-SNRGMSGERLLKQM------------------------------------------------------KAFLNGF---PKGISV

Query:  DPRKIEAIVEWSEPKTTKDIRSFLCLASFYRKFIPHFSTIAALMTACLKANIPFKWGEKQSQSFSELKRMLSSTPVLGFPDFNKPFEVAVDASGIGIGVV
        D  K+ AI +W  PK+  ++RSF  L +FYR+F+  FSTI A +T CLK    F WG +Q +SF+ +K  L + PVL  PDF+K F+V  DASG+GIG V
Subjt:  DPRKIEAIVEWSEPKTTKDIRSFLCLASFYRKFIPHFSTIAALMTACLKANIPFKWGEKQSQSFSELKRMLSSTPVLGFPDFNKPFEVAVDASGIGIGVV

Query:  LSQDKHPIEFFSEKLSPSRQKWSTYEQELYSLVQSLKQWEHYLLGKEFVLLIDHFSLKFLEFQKEISCMHARWIQFIQRFDFVIKHTPGKANKFADTLSR
        LSQ+K P+ FFSEKLS +RQ+WSTY+QE Y++ ++L+QWEHYL+ +EF+L  DH +LKFL  QK I+ MHARW+ F+Q+F F+I+H  G  NK AD LSR
Subjt:  LSQDKHPIEFFSEKLSPSRQKWSTYEQELYSLVQSLKQWEHYLLGKEFVLLIDHFSLKFLEFQKEISCMHARWIQFIQRFDFVIKHTPGKANKFADTLSR

Query:  KGTLLTLLRGKIIAFDHLPELCANDLDFKNIWQAWSNHIPAKDFHIFYGFLFKNDTLCIPQTSLHESLLREAHSGGLAGHFGRDKTLALLSSKFFWPQLK
        + +LL  L  +I+ F+ L EL  +D +FK +W   +   P+ DFHI  GFLFK D LCIP +SL E L+R+ H GGL+GH GRDKT+A L  +++WP L+
Subjt:  KGTLLTLLRGKIIAFDHLPELCANDLDFKNIWQAWSNHIPAKDFHIFYGFLFKNDTLCIPQTSLHESLLREAHSGGLAGHFGRDKTLALLSSKFFWPQLK

Query:  KDVANFVKRCYICQSSKGASLNQGLYSTLPIPNNIWEDLSMDFVLGLPRTQRGFESILVVVDRFSKMAHFLACKKTNDALNVANLFLREIVRLHGIPKSI
        +D    VKRCYICQ+SKG S N GLY  LP+P++IW+DLSMDFVLGLPRTQRG +S+ VVVDRFSKM HF+ACKKT DA N+A LF RE+VRLHG+PK+I
Subjt:  KDVANFVKRCYICQSSKGASLNQGLYSTLPIPNNIWEDLSMDFVLGLPRTQRGFESILVVVDRFSKMAHFLACKKTNDALNVANLFLREIVRLHGIPKSI

Query:  VSDRDVKFMGHFWRSLWKKFGTNLLYSTINHPQTDGQTEVTNRTLGNLLRSLSGDRPRQWDLYLAQAEFAFNNMTNGSTGKCPFEVVYTHVPRLTLDLAK
        +SDRD KF+ HFW +LW+ FGT L  S+  HPQTDGQTEVTNRTLGN++RS+ GDRP+QWDL L Q EFA+N+  + +TGK PF +VYT VP+  +DL K
Subjt:  VSDRDVKFMGHFWRSLWKKFGTNLLYSTINHPQTDGQTEVTNRTLGNLLRSLSGDRPRQWDLYLAQAEFAFNNMTNGSTGKCPFEVVYTHVPRLTLDLAK

Query:  LPIYVDLSAEATIMADRIKTIHEEVQQHLEATDPSYKSKADKHKRAVEFQIGDLVMIHLKKSRLPTGQHSKLINKKIGPLPILERIGPYAYTIDLPPTVK
        LP    +S  A  MA+ I    E V+  LEAT    K  ADK +R   F+ GD VM+ L+K R P G + KL   K GP  +L +I   AY +DLP  + 
Subjt:  LPIYVDLSAEATIMADRIKTIHEEVQQHLEATDPSYKSKADKHKRAVEFQIGDLVMIHLKKSRLPTGQHSKLINKKIGPLPILERIGPYAYTIDLPPTVK

Query:  ISNIFNVSDIFPYHAQNQFTLSQSSR
        ISN FNV+DI  YHA       ++SR
Subjt:  ISNIFNVSDIFPYHAQNQFTLSQSSR

CAA7028195.1 unnamed protein product [Microthlaspi erraticum]1.1e-26444.76Show/hide
Query:  CFRCGQQGHLSNECPQRRTLA-VQEEEDGADGCNSEDN-EELAYLQPDDGDQLSCVLQQVLLTPKSESHPQRHSLFRTRCTINGRVCQIIIDSGSSKNIV
        C+RC  QGH SN CP RRT+A + EEED     N +D  E + + + +  ++++ VLQ++LL+ K E   QR +LFRT C+I  +VC +I+D+GS++N+V
Subjt:  CFRCGQQGHLSNECPQRRTLA-VQEEEDGADGCNSEDN-EELAYLQPDDGDQLSCVLQQVLLTPKSESHPQRHSLFRTRCTINGRVCQIIIDSGSSKNIV

Query:  SKKLVTALNLKAKAHPIRIGLAGSKKGE-------------------------------------KQWLKTKLETL--TNTKLHAIWMGKKVVLLPL---
        S+KLV  L L  K H     L   KKG                                      + W   K  T    +  +   W   K+ + P+   
Subjt:  SKKLVTALNLKAKAHPIRIGLAGSKKGE-------------------------------------KQWLKTKLETL--TNTKLHAIWMGKKVVLLPL---

Query:  ----GK-------------------NDSKLLSP----------NSNILDP-HISNLLNEFPTLI--EEPNSLPPLWNIQHHIDQILGSSLPHLPHYRTSP
            GK                    ++K L P          N     P  +  +L +F  L   E PN LPP+ +IQHHID I GSSLP+LPHYR SP
Subjt:  ----GK-------------------NDSKLLSP----------NSNILDP-HISNLLNEFPTLI--EEPNSLPPLWNIQHHIDQILGSSLPHLPHYRTSP

Query:  KEYEALHNEIHTLLDKGHLQPSLSPCAVPALLAPKKDGSWRLCVDSRAINKITIKYRFPIPRLSDLLDQLGGASSFSKLTLKV-----------------
        KE E L  +I  LL KG ++ S+SPCAVP LL PKK   WR+CVDSRAINKITIKYRFPIPRL D+LD+L G+  FSK+ L+                  
Subjt:  KEYEALHNEIHTLLDKGHLQPSLSPCAVPALLAPKKDGSWRLCVDSRAINKITIKYRFPIPRLSDLLDQLGGASSFSKLTLKV-----------------

Query:  ---------VITKLG-SNRGMSGERLLKQM------------------------------------------------------KAFLNGF---PKGISV
                 ++   G SN   +  RL+ Q+                                                      K    GF    +GI V
Subjt:  ---------VITKLG-SNRGMSGERLLKQM------------------------------------------------------KAFLNGF---PKGISV

Query:  DPRKIEAIVEWSEPKTTKDIRSFLCLASFYRKFIPHFSTIAALMTACLKANIPFKWGEKQSQSFSELKRMLSSTPVLGFPDFNKPFEVAVDASGIGIGVV
        D  K+ AI +W  PK+  ++RSF  L +FYR+F+  FSTI A +T CLK    F WG +Q +SF+ +K  L + PVL  PDF+K F+V  DASG+GIG V
Subjt:  DPRKIEAIVEWSEPKTTKDIRSFLCLASFYRKFIPHFSTIAALMTACLKANIPFKWGEKQSQSFSELKRMLSSTPVLGFPDFNKPFEVAVDASGIGIGVV

Query:  LSQDKHPIEFFSEKLSPSRQKWSTYEQELYSLVQSLKQWEHYLLGKEFVLLIDHFSLKFLEFQKEISCMHARWIQFIQRFDFVIKHTPGKANKFADTLSR
        LSQ+K P+ FFSEKLS +RQ+WSTY+QE Y++ ++L+QWEHYL+ +EF+L  DH +LKFL  QK I+ MHARW+ F+Q+F F+I+H  G  NK AD LSR
Subjt:  LSQDKHPIEFFSEKLSPSRQKWSTYEQELYSLVQSLKQWEHYLLGKEFVLLIDHFSLKFLEFQKEISCMHARWIQFIQRFDFVIKHTPGKANKFADTLSR

Query:  KGTLLTLLRGKIIAFDHLPELCANDLDFKNIWQAWSNHIPAKDFHIFYGFLFKNDTLCIPQTSLHESLLREAHSGGLAGHFGRDKTLALLSSKFFWPQLK
        + +LL  L  +I+ F+ L EL  +D +FK +W   +   P+ DFHI  GFLFK D LCIP +SL E L+R+ H GGL+GH GRDKT+A L  +++WP L+
Subjt:  KGTLLTLLRGKIIAFDHLPELCANDLDFKNIWQAWSNHIPAKDFHIFYGFLFKNDTLCIPQTSLHESLLREAHSGGLAGHFGRDKTLALLSSKFFWPQLK

Query:  KDVANFVKRCYICQSSKGASLNQGLYSTLPIPNNIWEDLSMDFVLGLPRTQRGFESILVVVDRFSKMAHFLACKKTNDALNVANLFLREIVRLHGIPKSI
        +D    VKRCYICQ+SKG S N GLY  LP+P++IW+DLSMDFVLGLPRTQRG +S+ VVVDRFSKM HF+ACKKT DA N+A LF RE+VRLHG+PK+I
Subjt:  KDVANFVKRCYICQSSKGASLNQGLYSTLPIPNNIWEDLSMDFVLGLPRTQRGFESILVVVDRFSKMAHFLACKKTNDALNVANLFLREIVRLHGIPKSI

Query:  VSDRDVKFMGHFWRSLWKKFGTNLLYSTINHPQTDGQTEVTNRTLGNLLRSLSGDRPRQWDLYLAQAEFAFNNMTNGSTGKCPFEVVYTHVPRLTLDLAK
        +SDRD KF+ HFW +LW+ FGT L  S+  HPQTDGQTEVTNRTLGN++RS+ GDRP+QWDL L Q EFA+N+  + +TGK PF +VYT VP+  +DL K
Subjt:  VSDRDVKFMGHFWRSLWKKFGTNLLYSTINHPQTDGQTEVTNRTLGNLLRSLSGDRPRQWDLYLAQAEFAFNNMTNGSTGKCPFEVVYTHVPRLTLDLAK

Query:  LPIYVDLSAEATIMADRIKTIHEEVQQHLEATDPSYKSKADKHKRAVEFQIGDLVMIHLKKSRLPTGQHSKLINKKIGPLPILERIGPYAYTIDLPPTVK
        LP    +S  A  MA+ I    E V+  LEAT    K  ADK +R   F+ GD VM+ L+K R P G + KL   K GP  +L +I   AY +DLP  + 
Subjt:  LPIYVDLSAEATIMADRIKTIHEEVQQHLEATDPSYKSKADKHKRAVEFQIGDLVMIHLKKSRLPTGQHSKLINKKIGPLPILERIGPYAYTIDLPPTVK

Query:  ISNIFNVSDIFPYHAQNQFTLSQSSR
        ISN FNV+DI  YHA       ++SR
Subjt:  ISNIFNVSDIFPYHAQNQFTLSQSSR

KAG7588770.1 Integrase catalytic core [Arabidopsis suecica]1.1e-26744.26Show/hide
Query:  DTSVARRNNNPYNPPTLAKCFRCGQQGHLSNECPQRRTLAVQEEEDGADGCNSEDNEELAYLQPDDGDQLSCVLQQVLLTPKSESHPQRHSLFRTRCTIN
        D + A + N PY  PTL KCFRC   GH SN CP R+T+A+ EE++  D   +E+ E + + + +  D++S VLQ++LL+ K E   QR +LFRT C+I 
Subjt:  DTSVARRNNNPYNPPTLAKCFRCGQQGHLSNECPQRRTLAVQEEEDGADGCNSEDNEELAYLQPDDGDQLSCVLQQVLLTPKSESHPQRHSLFRTRCTIN

Query:  GRVCQIIIDSGSSKNIVSKKLVTALNLKAKAHPIRIGLAGSKKGEK------------------------------------------------------
         +VC +I+D+GSS+N+VS+KLV  L +    H    GL   KKG K                                                      
Subjt:  GRVCQIIIDSGSSKNIVSKKLVTALNLKAKAHPIRIGLAGSKKGEK------------------------------------------------------

Query:  --QWLKTKLETLTNTKLHAIWMGKKVVLLPLGKNDSKL--------------------LSPNSNILDPHISNLLNEFPTLIEE--PNSLPPLWNIQHHID
           W   K+         ++   KK   L +  +D +L                               +  +L ++  L+ E  P++LPP+ +IQHHID
Subjt:  --QWLKTKLETLTNTKLHAIWMGKKVVLLPLGKNDSKL--------------------LSPNSNILDPHISNLLNEFPTLIEE--PNSLPPLWNIQHHID

Query:  QILGSSLPHLPHYRTSPKEYEALHNEIHTLLDKGHLQPSLSPCAVPALLAPKKDGSWRLCVDSRAINKITIKYRFPIPRLSDLLDQLGGASSFSKLTLKV
         I GSSLP+LPHYR SPKE E +  +I  LL KG ++ S+SPCAVP LL PKK   WR+CVDSRAINKITIKYRFPIPRL D+LD+L G+  FSK+ L+ 
Subjt:  QILGSSLPHLPHYRTSPKEYEALHNEIHTLLDKGHLQPSLSPCAVPALLAPKKDGSWRLCVDSRAINKITIKYRFPIPRLSDLLDQLGGASSFSKLTLKV

Query:  --------------------------VITKLG-SNRGMSGERLLKQ-MKAFLNGF---------------------------------------------
                                  ++   G SN   +  RL+ Q ++ F+  F                                             
Subjt:  --------------------------VITKLG-SNRGMSGERLLKQ-MKAFLNGF---------------------------------------------

Query:  -----------PKGISVDPRKIEAIVEWSEPKTTKDIRSFLCLASFYRKFIPHFSTIAALMTACLKANIPFKWGEKQSQSFSELKRMLSSTPVLGFPDFN
                     GI VD  K+ AI +W  PKT  ++RSF  LA+FYR+F+  FSTI A +T CLK    F WG +Q +SF+ +K  L + PVL  PDF+
Subjt:  -----------PKGISVDPRKIEAIVEWSEPKTTKDIRSFLCLASFYRKFIPHFSTIAALMTACLKANIPFKWGEKQSQSFSELKRMLSSTPVLGFPDFN

Query:  KPFEVAVDASGIGIGVVLSQDKHPIEFFSEKLSPSRQKWSTYEQELYSLVQSLKQWEHYLLGKEFVLLIDHFSLKFLEFQKEISCMHARWIQFIQRFDFV
        K F+V  DASG+GIG VLSQ+K PI FFSEKLS +RQKWSTY+QE Y++ ++L+QWEHYL+ +EF+L  DH +LKFL  QK I+ MHARW+ F+Q+F F+
Subjt:  KPFEVAVDASGIGIGVVLSQDKHPIEFFSEKLSPSRQKWSTYEQELYSLVQSLKQWEHYLLGKEFVLLIDHFSLKFLEFQKEISCMHARWIQFIQRFDFV

Query:  IKHTPGKANKFADTLSRKGTLLTLLRGKIIAFDHLPELCANDLDFKNIWQAWSNHIPAKDFHIFYGFLFKNDTLCIPQTSLHESLLREAHSGGLAGHFGR
        I+H  G  NK AD LSR+ +LLT L  +I+ F+ L EL   D +FK +W   +   P+ DFHI  G+LFK D LCIP +SL E L+RE H GGL+GH GR
Subjt:  IKHTPGKANKFADTLSRKGTLLTLLRGKIIAFDHLPELCANDLDFKNIWQAWSNHIPAKDFHIFYGFLFKNDTLCIPQTSLHESLLREAHSGGLAGHFGR

Query:  DKTLALLSSKFFWPQLKKDVANFVKRCYICQSSKGASLNQGLYSTLPIPNNIWEDLSMDFVLGLPRTQRGFESILVVVDRFSKMAHFLACKKTNDALNVA
        DKT+A L  +++WP L+KD    V+RC++CQ SKG S N GLY  L +P++IW+DLSMDFVLGLPRTQRG +S+ VVVD+FSKM HF+AC+KT DA N+A
Subjt:  DKTLALLSSKFFWPQLKKDVANFVKRCYICQSSKGASLNQGLYSTLPIPNNIWEDLSMDFVLGLPRTQRGFESILVVVDRFSKMAHFLACKKTNDALNVA

Query:  NLFLREIVRLHGIPKSIVSDRDVKFMGHFWRSLWKKFGTNLLYSTINHPQTDGQTEVTNRTLGNLLRSLSGDRPRQWDLYLAQAEFAFNNMTNGSTGKCP
         LF RE+VRLHG+PKSIVSDRD KF+ HFW +LW+ FGT+L  S+  HPQ+DGQTEVTNRTLGN++RS+ GD+P+QWDL L Q EFA+N+  + +TGK P
Subjt:  NLFLREIVRLHGIPKSIVSDRDVKFMGHFWRSLWKKFGTNLLYSTINHPQTDGQTEVTNRTLGNLLRSLSGDRPRQWDLYLAQAEFAFNNMTNGSTGKCP

Query:  FEVVYTHVPRLTLDLAKLPIYVDLSAEATIMADRIKTIHEEVQQHLEATDPSYKSKADKHKRAVEFQIGDLVMIHLKKSRLPTGQHSKLINKKIGPLPIL
        F +VYT VP+  +DL  LP    +SA A  MA  I    E V+  LEAT    K  ADK +R   F+ GD VM+ LKK R P G + KL  +K GP  IL
Subjt:  FEVVYTHVPRLTLDLAKLPIYVDLSAEATIMADRIKTIHEEVQQHLEATDPSYKSKADKHKRAVEFQIGDLVMIHLKKSRLPTGQHSKLINKKIGPLPIL

Query:  ERIGPYAYTIDLPPTVKISNIFNVSDIFPYHA
        +++   AY +DLP  + IS+ FNV+DI+ YHA
Subjt:  ERIGPYAYTIDLPPTVKISNIFNVSDIFPYHA

PKU87230.1 RNA-directed DNA polymerase [Dendrobium catenatum]1.2e-26344.07Show/hide
Query:  NPYNPPTLAKCFRCGQQGHLSNECPQRRTLAVQEEEDGADGCNSEDNEE--LAYLQPDDGDQLSCVLQQVLLTPKSESHPQRHSLFRTRCTINGRVCQII
        NPY  PT  KCFRC Q GH SNECP R  + + E E   D    +DN E  +  +  D+G+ + C+L+++LL P+  +  QR+++FRTRCTING+VC+++
Subjt:  NPYNPPTLAKCFRCGQQGHLSNECPQRRTLAVQEEEDGADGCNSEDNEE--LAYLQPDDGDQLSCVLQQVLLTPKSESHPQRHSLFRTRCTINGRVCQII

Query:  IDSGSSKNIVSKKLVTALNLKAKAHPIRIGLAGSKKG---------------EKQW-LKTKLETLTNTKLHAI-----------------------WMGK
        ID+G ++N+VS+ LV +L LK   +P+   ++  K+G                KQ+  +   + +     H I                       W G+
Subjt:  IDSGSSKNIVSKKLVTALNLKAKAHPIRIGLAGSKKG---------------EKQW-LKTKLETLTNTKLHAI-----------------------WMGK

Query:  KVVLLPLGKNDSKLLSPNSNIL----------------------------------DPHISNLLNEFPTLI--EEPNSLPPLWNIQHHIDQILGSSLPHL
        K+ LLP   N S   S +  IL                                  +P +S L+ +F  L   E P  LPPL +IQH ID + GS+LP+L
Subjt:  KVVLLPLGKNDSKLLSPNSNIL----------------------------------DPHISNLLNEFPTLI--EEPNSLPPLWNIQHHIDQILGSSLPHL

Query:  PHYRTSPKEYEALHNEIHTLLDKGHLQPSLSPCAVPALLAPKKDGSWRLCVDSRAINKITIKYRFPIPRLSDLLDQLGGASSFSKLTLKVVI--------
        PHYR SPKE+E L   +  L+ K  +QPSLSPCAVPALL PKKD  WR+C+DSRAINKIT K+RFP+PR+ DLLD+L G+  FSKL L+           
Subjt:  PHYRTSPKEYEALHNEIHTLLDKGHLQPSLSPCAVPALLAPKKDGSWRLCVDSRAINKITIKYRFPIPRLSDLLDQLGGASSFSKLTLKVVI--------

Query:  ----TKLGSNRGMSGER-------------------------------------------------------LLKQMKAFLN--------------GF--
            T   + +G+   R                                                       +L++ K +LN              GF  
Subjt:  ----TKLGSNRGMSGER-------------------------------------------------------LLKQMKAFLN--------------GF--

Query:  -PKGISVDPRKIEAIVEWSEPKTTKDIRSFLCLASFYRKFIPHFSTIAALMTACLKANIPFKWGEKQSQSFSELKRMLSSTPVLGFPDFNKPFEVAVDAS
          +G+  DP K+ A+ +W  P++  D+RSF  LA+FYR+FI  FS I A +T CLK+   F+W E+  +S+  +K+ LSS PVL  P+F  PF+V  DAS
Subjt:  -PKGISVDPRKIEAIVEWSEPKTTKDIRSFLCLASFYRKFIPHFSTIAALMTACLKANIPFKWGEKQSQSFSELKRMLSSTPVLGFPDFNKPFEVAVDAS

Query:  GIGIGVVLSQDKHPIEFFSEKLSPSRQKWSTYEQELYSLVQSLKQWEHYLLGKEFVLLIDHFSLKFLEFQKEISCMHARWIQFIQRFDFVIKHTPGKANK
         +GIG VLSQD  PIEFFSEKLSP+RQKW+ YEQELY+++++LK WEHYLL ++FVL  DH +L+F+  QK IS MHARW+ F+Q+F FV+KH  G  N+
Subjt:  GIGIGVVLSQDKHPIEFFSEKLSPSRQKWSTYEQELYSLVQSLKQWEHYLLGKEFVLLIDHFSLKFLEFQKEISCMHARWIQFIQRFDFVIKHTPGKANK

Query:  FADTLSRKGTLLTLLRGKIIAFDHLPELCANDLDFKNIWQAWSNHIPAKDFHIFYGFLFKNDTLCIPQTSLHESLLREAHSGGLAGHFGRDKTLALLSSK
         AD LSR+  L+T L+ ++   D L EL   D DF  IW+  S   PA D+ + +G+LFK + LCIP++S  + ++++AH+GGLA H GR+KT+  + S+
Subjt:  FADTLSRKGTLLTLLRGKIIAFDHLPELCANDLDFKNIWQAWSNHIPAKDFHIFYGFLFKNDTLCIPQTSLHESLLREAHSGGLAGHFGRDKTLALLSSK

Query:  FFWPQLKKDVANFVKRCYICQSSKGASLNQGLYSTLPIPNNIWEDLSMDFVLGLPRTQRGFESILVVVDRFSKMAHFLACKKTNDALNVANLFLREIVRL
        FFWP+L +DV+ FV+RC +CQ  KG   N GLY+ LP+P++IWED+S+DF+LGLPRTQ+G +SI+VVVDRFSKMAHF+ACKKT DA+++A LF  EIVRL
Subjt:  FFWPQLKKDVANFVKRCYICQSSKGASLNQGLYSTLPIPNNIWEDLSMDFVLGLPRTQRGFESILVVVDRFSKMAHFLACKKTNDALNVANLFLREIVRL

Query:  HGIPKSIVSDRDVKFMGHFWRSLWKKFGTNLLYSTINHPQTDGQTEVTNRTLGNLLRSLSGDRPRQWDLYLAQAEFAFNNMTNGSTGKCPFEVVYTHVPR
        HG+P+SI SDRDVKF+ HFWR LWK+  T++  S+  HPQ+DGQTEV NRTLGN+LR L   RP+QW+  L QAEFA+N+M+N S GK PF +VYT  P 
Subjt:  HGIPKSIVSDRDVKFMGHFWRSLWKKFGTNLLYSTINHPQTDGQTEVTNRTLGNLLRSLSGDRPRQWDLYLAQAEFAFNNMTNGSTGKCPFEVVYTHVPR

Query:  LTLDLAKLPIYVDLSAEATIMADRIKTIHEEVQQHLEATDPSYKSKADKHKRAVEFQIGDLVMIHLKKSRLPTGQHSKLINKKIGPLPILERIGPYAYTI
           D+A LP   + SA A  + D  + I +EV+  L +++  YK  AD  +R V F+ G+LVMI L+K R P G  SKL  +K GP+PIL+RI   AYT+
Subjt:  LTLDLAKLPIYVDLSAEATIMADRIKTIHEEVQQHLEATDPSYKSKADKHKRAVEFQIGDLVMIHLKKSRLPTGQHSKLINKKIGPLPILERIGPYAYTI

Query:  DLPPTVKISNIFNVSDIFPYH
        DLP   + S  FNV DIFPYH
Subjt:  DLPPTVKISNIFNVSDIFPYH

PWA81295.1 transposon Ty3-I Gag-Pol polyprotein [Artemisia annua]1.9e-26442.94Show/hide
Query:  DSILKQQPIRSNDTSVARRNNNPYNPPTLAKCFRCGQQGHLSNECPQRRTLAVQE-EEDGADGCNS-EDNEELAYLQPDDG--DQLSCVLQQVLLTPKSE
        D   + QP++S          NPY  P   KCFRCG+ GH SN CP+R TL   E   DG  G  S ++ +++ Y +P DG  +Q++CV+Q+ L +PK  
Subjt:  DSILKQQPIRSNDTSVARRNNNPYNPPTLAKCFRCGQQGHLSNECPQRRTLAVQE-EEDGADGCNS-EDNEELAYLQPDDG--DQLSCVLQQVLLTPKSE

Query:  SHPQRHSLFRTRCTINGRVCQIIIDSGSSKNIVSKKLVTALNLKAKAHPIRIGLAGSKKGE----KQWLKTKL------------ETLTNTKLHAI----
           QR+ +F+T+C +  ++C IIID GS +N+VSK LV A  L  + HP    +   KKG      +  K  L            + +     H +    
Subjt:  SHPQRHSLFRTRCTINGRVCQIIIDSGSSKNIVSKKLVTALNLKAKAHPIRIGLAGSKKGE----KQWLKTKL------------ETLTNTKLHAI----

Query:  -------------------WMGKKVVLLPLG------KNDSKLL-----SPN------------------------SNILDPHISNLLNEFPTLI--EEP
                           W GK + +LPLG      K D+K L     SP                          N++   I  +L EF  ++  + P
Subjt:  -------------------WMGKKVVLLPLG------KNDSKLL-----SPN------------------------SNILDPHISNLLNEFPTLI--EEP

Query:  NSLPPLWNIQHHIDQILGSSLPHLPHYRTSPKEYEALHNEIHTLLDKGHLQPSLSPCAVPALLAPKKDGSWRLCVDSRAINKITIKYRFPIPRLSDLLDQ
        +SLPPL NIQH ID + G+SLP+LPHYR SPKE + L  ++  LL KGH+Q S+SPCAVPALL PKKDGSWR+CVDSRAINKIT++YRFPIPRL DLLDQ
Subjt:  NSLPPLWNIQHHIDQILGSSLPHLPHYRTSPKEYEALHNEIHTLLDKGHLQPSLSPCAVPALLAPKKDGSWRLCVDSRAINKITIKYRFPIPRLSDLLDQ

Query:  LGGASSFSKLTLK----------------VVITKLG-----------SNRGMSGERLLKQ-MKAFLNGF-------------------------------
        L GA  FSK+ L+                   TK G           SN   +  RL+ Q ++ F+  F                               
Subjt:  LGGASSFSKLTLK----------------VVITKLG-----------SNRGMSGERLLKQ-MKAFLNGF-------------------------------

Query:  -------------------------PKGISVDPRKIEAIVEWSEPKTTKDIRSFLCLASFYRKFIPHFSTIAALMTACLKANIPFKWGEKQSQSFSELKR
                                   GI VD  K++A+ +W  PKT  ++RSF  LA+FYR+F+ +FS+I A +T C+K   PFKW ++  +SF  +K 
Subjt:  -------------------------PKGISVDPRKIEAIVEWSEPKTTKDIRSFLCLASFYRKFIPHFSTIAALMTACLKANIPFKWGEKQSQSFSELKR

Query:  MLSSTPVLGFPDFNKPFEVAVDASGIGIGVVLSQDKHPIEFFSEKLSPSRQKWSTYEQELYSLVQSLKQWEHYLLGKEFVLLIDHFSLKFLEFQKEISCM
         L++ PVL  P+F+  FE+  DA G GIG VLSQ+  P+ F SEKL+ +RQKWSTYEQELY++VQ++K+WEHYL+ +EFV+  DH +LK+ + Q+ ++ +
Subjt:  MLSSTPVLGFPDFNKPFEVAVDASGIGIGVVLSQDKHPIEFFSEKLSPSRQKWSTYEQELYSLVQSLKQWEHYLLGKEFVLLIDHFSLKFLEFQKEISCM

Query:  HARWIQFIQRFDFVIKHTPGKANKFADTLSRKGTLLTLLRGKIIAFDHLPELCANDLDFKNIWQAWSNHIPAKDFHIFYGFLFKNDTLCIPQTSLHESLL
        HARW  F+++F++VIKH  G +NK AD LSRK TLL  +   ++ F+ +  L  ND DF++ W+         +F +  G+LFK + LCIP+TSL   L+
Subjt:  HARWIQFIQRFDFVIKHTPGKANKFADTLSRKGTLLTLLRGKIIAFDHLPELCANDLDFKNIWQAWSNHIPAKDFHIFYGFLFKNDTLCIPQTSLHESLL

Query:  REAHSGGLAGHFGRDKTLALLSSKFFWPQLKKDVANFVKRCYICQSSKGASLNQGLYSTLPIPNNIWEDLSMDFVLGLPRTQRGFESILVVVDRFSKMAH
        +E H+GGL+ H GRDKT+A + S+F+WPQLK+DV +FV+RC +CQ  KG + N GLY  LP+P + W D+SMDFVLGLPRTQRG +S+ VVVDRFSKMAH
Subjt:  REAHSGGLAGHFGRDKTLALLSSKFFWPQLKKDVANFVKRCYICQSSKGASLNQGLYSTLPIPNNIWEDLSMDFVLGLPRTQRGFESILVVVDRFSKMAH

Query:  FLACKKTNDALNVANLFLREIVRLHGIPKSIVSDRDVKFMGHFWRSLWKKFGTNLLYSTINHPQTDGQTEVTNRTLGNLLRSLSGDRPRQWDLYLAQAEF
        F+ CKKT+DA ++A LF +E+VRLHG+PKSI SDRD KF+ HFW +LW++ GT+L +S+  HPQTDGQTEV NRTLGN++R L G++P+ WD+ LAQAEF
Subjt:  FLACKKTNDALNVANLFLREIVRLHGIPKSIVSDRDVKFMGHFWRSLWKKFGTNLLYSTINHPQTDGQTEVTNRTLGNLLRSLSGDRPRQWDLYLAQAEF

Query:  AFNNMTNGSTGKCPFEVVYTHVPRLTLDLAKLPIYVDLSAEATIMADRIKTIHEEVQQHLEATDPSYKSKADKHKRAVEFQIGDLVMIHLKKSRLPTGQH
        A+N+  + STG  PF+VVY   PR  +DL  LP     + +A  M + ++  HE V+  +  ++  YK+ ADKH+R   F++GD VM+ L+K R P G +
Subjt:  AFNNMTNGSTGKCPFEVVYTHVPRLTLDLAKLPIYVDLSAEATIMADRIKTIHEEVQQHLEATDPSYKSKADKHKRAVEFQIGDLVMIHLKKSRLPTGQH

Query:  SKLINKKIGPLPILERIGPYAYTIDLPPTVKISNIFNVSDIFPYHAQN
        SKL  KK GP  IL +I   AY +DLP T+ IS  FNVSDI+ +H ++
Subjt:  SKLINKKIGPLPILERIGPYAYTIDLPPTVKISNIFNVSDIFPYHAQN

TrEMBL top hitse value%identityAlignment
A0A5B7BER3 Uncharacterized protein4.1e-28145.86Show/hide
Query:  RSNDTSVA--RRNNNPYNPPTLAKCFRCGQQGHLSNECPQRRTL-AVQEEEDGADGCNSEDNEE-------LAYLQPDDGDQLSCVLQQVLLTPKSESHP
        ++ +T +A  +++ NPY  P   KCFRC Q GH SNECP RR +  V   ED +    +E+  E           + D+G+ +SCV+Q++LL PK E  P
Subjt:  RSNDTSVA--RRNNNPYNPPTLAKCFRCGQQGHLSNECPQRRTL-AVQEEEDGADGCNSEDNEE-------LAYLQPDDGDQLSCVLQQVLLTPKSESHP

Query:  QRHSLFRTRCTINGRVCQIIIDSGSSKNIVSKKLVTALNLKAKAHPIRIGLAGSKKGEKQWL----------------KTKLETLTNTKLHAI-------
        QRH++FRTRCTIN +VC +IIDSGSS+NIVSK LV AL LK + HP    +   KKG +  +                +   + +     H +       
Subjt:  QRHSLFRTRCTINGRVCQIIIDSGSSKNIVSKKLVTALNLKAKAHPIRIGLAGSKKGEKQWL----------------KTKLETLTNTKLHAI-------

Query:  ----------------WMGKKVVLLPLGKND-----------SKLLSPNSNILD------------------------PHI-SNLLNEFP--TLIEEPNS
                        W  KKVVL+P  K             S L    S  ++                        P I   LL EF   T  E P+ 
Subjt:  ----------------WMGKKVVLLPLGKND-----------SKLLSPNSNILD------------------------PHI-SNLLNEFP--TLIEEPNS

Query:  LPPLWNIQHHIDQILGSSLPHLPHYRTSPKEYEALHNEIHTLLDKGHLQPSLSPCAVPALLAPKKDGSWRLCVDSRAINKITIKYRFPIPRLSDLLDQLG
        LPP+ +IQHHID + G+SLP+LPHYR SPKE E L  ++  L++KG +Q S+SPCAVPALL PKKDGSWR+CVDSRAINKIT+KYRFPIPRL+D+LD L 
Subjt:  LPPLWNIQHHIDQILGSSLPHLPHYRTSPKEYEALHNEIHTLLDKGHLQPSLSPCAVPALLAPKKDGSWRLCVDSRAINKITIKYRFPIPRLSDLLDQLG

Query:  GASSFSKLTLK----------------VVITKLG-----------SNRGMSGERLLKQ-MKAFLNGF---------------------------------
        G+  FSK+ L+                   TK G           SN   +  R++ Q +K F+  F                                 
Subjt:  GASSFSKLTLK----------------VVITKLG-----------SNRGMSGERLLKQ-MKAFLNGF---------------------------------

Query:  -----------------------PKGISVDPRKIEAIVEWSEPKTTKDIRSFLCLASFYRKFIPHFSTIAALMTACLKANIPFKWGEKQSQSFSELKRML
                                +GI VD  K+ AI +W  PKT  DIRSF  LA+FYR+FI +FS+I A +T C+K    F+W + Q  SF+ +K  L
Subjt:  -----------------------PKGISVDPRKIEAIVEWSEPKTTKDIRSFLCLASFYRKFIPHFSTIAALMTACLKANIPFKWGEKQSQSFSELKRML

Query:  SSTPVLGFPDFNKPFEVAVDASGIGIGVVLSQDKHPIEFFSEKLSPSRQKWSTYEQELYSLVQSLKQWEHYLLGKEFVLLIDHFSLKFLEFQKEISCMHA
        S+ PVL  P F K F+V  DAS  GIG VLSQ+  P+EFFSEKL+ +RQKW+TYE EL+++V++LK WEHYL+ +EFV+  DH +LKF+  Q  +S MH 
Subjt:  SSTPVLGFPDFNKPFEVAVDASGIGIGVVLSQDKHPIEFFSEKLSPSRQKWSTYEQELYSLVQSLKQWEHYLLGKEFVLLIDHFSLKFLEFQKEISCMHA

Query:  RWIQFIQRFDFVIKHTPGKANKFADTLSRKGTLLTLLRGKIIAFDHLPELCANDLDFKNIWQAWSNHIPAKDFHIFYGFLFKNDTLCIPQTSLHESLLRE
        RWI F+QRF FV+KH  G+ NK AD LSR+  LL ++  +I +F+ L EL   D DF+  W        + +FHI  G+LFK + LCIP+TSL E +LR+
Subjt:  RWIQFIQRFDFVIKHTPGKANKFADTLSRKGTLLTLLRGKIIAFDHLPELCANDLDFKNIWQAWSNHIPAKDFHIFYGFLFKNDTLCIPQTSLHESLLRE

Query:  AHSGGLAGHFGRDKTLALLSSKFFWPQLKKDVANFVKRCYICQSSKGASLNQGLYSTLPIPNNIWEDLSMDFVLGLPRTQRGFESILVVVDRFSKMAHFL
         HSGGL GH GRDKT+AL+  +++WPQLK+DV  FV++C ICQ++KG + N GLY+ LP+P +IWEDL+MDF+LGLPRTQRG +S+ VVVDRFSKMAHF+
Subjt:  AHSGGLAGHFGRDKTLALLSSKFFWPQLKKDVANFVKRCYICQSSKGASLNQGLYSTLPIPNNIWEDLSMDFVLGLPRTQRGFESILVVVDRFSKMAHFL

Query:  ACKKTNDALNVANLFLREIVRLHGIPKSIVSDRDVKFMGHFWRSLWKKFGTNLLYSTINHPQTDGQTEVTNRTLGNLLRSLSGDRPRQWDLYLAQAEFAF
         CKKT+DA +VANLF REIVRLHG+PKSI SDRDVKF+ HFWR+LW+KF T+L YS+  HPQTDGQTEVTNRTLGNL+R  SGDRP+QWD+ L Q EFA+
Subjt:  ACKKTNDALNVANLFLREIVRLHGIPKSIVSDRDVKFMGHFWRSLWKKFGTNLLYSTINHPQTDGQTEVTNRTLGNLLRSLSGDRPRQWDLYLAQAEFAF

Query:  NNMTNGSTGKCPFEVVYTHVPRLTLDLAKLPIYVDLSAEATIMADRIKTIHEEVQQHLEATDPSYKSKADKHKRAVEFQIGDLVMIHLKKSRLPTGQHSK
        N MTN ST K PFE+VYT  P+  LDLA LP     S  A   ADR  TI EEV+Q+LE  +  YK+ ADKH+R   F  GDLVM+ L+K+R P G ++K
Subjt:  NNMTNGSTGKCPFEVVYTHVPRLTLDLAKLPIYVDLSAEATIMADRIKTIHEEVQQHLEATDPSYKSKADKHKRAVEFQIGDLVMIHLKKSRLPTGQHSK

Query:  LINKKIGPLPILERIGPYAYTIDLPPTVKISNIFNVSDIFPYHAQNQFTLSQSSRDEFIFWEGKNCCRYFK
        L N+K GP  +  +I   AY ++LP  + IS+ FNV+D+F YH  ++   + +SR  F   EG +     K
Subjt:  LINKKIGPLPILERIGPYAYTIDLPPTVKISNIFNVSDIFPYHAQNQFTLSQSSRDEFIFWEGKNCCRYFK

A0A6D2HLB5 Reverse transcriptase5.4e-26544.76Show/hide
Query:  CFRCGQQGHLSNECPQRRTLA-VQEEEDGADGCNSEDN-EELAYLQPDDGDQLSCVLQQVLLTPKSESHPQRHSLFRTRCTINGRVCQIIIDSGSSKNIV
        C+RC  QGH SN CP RRT+A + EEED     N +D  E + + + +  ++++ VLQ++LL+ K E   QR +LFRT C+I  +VC +I+D+GS++N+V
Subjt:  CFRCGQQGHLSNECPQRRTLA-VQEEEDGADGCNSEDN-EELAYLQPDDGDQLSCVLQQVLLTPKSESHPQRHSLFRTRCTINGRVCQIIIDSGSSKNIV

Query:  SKKLVTALNLKAKAHPIRIGLAGSKKGE-------------------------------------KQWLKTKLETL--TNTKLHAIWMGKKVVLLPL---
        S+KLV  L L  K H     L   KKG                                      + W   K  T    +  +   W   K+ + P+   
Subjt:  SKKLVTALNLKAKAHPIRIGLAGSKKGE-------------------------------------KQWLKTKLETL--TNTKLHAIWMGKKVVLLPL---

Query:  ----GK-------------------NDSKLLSP----------NSNILDP-HISNLLNEFPTLI--EEPNSLPPLWNIQHHIDQILGSSLPHLPHYRTSP
            GK                    ++K L P          N     P  +  +L +F  L   E PN LPP+ +IQHHID I GSSLP+LPHYR SP
Subjt:  ----GK-------------------NDSKLLSP----------NSNILDP-HISNLLNEFPTLI--EEPNSLPPLWNIQHHIDQILGSSLPHLPHYRTSP

Query:  KEYEALHNEIHTLLDKGHLQPSLSPCAVPALLAPKKDGSWRLCVDSRAINKITIKYRFPIPRLSDLLDQLGGASSFSKLTLKV-----------------
        KE E L  +I  LL KG ++ S+SPCAVP LL PKK   WR+CVDSRAINKITIKYRFPIPRL D+LD+L G+  FSK+ L+                  
Subjt:  KEYEALHNEIHTLLDKGHLQPSLSPCAVPALLAPKKDGSWRLCVDSRAINKITIKYRFPIPRLSDLLDQLGGASSFSKLTLKV-----------------

Query:  ---------VITKLG-SNRGMSGERLLKQM------------------------------------------------------KAFLNGF---PKGISV
                 ++   G SN   +  RL+ Q+                                                      K    GF    +GI V
Subjt:  ---------VITKLG-SNRGMSGERLLKQM------------------------------------------------------KAFLNGF---PKGISV

Query:  DPRKIEAIVEWSEPKTTKDIRSFLCLASFYRKFIPHFSTIAALMTACLKANIPFKWGEKQSQSFSELKRMLSSTPVLGFPDFNKPFEVAVDASGIGIGVV
        D  K+ AI +W  PK+  ++RSF  L +FYR+F+  FSTI A +T CLK    F WG +Q +SF+ +K  L + PVL  PDF+K F+V  DASG+GIG V
Subjt:  DPRKIEAIVEWSEPKTTKDIRSFLCLASFYRKFIPHFSTIAALMTACLKANIPFKWGEKQSQSFSELKRMLSSTPVLGFPDFNKPFEVAVDASGIGIGVV

Query:  LSQDKHPIEFFSEKLSPSRQKWSTYEQELYSLVQSLKQWEHYLLGKEFVLLIDHFSLKFLEFQKEISCMHARWIQFIQRFDFVIKHTPGKANKFADTLSR
        LSQ+K P+ FFSEKLS +RQ+WSTY+QE Y++ ++L+QWEHYL+ +EF+L  DH +LKFL  QK I+ MHARW+ F+Q+F F+I+H  G  NK AD LSR
Subjt:  LSQDKHPIEFFSEKLSPSRQKWSTYEQELYSLVQSLKQWEHYLLGKEFVLLIDHFSLKFLEFQKEISCMHARWIQFIQRFDFVIKHTPGKANKFADTLSR

Query:  KGTLLTLLRGKIIAFDHLPELCANDLDFKNIWQAWSNHIPAKDFHIFYGFLFKNDTLCIPQTSLHESLLREAHSGGLAGHFGRDKTLALLSSKFFWPQLK
        + +LL  L  +I+ F+ L EL  +D +FK +W   +   P+ DFHI  GFLFK D LCIP +SL E L+R+ H GGL+GH GRDKT+A L  +++WP L+
Subjt:  KGTLLTLLRGKIIAFDHLPELCANDLDFKNIWQAWSNHIPAKDFHIFYGFLFKNDTLCIPQTSLHESLLREAHSGGLAGHFGRDKTLALLSSKFFWPQLK

Query:  KDVANFVKRCYICQSSKGASLNQGLYSTLPIPNNIWEDLSMDFVLGLPRTQRGFESILVVVDRFSKMAHFLACKKTNDALNVANLFLREIVRLHGIPKSI
        +D    VKRCYICQ+SKG S N GLY  LP+P++IW+DLSMDFVLGLPRTQRG +S+ VVVDRFSKM HF+ACKKT DA N+A LF RE+VRLHG+PK+I
Subjt:  KDVANFVKRCYICQSSKGASLNQGLYSTLPIPNNIWEDLSMDFVLGLPRTQRGFESILVVVDRFSKMAHFLACKKTNDALNVANLFLREIVRLHGIPKSI

Query:  VSDRDVKFMGHFWRSLWKKFGTNLLYSTINHPQTDGQTEVTNRTLGNLLRSLSGDRPRQWDLYLAQAEFAFNNMTNGSTGKCPFEVVYTHVPRLTLDLAK
        +SDRD KF+ HFW +LW+ FGT L  S+  HPQTDGQTEVTNRTLGN++RS+ GDRP+QWDL L Q EFA+N+  + +TGK PF +VYT VP+  +DL K
Subjt:  VSDRDVKFMGHFWRSLWKKFGTNLLYSTINHPQTDGQTEVTNRTLGNLLRSLSGDRPRQWDLYLAQAEFAFNNMTNGSTGKCPFEVVYTHVPRLTLDLAK

Query:  LPIYVDLSAEATIMADRIKTIHEEVQQHLEATDPSYKSKADKHKRAVEFQIGDLVMIHLKKSRLPTGQHSKLINKKIGPLPILERIGPYAYTIDLPPTVK
        LP    +S  A  MA+ I    E V+  LEAT    K  ADK +R   F+ GD VM+ L+K R P G + KL   K GP  +L +I   AY +DLP  + 
Subjt:  LPIYVDLSAEATIMADRIKTIHEEVQQHLEATDPSYKSKADKHKRAVEFQIGDLVMIHLKKSRLPTGQHSKLINKKIGPLPILERIGPYAYTIDLPPTVK

Query:  ISNIFNVSDIFPYHAQNQFTLSQSSR
        ISN FNV+DI  YHA       ++SR
Subjt:  ISNIFNVSDIFPYHAQNQFTLSQSSR

A0A6D2IKM3 Reverse transcriptase5.4e-26544.76Show/hide
Query:  CFRCGQQGHLSNECPQRRTLA-VQEEEDGADGCNSEDN-EELAYLQPDDGDQLSCVLQQVLLTPKSESHPQRHSLFRTRCTINGRVCQIIIDSGSSKNIV
        C+RC  QGH SN CP RRT+A + EEED     N +D  E + + + +  ++++ VLQ++LL+ K E   QR +LFRT C+I  +VC +I+D+GS++N+V
Subjt:  CFRCGQQGHLSNECPQRRTLA-VQEEEDGADGCNSEDN-EELAYLQPDDGDQLSCVLQQVLLTPKSESHPQRHSLFRTRCTINGRVCQIIIDSGSSKNIV

Query:  SKKLVTALNLKAKAHPIRIGLAGSKKGE-------------------------------------KQWLKTKLETL--TNTKLHAIWMGKKVVLLPL---
        S+KLV  L L  K H     L   KKG                                      + W   K  T    +  +   W   K+ + P+   
Subjt:  SKKLVTALNLKAKAHPIRIGLAGSKKGE-------------------------------------KQWLKTKLETL--TNTKLHAIWMGKKVVLLPL---

Query:  ----GK-------------------NDSKLLSP----------NSNILDP-HISNLLNEFPTLI--EEPNSLPPLWNIQHHIDQILGSSLPHLPHYRTSP
            GK                    ++K L P          N     P  +  +L +F  L   E PN LPP+ +IQHHID I GSSLP+LPHYR SP
Subjt:  ----GK-------------------NDSKLLSP----------NSNILDP-HISNLLNEFPTLI--EEPNSLPPLWNIQHHIDQILGSSLPHLPHYRTSP

Query:  KEYEALHNEIHTLLDKGHLQPSLSPCAVPALLAPKKDGSWRLCVDSRAINKITIKYRFPIPRLSDLLDQLGGASSFSKLTLKV-----------------
        KE E L  +I  LL KG ++ S+SPCAVP LL PKK   WR+CVDSRAINKITIKYRFPIPRL D+LD+L G+  FSK+ L+                  
Subjt:  KEYEALHNEIHTLLDKGHLQPSLSPCAVPALLAPKKDGSWRLCVDSRAINKITIKYRFPIPRLSDLLDQLGGASSFSKLTLKV-----------------

Query:  ---------VITKLG-SNRGMSGERLLKQM------------------------------------------------------KAFLNGF---PKGISV
                 ++   G SN   +  RL+ Q+                                                      K    GF    +GI V
Subjt:  ---------VITKLG-SNRGMSGERLLKQM------------------------------------------------------KAFLNGF---PKGISV

Query:  DPRKIEAIVEWSEPKTTKDIRSFLCLASFYRKFIPHFSTIAALMTACLKANIPFKWGEKQSQSFSELKRMLSSTPVLGFPDFNKPFEVAVDASGIGIGVV
        D  K+ AI +W  PK+  ++RSF  L +FYR+F+  FSTI A +T CLK    F WG +Q +SF+ +K  L + PVL  PDF+K F+V  DASG+GIG V
Subjt:  DPRKIEAIVEWSEPKTTKDIRSFLCLASFYRKFIPHFSTIAALMTACLKANIPFKWGEKQSQSFSELKRMLSSTPVLGFPDFNKPFEVAVDASGIGIGVV

Query:  LSQDKHPIEFFSEKLSPSRQKWSTYEQELYSLVQSLKQWEHYLLGKEFVLLIDHFSLKFLEFQKEISCMHARWIQFIQRFDFVIKHTPGKANKFADTLSR
        LSQ+K P+ FFSEKLS +RQ+WSTY+QE Y++ ++L+QWEHYL+ +EF+L  DH +LKFL  QK I+ MHARW+ F+Q+F F+I+H  G  NK AD LSR
Subjt:  LSQDKHPIEFFSEKLSPSRQKWSTYEQELYSLVQSLKQWEHYLLGKEFVLLIDHFSLKFLEFQKEISCMHARWIQFIQRFDFVIKHTPGKANKFADTLSR

Query:  KGTLLTLLRGKIIAFDHLPELCANDLDFKNIWQAWSNHIPAKDFHIFYGFLFKNDTLCIPQTSLHESLLREAHSGGLAGHFGRDKTLALLSSKFFWPQLK
        + +LL  L  +I+ F+ L EL  +D +FK +W   +   P+ DFHI  GFLFK D LCIP +SL E L+R+ H GGL+GH GRDKT+A L  +++WP L+
Subjt:  KGTLLTLLRGKIIAFDHLPELCANDLDFKNIWQAWSNHIPAKDFHIFYGFLFKNDTLCIPQTSLHESLLREAHSGGLAGHFGRDKTLALLSSKFFWPQLK

Query:  KDVANFVKRCYICQSSKGASLNQGLYSTLPIPNNIWEDLSMDFVLGLPRTQRGFESILVVVDRFSKMAHFLACKKTNDALNVANLFLREIVRLHGIPKSI
        +D    VKRCYICQ+SKG S N GLY  LP+P++IW+DLSMDFVLGLPRTQRG +S+ VVVDRFSKM HF+ACKKT DA N+A LF RE+VRLHG+PK+I
Subjt:  KDVANFVKRCYICQSSKGASLNQGLYSTLPIPNNIWEDLSMDFVLGLPRTQRGFESILVVVDRFSKMAHFLACKKTNDALNVANLFLREIVRLHGIPKSI

Query:  VSDRDVKFMGHFWRSLWKKFGTNLLYSTINHPQTDGQTEVTNRTLGNLLRSLSGDRPRQWDLYLAQAEFAFNNMTNGSTGKCPFEVVYTHVPRLTLDLAK
        +SDRD KF+ HFW +LW+ FGT L  S+  HPQTDGQTEVTNRTLGN++RS+ GDRP+QWDL L Q EFA+N+  + +TGK PF +VYT VP+  +DL K
Subjt:  VSDRDVKFMGHFWRSLWKKFGTNLLYSTINHPQTDGQTEVTNRTLGNLLRSLSGDRPRQWDLYLAQAEFAFNNMTNGSTGKCPFEVVYTHVPRLTLDLAK

Query:  LPIYVDLSAEATIMADRIKTIHEEVQQHLEATDPSYKSKADKHKRAVEFQIGDLVMIHLKKSRLPTGQHSKLINKKIGPLPILERIGPYAYTIDLPPTVK
        LP    +S  A  MA+ I    E V+  LEAT    K  ADK +R   F+ GD VM+ L+K R P G + KL   K GP  +L +I   AY +DLP  + 
Subjt:  LPIYVDLSAEATIMADRIKTIHEEVQQHLEATDPSYKSKADKHKRAVEFQIGDLVMIHLKKSRLPTGQHSKLINKKIGPLPILERIGPYAYTIDLPPTVK

Query:  ISNIFNVSDIFPYHAQNQFTLSQSSR
        ISN FNV+DI  YHA       ++SR
Subjt:  ISNIFNVSDIFPYHAQNQFTLSQSSR

A0A6N2LVR1 Uncharacterized protein6.6e-27145.13Show/hide
Query:  NPYNPPTLAKCFRCGQQGHLSNECPQRR--TLAVQEEEDGADGCNSEDNEELAYLQP-----DDGDQLS--CVLQQVLLTPKSESHPQRHSLFRTRCTIN
        N Y+ PT  KC+RCGQQGH SN CP+R    L  Q EE   +G  +E  ++  Y +      D+G+ LS   V+++++L PK E+  QR+++FRTRCT+N
Subjt:  NPYNPPTLAKCFRCGQQGHLSNECPQRR--TLAVQEEEDGADGCNSEDNEELAYLQP-----DDGDQLS--CVLQQVLLTPKSESHPQRHSLFRTRCTIN

Query:  GRVCQIIIDSGSSKNIVSKKLVTALNLKAKAH--PIRIGLAGSKKGEKQWLKTKLET-LTNT-----------------------KLHAI----WM----
         +VC +IIDSGSS+NI+SK +V  L LK + H  P +IG          W+K   ET +T T                         H I    W     
Subjt:  GRVCQIIIDSGSSKNIVSKKLVTALNLKAKAH--PIRIGLAGSKKGEKQWLKTKLET-LTNT-----------------------KLHAI----WM----

Query:  ---------------GKKVVLLPLGKN---------------------------DSKL--------LSPNSNILDPHISNLLNEFPTLIEE--PNSLPPL
                       G+KV+L PL ++                           DS++        +  NS  +   +  LL EF T+I E  P  LPP+
Subjt:  ---------------GKKVVLLPLGKN---------------------------DSKL--------LSPNSNILDPHISNLLNEFPTLIEE--PNSLPPL

Query:  WNIQHHIDQILGSSLPHLPHYRTSPKEYEALHNEIHTLLDKGHLQPSLSPCAVPALLAPKKDGSWRLCVDSRAINKITIKYRFPIPRLSDLLDQLGGASS
         +IQHHID I G+SLP+ PHYR SP+E   L  ++  L+ KG +Q S+SPCAVPALL PKKDGSWR+C+DSRAINKITIKYRFPIPRL D+LD L G+  
Subjt:  WNIQHHIDQILGSSLPHLPHYRTSPKEYEALHNEIHTLLDKGHLQPSLSPCAVPALLAPKKDGSWRLCVDSRAINKITIKYRFPIPRLSDLLDQLGGASS

Query:  FSKLTLK----------------VVITKLG-----------SNRGMSGERLLKQM----------------------------------------KAFLN
        FSK+ L+                   TK G           SN   +  RL+ Q+                                        K F+N
Subjt:  FSKLTLK----------------VVITKLG-----------SNRGMSGERLLKQM----------------------------------------KAFLN

Query:  --------------GF---PKGISVDPRKIEAIVEWSEPKTTKDIRSFLCLASFYRKFIPHFSTIAALMTACLKANIPFKWGEKQSQSFSELKRMLSSTP
                      GF     GI VD  K+ AI +W  PK   ++RSF  LA+FYR+F+  FS I A +T C+K    F WG +   SF+ +K  L+S P
Subjt:  --------------GF---PKGISVDPRKIEAIVEWSEPKTTKDIRSFLCLASFYRKFIPHFSTIAALMTACLKANIPFKWGEKQSQSFSELKRMLSSTP

Query:  VLGFPDFNKPFEVAVDASGIGIGVVLSQDKHPIEFFSEKLSPSRQKWSTYEQELYSLVQSLKQWEHYLLGKEFVLLIDHFSLKFLEFQKEISCMHARWIQ
        VL  PDF+K FEV  DAS IGIG VLSQ+  P+ F+SEKLS +R+KWSTYE ELY++ +++K WEHYL+ +EF+L  DH +LKF+  Q  ++ MHARW+ 
Subjt:  VLGFPDFNKPFEVAVDASGIGIGVVLSQDKHPIEFFSEKLSPSRQKWSTYEQELYSLVQSLKQWEHYLLGKEFVLLIDHFSLKFLEFQKEISCMHARWIQ

Query:  FIQRFDFVIKHTPGKANKFADTLSRKGTLLTLLRGKIIAFDHLPELCANDLDFKNIWQAWSNHIPAKDFHIFYGFLFKNDTLCIPQTSLHESLLREAHSG
        FIQRF+F +KH  G+ NK AD LSRK +LLT L+ ++I F+ + +L A D DF N W      +  +  H   G+LF+ + LCIP++SL E ++ E H G
Subjt:  FIQRFDFVIKHTPGKANKFADTLSRKGTLLTLLRGKIIAFDHLPELCANDLDFKNIWQAWSNHIPAKDFHIFYGFLFKNDTLCIPQTSLHESLLREAHSG

Query:  GLAGHFGRDKTLALLSSKFFWPQLKKDVANFVKRCYICQSSKGASLNQGLYSTLPIPNNIWEDLSMDFVLGLPRTQRGFESILVVVDRFSKMAHFLACKK
        GL GH GRDKT+AL   +++WPQLK+D+ N VKRC  CQ+SKG + N GLY  LPIP   WEDLSMDF+LGLPRTQRG +S+ VVVDRFSKMAHF+ACKK
Subjt:  GLAGHFGRDKTLALLSSKFFWPQLKKDVANFVKRCYICQSSKGASLNQGLYSTLPIPNNIWEDLSMDFVLGLPRTQRGFESILVVVDRFSKMAHFLACKK

Query:  TNDALNVANLFLREIVRLHGIPKSIVSDRDVKFMGHFWRSLWKKFGTNLLYSTINHPQTDGQTEVTNRTLGNLLRSLSGDRPRQWDLYLAQAEFAFNNMT
        T+DA++VANLF +E+VRLHG+PKSI SDRD KF+ HFWR+LW++F T L +S+ +HPQTDGQTEV NRTLGNL+R LSG+RP+QWDL LAQAEFA+N+M 
Subjt:  TNDALNVANLFLREIVRLHGIPKSIVSDRDVKFMGHFWRSLWKKFGTNLLYSTINHPQTDGQTEVTNRTLGNLLRSLSGDRPRQWDLYLAQAEFAFNNMT

Query:  NGSTGKCPFEVVYTHVPRLTLDLAKLPIYVDLSAEATIMADRIKTIHEEVQQHLEATDPSYKSKADKHKRAVEFQIGDLVMIHLKKSRLPTGQHSKLINK
        N STGK PF+VVY   P+  LDL  LP    ++  A  MADR++ I EEV+++LEA++  YK+ ADK +R   F+ GDLVM++L+K R+P G   KL +K
Subjt:  NGSTGKCPFEVVYTHVPRLTLDLAKLPIYVDLSAEATIMADRIKTIHEEVQQHLEATDPSYKSKADKHKRAVEFQIGDLVMIHLKKSRLPTGQHSKLINK

Query:  KIGPLPILERIGPYAYTIDLPPTVKISNIFNVSDIFPYHAQNQFTLSQSSRDEFIFWEG
        K GP  IL++I   AY +DLP  + IS  FNV+D+F YH  ++   + S+     F +G
Subjt:  KIGPLPILERIGPYAYTIDLPPTVKISNIFNVSDIFPYHAQNQFTLSQSSRDEFIFWEG

M5WCC7 Reverse transcriptase6.4e-26644.55Show/hide
Query:  RNNNPYNPPTLAKCFRCGQQGHLSNECPQRRTLAVQEEEDGADGCNSEDNEEL-------AYLQPDDG-DQLSCVLQQVLLTPKSESHPQRHSLFRTRCT
        ++ N Y  P    C+RC + GH SN CP+ +     EE D       E+N+E+       A    ++G ++++ VLQ+VLL P+ E   QRHS+FR+ C+
Subjt:  RNNNPYNPPTLAKCFRCGQQGHLSNECPQRRTLAVQEEEDGADGCNSEDNEEL-------AYLQPDDG-DQLSCVLQQVLLTPKSESHPQRHSLFRTRCT

Query:  INGRVCQIIIDSGSSKNIVSKKLVTALNLKAKAHPIRIGLAGSKKGE-------------------------------------KQW---LKTKLETLTN
        I  +VC +I+D+GS +N VSKKLV  L L  + H     L   KKG                                      + W   +    +   N
Subjt:  INGRVCQIIIDSGSSKNIVSKKLVTALNLKAKAHPIRIGLAGSKKGE-------------------------------------KQW---LKTKLETLTN

Query:  TKLHAIWMGKKVVL-------------------LPLGKNDSKL-----LSPNSNILDPHISNLLNEFPTLIEE--PNSLPPLWNIQHHIDQILGSSLPHL
          L + W  +K+ +                   L L  N+ +L      +     +   +  +L++F  L  E  PN LPP+ +IQH ID + G+SL +L
Subjt:  TKLHAIWMGKKVVL-------------------LPLGKNDSKL-----LSPNSNILDPHISNLLNEFPTLIEE--PNSLPPLWNIQHHIDQILGSSLPHL

Query:  PHYRTSPKEYEALHNEIHTLLDKGHLQPSLSPCAVPALLAPKKDGSWRLCVDSRAINKITIKYRFPIPRLSDLLDQLGGASSFSKLTLKV----------
        PHYR SPKE + L  +I  LL KG ++ SLSPCAVP LL PKKD +WR+CVDSRAINKIT+KYRFPIPRL D+LD L G+  FSK+ L+           
Subjt:  PHYRTSPKEYEALHNEIHTLLDKGHLQPSLSPCAVPALLAPKKDGSWRLCVDSRAINKITIKYRFPIPRLSDLLDQLGGASSFSKLTLKV----------

Query:  ----------------VITKLG-SNRGMSGERLLKQM----------------------------------------KAFLN--------------GF--
                        ++   G SN   +  RL+ Q+                                        K F+N              GF  
Subjt:  ----------------VITKLG-SNRGMSGERLLKQM----------------------------------------KAFLN--------------GF--

Query:  -PKGISVDPRKIEAIVEWSEPKTTKDIRSFLCLASFYRKFIPHFSTIAALMTACLKANIPFKWGEKQSQSFSELKRMLSSTPVLGFPDFNKPFEVAVDAS
           GI VD  KI+AI++W  PKT  ++RSF  LA+FYR+F+ HFS+I A +T CLK    F WGE+Q +SF+++K  L + PVL  P+F K FEV  DAS
Subjt:  -PKGISVDPRKIEAIVEWSEPKTTKDIRSFLCLASFYRKFIPHFSTIAALMTACLKANIPFKWGEKQSQSFSELKRMLSSTPVLGFPDFNKPFEVAVDAS

Query:  GIGIGVVLSQDKHPIEFFSEKLSPSRQKWSTYEQELYSLVQSLKQWEHYLLGKEFVLLIDHFSLKFLEFQKEISCMHARWIQFIQRFDFVIKHTPGKANK
        G+G+G VLSQDK P+ FFSEKLS +RQKWSTY+QE Y++V++LKQWEHYL+ KEFVL  DH +LK++  QK I  MHARW+ F+Q+F FVIKHT GK N+
Subjt:  GIGIGVVLSQDKHPIEFFSEKLSPSRQKWSTYEQELYSLVQSLKQWEHYLLGKEFVLLIDHFSLKFLEFQKEISCMHARWIQFIQRFDFVIKHTPGKANK

Query:  FADTLSRKGTLLTLLRGKIIAFDHLPELCANDLDFKNIWQAWSNHIPAKDFHIFYGFLFKNDTLCIPQTSLHESLLREAHSGGLAGHFGRDKTLALLSSK
         AD LSR+ +LL  L  +++ F+ L EL   D DF  IW   +N  P  D+ +  G+LFK + LCIP +SL E L+R+ H GGL+GH GRDKT+A +  +
Subjt:  FADTLSRKGTLLTLLRGKIIAFDHLPELCANDLDFKNIWQAWSNHIPAKDFHIFYGFLFKNDTLCIPQTSLHESLLREAHSGGLAGHFGRDKTLALLSSK

Query:  FFWPQLKKDVANFVKRCYICQSSKGASLNQGLYSTLPIPNNIWEDLSMDFVLGLPRTQRGFESILVVVDRFSKMAHFLACKKTNDALNVANLFLREIVRL
        F+WPQLK+DV   V++CY CQ+SKG   N GLY  LP+PN+IW+DL+MDFVLGLPRTQRG +S+ VVVDRFSKMAHF+AC+KT DA N+A LF RE+VRL
Subjt:  FFWPQLKKDVANFVKRCYICQSSKGASLNQGLYSTLPIPNNIWEDLSMDFVLGLPRTQRGFESILVVVDRFSKMAHFLACKKTNDALNVANLFLREIVRL

Query:  HGIPKSIVSDRDVKFMGHFWRSLWKKFGTNLLYSTINHPQTDGQTEVTNRTLGNLLRSLSGDRPRQWDLYLAQAEFAFNNMTNGSTGKCPFEVVYTHVPR
        HG+P SI SDRD KF+ HFW +LW+ FGT L  S+  HPQTDGQTEVTNRTLGN++RS+ G++P+QWD  L Q EFA+N+  + +TGK PF +VYT +P 
Subjt:  HGIPKSIVSDRDVKFMGHFWRSLWKKFGTNLLYSTINHPQTDGQTEVTNRTLGNLLRSLSGDRPRQWDLYLAQAEFAFNNMTNGSTGKCPFEVVYTHVPR

Query:  LTLDLAKLPIYVDLSAEATIMADRIKTIHEEVQQHLEATDPSYKSKADKHKRAVEFQIGDLVMIHLKKSRLPTGQHSKLINKKIGPLPILERIGPYAYTI
          +DL KLP     S  A  +A+ +  + +EV+Q LE T+  YK+ ADKH+R   FQ GD VMI L+K R P G +SKL  KK GP  +L+RI   AY I
Subjt:  LTLDLAKLPIYVDLSAEATIMADRIKTIHEEVQQHLEATDPSYKSKADKHKRAVEFQIGDLVMIHLKKSRLPTGQHSKLINKKIGPLPILERIGPYAYTI

Query:  DLPPTVKISNIFNVSDIFPY
        +LP ++ ISNIFNV+D++ +
Subjt:  DLPPTVKISNIFNVSDIFPY

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein2.4e-9227.56Show/hide
Query:  SNIL-DPHISNLLNEFPTLIEEPNS--LP-PLWNIQHHIDQILGSSLPHLPHYRTSPKEYEALHNEIHTLLDKGHLQPSLSPCAVPALLAPKKDGSWRLC
        SNI+ +P + ++  EF  +  E N+  LP P+  ++  ++    +    + +Y   P + +A+++EI+  L  G ++ S +  A P +  PKK+G+ R+ 
Subjt:  SNIL-DPHISNLLNEFPTLIEEPNS--LP-PLWNIQHHIDQILGSSLPHLPHYRTSPKEYEALHNEIHTLLDKGHLQPSLSPCAVPALLAPKKDGSWRLC

Query:  VDSRAINKITIKYRFPIPRLSDLLDQLGGASSFSKLTLKVVITKLGSNRG------------------------------------MSGE----------
        VD + +NK      +P+P +  LL ++ G++ F+KL LK     +   +G                                    + GE          
Subjt:  VDSRAINKITIKYRFPIPRLSDLLDQLGGASSFSKLTLKVVITKLGSNRG------------------------------------MSGE----------

Query:  ---------------------RLLKQMKAFLN--------------GF---PKGISVDPRKIEAIVEWSEPKTTKDIRSFLCLASFYRKFIPHFSTIAAL
                             + LK     +N              G+    KG +     I+ +++W +PK  K++R FL   ++ RKFIP  S +   
Subjt:  ---------------------RLLKQMKAFLN--------------GF---PKGISVDPRKIEAIVEWSEPKTTKDIRSFLCLASFYRKFIPHFSTIAAL

Query:  MTACLKANIPFKWGEKQSQSFSELKRMLSSTPVLGFPDFNKPFEVAVDASGIGIGVVLSQ----DK-HPIEFFSEKLSPSRQKWSTYEQELYSLVQSLKQ
        +   LK ++ +KW   Q+Q+   +K+ L S PVL   DF+K   +  DAS + +G VLSQ    DK +P+ ++S K+S ++  +S  ++E+ ++++SLK 
Subjt:  MTACLKANIPFKWGEKQSQSFSELKRMLSSTPVLGFPDFNKPFEVAVDASGIGIGVVLSQ----DK-HPIEFFSEKLSPSRQKWSTYEQELYSLVQSLKQ

Query:  WEHYLLG--KEFVLLIDHFSL--KFLEFQKEISCMHARWIQFIQRFDFVIKHTPGKANKFADTLSR------------KGTLLTLLRGKIIAFDHLPELC
        W HYL    + F +L DH +L  +     +  +   ARW  F+Q F+F I + PG AN  AD LSR            +   +  +    I  D   ++ 
Subjt:  WEHYLLG--KEFVLLIDHFSL--KFLEFQKEISCMHARWIQFIQRFDFVIKHTPGKANKFADTLSR------------KGTLLTLLRGKIIAFDHLPELC

Query:  ---ANDLDFKNIWQAWSNHIPAKDFHIFYGFLFKN-DTLCIP-QTSLHESLLREAHSGGLAGHFGRDKTLALLSSKFFWPQLKKDVANFVKRCYICQSSK
            ND    N+       +  ++  +  G L  + D + +P  T L  +++++ H  G   H G +    ++  +F W  ++K +  +V+ C+ CQ +K
Subjt:  ---ANDLDFKNIWQAWSNHIPAKDFHIFYGFLFKN-DTLCIP-QTSLHESLLREAHSGGLAGHFGRDKTLALLSSKFFWPQLKKDVANFVKRCYICQSSK

Query:  GASLNQGLYSTL-PIP--NNIWEDLSMDFVLGLPRTQRGFESILVVVDRFSKMAHFLACKKTNDALNVANLFLREIVRLHGIPKSIVSDRDVKFMGHFWR
          S N   Y  L PIP     WE LSMDF+  LP +  G+ ++ VVVDRFSKMA  + C K+  A   A +F + ++   G PK I++D D  F    W+
Subjt:  GASLNQGLYSTL-PIP--NNIWEDLSMDFVLGLPRTQRGFESILVVVDRFSKMAHFLACKKTNDALNVANLFLREIVRLHGIPKSIVSDRDVKFMGHFWR

Query:  SLWKKFGTNLLYSTINHPQTDGQTEVTNRTLGNLLRSLSGDRPRQWDLYLAQAEFAFNNMTNGSTGKCPFEVVYTHVPRLT-LDLAKLPIYVDLSAEATI
            K+   + +S    PQTDGQTE TN+T+  LLR +    P  W  +++  + ++NN  + +T   PFE+V+ + P L+ L+L       D +++ TI
Subjt:  SLWKKFGTNLLYSTINHPQTDGQTEVTNRTLGNLLRSLSGDRPRQWDLYLAQAEFAFNNMTNGSTGKCPFEVVYTHVPRLT-LDLAKLPIYVDLSAEATI

Query:  MADRIKTIHEEVQQHLEATDPSYKSKAD-KHKRAVEFQIGDLVMIHLKKSRLPTGQHSKLINKKIGPLPILERIGPYAYTIDLPPTVK--ISNIFNVSDI
               + + V++HL   +   K   D K +   EFQ GDLVM+   K+     + +KL     GP  +L++ GP  Y +DLP ++K   S+ F+VS +
Subjt:  MADRIKTIHEEVQQHLEATDPSYKSKAD-KHKRAVEFQIGDLVMIHLKKSRLPTGQHSKLINKKIGPLPILERIGPYAYTIDLPPTVK--ISNIFNVSDI

Query:  FPYHAQNQFTLSQSSRDE
          Y   ++  L+ ++ DE
Subjt:  FPYHAQNQFTLSQSSRDE

P0CT41 Transposon Tf2-12 polyprotein2.4e-9227.56Show/hide
Query:  SNIL-DPHISNLLNEFPTLIEEPNS--LP-PLWNIQHHIDQILGSSLPHLPHYRTSPKEYEALHNEIHTLLDKGHLQPSLSPCAVPALLAPKKDGSWRLC
        SNI+ +P + ++  EF  +  E N+  LP P+  ++  ++    +    + +Y   P + +A+++EI+  L  G ++ S +  A P +  PKK+G+ R+ 
Subjt:  SNIL-DPHISNLLNEFPTLIEEPNS--LP-PLWNIQHHIDQILGSSLPHLPHYRTSPKEYEALHNEIHTLLDKGHLQPSLSPCAVPALLAPKKDGSWRLC

Query:  VDSRAINKITIKYRFPIPRLSDLLDQLGGASSFSKLTLKVVITKLGSNRG------------------------------------MSGE----------
        VD + +NK      +P+P +  LL ++ G++ F+KL LK     +   +G                                    + GE          
Subjt:  VDSRAINKITIKYRFPIPRLSDLLDQLGGASSFSKLTLKVVITKLGSNRG------------------------------------MSGE----------

Query:  ---------------------RLLKQMKAFLN--------------GF---PKGISVDPRKIEAIVEWSEPKTTKDIRSFLCLASFYRKFIPHFSTIAAL
                             + LK     +N              G+    KG +     I+ +++W +PK  K++R FL   ++ RKFIP  S +   
Subjt:  ---------------------RLLKQMKAFLN--------------GF---PKGISVDPRKIEAIVEWSEPKTTKDIRSFLCLASFYRKFIPHFSTIAAL

Query:  MTACLKANIPFKWGEKQSQSFSELKRMLSSTPVLGFPDFNKPFEVAVDASGIGIGVVLSQ----DK-HPIEFFSEKLSPSRQKWSTYEQELYSLVQSLKQ
        +   LK ++ +KW   Q+Q+   +K+ L S PVL   DF+K   +  DAS + +G VLSQ    DK +P+ ++S K+S ++  +S  ++E+ ++++SLK 
Subjt:  MTACLKANIPFKWGEKQSQSFSELKRMLSSTPVLGFPDFNKPFEVAVDASGIGIGVVLSQ----DK-HPIEFFSEKLSPSRQKWSTYEQELYSLVQSLKQ

Query:  WEHYLLG--KEFVLLIDHFSL--KFLEFQKEISCMHARWIQFIQRFDFVIKHTPGKANKFADTLSR------------KGTLLTLLRGKIIAFDHLPELC
        W HYL    + F +L DH +L  +     +  +   ARW  F+Q F+F I + PG AN  AD LSR            +   +  +    I  D   ++ 
Subjt:  WEHYLLG--KEFVLLIDHFSL--KFLEFQKEISCMHARWIQFIQRFDFVIKHTPGKANKFADTLSR------------KGTLLTLLRGKIIAFDHLPELC

Query:  ---ANDLDFKNIWQAWSNHIPAKDFHIFYGFLFKN-DTLCIP-QTSLHESLLREAHSGGLAGHFGRDKTLALLSSKFFWPQLKKDVANFVKRCYICQSSK
            ND    N+       +  ++  +  G L  + D + +P  T L  +++++ H  G   H G +    ++  +F W  ++K +  +V+ C+ CQ +K
Subjt:  ---ANDLDFKNIWQAWSNHIPAKDFHIFYGFLFKN-DTLCIP-QTSLHESLLREAHSGGLAGHFGRDKTLALLSSKFFWPQLKKDVANFVKRCYICQSSK

Query:  GASLNQGLYSTL-PIP--NNIWEDLSMDFVLGLPRTQRGFESILVVVDRFSKMAHFLACKKTNDALNVANLFLREIVRLHGIPKSIVSDRDVKFMGHFWR
          S N   Y  L PIP     WE LSMDF+  LP +  G+ ++ VVVDRFSKMA  + C K+  A   A +F + ++   G PK I++D D  F    W+
Subjt:  GASLNQGLYSTL-PIP--NNIWEDLSMDFVLGLPRTQRGFESILVVVDRFSKMAHFLACKKTNDALNVANLFLREIVRLHGIPKSIVSDRDVKFMGHFWR

Query:  SLWKKFGTNLLYSTINHPQTDGQTEVTNRTLGNLLRSLSGDRPRQWDLYLAQAEFAFNNMTNGSTGKCPFEVVYTHVPRLT-LDLAKLPIYVDLSAEATI
            K+   + +S    PQTDGQTE TN+T+  LLR +    P  W  +++  + ++NN  + +T   PFE+V+ + P L+ L+L       D +++ TI
Subjt:  SLWKKFGTNLLYSTINHPQTDGQTEVTNRTLGNLLRSLSGDRPRQWDLYLAQAEFAFNNMTNGSTGKCPFEVVYTHVPRLT-LDLAKLPIYVDLSAEATI

Query:  MADRIKTIHEEVQQHLEATDPSYKSKAD-KHKRAVEFQIGDLVMIHLKKSRLPTGQHSKLINKKIGPLPILERIGPYAYTIDLPPTVK--ISNIFNVSDI
               + + V++HL   +   K   D K +   EFQ GDLVM+   K+     + +KL     GP  +L++ GP  Y +DLP ++K   S+ F+VS +
Subjt:  MADRIKTIHEEVQQHLEATDPSYKSKAD-KHKRAVEFQIGDLVMIHLKKSRLPTGQHSKLINKKIGPLPILERIGPYAYTIDLPPTVK--ISNIFNVSDI

Query:  FPYHAQNQFTLSQSSRDE
          Y   ++  L+ ++ DE
Subjt:  FPYHAQNQFTLSQSSRDE

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein6.5e-9828.91Show/hide
Query:  NEFPTLIEEPNSLPPLWNIQHHIDQILGSSLPHLPHYRTSPKEYEALHNEIHTLLDKGHLQPSLSPCAVPALLAPKKDGSWRLCVDSRAINKITIKYRFP
        N+ P    + N++P    ++H I+   G+ LP L  Y  + K  + ++  +  LLD   + PS SPC+ P +L PKKDG++RLCVD R +NK TI   FP
Subjt:  NEFPTLIEEPNSLPPLWNIQHHIDQILGSSLPHLPHYRTSPKEYEALHNEIHTLLDKGHLQPSLSPCAVPALLAPKKDGSWRLCVDSRAINKITIKYRFP

Query:  IPRLSDLLDQLGGASSFSKLTL----------------KVVITKLGSN----------------------------------------------------
        +PR+ +LL ++G A  F+ L L                   +T  G                                                      
Subjt:  IPRLSDLLDQLGGASSFSKLTL----------------KVVITKLGSN----------------------------------------------------

Query:  ---RGMSGERLLKQMK---------AFLNGFPKGI-SVDP--RKIEAIVEWSEPKTTKDIRSFLCLASFYRKFIPHFSTIA--ALMTACLKANIPFKWGE
             +  E L+ + K          FL G+  GI  + P   K  AI ++  PKT K  + FL + ++YR+FIP+ S IA    +  C K+    +W E
Subjt:  ---RGMSGERLLKQMK---------AFLNGFPKGI-SVDP--RKIEAIVEWSEPKTTKDIRSFLCLASFYRKFIPHFSTIA--ALMTACLKANIPFKWGE

Query:  KQSQSFSELKRMLSSTPVLGFPDFNKP-FEVAVDASGIGIGVVLSQDKHP------IEFFSEKLSPSRQKWSTYEQELYSLVQSLKQWEHYLLGKEFVLL
        KQ ++  +LK  L ++PVL  P  NK  + +  DAS  GIG VL +  +       + +FS+ L  +++ +   E EL  ++++L  + + L GK F L 
Subjt:  KQSQSFSELKRMLSSTPVLGFPDFNKP-FEVAVDASGIGIGVVLSQDKHP------IEFFSEKLSPSRQKWSTYEQELYSLVQSLKQWEHYLLGKEFVLL

Query:  IDHFSLKFLEFQKEISCMHARWIQFIQRFDFVIKHTPGKANKFADTLSRKGTLLTLLRGKIIAFD-----------------HLPELCANDL--DFKNIW
         DH SL  L+ + E +    RW+  +  +DF +++  G  N  AD +SR    +T    + I  +                 H+ EL  +++  +  + +
Subjt:  IDHFSLKFLEFQKEISCMHARWIQFIQRFDFVIKHTPGKANKFADTLSRKGTLLTLLRGKIIAFD-----------------HLPELCANDL--DFKNIW

Query:  QAWSNHIP-----AKDFHIFYGFLFKNDTLCIPQTSLHESLLREAHSGGL-AGHFGRDKTLALLSSKFFWPQLKKDVANFVKRCYICQSSKGASLN-QGL
        +++   +       K++ +    ++  D L +P      +++R  H   L  GHFG   TLA +S  ++WP+L+  +  +++ C  CQ  K       GL
Subjt:  QAWSNHIP-----AKDFHIFYGFLFKNDTLCIPQTSLHESLLREAHSGGL-AGHFGRDKTLALLSSKFFWPQLKKDVANFVKRCYICQSSKGASLN-QGL

Query:  YSTLPIPNNIWEDLSMDFVLGLPRTQRGFESILVVVDRFSKMAHFLACKKTNDALNVANLFLREIVRLHGIPKSIVSDRDVKFMGHFWRSLWKKFGTNLL
           LPI    W D+SMDFV GLP T      ILVVVDRFSK AHF+A +KT DA  + +L  R I   HG P++I SDRDV+     ++ L K+ G    
Subjt:  YSTLPIPNNIWEDLSMDFVLGLPRTQRGFESILVVVDRFSKMAHFLACKKTNDALNVANLFLREIVRLHGIPKSIVSDRDVKFMGHFWRSLWKKFGTNLL

Query:  YSTINHPQTDGQTEVTNRTLGNLLRSLSGDRPRQWDLYLAQAEFAFNNMTNGSTGKCPFEVVYTHVPRLTLDLAKLPIYVDLSAEATIMADRIKTIHEEV
         S+ NHPQTDGQ+E T +TL  LLR+      + W +YL Q EF +N+    + GK PFE+   ++P      +   +    S  A  +A  +K +  + 
Subjt:  YSTINHPQTDGQTEVTNRTLGNLLRSLSGDRPRQWDLYLAQAEFAFNNMTNGSTGKCPFEVVYTHVPRLTLDLAKLPIYVDLSAEATIMADRIKTIHEEV

Query:  QQHLEATDPSYKSKADKHKRAVEFQIGDLVMIHLKKSRLPTGQHSKLINKKIGPLPILERIGPYAYTIDLPPTVKISNIFNV
        ++ LE      ++  ++ ++ +   IGD V++H + +    G + K+    +GP  ++++I   AY +DL    K   + NV
Subjt:  QQHLEATDPSYKSKADKHKRAVEFQIGDLVMIHLKKSRLPTGQHSKLINKKIGPLPILERIGPYAYTIDLPPTVKISNIFNV

Q99315 Transposon Ty3-G Gag-Pol polyprotein2.9e-9828.91Show/hide
Query:  NEFPTLIEEPNSLPPLWNIQHHIDQILGSSLPHLPHYRTSPKEYEALHNEIHTLLDKGHLQPSLSPCAVPALLAPKKDGSWRLCVDSRAINKITIKYRFP
        N+ P    + N++P    ++H I+   G+ LP L  Y  + K  + ++  +  LLD   + PS SPC+ P +L PKKDG++RLCVD R +NK TI   FP
Subjt:  NEFPTLIEEPNSLPPLWNIQHHIDQILGSSLPHLPHYRTSPKEYEALHNEIHTLLDKGHLQPSLSPCAVPALLAPKKDGSWRLCVDSRAINKITIKYRFP

Query:  IPRLSDLLDQLGGASSFSKLTL----------------KVVITKLGSN----------------------------------------------------
        +PR+ +LL ++G A  F+ L L                   +T  G                                                      
Subjt:  IPRLSDLLDQLGGASSFSKLTL----------------KVVITKLGSN----------------------------------------------------

Query:  ---RGMSGERLLKQMK---------AFLNGFPKGI-SVDP--RKIEAIVEWSEPKTTKDIRSFLCLASFYRKFIPHFSTIA--ALMTACLKANIPFKWGE
             +  E L+ + K          FL G+  GI  + P   K  AI ++  PKT K  + FL + ++YR+FIP+ S IA    +  C K+    +W E
Subjt:  ---RGMSGERLLKQMK---------AFLNGFPKGI-SVDP--RKIEAIVEWSEPKTTKDIRSFLCLASFYRKFIPHFSTIA--ALMTACLKANIPFKWGE

Query:  KQSQSFSELKRMLSSTPVLGFPDFNKP-FEVAVDASGIGIGVVLSQDKHP------IEFFSEKLSPSRQKWSTYEQELYSLVQSLKQWEHYLLGKEFVLL
        KQ ++  +LK  L ++PVL  P  NK  + +  DAS  GIG VL +  +       + +FS+ L  +++ +   E EL  ++++L  + + L GK F L 
Subjt:  KQSQSFSELKRMLSSTPVLGFPDFNKP-FEVAVDASGIGIGVVLSQDKHP------IEFFSEKLSPSRQKWSTYEQELYSLVQSLKQWEHYLLGKEFVLL

Query:  IDHFSLKFLEFQKEISCMHARWIQFIQRFDFVIKHTPGKANKFADTLSRKGTLLTLLRGKIIAFD-----------------HLPELCANDL--DFKNIW
         DH SL  L+ + E +    RW+  +  +DF +++  G  N  AD +SR    +T    + I  +                 H+ EL  +++  +  + +
Subjt:  IDHFSLKFLEFQKEISCMHARWIQFIQRFDFVIKHTPGKANKFADTLSRKGTLLTLLRGKIIAFD-----------------HLPELCANDL--DFKNIW

Query:  QAWSNHIP-----AKDFHIFYGFLFKNDTLCIPQTSLHESLLREAHSGGL-AGHFGRDKTLALLSSKFFWPQLKKDVANFVKRCYICQSSKGASLN-QGL
        +++   +       K++ +    ++  D L +P      +++R  H   L  GHFG   TLA +S  ++WP+L+  +  +++ C  CQ  K       GL
Subjt:  QAWSNHIP-----AKDFHIFYGFLFKNDTLCIPQTSLHESLLREAHSGGL-AGHFGRDKTLALLSSKFFWPQLKKDVANFVKRCYICQSSKGASLN-QGL

Query:  YSTLPIPNNIWEDLSMDFVLGLPRTQRGFESILVVVDRFSKMAHFLACKKTNDALNVANLFLREIVRLHGIPKSIVSDRDVKFMGHFWRSLWKKFGTNLL
           LPI    W D+SMDFV GLP T      ILVVVDRFSK AHF+A +KT DA  + +L  R I   HG P++I SDRDV+     ++ L K+ G    
Subjt:  YSTLPIPNNIWEDLSMDFVLGLPRTQRGFESILVVVDRFSKMAHFLACKKTNDALNVANLFLREIVRLHGIPKSIVSDRDVKFMGHFWRSLWKKFGTNLL

Query:  YSTINHPQTDGQTEVTNRTLGNLLRSLSGDRPRQWDLYLAQAEFAFNNMTNGSTGKCPFEVVYTHVPRLTLDLAKLPIYVDLSAEATIMADRIKTIHEEV
         S+ NHPQTDGQ+E T +TL  LLR+ +    + W +YL Q EF +N+    + GK PFE+   ++P      +   +    S  A  +A  +K +  + 
Subjt:  YSTINHPQTDGQTEVTNRTLGNLLRSLSGDRPRQWDLYLAQAEFAFNNMTNGSTGKCPFEVVYTHVPRLTLDLAKLPIYVDLSAEATIMADRIKTIHEEV

Query:  QQHLEATDPSYKSKADKHKRAVEFQIGDLVMIHLKKSRLPTGQHSKLINKKIGPLPILERIGPYAYTIDLPPTVKISNIFNV
        ++ LE      ++  ++ ++ +   IGD V++H + +    G + K+    +GP  ++++I   AY +DL    K   + NV
Subjt:  QQHLEATDPSYKSKADKHKRAVEFQIGDLVMIHLKKSRLPTGQHSKLINKKIGPLPILERIGPYAYTIDLPPTVKISNIFNV

Q9UR07 Transposon Tf2-11 polyprotein2.4e-9227.56Show/hide
Query:  SNIL-DPHISNLLNEFPTLIEEPNS--LP-PLWNIQHHIDQILGSSLPHLPHYRTSPKEYEALHNEIHTLLDKGHLQPSLSPCAVPALLAPKKDGSWRLC
        SNI+ +P + ++  EF  +  E N+  LP P+  ++  ++    +    + +Y   P + +A+++EI+  L  G ++ S +  A P +  PKK+G+ R+ 
Subjt:  SNIL-DPHISNLLNEFPTLIEEPNS--LP-PLWNIQHHIDQILGSSLPHLPHYRTSPKEYEALHNEIHTLLDKGHLQPSLSPCAVPALLAPKKDGSWRLC

Query:  VDSRAINKITIKYRFPIPRLSDLLDQLGGASSFSKLTLKVVITKLGSNRG------------------------------------MSGE----------
        VD + +NK      +P+P +  LL ++ G++ F+KL LK     +   +G                                    + GE          
Subjt:  VDSRAINKITIKYRFPIPRLSDLLDQLGGASSFSKLTLKVVITKLGSNRG------------------------------------MSGE----------

Query:  ---------------------RLLKQMKAFLN--------------GF---PKGISVDPRKIEAIVEWSEPKTTKDIRSFLCLASFYRKFIPHFSTIAAL
                             + LK     +N              G+    KG +     I+ +++W +PK  K++R FL   ++ RKFIP  S +   
Subjt:  ---------------------RLLKQMKAFLN--------------GF---PKGISVDPRKIEAIVEWSEPKTTKDIRSFLCLASFYRKFIPHFSTIAAL

Query:  MTACLKANIPFKWGEKQSQSFSELKRMLSSTPVLGFPDFNKPFEVAVDASGIGIGVVLSQ----DK-HPIEFFSEKLSPSRQKWSTYEQELYSLVQSLKQ
        +   LK ++ +KW   Q+Q+   +K+ L S PVL   DF+K   +  DAS + +G VLSQ    DK +P+ ++S K+S ++  +S  ++E+ ++++SLK 
Subjt:  MTACLKANIPFKWGEKQSQSFSELKRMLSSTPVLGFPDFNKPFEVAVDASGIGIGVVLSQ----DK-HPIEFFSEKLSPSRQKWSTYEQELYSLVQSLKQ

Query:  WEHYLLG--KEFVLLIDHFSL--KFLEFQKEISCMHARWIQFIQRFDFVIKHTPGKANKFADTLSR------------KGTLLTLLRGKIIAFDHLPELC
        W HYL    + F +L DH +L  +     +  +   ARW  F+Q F+F I + PG AN  AD LSR            +   +  +    I  D   ++ 
Subjt:  WEHYLLG--KEFVLLIDHFSL--KFLEFQKEISCMHARWIQFIQRFDFVIKHTPGKANKFADTLSR------------KGTLLTLLRGKIIAFDHLPELC

Query:  ---ANDLDFKNIWQAWSNHIPAKDFHIFYGFLFKN-DTLCIP-QTSLHESLLREAHSGGLAGHFGRDKTLALLSSKFFWPQLKKDVANFVKRCYICQSSK
            ND    N+       +  ++  +  G L  + D + +P  T L  +++++ H  G   H G +    ++  +F W  ++K +  +V+ C+ CQ +K
Subjt:  ---ANDLDFKNIWQAWSNHIPAKDFHIFYGFLFKN-DTLCIP-QTSLHESLLREAHSGGLAGHFGRDKTLALLSSKFFWPQLKKDVANFVKRCYICQSSK

Query:  GASLNQGLYSTL-PIP--NNIWEDLSMDFVLGLPRTQRGFESILVVVDRFSKMAHFLACKKTNDALNVANLFLREIVRLHGIPKSIVSDRDVKFMGHFWR
          S N   Y  L PIP     WE LSMDF+  LP +  G+ ++ VVVDRFSKMA  + C K+  A   A +F + ++   G PK I++D D  F    W+
Subjt:  GASLNQGLYSTL-PIP--NNIWEDLSMDFVLGLPRTQRGFESILVVVDRFSKMAHFLACKKTNDALNVANLFLREIVRLHGIPKSIVSDRDVKFMGHFWR

Query:  SLWKKFGTNLLYSTINHPQTDGQTEVTNRTLGNLLRSLSGDRPRQWDLYLAQAEFAFNNMTNGSTGKCPFEVVYTHVPRLT-LDLAKLPIYVDLSAEATI
            K+   + +S    PQTDGQTE TN+T+  LLR +    P  W  +++  + ++NN  + +T   PFE+V+ + P L+ L+L       D +++ TI
Subjt:  SLWKKFGTNLLYSTINHPQTDGQTEVTNRTLGNLLRSLSGDRPRQWDLYLAQAEFAFNNMTNGSTGKCPFEVVYTHVPRLT-LDLAKLPIYVDLSAEATI

Query:  MADRIKTIHEEVQQHLEATDPSYKSKAD-KHKRAVEFQIGDLVMIHLKKSRLPTGQHSKLINKKIGPLPILERIGPYAYTIDLPPTVK--ISNIFNVSDI
               + + V++HL   +   K   D K +   EFQ GDLVM+   K+     + +KL     GP  +L++ GP  Y +DLP ++K   S+ F+VS +
Subjt:  MADRIKTIHEEVQQHLEATDPSYKSKAD-KHKRAVEFQIGDLVMIHLKKSRLPTGQHSKLINKKIGPLPILERIGPYAYTIDLPPTVK--ISNIFNVSDI

Query:  FPYHAQNQFTLSQSSRDE
          Y   ++  L+ ++ DE
Subjt:  FPYHAQNQFTLSQSSRDE

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein1.1e-1541.05Show/hide
Query:  KGISVDPRKIEAIVEWSEPKTTKDIRSFLCLASFYRKFIPHFSTIAALMTACLKANIPFKWGEKQSQSFSELKRMLSSTPVLGFPDFNKPFEVAV
        +G+S DP K+EA+V W EPK T ++R FL L  +YR+F+ ++  I   +T  LK N   KW E  + +F  LK  +++ PVL  PD   PF   V
Subjt:  KGISVDPRKIEAIVEWSEPKTTKDIRSFLCLASFYRKFIPHFSTIAALMTACLKANIPFKWGEKQSQSFSELKRMLSSTPVLGFPDFNKPFEVAV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGATTTCTTCTAATCATGTCGACATTCTTTCTAGTAACCTAGGAATACCACCAATGGAATCTTCAATAGACAGCAAGCATCCGGTTAAAGTTCTTGGTGATATGGC
AGTGGTAGACTCCGCTTCTTTACTAAACCATATCTCTGGAAATACATTGAACCAACATTCCAAGAAAGATGATTCAATTCTAAAACAGCAACCTATTCGAAGCAATGATA
CAAGCGTTGCCCGAAGAAACAATAATCCCTACAATCCACCAACATTAGCAAAGTGCTTTCGTTGTGGACAGCAAGGCCACCTTTCTAATGAATGTCCCCAACGCAGGACG
TTGGCTGTACAAGAAGAAGAGGATGGAGCAGACGGCTGCAATTCAGAGGATAATGAAGAATTGGCTTACTTACAACCTGACGATGGTGATCAATTGTCTTGTGTGCTTCA
ACAAGTGCTCCTCACACCTAAATCTGAAAGCCACCCTCAGAGACACTCCTTATTCCGGACAAGATGTACTATAAATGGTCGGGTGTGTCAAATAATTATAGACAGTGGCA
GCAGCAAGAATATTGTGTCAAAAAAGTTGGTTACTGCACTCAATTTGAAGGCTAAAGCACACCCAATCCGTATAGGGTTAGCTGGATCAAAAAAGGGGGAGAAACAATGG
TTAAAGACCAAATTGGAAACACTTACCAATACCAAATTACATGCGATATGGATGGGCAAGAAGGTGGTTCTCCTTCCCCTCGGTAAGAATGACAGCAAGCTCTTAAGTCC
AAATTCTAACATCTTAGATCCTCACATATCCAACCTATTAAATGAATTCCCTACTCTAATAGAAGAACCCAATTCTCTGCCACCACTTTGGAACATTCAACATCATATAG
ATCAAATACTAGGGAGCTCCTTACCTCATTTGCCACATTATCGCACGAGTCCAAAAGAATATGAAGCTTTGCACAATGAAATCCACACCTTATTAGATAAAGGACATTTA
CAACCAAGCCTTAGCCCATGTGCTGTCCCTGCTCTACTGGCACCAAAAAAAGATGGATCTTGGAGGCTTTGTGTGGATAGTAGAGCCATTAACAAAATAACTATAAAATA
TAGGTTCCCAATACCTCGACTCTCTGATTTGCTAGACCAACTAGGAGGAGCCTCCAGTTTTTCTAAATTGACCTTAAAAGTGGTTATCACCAAATTAGGATCCAATCGGG
GGATGAGTGGAGAACGACTTTTAAAACAAATGAAGGCCTTTTTGAATGGCTTCCCAAAGGGAATTAGTGTAGACCCGAGAAAAATAGAGGCGATAGTTGAATGGTCAGAA
CCAAAAACTACCAAAGACATCCGAAGCTTTCTTTGTTTAGCTTCATTTTATAGAAAATTCATCCCACATTTTAGCACCATTGCAGCACTAATGACAGCTTGTTTAAAGGC
CAATATTCCATTCAAATGGGGGGAGAAACAATCACAAAGTTTTTCTGAATTAAAGAGAATGCTTAGCTCAACTCCAGTCCTAGGATTTCCAGACTTTAATAAACCATTTG
AGGTCGCAGTAGATGCTTCTGGAATCGGTATTGGAGTTGTGCTAAGTCAAGACAAGCACCCTATAGAATTTTTTAGTGAAAAATTGAGCCCTTCCAGACAAAAATGGTCA
ACCTATGAACAAGAACTCTACTCTCTTGTTCAATCTTTAAAACAGTGGGAACATTACTTGCTTGGGAAAGAATTTGTGTTGCTAATAGATCATTTTTCTTTGAAATTTTT
GGAATTTCAAAAGGAAATTAGCTGTATGCATGCTAGGTGGATCCAATTCATTCAAAGGTTTGACTTCGTCATTAAACACACTCCAGGTAAAGCTAATAAATTTGCAGACA
CATTAAGCAGAAAAGGTACACTTCTAACTTTACTTCGTGGAAAAATTATAGCTTTTGATCATCTACCTGAATTATGTGCTAATGATTTGGATTTCAAAAATATTTGGCAA
GCATGGTCAAATCATATTCCAGCGAAAGATTTCCATATTTTTTATGGGTTTTTATTTAAAAATGACACCTTATGTATCCCTCAAACATCTCTTCATGAATCATTGTTACG
AGAAGCCCATTCGGGAGGTCTTGCTGGCCACTTTGGTCGAGATAAAACTTTGGCTCTACTTTCCTCTAAATTTTTCTGGCCACAACTTAAAAAGGATGTTGCTAATTTTG
TTAAACGTTGCTATATTTGTCAATCTTCCAAGGGTGCTAGTTTAAACCAAGGTCTGTACTCCACGTTACCAATACCAAACAATATTTGGGAGGATCTCTCTATGGATTTC
GTCCTTGGTTTACCTAGGACCCAAAGAGGCTTTGAATCCATTTTAGTGGTTGTTGACCGCTTTAGCAAAATGGCACATTTTTTAGCTTGCAAAAAGACTAATGATGCTTT
AAATGTTGCCAATCTTTTTTTAAGGGAAATTGTCCGTCTTCATGGTATACCTAAAAGCATAGTATCAGACAGGGATGTCAAATTCATGGGTCATTTTTGGCGATCATTAT
GGAAAAAATTTGGAACAAACCTCCTATATAGCACAATAAACCATCCTCAAACTGATGGACAAACAGAGGTTACAAACCGCACACTCGGCAATCTTTTAAGAAGTCTAAGT
GGAGATAGACCTCGCCAATGGGATCTTTACCTTGCTCAAGCAGAGTTTGCATTCAACAATATGACTAATGGCTCAACAGGGAAGTGCCCATTTGAAGTTGTATATACTCA
TGTTCCTAGATTAACATTAGATCTTGCTAAACTACCCATCTACGTTGATCTTAGTGCCGAGGCTACCATTATGGCTGATCGAATAAAAACAATTCATGAAGAAGTTCAAC
AACACCTAGAAGCAACCGACCCTTCTTATAAGTCCAAAGCAGATAAGCACAAACGAGCTGTGGAATTCCAGATTGGAGATTTAGTAATGATCCACTTGAAGAAAAGTAGA
CTACCCACTGGCCAACACTCCAAACTCATAAATAAGAAGATAGGACCACTTCCTATTCTAGAAAGAATTGGCCCTTATGCATACACGATTGATCTTCCTCCAACAGTGAA
GATCAGCAACATTTTCAATGTCTCAGACATATTCCCATATCATGCTCAAAACCAATTCACTTTATCACAATCGTCCCGGGACGAGTTTATTTTCTGGGAAGGGAAGAACT
GTTGTAGATATTTTAAGTATTATTTTACAATTATTTTCAGTTTCCAAACTTATGTATAA
mRNA sequenceShow/hide mRNA sequence
ATGGCGATTTCTTCTAATCATGTCGACATTCTTTCTAGTAACCTAGGAATACCACCAATGGAATCTTCAATAGACAGCAAGCATCCGGTTAAAGTTCTTGGTGATATGGC
AGTGGTAGACTCCGCTTCTTTACTAAACCATATCTCTGGAAATACATTGAACCAACATTCCAAGAAAGATGATTCAATTCTAAAACAGCAACCTATTCGAAGCAATGATA
CAAGCGTTGCCCGAAGAAACAATAATCCCTACAATCCACCAACATTAGCAAAGTGCTTTCGTTGTGGACAGCAAGGCCACCTTTCTAATGAATGTCCCCAACGCAGGACG
TTGGCTGTACAAGAAGAAGAGGATGGAGCAGACGGCTGCAATTCAGAGGATAATGAAGAATTGGCTTACTTACAACCTGACGATGGTGATCAATTGTCTTGTGTGCTTCA
ACAAGTGCTCCTCACACCTAAATCTGAAAGCCACCCTCAGAGACACTCCTTATTCCGGACAAGATGTACTATAAATGGTCGGGTGTGTCAAATAATTATAGACAGTGGCA
GCAGCAAGAATATTGTGTCAAAAAAGTTGGTTACTGCACTCAATTTGAAGGCTAAAGCACACCCAATCCGTATAGGGTTAGCTGGATCAAAAAAGGGGGAGAAACAATGG
TTAAAGACCAAATTGGAAACACTTACCAATACCAAATTACATGCGATATGGATGGGCAAGAAGGTGGTTCTCCTTCCCCTCGGTAAGAATGACAGCAAGCTCTTAAGTCC
AAATTCTAACATCTTAGATCCTCACATATCCAACCTATTAAATGAATTCCCTACTCTAATAGAAGAACCCAATTCTCTGCCACCACTTTGGAACATTCAACATCATATAG
ATCAAATACTAGGGAGCTCCTTACCTCATTTGCCACATTATCGCACGAGTCCAAAAGAATATGAAGCTTTGCACAATGAAATCCACACCTTATTAGATAAAGGACATTTA
CAACCAAGCCTTAGCCCATGTGCTGTCCCTGCTCTACTGGCACCAAAAAAAGATGGATCTTGGAGGCTTTGTGTGGATAGTAGAGCCATTAACAAAATAACTATAAAATA
TAGGTTCCCAATACCTCGACTCTCTGATTTGCTAGACCAACTAGGAGGAGCCTCCAGTTTTTCTAAATTGACCTTAAAAGTGGTTATCACCAAATTAGGATCCAATCGGG
GGATGAGTGGAGAACGACTTTTAAAACAAATGAAGGCCTTTTTGAATGGCTTCCCAAAGGGAATTAGTGTAGACCCGAGAAAAATAGAGGCGATAGTTGAATGGTCAGAA
CCAAAAACTACCAAAGACATCCGAAGCTTTCTTTGTTTAGCTTCATTTTATAGAAAATTCATCCCACATTTTAGCACCATTGCAGCACTAATGACAGCTTGTTTAAAGGC
CAATATTCCATTCAAATGGGGGGAGAAACAATCACAAAGTTTTTCTGAATTAAAGAGAATGCTTAGCTCAACTCCAGTCCTAGGATTTCCAGACTTTAATAAACCATTTG
AGGTCGCAGTAGATGCTTCTGGAATCGGTATTGGAGTTGTGCTAAGTCAAGACAAGCACCCTATAGAATTTTTTAGTGAAAAATTGAGCCCTTCCAGACAAAAATGGTCA
ACCTATGAACAAGAACTCTACTCTCTTGTTCAATCTTTAAAACAGTGGGAACATTACTTGCTTGGGAAAGAATTTGTGTTGCTAATAGATCATTTTTCTTTGAAATTTTT
GGAATTTCAAAAGGAAATTAGCTGTATGCATGCTAGGTGGATCCAATTCATTCAAAGGTTTGACTTCGTCATTAAACACACTCCAGGTAAAGCTAATAAATTTGCAGACA
CATTAAGCAGAAAAGGTACACTTCTAACTTTACTTCGTGGAAAAATTATAGCTTTTGATCATCTACCTGAATTATGTGCTAATGATTTGGATTTCAAAAATATTTGGCAA
GCATGGTCAAATCATATTCCAGCGAAAGATTTCCATATTTTTTATGGGTTTTTATTTAAAAATGACACCTTATGTATCCCTCAAACATCTCTTCATGAATCATTGTTACG
AGAAGCCCATTCGGGAGGTCTTGCTGGCCACTTTGGTCGAGATAAAACTTTGGCTCTACTTTCCTCTAAATTTTTCTGGCCACAACTTAAAAAGGATGTTGCTAATTTTG
TTAAACGTTGCTATATTTGTCAATCTTCCAAGGGTGCTAGTTTAAACCAAGGTCTGTACTCCACGTTACCAATACCAAACAATATTTGGGAGGATCTCTCTATGGATTTC
GTCCTTGGTTTACCTAGGACCCAAAGAGGCTTTGAATCCATTTTAGTGGTTGTTGACCGCTTTAGCAAAATGGCACATTTTTTAGCTTGCAAAAAGACTAATGATGCTTT
AAATGTTGCCAATCTTTTTTTAAGGGAAATTGTCCGTCTTCATGGTATACCTAAAAGCATAGTATCAGACAGGGATGTCAAATTCATGGGTCATTTTTGGCGATCATTAT
GGAAAAAATTTGGAACAAACCTCCTATATAGCACAATAAACCATCCTCAAACTGATGGACAAACAGAGGTTACAAACCGCACACTCGGCAATCTTTTAAGAAGTCTAAGT
GGAGATAGACCTCGCCAATGGGATCTTTACCTTGCTCAAGCAGAGTTTGCATTCAACAATATGACTAATGGCTCAACAGGGAAGTGCCCATTTGAAGTTGTATATACTCA
TGTTCCTAGATTAACATTAGATCTTGCTAAACTACCCATCTACGTTGATCTTAGTGCCGAGGCTACCATTATGGCTGATCGAATAAAAACAATTCATGAAGAAGTTCAAC
AACACCTAGAAGCAACCGACCCTTCTTATAAGTCCAAAGCAGATAAGCACAAACGAGCTGTGGAATTCCAGATTGGAGATTTAGTAATGATCCACTTGAAGAAAAGTAGA
CTACCCACTGGCCAACACTCCAAACTCATAAATAAGAAGATAGGACCACTTCCTATTCTAGAAAGAATTGGCCCTTATGCATACACGATTGATCTTCCTCCAACAGTGAA
GATCAGCAACATTTTCAATGTCTCAGACATATTCCCATATCATGCTCAAAACCAATTCACTTTATCACAATCGTCCCGGGACGAGTTTATTTTCTGGGAAGGGAAGAACT
GTTGTAGATATTTTAAGTATTATTTTACAATTATTTTCAGTTTCCAAACTTATGTATAA
Protein sequenceShow/hide protein sequence
MAISSNHVDILSSNLGIPPMESSIDSKHPVKVLGDMAVVDSASLLNHISGNTLNQHSKKDDSILKQQPIRSNDTSVARRNNNPYNPPTLAKCFRCGQQGHLSNECPQRRT
LAVQEEEDGADGCNSEDNEELAYLQPDDGDQLSCVLQQVLLTPKSESHPQRHSLFRTRCTINGRVCQIIIDSGSSKNIVSKKLVTALNLKAKAHPIRIGLAGSKKGEKQW
LKTKLETLTNTKLHAIWMGKKVVLLPLGKNDSKLLSPNSNILDPHISNLLNEFPTLIEEPNSLPPLWNIQHHIDQILGSSLPHLPHYRTSPKEYEALHNEIHTLLDKGHL
QPSLSPCAVPALLAPKKDGSWRLCVDSRAINKITIKYRFPIPRLSDLLDQLGGASSFSKLTLKVVITKLGSNRGMSGERLLKQMKAFLNGFPKGISVDPRKIEAIVEWSE
PKTTKDIRSFLCLASFYRKFIPHFSTIAALMTACLKANIPFKWGEKQSQSFSELKRMLSSTPVLGFPDFNKPFEVAVDASGIGIGVVLSQDKHPIEFFSEKLSPSRQKWS
TYEQELYSLVQSLKQWEHYLLGKEFVLLIDHFSLKFLEFQKEISCMHARWIQFIQRFDFVIKHTPGKANKFADTLSRKGTLLTLLRGKIIAFDHLPELCANDLDFKNIWQ
AWSNHIPAKDFHIFYGFLFKNDTLCIPQTSLHESLLREAHSGGLAGHFGRDKTLALLSSKFFWPQLKKDVANFVKRCYICQSSKGASLNQGLYSTLPIPNNIWEDLSMDF
VLGLPRTQRGFESILVVVDRFSKMAHFLACKKTNDALNVANLFLREIVRLHGIPKSIVSDRDVKFMGHFWRSLWKKFGTNLLYSTINHPQTDGQTEVTNRTLGNLLRSLS
GDRPRQWDLYLAQAEFAFNNMTNGSTGKCPFEVVYTHVPRLTLDLAKLPIYVDLSAEATIMADRIKTIHEEVQQHLEATDPSYKSKADKHKRAVEFQIGDLVMIHLKKSR
LPTGQHSKLINKKIGPLPILERIGPYAYTIDLPPTVKISNIFNVSDIFPYHAQNQFTLSQSSRDEFIFWEGKNCCRYFKYYFTIIFSFQTYV