; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc06g0169941 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc06g0169941
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationCMiso1.1chr06:24714007..24715728
RNA-Seq ExpressionCmc06g0169941
SyntenyCmc06g0169941
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR001878 - Zinc finger, CCHC-type
IPR012337 - Ribonuclease H-like superfamily
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
AAU90333.1 Putative gag and pol polyprotein, identical [Solanum demissum]2.6e-13547.05Show/hide
Query:  SNKQNNPQSRSTVQI--VCYNCNKLGHLARNCRNKSR-PAAHANLIKDELVAMISKVNVIGGSEGWWLDTSASRHVCYDLSLFRKYNEVKD-KNILLGDH
        +N +NN      VQ    C+ C K GH+AR CR + R P   AN+ ++  VA+I+ +N++   +GWW+D+ A+RHVCYD   F+KY   ++ K I+LGD 
Subjt:  SNKQNNPQSRSTVQI--VCYNCNKLGHLARNCRNKSR-PAAHANLIKDELVAMISKVNVIGGSEGWWLDTSASRHVCYDLSLFRKYNEVKD-KNILLGDH

Query:  HTTKVAGIGEVELKFTSGKMLVLKEFLHTPEIQKNLVSRYLLNKAGFTQTIGSDLFTLTKNNVFVEKGYATDGMFKLNLEINKIASSAYMLTSFNVWHVR
        HTT+V G G+VEL F+SG+ L LK+ L+TP ++KNL+S +L NK GF Q I SD + + K  +FV KG                                
Subjt:  HTTKVAGIGEVELKFTSGKMLVLKEFLHTPEIQKNLVSRYLLNKAGFTQTIGSDLFTLTKNNVFVEKGYATDGMFKLNLEINKIASSAYMLTSFNVWHVR

Query:  LCHVNKRLISNMSRLNLIPKLSLHDFEKCACCSQAKITKTSHKYVTRVTEPLELIHSDLCEFDGALTRNSKRYVITFIDDCSDYTFIYLLKNKSDAYEMF
                      L LIP +   +FEKC  CS+AKITK  H  V R T  LEL+H+D+CE  G LTR   R  ITFIDD S +T++YL+KNKSDA+E F
Subjt:  LCHVNKRLISNMSRLNLIPKLSLHDFEKCACCSQAKITKTSHKYVTRVTEPLELIHSDLCEFDGALTRNSKRYVITFIDDCSDYTFIYLLKNKSDAYEMF

Query:  KVFVTEIEKQFNKRIKRLRSDRGIEYDSVSFNEFYSSKGIIHETIAPYSPEMNGKAERKNRTLTELVIVILLESGAAPSWWGEIFKTINYVLNRIPKSNS
        K ++ E+E QF ++IKR+RSDRG EY+S  FN F  S GIIHET  PYSP  NG AERKNRTL EL   +L+ES A  ++WGE   T  YVLNR+P   S
Subjt:  KVFVTEIEKQFNKRIKRLRSDRGIEYDSVSFNEFYSSKGIIHETIAPYSPEMNGKAERKNRTLTELVIVILLESGAAPSWWGEIFKTINYVLNRIPKSNS

Query:  KTSPYEVLKHKAPNLSYLRTWGCLAYVRIPDPKRRKLASRAYECVFIEYAKNNKAYRFYDLENKVIIEWNDVDFFEDKFPFKSRNSGDLYSQISGGSISN
        K + +E+ K   P+L YLR WGCLA+VR+ DPK  KL  +   C F+ YA N+ AYRF++LE+ ++IE  D  F E+KFPF S+NSG    +I    +  
Subjt:  KTSPYEVLKHKAPNLSYLRTWGCLAYVRIPDPKRRKLASRAYECVFIEYAKNNKAYRFYDLENKVIIEWNDVDFFEDKFPFKSRNSGDLYSQISGGSISN

Query:  SLPSIRIQT-QDKEV-DPEPRRSKRARTVKDFGEDFEMYNV-EDPKDLTEALSSVDANLWQEAINNEMDSLEPNRT
        SLPS    T ++KEV D E RRSKRAR  KDFG +F ++NV +DP  L EALSS D+  W+EA+N+EM+SL  N+T
Subjt:  SLPSIRIQT-QDKEV-DPEPRRSKRARTVKDFGEDFEMYNV-EDPKDLTEALSSVDANLWQEAINNEMDSLEPNRT

ABI34306.1 Polyprotein, putative [Solanum demissum]5.5e-16253.05Show/hide
Query:  CYNCNKLGHLARNCRNKSR-PAAHANLIKDELVAMISKVNVIGGSEGWWLDTSASRHVCYDLSLFRKYNEVKD-KNILLGDHHTTKVAGIGEVELKFTSG
        C+ C K GH+AR CR + R P   AN+ ++  VA+I+ +N++   +GWW D+ A+RHVCYD   F+KY   ++ K I+LGD HTT+V G G+VEL FTSG
Subjt:  CYNCNKLGHLARNCRNKSR-PAAHANLIKDELVAMISKVNVIGGSEGWWLDTSASRHVCYDLSLFRKYNEVKD-KNILLGDHHTTKVAGIGEVELKFTSG

Query:  KMLVLKEFLHTPEIQKNLVSRYLLNKAGFTQTIGSDLFTLTKNNVFVEKGYATDGMFKLNLEINKIASSAYMLTSFNVWHVRLCHVNKRLISNMSRLNLI
        ++L LK+ L+TP ++K L+S +LLNKAGF Q I S+ + + K  +FV KGYA DGMFKLN+E+NK ++S YML+S N WH RLCH+N R +  MS L LI
Subjt:  KMLVLKEFLHTPEIQKNLVSRYLLNKAGFTQTIGSDLFTLTKNNVFVEKGYATDGMFKLNLEINKIASSAYMLTSFNVWHVRLCHVNKRLISNMSRLNLI

Query:  PKLSLHDFEKCACCSQAKITKTSHKYVTRVTEPLELIHSDLCEFDGALTRNSKRYVITFIDDCSDYTFIYLLKNKSDAYEMFKVFVTEIEKQFNKRIKRL
        P +   +FEKC  CS+AKITK  H  V R T+ LEL+H+D+CE  G LTR   RY ITFIDD S +T++YL+KNKSDA+E FK ++ E+E QF ++IKR+
Subjt:  PKLSLHDFEKCACCSQAKITKTSHKYVTRVTEPLELIHSDLCEFDGALTRNSKRYVITFIDDCSDYTFIYLLKNKSDAYEMFKVFVTEIEKQFNKRIKRL

Query:  RSDRGIEYDSVSFNEFYSSKGIIHETIAPYSPEMNGKAERKNRTLTELVIVILLESGAAPSWWGEIFKTINYVLNRIPKSNSKTSPYEVLKHKAPNLSYL
        RSDRG EY+S  FN F  S GIIHET  PYSP  NG AERKNRTL EL   +L+ES A  ++WGE   T  YVLNR+P   SK +P+E+ K   P+L YL
Subjt:  RSDRGIEYDSVSFNEFYSSKGIIHETIAPYSPEMNGKAERKNRTLTELVIVILLESGAAPSWWGEIFKTINYVLNRIPKSNSKTSPYEVLKHKAPNLSYL

Query:  RTWGCLAYVRIPDPKRRKLASRAYECVFIEYAKNNKAYRFYDLENKVIIEWNDVDFFEDKFPFKSRNSGDLYSQISGGSISNSLPSIRIQT-QDKEV-DP
        R WGCLA+VR+ DPK  KL  +   C F+ YA N+ AYRF++LE+ ++IE  D  F E+KFPF S+NSG    +I    +  +LPS    T ++KEV D 
Subjt:  RTWGCLAYVRIPDPKRRKLASRAYECVFIEYAKNNKAYRFYDLENKVIIEWNDVDFFEDKFPFKSRNSGDLYSQISGGSISNSLPSIRIQT-QDKEV-DP

Query:  EPRRSKRARTVKDFGEDFEMYNVEDPK-DLTEALSSVDANLWQEAINNEMDSLEPNRT
        E RRSKRAR  KDFG DF ++NV D +  L EALSS D+  W+EA+N+EM+SL  N+T
Subjt:  EPRRSKRARTVKDFGEDFEMYNVEDPK-DLTEALSSVDANLWQEAINNEMDSLEPNRT

KAA0034938.1 putative Polyprotein [Cucumis melo var. makuwa]4.7e-30289.88Show/hide
Query:  MKRGSNKQNNPQSRSTVQIVCYNCNKLGHLARNCRNKSRPAAHANLIKDELVAMISKVNVIGGSEGWWLDTSASRHVCYDLSLFRKYNEVKDKNILLGDH
        MKRGSNKQNN QSRSTVQI CYNCNK GHLA+NCRN+SRPAA ANLI+DELVAMISKVNVIGGSEGWWLDT AS HVC++LSLFRKYNEVKDKNILLGDH
Subjt:  MKRGSNKQNNPQSRSTVQIVCYNCNKLGHLARNCRNKSRPAAHANLIKDELVAMISKVNVIGGSEGWWLDTSASRHVCYDLSLFRKYNEVKDKNILLGDH

Query:  HTTKVAGIGEVELKFTSGKMLVLKEFLHTPEIQKNLVSRYLLNKAGFTQTIGSDLFTLTKNNVFVEKGYATDGMFKLNLEINKIASSAYMLTSFNVWHVR
        HTTKV GIGEVELKFTS K LV+KE LHTPEI+KNLV  YLLNKAGFTQTIGS+LFTLTKNNVFV KGYATDGMFKLNLEINKIASSAYMLTSFNVWH R
Subjt:  HTTKVAGIGEVELKFTSGKMLVLKEFLHTPEIQKNLVSRYLLNKAGFTQTIGSDLFTLTKNNVFVEKGYATDGMFKLNLEINKIASSAYMLTSFNVWHVR

Query:  LCHVNKRLISNMSRLNLIPKLSLHDFEKCACCSQAKITKTSHKYVTRVTEPLELIHSDLCEFDGALTRNSKRYVITFIDDCSDYTFIYLLKNKSDAYEMF
        LCHVNKRLISNMSRLNLIPKLSLHDFEKCACCSQAKITKTSHK+VTRVT+PLELIHSDLCEFDG LTRNSKRYV+TFIDDCSDYTFIYLLKNKSDAYEMF
Subjt:  LCHVNKRLISNMSRLNLIPKLSLHDFEKCACCSQAKITKTSHKYVTRVTEPLELIHSDLCEFDGALTRNSKRYVITFIDDCSDYTFIYLLKNKSDAYEMF

Query:  KVFVTEIEKQFNKRIKRLRSDRGIEYDSVSFNEFYSSKGIIHETIAPYSPEMNGKAERKNRTLTELVIVILLESGAAPSWWGEIFKTINYVLNRIPKSNS
        KVFVTEIE QFNKRIKRLRSDRG EYDSV+FNEFY+SKGIIHET  PYSPEMNGK ERKNRTLTEL + ILLES AAPSWWGEI KT+NYVLNRIPKSNS
Subjt:  KVFVTEIEKQFNKRIKRLRSDRGIEYDSVSFNEFYSSKGIIHETIAPYSPEMNGKAERKNRTLTELVIVILLESGAAPSWWGEIFKTINYVLNRIPKSNS

Query:  KTSPYEVLKHKAPNLSYLRTWGCLAYVRIPDPKRRKLASRAYECVFIEYAKNNKAYRFYDLENKVIIEWNDVDFFEDKFPFKSRNSGDLYSQISGGSISN
        KTSPYEVLKHK PNLSYLRTWGCLAYVRIP+P+RRKLAS+AYECVFI YA+N+KAYRFYDLENKVIIE NDVDFFEDKFPFKSRNSG LYSQ SGGS  +
Subjt:  KTSPYEVLKHKAPNLSYLRTWGCLAYVRIPDPKRRKLASRAYECVFIEYAKNNKAYRFYDLENKVIIEWNDVDFFEDKFPFKSRNSGDLYSQISGGSISN

Query:  SLPSIRIQTQDKEVDPEPRRSKRARTVKDFGEDFEMYNVEDPKDLTEALSSVDANLWQEAINNEMDSLEPNRT
        SLPSIRIQTQDKEVDPEPRRSKRARTVKDF EDFEMYNVEDPKDLT+ALSSVDANLWQEAIN+ +DSLE NRT
Subjt:  SLPSIRIQTQDKEVDPEPRRSKRARTVKDFGEDFEMYNVEDPKDLTEALSSVDANLWQEAINNEMDSLEPNRT

KAG5527251.1 hypothetical protein RHGRI_028223 [Rhododendron griersonianum]3.1e-13645.23Show/hide
Query:  GSNKQNNPQSRSTVQ--IVCYNCNKLGHLARNCRNKSRPAAHANLIKDELVAMISKVNVIGGSEGWWLDTSASRHVCYDLSLFRKYNEVKDKNILLGDHH
        GSN+ NN + R+  +    CYNC K GH A++CR+K +  + AN+++++LVAM++++N+   S GWW D+ A+ HVC D SLF+ Y ++  +   +G+  
Subjt:  GSNKQNNPQSRSTVQ--IVCYNCNKLGHLARNCRNKSRPAAHANLIKDELVAMISKVNVIGGSEGWWLDTSASRHVCYDLSLFRKYNEVKDKNILLGDHH

Query:  TTKVAGIGEVELKFTSGKMLVLKEFLHTPEIQKNLVSRYLLNKAGFTQTIGSDLFTLTKNNVFVEKGYATDGMFKLNLEINKIASSAYMLTSFNVWHVRL
          KVAG G  EL FTSGK L L   LH P+++KNLVS  L+ K GF     SD   LTKN +FV KGY  +GMFKL++  N  ASS Y++ SF++WH RL
Subjt:  TTKVAGIGEVELKFTSGKMLVLKEFLHTPEIQKNLVSRYLLNKAGFTQTIGSDLFTLTKNNVFVEKGYATDGMFKLNLEINKIASSAYMLTSFNVWHVRL

Query:  CHVNKRLISNMSRLNLIPKLSLHD--FEKCACCSQAKITKTSHKYVTRVTEPLELIHSDLCEFDGALTRNSKRYVITFIDDCSDYTFIYLLKNKSDAYEM
         H+N R I NMSR  LI   + HD   +KC  C++AK+ K S   V R +E L+LIHSD+CE +G LTR  KRY  TF+DD S YTF+YLL+ K + +  
Subjt:  CHVNKRLISNMSRLNLIPKLSLHD--FEKCACCSQAKITKTSHKYVTRVTEPLELIHSDLCEFDGALTRNSKRYVITFIDDCSDYTFIYLLKNKSDAYEM

Query:  FKVFVTEIEKQFNKRIKRLRSDRGIEYDSVSFNEFYSSKGIIHETIAPYSPEMNGKAERKNRTLTELVIVILLESGAAPSWWGEIFKTINYVLNRIPKSN
        F+ +  E+E Q NK+IK LRSDRG+EY    F++F    GIIH+  APYSP+ NG AERKNRTLTE+   +++ + A    WGE   T  Y+ NRI    
Subjt:  FKVFVTEIEKQFNKRIKRLRSDRGIEYDSVSFNEFYSSKGIIHETIAPYSPEMNGKAERKNRTLTELVIVILLESGAAPSWWGEIFKTINYVLNRIPKSN

Query:  SKTSPYEVLKHKAPNLSYLRTWGCLAYVRIPDPKRRKLASRAYECVFIEYAKNNKAYRFYDLENKVIIEWNDVDFFEDKFPFKSRNSGDLYSQISGGSIS
        +   PYE+ K + PNLSYL+ WGCLAY R+PDPKR KL  RA + +F+ YA+++KAYR  DLE+  I+E  +V FFE KF   S++   + ++ SG S++
Subjt:  SKTSPYEVLKHKAPNLSYLRTWGCLAYVRIPDPKRRKLASRAYECVFIEYAKNNKAYRFYDLENKVIIEWNDVDFFEDKFPFKSRNSGDLYSQISGGSIS

Query:  NSLPSIRIQTQDKEVDPEPRRSKRARTVKDFGEDF-----EMYNVE------------------DPKDLTEALSSVDANLWQEAINNEMDSLEPNRT
          +   +   +   ++ EPRRS R R  K    D       ++ VE                  DPK  TEA+SS DA  W+EAIN+EMDSL  N T
Subjt:  NSLPSIRIQTQDKEVDPEPRRSKRARTVKDFGEDF-----EMYNVE------------------DPKDLTEALSSVDANLWQEAINNEMDSLEPNRT

XP_021732277.1 uncharacterized protein LOC110699091 [Chenopodium quinoa]5.4e-13347.4Show/hide
Query:  KRGSNKQNNPQSRSTVQIVCYNCNKLGHLARNCRNKSRPA-AHANLIKDELVAMISKVNVIGGSEGWWLDTSASRHVCYDLSLFRKYNE-VKDKNILLGD
        K    +Q+ P    T Q +CY C K GH+AR CRN   P  A A++I++  VAMI+++N+ GGS+GWW+DT A+RHVCYD  +F+ Y E   DK +LLGD
Subjt:  KRGSNKQNNPQSRSTVQIVCYNCNKLGHLARNCRNKSRPA-AHANLIKDELVAMISKVNVIGGSEGWWLDTSASRHVCYDLSLFRKYNE-VKDKNILLGD

Query:  HHTTKVAGIGEVELKFTSGKMLVLKEFLHTPEIQKNLVSRYLLNKAGFTQTIGSDLFTLTKNNVFVEKGYATDGMFKLNLEINKIASSAYMLTS-FNVWH
         H+T +AG+G VELKFTSG+ L+LK+ LHTPE++KNLVS +LLNKAGF QTIGSDLFTLTKN +FV KGYATDGMFKLN+E+NKI++SAYML S  NVWH
Subjt:  HHTTKVAGIGEVELKFTSGKMLVLKEFLHTPEIQKNLVSRYLLNKAGFTQTIGSDLFTLTKNNVFVEKGYATDGMFKLNLEINKIASSAYMLTS-FNVWH

Query:  VRLCHVNKRLISNMSRLNLIPKLSLHDFEKCACCSQAKITKTSHKYVTRVTEPLELIHSDLCEFDGALTRNSKRYVITFIDDCSDYTFIYLLKNKSDAYE
         RLCHVNKRLI NMS L LIP +SL+DF+KC  CSQAKITKT HK V R +EPL+LIHSD+CE +G LTRN +RY ITFIDDCSDYT IYL+KNKSDA+E
Subjt:  VRLCHVNKRLISNMSRLNLIPKLSLHDFEKCACCSQAKITKTSHKYVTRVTEPLELIHSDLCEFDGALTRNSKRYVITFIDDCSDYTFIYLLKNKSDAYE

Query:  MFKVFVTEIEKQFNKRIKRLRSDRGIEYDSVSFNEFYSSKGIIHETIAPYSPEMNGKAERKNRTLTELVIVILLESGAAPSWWGEIFKTINYVLNRIPKS
        M                                                        AERKNRT TELV+ I L SGAA  W                  
Subjt:  MFKVFVTEIEKQFNKRIKRLRSDRGIEYDSVSFNEFYSSKGIIHETIAPYSPEMNGKAERKNRTLTELVIVILLESGAAPSWWGEIFKTINYVLNRIPKS

Query:  NSKTSPYEVLKHKAPNLSYLRTWGCLAYVRIPDPKRRKLASRAYECVFIEYAKNNKAYRFYDLENKVIIEWNDVDFFEDKFPFKSRNSGDLYSQISGGSI
                                                                                        FPFKSRN        SGG+ 
Subjt:  NSKTSPYEVLKHKAPNLSYLRTWGCLAYVRIPDPKRRKLASRAYECVFIEYAKNNKAYRFYDLENKVIIEWNDVDFFEDKFPFKSRNSGDLYSQISGGSI

Query:  SNSLPSIRIQTQDKEVDPEPRRSKRARTVKDFGEDFEMYNV-EDPKDLTEALSSVDANLWQEAINNEMDSLEPNRT
        S+ +P  R  +QD + +PE R+SKRAR  KDFG DF + NV EDP  L EAL+SVDA+LWQEA+N+EMDSLE NRT
Subjt:  SNSLPSIRIQTQDKEVDPEPRRSKRARTVKDFGEDFEMYNV-EDPKDLTEALSSVDANLWQEAINNEMDSLEPNRT

TrEMBL top hitse value%identityAlignment
A0A5D3DCJ1 Putative Polyprotein2.3e-30289.88Show/hide
Query:  MKRGSNKQNNPQSRSTVQIVCYNCNKLGHLARNCRNKSRPAAHANLIKDELVAMISKVNVIGGSEGWWLDTSASRHVCYDLSLFRKYNEVKDKNILLGDH
        MKRGSNKQNN QSRSTVQI CYNCNK GHLA+NCRN+SRPAA ANLI+DELVAMISKVNVIGGSEGWWLDT AS HVC++LSLFRKYNEVKDKNILLGDH
Subjt:  MKRGSNKQNNPQSRSTVQIVCYNCNKLGHLARNCRNKSRPAAHANLIKDELVAMISKVNVIGGSEGWWLDTSASRHVCYDLSLFRKYNEVKDKNILLGDH

Query:  HTTKVAGIGEVELKFTSGKMLVLKEFLHTPEIQKNLVSRYLLNKAGFTQTIGSDLFTLTKNNVFVEKGYATDGMFKLNLEINKIASSAYMLTSFNVWHVR
        HTTKV GIGEVELKFTS K LV+KE LHTPEI+KNLV  YLLNKAGFTQTIGS+LFTLTKNNVFV KGYATDGMFKLNLEINKIASSAYMLTSFNVWH R
Subjt:  HTTKVAGIGEVELKFTSGKMLVLKEFLHTPEIQKNLVSRYLLNKAGFTQTIGSDLFTLTKNNVFVEKGYATDGMFKLNLEINKIASSAYMLTSFNVWHVR

Query:  LCHVNKRLISNMSRLNLIPKLSLHDFEKCACCSQAKITKTSHKYVTRVTEPLELIHSDLCEFDGALTRNSKRYVITFIDDCSDYTFIYLLKNKSDAYEMF
        LCHVNKRLISNMSRLNLIPKLSLHDFEKCACCSQAKITKTSHK+VTRVT+PLELIHSDLCEFDG LTRNSKRYV+TFIDDCSDYTFIYLLKNKSDAYEMF
Subjt:  LCHVNKRLISNMSRLNLIPKLSLHDFEKCACCSQAKITKTSHKYVTRVTEPLELIHSDLCEFDGALTRNSKRYVITFIDDCSDYTFIYLLKNKSDAYEMF

Query:  KVFVTEIEKQFNKRIKRLRSDRGIEYDSVSFNEFYSSKGIIHETIAPYSPEMNGKAERKNRTLTELVIVILLESGAAPSWWGEIFKTINYVLNRIPKSNS
        KVFVTEIE QFNKRIKRLRSDRG EYDSV+FNEFY+SKGIIHET  PYSPEMNGK ERKNRTLTEL + ILLES AAPSWWGEI KT+NYVLNRIPKSNS
Subjt:  KVFVTEIEKQFNKRIKRLRSDRGIEYDSVSFNEFYSSKGIIHETIAPYSPEMNGKAERKNRTLTELVIVILLESGAAPSWWGEIFKTINYVLNRIPKSNS

Query:  KTSPYEVLKHKAPNLSYLRTWGCLAYVRIPDPKRRKLASRAYECVFIEYAKNNKAYRFYDLENKVIIEWNDVDFFEDKFPFKSRNSGDLYSQISGGSISN
        KTSPYEVLKHK PNLSYLRTWGCLAYVRIP+P+RRKLAS+AYECVFI YA+N+KAYRFYDLENKVIIE NDVDFFEDKFPFKSRNSG LYSQ SGGS  +
Subjt:  KTSPYEVLKHKAPNLSYLRTWGCLAYVRIPDPKRRKLASRAYECVFIEYAKNNKAYRFYDLENKVIIEWNDVDFFEDKFPFKSRNSGDLYSQISGGSISN

Query:  SLPSIRIQTQDKEVDPEPRRSKRARTVKDFGEDFEMYNVEDPKDLTEALSSVDANLWQEAINNEMDSLEPNRT
        SLPSIRIQTQDKEVDPEPRRSKRARTVKDF EDFEMYNVEDPKDLT+ALSSVDANLWQEAIN+ +DSLE NRT
Subjt:  SLPSIRIQTQDKEVDPEPRRSKRARTVKDFGEDFEMYNVEDPKDLTEALSSVDANLWQEAINNEMDSLEPNRT

A0A7N2L531 Uncharacterized protein4.7e-17556.47Show/hide
Query:  NKQNNPQSRSTVQIVCYNCNKLGHLARNCRNKSRPAA-HANLIKDELVAMISKVNVIGGSEGWWLDTSASRHVCYDLSLFRKYNEV-KDKNILLGDHHTT
        NK   P S+      C+ C K GH+AR C+ + R +   AN+ ++ LVAMI+ +N++   EGWW D+ A+RHVCYD + F+ Y    ++K ++LGD   T
Subjt:  NKQNNPQSRSTVQIVCYNCNKLGHLARNCRNKSRPAA-HANLIKDELVAMISKVNVIGGSEGWWLDTSASRHVCYDLSLFRKYNEV-KDKNILLGDHHTT

Query:  KVAGIGEVELKFTSGKMLVLKEFLHTPEIQKNLVSRYLLNKAGFTQTIGSDLFTLTKNNVFVEKGYATDGMFKLNLEINKIA-SSAYMLTSFNVWHVRLC
        KV G GEVELKFTSG++L LK+ L+TP ++KNL+S +LLNKAGF QT+ SD + +TK  +FV KGYA DGMFKLN+E NK + SS YML+S N WH RLC
Subjt:  KVAGIGEVELKFTSGKMLVLKEFLHTPEIQKNLVSRYLLNKAGFTQTIGSDLFTLTKNNVFVEKGYATDGMFKLNLEINKIA-SSAYMLTSFNVWHVRLC

Query:  HVNKRLISNMSRLNLIPKLSLHDFEKCACCSQAKITKTSHKYVTRVTEPLELIHSDLCEFDGALTRNSKRYVITFIDDCSDYTFIYLLKNKSDAYEMFKV
        H+N R +  MS L LIP+LS  DFEKC  CSQAKITK  HK V R TE LELIHSDLCEF+G LTR   RY+ITFIDD S YT IYLLKNKSDA+E F+ 
Subjt:  HVNKRLISNMSRLNLIPKLSLHDFEKCACCSQAKITKTSHKYVTRVTEPLELIHSDLCEFDGALTRNSKRYVITFIDDCSDYTFIYLLKNKSDAYEMFKV

Query:  FVTEIEKQFNKRIKRLRSDRGIEYDSVSFNEFYSSKGIIHETIAPYSPEMNGKAERKNRTLTELVIVILLESGAAPSWWGEIFKTINYVLNRIPKSNSKT
        F+ E+E QF ++IKR+RSDRG EY+S +FN F  S GIIHET APYSP  NG AERKNRTL EL   +L+ESGA   +WGE   T  +VLNR+P   S T
Subjt:  FVTEIEKQFNKRIKRLRSDRGIEYDSVSFNEFYSSKGIIHETIAPYSPEMNGKAERKNRTLTELVIVILLESGAAPSWWGEIFKTINYVLNRIPKSNSKT

Query:  SPYEVLKHKAPNLSYLRTWGCLAYVRIPDPKRRKLASRAYECVFIEYAKNNKAYRFYDLENKVIIEWNDVDFFEDKFPFKSRNSGDLYSQISGGSISNSL
        +P+E+ K   PNL YLR W CLAYVR+ DPK  KL  RA  C F+ YA N+ AYRF+DLENK+I E  D  F E+KFPFK +NSG   + +S  S S S 
Subjt:  SPYEVLKHKAPNLSYLRTWGCLAYVRIPDPKRRKLASRAYECVFIEYAKNNKAYRFYDLENKVIIEWNDVDFFEDKFPFKSRNSGDLYSQISGGSISNSL

Query:  PSIRIQTQDKEVDPEPRRSKRARTVKDFGEDFEMYNVED-PKDLTEALSSVDANLWQEAINNEMDSLEPNRT
             Q Q+   + EPRRSKRAR  KDFG D+ ++N+E+ PK+L EAL+S DA  W+EA+N+EM+SL  NRT
Subjt:  PSIRIQTQDKEVDPEPRRSKRARTVKDFGEDFEMYNVED-PKDLTEALSSVDANLWQEAINNEMDSLEPNRT

A0A7N2R9F3 Uncharacterized protein1.4e-17155.59Show/hide
Query:  NKQNNPQSRSTVQIVCYNCNKLGHLARNCRNKSRPAA-HANLIKDELVAMISKVNVIGGSEGWWLDTSASRHVCYDLSLFRKYNEV-KDKNILLGDHHTT
        NK   P S+      C+ C K GH+AR C+ + R +    N+ ++ LVA+I+ +N++   EGWW D  A+RHVCYD + F+ Y    ++K I+LGD   T
Subjt:  NKQNNPQSRSTVQIVCYNCNKLGHLARNCRNKSRPAA-HANLIKDELVAMISKVNVIGGSEGWWLDTSASRHVCYDLSLFRKYNEV-KDKNILLGDHHTT

Query:  KVAGIGEVELKFTSGKMLVLKEFLHTPEIQKNLVSRYLLNKAGFTQTIGSDLFTLTKNNVFVEKGYATDGMFKLNLEINKIA-SSAYMLTSFNVWHVRLC
        KV G GEVELKFTSG++L LK+  +TP ++KNL+S +LLNKAGF QT+ SD + +TK  +FV KGYA DGMFKLN+E NK + SS YML+S N WH RLC
Subjt:  KVAGIGEVELKFTSGKMLVLKEFLHTPEIQKNLVSRYLLNKAGFTQTIGSDLFTLTKNNVFVEKGYATDGMFKLNLEINKIA-SSAYMLTSFNVWHVRLC

Query:  HVNKRLISNMSRLNLIPKLSLHDFEKCACCSQAKITKTSHKYVTRVTEPLELIHSDLCEFDGALTRNSKRYVITFIDDCSDYTFIYLLKNKSDAYEMFKV
        H+N R +  MS L LIP+LS  DFEKC  CSQAKITK  HK V R TE LELIHSDLCEF+G LTR   RY+ITFIDD S YT IYLLKNKSDA+E F+ 
Subjt:  HVNKRLISNMSRLNLIPKLSLHDFEKCACCSQAKITKTSHKYVTRVTEPLELIHSDLCEFDGALTRNSKRYVITFIDDCSDYTFIYLLKNKSDAYEMFKV

Query:  FVTEIEKQFNKRIKRLRSDRGIEYDSVSFNEFYSSKGIIHETIAPYSPEMNGKAERKNRTLTELVIVILLESGAAPSWWGEIFKTINYVLNRIPKSNSKT
        F+ E+E QF ++IKR+RSDRG EY+S +FN F  S GIIHET APYSP  NG  ERKNRTL EL   +L+ESGA   +WGE   T  +VLNR+P   S T
Subjt:  FVTEIEKQFNKRIKRLRSDRGIEYDSVSFNEFYSSKGIIHETIAPYSPEMNGKAERKNRTLTELVIVILLESGAAPSWWGEIFKTINYVLNRIPKSNSKT

Query:  SPYEVLKHKAPNLSYLRTWGCLAYVRIPDPKRRKLASRAYECVFIEYAKNNKAYRFYDLENKVIIEWNDVDFFEDKFPFKSRNSGDLYSQISGGSISNSL
        +P+E+ K   PNL YLR WGCLAYVR+ DPK  KL  RA  C F+ YA N+ AYRF+DLENK+I E  D  F E+KFPFK +NSG   + +   S S S 
Subjt:  SPYEVLKHKAPNLSYLRTWGCLAYVRIPDPKRRKLASRAYECVFIEYAKNNKAYRFYDLENKVIIEWNDVDFFEDKFPFKSRNSGDLYSQISGGSISNSL

Query:  PSIRIQTQDKEVDPEPRRSKRARTVKDFGEDFEMYNVED-PKDLTEALSSVDANLWQEAINNEMDSLEPNRT
            +Q Q+   + E RRSKRAR  KDFG D+ ++N+E+ P++L EAL+S DA  W+EA+N+EM+SL  NRT
Subjt:  PSIRIQTQDKEVDPEPRRSKRARTVKDFGEDFEMYNVED-PKDLTEALSSVDANLWQEAINNEMDSLEPNRT

Q0KIN7 Polyprotein, putative2.7e-16253.05Show/hide
Query:  CYNCNKLGHLARNCRNKSR-PAAHANLIKDELVAMISKVNVIGGSEGWWLDTSASRHVCYDLSLFRKYNEVKD-KNILLGDHHTTKVAGIGEVELKFTSG
        C+ C K GH+AR CR + R P   AN+ ++  VA+I+ +N++   +GWW D+ A+RHVCYD   F+KY   ++ K I+LGD HTT+V G G+VEL FTSG
Subjt:  CYNCNKLGHLARNCRNKSR-PAAHANLIKDELVAMISKVNVIGGSEGWWLDTSASRHVCYDLSLFRKYNEVKD-KNILLGDHHTTKVAGIGEVELKFTSG

Query:  KMLVLKEFLHTPEIQKNLVSRYLLNKAGFTQTIGSDLFTLTKNNVFVEKGYATDGMFKLNLEINKIASSAYMLTSFNVWHVRLCHVNKRLISNMSRLNLI
        ++L LK+ L+TP ++K L+S +LLNKAGF Q I S+ + + K  +FV KGYA DGMFKLN+E+NK ++S YML+S N WH RLCH+N R +  MS L LI
Subjt:  KMLVLKEFLHTPEIQKNLVSRYLLNKAGFTQTIGSDLFTLTKNNVFVEKGYATDGMFKLNLEINKIASSAYMLTSFNVWHVRLCHVNKRLISNMSRLNLI

Query:  PKLSLHDFEKCACCSQAKITKTSHKYVTRVTEPLELIHSDLCEFDGALTRNSKRYVITFIDDCSDYTFIYLLKNKSDAYEMFKVFVTEIEKQFNKRIKRL
        P +   +FEKC  CS+AKITK  H  V R T+ LEL+H+D+CE  G LTR   RY ITFIDD S +T++YL+KNKSDA+E FK ++ E+E QF ++IKR+
Subjt:  PKLSLHDFEKCACCSQAKITKTSHKYVTRVTEPLELIHSDLCEFDGALTRNSKRYVITFIDDCSDYTFIYLLKNKSDAYEMFKVFVTEIEKQFNKRIKRL

Query:  RSDRGIEYDSVSFNEFYSSKGIIHETIAPYSPEMNGKAERKNRTLTELVIVILLESGAAPSWWGEIFKTINYVLNRIPKSNSKTSPYEVLKHKAPNLSYL
        RSDRG EY+S  FN F  S GIIHET  PYSP  NG AERKNRTL EL   +L+ES A  ++WGE   T  YVLNR+P   SK +P+E+ K   P+L YL
Subjt:  RSDRGIEYDSVSFNEFYSSKGIIHETIAPYSPEMNGKAERKNRTLTELVIVILLESGAAPSWWGEIFKTINYVLNRIPKSNSKTSPYEVLKHKAPNLSYL

Query:  RTWGCLAYVRIPDPKRRKLASRAYECVFIEYAKNNKAYRFYDLENKVIIEWNDVDFFEDKFPFKSRNSGDLYSQISGGSISNSLPSIRIQT-QDKEV-DP
        R WGCLA+VR+ DPK  KL  +   C F+ YA N+ AYRF++LE+ ++IE  D  F E+KFPF S+NSG    +I    +  +LPS    T ++KEV D 
Subjt:  RTWGCLAYVRIPDPKRRKLASRAYECVFIEYAKNNKAYRFYDLENKVIIEWNDVDFFEDKFPFKSRNSGDLYSQISGGSISNSLPSIRIQT-QDKEV-DP

Query:  EPRRSKRARTVKDFGEDFEMYNVEDPK-DLTEALSSVDANLWQEAINNEMDSLEPNRT
        E RRSKRAR  KDFG DF ++NV D +  L EALSS D+  W+EA+N+EM+SL  N+T
Subjt:  EPRRSKRARTVKDFGEDFEMYNVEDPK-DLTEALSSVDANLWQEAINNEMDSLEPNRT

Q60D13 Putative gag and pol polyprotein, identical1.3e-13547.05Show/hide
Query:  SNKQNNPQSRSTVQI--VCYNCNKLGHLARNCRNKSR-PAAHANLIKDELVAMISKVNVIGGSEGWWLDTSASRHVCYDLSLFRKYNEVKD-KNILLGDH
        +N +NN      VQ    C+ C K GH+AR CR + R P   AN+ ++  VA+I+ +N++   +GWW+D+ A+RHVCYD   F+KY   ++ K I+LGD 
Subjt:  SNKQNNPQSRSTVQI--VCYNCNKLGHLARNCRNKSR-PAAHANLIKDELVAMISKVNVIGGSEGWWLDTSASRHVCYDLSLFRKYNEVKD-KNILLGDH

Query:  HTTKVAGIGEVELKFTSGKMLVLKEFLHTPEIQKNLVSRYLLNKAGFTQTIGSDLFTLTKNNVFVEKGYATDGMFKLNLEINKIASSAYMLTSFNVWHVR
        HTT+V G G+VEL F+SG+ L LK+ L+TP ++KNL+S +L NK GF Q I SD + + K  +FV KG                                
Subjt:  HTTKVAGIGEVELKFTSGKMLVLKEFLHTPEIQKNLVSRYLLNKAGFTQTIGSDLFTLTKNNVFVEKGYATDGMFKLNLEINKIASSAYMLTSFNVWHVR

Query:  LCHVNKRLISNMSRLNLIPKLSLHDFEKCACCSQAKITKTSHKYVTRVTEPLELIHSDLCEFDGALTRNSKRYVITFIDDCSDYTFIYLLKNKSDAYEMF
                      L LIP +   +FEKC  CS+AKITK  H  V R T  LEL+H+D+CE  G LTR   R  ITFIDD S +T++YL+KNKSDA+E F
Subjt:  LCHVNKRLISNMSRLNLIPKLSLHDFEKCACCSQAKITKTSHKYVTRVTEPLELIHSDLCEFDGALTRNSKRYVITFIDDCSDYTFIYLLKNKSDAYEMF

Query:  KVFVTEIEKQFNKRIKRLRSDRGIEYDSVSFNEFYSSKGIIHETIAPYSPEMNGKAERKNRTLTELVIVILLESGAAPSWWGEIFKTINYVLNRIPKSNS
        K ++ E+E QF ++IKR+RSDRG EY+S  FN F  S GIIHET  PYSP  NG AERKNRTL EL   +L+ES A  ++WGE   T  YVLNR+P   S
Subjt:  KVFVTEIEKQFNKRIKRLRSDRGIEYDSVSFNEFYSSKGIIHETIAPYSPEMNGKAERKNRTLTELVIVILLESGAAPSWWGEIFKTINYVLNRIPKSNS

Query:  KTSPYEVLKHKAPNLSYLRTWGCLAYVRIPDPKRRKLASRAYECVFIEYAKNNKAYRFYDLENKVIIEWNDVDFFEDKFPFKSRNSGDLYSQISGGSISN
        K + +E+ K   P+L YLR WGCLA+VR+ DPK  KL  +   C F+ YA N+ AYRF++LE+ ++IE  D  F E+KFPF S+NSG    +I    +  
Subjt:  KTSPYEVLKHKAPNLSYLRTWGCLAYVRIPDPKRRKLASRAYECVFIEYAKNNKAYRFYDLENKVIIEWNDVDFFEDKFPFKSRNSGDLYSQISGGSISN

Query:  SLPSIRIQT-QDKEV-DPEPRRSKRARTVKDFGEDFEMYNV-EDPKDLTEALSSVDANLWQEAINNEMDSLEPNRT
        SLPS    T ++KEV D E RRSKRAR  KDFG +F ++NV +DP  L EALSS D+  W+EA+N+EM+SL  N+T
Subjt:  SLPSIRIQT-QDKEV-DPEPRRSKRARTVKDFGEDFEMYNV-EDPKDLTEALSSVDANLWQEAINNEMDSLEPNRT

SwissProt top hitse value%identityAlignment
P04146 Copia protein7.3e-4826.95Show/hide
Query:  QSRSTVQIVCYNCNKLGHLARNC---------RNKSRPAAHANLIKDELVAMISKVN--VIGGSEGWWLDTSASRHVCYDLSLFRKYNEVKDKNILLGDH
        +  S  ++ C++C + GH+ ++C         +NK             +  M+ +VN   +  + G+ LD+ AS H+  D SL+    EV     +    
Subjt:  QSRSTVQIVCYNCNKLGHLARNC---------RNKSRPAAHANLIKDELVAMISKVN--VIGGSEGWWLDTSASRHVCYDLSLFRKYNEVKDKNILLGDH

Query:  HTTKVAGIGEVELKFTSGKMLVLKEFLHTPEIQKNLVSRYLLNKAGFTQTIGSDLFTLTKNNVFVEKGYATDGMFKLNLEINKIASS--AYMLTSFNVWH
            +       ++  +   + L++ L   E   NL+S   L +AG +        T++KN + V K     GM      IN  A S  A    +F +WH
Subjt:  HTTKVAGIGEVELKFTSGKMLVLKEFLHTPEIQKNLVSRYLLNKAGFTQTIGSDLFTLTKNNVFVEKGYATDGMFKLNLEINKIASS--AYMLTSFNVWH

Query:  VRLCHVN---------KRLISNMSRLNLIPKLSLHDFEKCACCSQAKITKTSHKYVTRVTEPLELIHSDLCEFDGALTRNSKRYVITFIDDCSDYTFIYL
         R  H++         K + S+ S LN + +LS    E C    QA++     K  T +  PL ++HSD+C     +T + K Y + F+D  + Y   YL
Subjt:  VRLCHVN---------KRLISNMSRLNLIPKLSLHDFEKCACCSQAKITKTSHKYVTRVTEPLELIHSDLCEFDGALTRNSKRYVITFIDDCSDYTFIYL

Query:  LKNKSDAYEMFKVFVTEIEKQFNKRIKRLRSDRGIEYDSVSFNEFYSSKGIIHETIAPYSPEMNGKAERKNRTLTELVIVILLESGAAPSWWGEIFKTIN
        +K KSD + MF+ FV + E  FN ++  L  D G EY S    +F   KGI +    P++P++NG +ER  RT+TE    ++  +    S+WGE   T  
Subjt:  LKNKSDAYEMFKVFVTEIEKQFNKRIKRLRSDRGIEYDSVSFNEFYSSKGIIHETIAPYSPEMNGKAERKNRTLTELVIVILLESGAAPSWWGEIFKTIN

Query:  YVLNRIPKS---NSKTSPYEVLKHKAPNLSYLRTWGCLAYVRIPDPKRRKLASRAYECVFIEYAKNNKAYRFYDLENKVIIEWNDVDFFE-DKFPFKSRN
        Y++NRIP     +S  +PYE+  +K P L +LR +G   YV I + K+ K   ++++ +F+ Y  N   ++ +D  N+  I   DV   E +    ++  
Subjt:  YVLNRIPKS---NSKTSPYEVLKHKAPNLSYLRTWGCLAYVRIPDPKRRKLASRAYECVFIEYAKNNKAYRFYDLENKVIIEWNDVDFFE-DKFPFKSRN

Query:  SGDLYSQISGGSISNSLPSIRIQTQDKEVDPE-PRRSKRARTVKDFGEDFEMYNVEDPKDLTEALSSVDANLWQEAINNEM--DSLEPNR
           ++ + S  S + + P+       K +  E P  SK    ++   +  E  N   P D  + + +   N  +E  N +   DS E N+
Subjt:  SGDLYSQISGGSISNSLPSIRIQTQDKEVDPE-PRRSKRARTVKDFGEDFEMYNVEDPKDLTEALSSVDANLWQEAINNEM--DSLEPNR

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-944.4e-6928.86Show/hide
Query:  KRGSNKQNNPQSRSTVQIVCYNCNKLGHLARNCRN--KSRPAAHANLIKDELVAMISK--------------VNVIGGSEGWWLDTSASRHVCYDLSLFR
        + G+  ++  +S+S V+  CYNCN+ GH  R+C N  K +         D   AM+                +++ G    W +DT+AS H      LF 
Subjt:  KRGSNKQNNPQSRSTVQIVCYNCNKLGHLARNCRN--KSRPAAHANLIKDELVAMISK--------------VNVIGGSEGWWLDTSASRHVCYDLSLFR

Query:  KYNEVKDKNILLGDHHTTKVAGIGEVELKFTSGKMLVLKEFLHTPEIQKNLVSRYLLNKAGFTQTIGSDLFTLTKNNVFVEKGYATDGMFKLNLEI-NKI
        +Y       + +G+   +K+AGIG++ +K   G  LVLK+  H P+++ NL+S   L++ G+     +  + LTK ++ + KG A   +++ N EI    
Subjt:  KYNEVKDKNILLGDHHTTKVAGIGEVELKFTSGKMLVLKEFLHTPEIQKNLVSRYLLNKAGFTQTIGSDLFTLTKNNVFVEKGYATDGMFKLNLEI-NKI

Query:  ASSAYMLTSFNVWHVRLCHVNKRLISNMSRLNLIPKLSLHDFEKCACCSQAKITKTSHKYVT-RVTEPLELIHSDLCEFDGALTRNSKRYVITFIDDCSD
         ++A    S ++WH R+ H++++ +  +++ +LI        + C  C   K  + S +  + R    L+L++SD+C      +    +Y +TFIDD S 
Subjt:  ASSAYMLTSFNVWHVRLCHVNKRLISNMSRLNLIPKLSLHDFEKCACCSQAKITKTSHKYVT-RVTEPLELIHSDLCEFDGALTRNSKRYVITFIDDCSD

Query:  YTFIYLLKNKSDAYEMFKVFVTEIEKQFNKRIKRLRSDRGIEYDSVSFNEFYSSKGIIHETIAPYSPEMNGKAERKNRTLTELVIVILLESGAAPSWWGE
          ++Y+LK K   +++F+ F   +E++  +++KRLRSD G EY S  F E+ SS GI HE   P +P+ NG AER NRT+ E V  +L  +    S+WGE
Subjt:  YTFIYLLKNKSDAYEMFKVFVTEIEKQFNKRIKRLRSDRGIEYDSVSFNEFYSSKGIIHETIAPYSPEMNGKAERKNRTLTELVIVILLESGAAPSWWGE

Query:  IFKTINYVLNRIPK-SNSKTSPYEVLKHKAPNLSYLRTWGCLAYVRIPDPKRRKLASRAYECVFIEYAKNNKAYRFYDLENKVIIEWNDVDFFEDKFPFK
          +T  Y++NR P    +   P  V  +K  + S+L+ +GC A+  +P  +R KL  ++  C+FI Y      YR +D   K +I   DV F E +    
Subjt:  IFKTINYVLNRIPK-SNSKTSPYEVLKHKAPNLSYLRTWGCLAYVRIPDPKRRKLASRAYECVFIEYAKNNKAYRFYDLENKVIIEWNDVDFFEDKFPFK

Query:  SRNSGDLYSQISGGSISN--SLPS--------------------------------------IRIQTQDKEVDPEPRRSKRARTVKDFGEDFEMYNVED-
         R + D+  ++  G I N  ++PS                                      +   TQ +E     RRS+R R         E   + D 
Subjt:  SRNSGDLYSQISGGSISN--SLPS--------------------------------------IRIQTQDKEVDPEPRRSKRARTVKDFGEDFEMYNVED-

Query:  --PKDLTEALSSVDANLWQEAINNEMDSLEPNRT
          P+ L E LS  + N   +A+  EM+SL+ N T
Subjt:  --PKDLTEALSSVDANLWQEAINNEMDSLEPNRT

P47024 Transposon Ty4-J Gag-Pol polyprotein5.3e-1420.68Show/hide
Query:  LDTSASRHVCYDLSLFRKYNEVKDKNIL--LGDHHTTKVAGIGEVELK----FTSGKMLVLKEFLHTPEIQKNLVSRYLLNKAGFTQTIGSDLFTLTKNN
        +DT +  ++  D +L   Y +         +G + +  V G G +++K     T  K L+     + PE +  ++S Y L K   T+ + S  +T   N 
Subjt:  LDTSASRHVCYDLSLFRKYNEVKDKNIL--LGDHHTTKVAGIGEVELK----FTSGKMLVLKEFLHTPEIQKNLVSRYLLNKAGFTQTIGSDLFTLTKNN

Query:  VFVEKGYATDGMFKLNL-----------EINKI---ASSAYMLTSFNVW----HVRLCHVNKRLISNMSRLNLIPKLSLHDFEK-----CACCSQAKITK
        +   K    +G+  + +           +IN I   +S  + L   ++     H R+ H   + I N  + N   + SL   ++     C  C  +K TK
Subjt:  VFVEKGYATDGMFKLNL-----------EINKI---ASSAYMLTSFNVW----HVRLCHVNKRLISNMSRLNLIPKLSLHDFEK-----CACCSQAKITK

Query:  TSHKYVTRVT------EPLELIHSDLCEFDGALTRNSKRYVITFIDDCSDY--TFIYLLKNKSDAYEMFKVFVTEIEKQFNKRIKRLRSDRGIEYDSVSF
         +H Y   +       EP      D+     +   ++KRY++  +D+ + Y  T  +  KN        +  +  +E QF+++++ + SDRG E+ +   
Subjt:  TSHKYVTRVT------EPLELIHSDLCEFDGALTRNSKRYVITFIDDCSDY--TFIYLLKNKSDAYEMFKVFVTEIEKQFNKRIKRLRSDRGIEYDSVSF

Query:  NEFYSSKGIIHETIAPYSPEMNGKAERKNRTLTELVIVILLESGAAPSWWGEIFKTINYVLNRIPKSNSKTSPYEVLKHKAPNLSYLRTWGCLAYVRIPD
         E++ SKGI H   +      NG+AER  RT+      +L +S     +W     +   + N +   ++   P + +  +   +  +          I +
Subjt:  NEFYSSKGIIHETIAPYSPEMNGKAERKNRTLTELVIVILLESGAAPSWWGEIFKTINYVLNRIPKSNSKTSPYEVLKHKAPNLSYLRTWGCLAYVRIPD

Query:  PKRRKLASRAYECVFIEYAKNNKAYRFY-DLENKVIIEWN
           +KL       + +    N+  Y+F+   +NK++   N
Subjt:  PKRRKLASRAYECVFIEYAKNNKAYRFY-DLENKVIIEWN

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.3e-4429.44Show/hide
Query:  NNPQSRSTVQIVCYNCNKLGHLARNCRNKSRPAAHANLIK--DELVAMISKVNVIGG----SEGWWLDTSASRHVCYD---LSLFRKYNEVKDKNILLGD
        NN QS+  +   C  C   GH A+ C       +  N  +          + N+  G    S  W LD+ A+ H+  D   LSL + Y    D  +++ D
Subjt:  NNPQSRSTVQIVCYNCNKLGHLARNCRNKSRPAAHANLIK--DELVAMISKVNVIGG----SEGWWLDTSASRHVCYD---LSLFRKYNEVKDKNILLGD

Query:  HHTTKVAGIGEVELKFTSGKMLVLKEFLHTPEIQKNLVSRY-LLNKAGFTQTIGSDLFTLTKNNVFVE--KGYATDGMFKLNLEINK---IASSAYMLTS
          T  ++  G   L  T  + L L   L+ P I KNL+S Y L N  G +       F +   N  V   +G   D +++  +  ++   + +S     +
Subjt:  HHTTKVAGIGEVELKFTSGKMLVLKEFLHTPEIQKNLVSRY-LLNKAGFTQTIGSDLFTLTKNNVFVE--KGYATDGMFKLNLEINK---IASSAYMLTS

Query:  FNVWHVRLCH----VNKRLISNMSRLNLIPKLSLHDFEKCACCSQAKITKTSHKYVT-RVTEPLELIHSDLCEFDGALTRNSKRYVITFIDDCSDYTFIY
         + WH RL H    +   +ISN S   L P    H F  C+ C   K  K      T   T PLE I+SD+      L+ ++ RY + F+D  + YT++Y
Subjt:  FNVWHVRLCH----VNKRLISNMSRLNLIPKLSLHDFEKCACCSQAKITKTSHKYVT-RVTEPLELIHSDLCEFDGALTRNSKRYVITFIDDCSDYTFIY

Query:  LLKNKSDAYEMFKVFVTEIEKQFNKRIKRLRSDRGIEYDSVSFNEFYSSKGIIHETIAPYSPEMNGKAERKNRTLTELVIVILLESGAAPSWWGEIFKTI
         LK KS   E F  F   +E +F  RI    SD G E+  V+  E++S  GI H T  P++PE NG +ERK+R + E  + +L  +    ++W   F   
Subjt:  LLKNKSDAYEMFKVFVTEIEKQFNKRIKRLRSDRGIEYDSVSFNEFYSSKGIIHETIAPYSPEMNGKAERKNRTLTELVIVILLESGAAPSWWGEIFKTI

Query:  NYVLNRIPKSNSK-TSPYEVLKHKAPNLSYLRTWGCLAYVRIPDPKRRKLASRAYECVFIEYAKNNKAYRFYDLENKVIIEWNDVDFFEDKFPFKS
         Y++NR+P    +  SP++ L   +PN   LR +GC  Y  +    + KL  ++ +CVF+ Y+    AY    L+   +     V F E+ FPF +
Subjt:  NYVLNRIPKSNSK-TSPYEVLKHKAPNLSYLRTWGCLAYVRIPDPKRRKLASRAYECVFIEYAKNNKAYRFYDLENKVIIEWNDVDFFEDKFPFKS

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.0e-4227Show/hide
Query:  RGSNKQNNPQSRSTVQIVCYNCNKLGHLARNCRNKSRPAAHANLIKDELVAMIS--------KVNVIGGSEGWWLDTSASRHVCYD---LSLFRKYNEVK
        R  N+Q  P         C  C+  GH A+ C    +  +  N  + +  +  +         VN    +  W LD+ A+ H+  D   LS  + Y    
Subjt:  RGSNKQNNPQSRSTVQIVCYNCNKLGHLARNCRNKSRPAAHANLIKDELVAMIS--------KVNVIGGSEGWWLDTSASRHVCYD---LSLFRKYNEVK

Query:  DKNILLGDHHTTKVAGIGEVELKFTSGKMLVLKEFLHTPEIQKNLVSRYLLNKAGFTQTIGSDLFTLT------KNNVFVEKGYATDGMFKLNLEINKIA
        D  +++ D  T  +   G   L  TS + L L + L+ P I KNL+S Y L     T  +  + F  +         V + +G   D +++  +  ++  
Subjt:  DKNILLGDHHTTKVAGIGEVELKFTSGKMLVLKEFLHTPEIQKNLVSRYLLNKAGFTQTIGSDLFTLT------KNNVFVEKGYATDGMFKLNLEINKIA

Query:  S---SAYMLTSFNVWHVRLCHVNKRLISNMSRLNLIPKLS-LHDFEKCACCSQAKITKT--SHKYVTRVTEPLELIHSDLCEFDGALTRNSKRYVITFID
        S   S     + + WH RL H +  +++++   + +P L+  H    C+ C   K  K   S+  +T  ++PLE I+SD+      L+ ++ RY + F+D
Subjt:  S---SAYMLTSFNVWHVRLCHVNKRLISNMSRLNLIPKLS-LHDFEKCACCSQAKITKT--SHKYVTRVTEPLELIHSDLCEFDGALTRNSKRYVITFID

Query:  DCSDYTFIYLLKNKSDAYEMFKVFVTEIEKQFNKRIKRLRSDRGIEYDSVSFNEFYSSKGIIHETIAPYSPEMNGKAERKNRTLTELVIVILLESGAAPS
          + YT++Y LK KS   + F +F + +E +F  RI  L SD G E+  V   ++ S  GI H T  P++PE NG +ERK+R + E+ + +L  +    +
Subjt:  DCSDYTFIYLLKNKSDAYEMFKVFVTEIEKQFNKRIKRLRSDRGIEYDSVSFNEFYSSKGIIHETIAPYSPEMNGKAERKNRTLTELVIVILLESGAAPS

Query:  WWGEIFKTINYVLNRIPKSNSK-TSPYEVLKHKAPNLSYLRTWGCLAYVRIPDPKRRKLASRAYECVFIEYAKNNKAYRFYDLENKVIIEWNDVDFFEDK
        +W   F    Y++NR+P    +  SP++ L  + PN   L+ +GC  Y  +    R KL  ++ +C F+ Y+    AY    +    +     V F E  
Subjt:  WWGEIFKTINYVLNRIPKSNSK-TSPYEVLKHKAPNLSYLRTWGCLAYVRIPDPKRRKLASRAYECVFIEYAKNNKAYRFYDLENKVIIEWNDVDFFEDK

Query:  FPFKSRNSGDLYSQISGGSISNSLPS
        FPF + N G   SQ      + + PS
Subjt:  FPFKSRNSGDLYSQISGGSISNSLPS

Arabidopsis top hitse value%identityAlignment
ATMG00710.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein3.5e-0530.59Show/hide
Query:  NRTLTELVIVILLESGAAPSWWGEIFKTINYVLNRIPKSNSKTS-PYEVLKHKAPNLSYLRTWGCLAYVRIPDPKRRKLASRAYE
        NRT+ E V  +L E G   ++  +   T  +++N+ P +      P EV     P  SYLR +GC+AY+   + K +  A +  E
Subjt:  NRTLTELVIVILLESGAAPSWWGEIFKTINYVLNRIPKSNSKTS-PYEVLKHKAPNLSYLRTWGCLAYVRIPDPKRRKLASRAYE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAACGAGGATCAAACAAACAAAACAACCCACAGTCCAGAAGTACGGTACAAATTGTTTGTTATAATTGTAATAAGCTTGGTCATTTAGCTAGAAATTGTAGAAATAA
GAGTCGTCCTGCTGCGCATGCGAACCTGATAAAAGATGAATTAGTAGCTATGATATCTAAAGTTAATGTGATTGGGGGGTCTGAAGGTTGGTGGCTAGACACTAGTGCAT
CCCGCCATGTCTGCTACGACCTTAGTCTTTTTAGAAAATATAATGAAGTTAAGGATAAAAATATCCTTCTAGGAGATCATCACACGACCAAGGTGGCCGGCATTGGAGAA
GTAGAACTGAAATTCACATCCGGCAAGATGCTTGTGCTGAAGGAATTTCTGCATACTCCAGAAATTCAAAAGAATTTGGTCTCCAGATATCTCCTCAACAAGGCTGGATT
CACACAAACCATAGGATCAGACTTGTTTACTTTAACTAAAAACAATGTGTTTGTGGAGAAGGGTTACGCTACTGATGGCATGTTCAAATTGAATCTGGAAATTAATAAGA
TTGCATCTTCTGCTTACATGTTGACTTCTTTTAATGTTTGGCATGTTAGACTTTGTCATGTTAATAAAAGATTAATTAGTAACATGAGTAGGTTAAATCTTATACCTAAG
TTATCTCTGCATGATTTTGAGAAATGTGCATGTTGTAGTCAAGCTAAGATAACTAAAACCTCGCATAAGTATGTAACTAGAGTAACAGAGCCTTTAGAATTAATTCATTC
AGACTTATGTGAATTTGATGGCGCTTTAACTAGAAACAGTAAAAGGTATGTAATTACCTTTATAGATGATTGTTCTGACTACACTTTTATTTATCTGCTTAAAAATAAAA
GTGATGCATATGAAATGTTCAAAGTCTTTGTAACTGAAATAGAGAAACAGTTTAACAAAAGAATTAAGAGACTTCGTAGTGATAGAGGAATTGAATATGATTCAGTTTCT
TTCAATGAGTTTTATAGCTCAAAAGGAATAATACATGAAACTATTGCGCCTTATTCTCCTGAAATGAATGGAAAAGCAGAAAGAAAGAATAGAACTCTAACTGAGTTAGT
AATTGTTATCTTACTTGAGTCAGGAGCCGCACCATCTTGGTGGGGAGAAATATTTAAGACTATTAATTATGTTCTTAATAGGATTCCTAAATCAAACAGTAAAACTTCAC
CATACGAAGTCCTTAAACATAAAGCACCAAACCTATCTTATCTTCGAACTTGGGGTTGTCTAGCTTATGTTAGAATACCTGATCCAAAAAGAAGGAAATTAGCAAGTAGA
GCCTATGAATGTGTCTTCATAGAATACGCTAAAAATAATAAAGCCTATAGATTCTATGACTTAGAAAACAAAGTAATTATAGAATGGAATGACGTAGATTTTTTCGAGGA
CAAATTTCCTTTTAAATCTAGAAATAGTGGGGACCTATATAGTCAAATTAGTGGGGGCTCAATTTCCAATAGTCTACCTTCAATTAGGATCCAAACCCAAGACAAGGAAG
TAGATCCTGAACCTAGAAGAAGCAAGAGAGCTAGAACAGTAAAAGACTTTGGAGAAGACTTCGAAATGTACAACGTAGAAGATCCAAAAGATCTAACAGAAGCATTGTCA
TCAGTAGATGCTAATTTATGGCAAGAAGCTATCAATAATGAAATGGACTCTCTTGAACCCAATAGAACTTGA
mRNA sequenceShow/hide mRNA sequence
ATGAAACGAGGATCAAACAAACAAAACAACCCACAGTCCAGAAGTACGGTACAAATTGTTTGTTATAATTGTAATAAGCTTGGTCATTTAGCTAGAAATTGTAGAAATAA
GAGTCGTCCTGCTGCGCATGCGAACCTGATAAAAGATGAATTAGTAGCTATGATATCTAAAGTTAATGTGATTGGGGGGTCTGAAGGTTGGTGGCTAGACACTAGTGCAT
CCCGCCATGTCTGCTACGACCTTAGTCTTTTTAGAAAATATAATGAAGTTAAGGATAAAAATATCCTTCTAGGAGATCATCACACGACCAAGGTGGCCGGCATTGGAGAA
GTAGAACTGAAATTCACATCCGGCAAGATGCTTGTGCTGAAGGAATTTCTGCATACTCCAGAAATTCAAAAGAATTTGGTCTCCAGATATCTCCTCAACAAGGCTGGATT
CACACAAACCATAGGATCAGACTTGTTTACTTTAACTAAAAACAATGTGTTTGTGGAGAAGGGTTACGCTACTGATGGCATGTTCAAATTGAATCTGGAAATTAATAAGA
TTGCATCTTCTGCTTACATGTTGACTTCTTTTAATGTTTGGCATGTTAGACTTTGTCATGTTAATAAAAGATTAATTAGTAACATGAGTAGGTTAAATCTTATACCTAAG
TTATCTCTGCATGATTTTGAGAAATGTGCATGTTGTAGTCAAGCTAAGATAACTAAAACCTCGCATAAGTATGTAACTAGAGTAACAGAGCCTTTAGAATTAATTCATTC
AGACTTATGTGAATTTGATGGCGCTTTAACTAGAAACAGTAAAAGGTATGTAATTACCTTTATAGATGATTGTTCTGACTACACTTTTATTTATCTGCTTAAAAATAAAA
GTGATGCATATGAAATGTTCAAAGTCTTTGTAACTGAAATAGAGAAACAGTTTAACAAAAGAATTAAGAGACTTCGTAGTGATAGAGGAATTGAATATGATTCAGTTTCT
TTCAATGAGTTTTATAGCTCAAAAGGAATAATACATGAAACTATTGCGCCTTATTCTCCTGAAATGAATGGAAAAGCAGAAAGAAAGAATAGAACTCTAACTGAGTTAGT
AATTGTTATCTTACTTGAGTCAGGAGCCGCACCATCTTGGTGGGGAGAAATATTTAAGACTATTAATTATGTTCTTAATAGGATTCCTAAATCAAACAGTAAAACTTCAC
CATACGAAGTCCTTAAACATAAAGCACCAAACCTATCTTATCTTCGAACTTGGGGTTGTCTAGCTTATGTTAGAATACCTGATCCAAAAAGAAGGAAATTAGCAAGTAGA
GCCTATGAATGTGTCTTCATAGAATACGCTAAAAATAATAAAGCCTATAGATTCTATGACTTAGAAAACAAAGTAATTATAGAATGGAATGACGTAGATTTTTTCGAGGA
CAAATTTCCTTTTAAATCTAGAAATAGTGGGGACCTATATAGTCAAATTAGTGGGGGCTCAATTTCCAATAGTCTACCTTCAATTAGGATCCAAACCCAAGACAAGGAAG
TAGATCCTGAACCTAGAAGAAGCAAGAGAGCTAGAACAGTAAAAGACTTTGGAGAAGACTTCGAAATGTACAACGTAGAAGATCCAAAAGATCTAACAGAAGCATTGTCA
TCAGTAGATGCTAATTTATGGCAAGAAGCTATCAATAATGAAATGGACTCTCTTGAACCCAATAGAACTTGA
Protein sequenceShow/hide protein sequence
MKRGSNKQNNPQSRSTVQIVCYNCNKLGHLARNCRNKSRPAAHANLIKDELVAMISKVNVIGGSEGWWLDTSASRHVCYDLSLFRKYNEVKDKNILLGDHHTTKVAGIGE
VELKFTSGKMLVLKEFLHTPEIQKNLVSRYLLNKAGFTQTIGSDLFTLTKNNVFVEKGYATDGMFKLNLEINKIASSAYMLTSFNVWHVRLCHVNKRLISNMSRLNLIPK
LSLHDFEKCACCSQAKITKTSHKYVTRVTEPLELIHSDLCEFDGALTRNSKRYVITFIDDCSDYTFIYLLKNKSDAYEMFKVFVTEIEKQFNKRIKRLRSDRGIEYDSVS
FNEFYSSKGIIHETIAPYSPEMNGKAERKNRTLTELVIVILLESGAAPSWWGEIFKTINYVLNRIPKSNSKTSPYEVLKHKAPNLSYLRTWGCLAYVRIPDPKRRKLASR
AYECVFIEYAKNNKAYRFYDLENKVIIEWNDVDFFEDKFPFKSRNSGDLYSQISGGSISNSLPSIRIQTQDKEVDPEPRRSKRARTVKDFGEDFEMYNVEDPKDLTEALS
SVDANLWQEAINNEMDSLEPNRT