; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc11G11110 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc11G11110
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionReverse transcriptase
Genome locationClcChr11:16577312..16579293
RNA-Seq ExpressionClc11G11110
SyntenyClc11G11110
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR041373 - Reverse transcriptase, RNase H-like domain
IPR041588 - Integrase zinc-binding domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_012833379.1 PREDICTED: uncharacterized protein LOC105954252 [Erythranthe guttata]1.8e-25164.91Show/hide
Query:  MLDRLAGYSHYCFLDGYSGYNQIVIAPEDQEKMTFTCPYGTFAFKRMPFGLCNAPTTFQRCMMSIFQGLIEDIMEVFMDYFSVFGSSFDSCLVNLTRVLQ
        MLDRL G+ +YCFLDGYSGYNQI IAPEDQEK TFTCPYGTFAF+RMPFGLCNAP TFQRCMMSIF  ++E+ +EVFMD FSVFGSSFD C+ NL  VL+
Subjt:  MLDRLAGYSHYCFLDGYSGYNQIVIAPEDQEKMTFTCPYGTFAFKRMPFGLCNAPTTFQRCMMSIFQGLIEDIMEVFMDYFSVFGSSFDSCLVNLTRVLQ

Query:  RCQDANLVLNWEKCHFMVTEGIVLGHKVSKKGLEVDRAKIVAIEQLPPPTNVKGVRSFLGHAGFYRRFIKDFSKIAKPLSNLLEKEAKFIFDDACLLAFN
        RC + NLVLNWEKCHFMV EGIVLGHKVSKKGLEVDRAKI  IE+LPPP +VKGVRSFLGHAGFYRRFIKDFSKI KPL +LLEKEA F FD ACL AF 
Subjt:  RCQDANLVLNWEKCHFMVTEGIVLGHKVSKKGLEVDRAKIVAIEQLPPPTNVKGVRSFLGHAGFYRRFIKDFSKIAKPLSNLLEKEAKFIFDDACLLAFN

Query:  TLKERLIAAPIIVVPDWSQSFEIMCDASDYALGAVLGQRRDNMFRAIYYSSRTLDNTQQKYTTTEKELLAVVFAIDKFRSYLLGSKIVVHTDHAALKYLF
         LKE+L  +PI++ P+W + FEIMCDASDYA+GAVLGQRRD +F+AIYYSSRTLD  Q+ Y+TTEKE+LAVV+A+DKFR Y+LGS+++++TDHAA++YLF
Subjt:  TLKERLIAAPIIVVPDWSQSFEIMCDASDYALGAVLGQRRDNMFRAIYYSSRTLDNTQQKYTTTEKELLAVVFAIDKFRSYLLGSKIVVHTDHAALKYLF

Query:  VKKDSKPRLMRWILLLQEFDLEIKDRKGCENVVADHLSRIENEEAKSWPPIVEKFSDEQLYQVKDSLPWFADIVNYLAGGHLPPNMNYQQKKRFLHNIKS
         KKD+KPRL+RW+LLLQEFDLEI+D+KG ENVVADHLSR+  EE  +   I E F DEQL  +    PW+AD+ N+LA G +P +++Y QKK+FLH+ + 
Subjt:  VKKDSKPRLMRWILLLQEFDLEIKDRKGCENVVADHLSRIENEEAKSWPPIVEKFSDEQLYQVKDSLPWFADIVNYLAGGHLPPNMNYQQKKRFLHNIKS

Query:  YHWEDPLLYKVCADNMIRKCVPQEEVVSILNSCHASPYGGHFGPTRTAAKVLQSGFYWPSLFKDCCTFVKSCDRCQRTGNISRQHELPMKPILEVELFDV
        Y W++PLL++   D +IR+CVP+ EV  IL  CH+SP GGH G +RTAAKVLQSGF+WP+LF+D   FVK CDRCQRTGN+S + ++P+  + EVELFDV
Subjt:  YHWEDPLLYKVCADNMIRKCVPQEEVVSILNSCHASPYGGHFGPTRTAAKVLQSGFYWPSLFKDCCTFVKSCDRCQRTGNISRQHELPMKPILEVELFDV

Query:  WGIDFMGPFPISYNGYLYILVAVDYVSKWVEAIATRTNDARTVVKFLHKNIFTRF---------------------------------------------
        WGIDFMGPFP S NG LYIL+AVDYVSKWVEAIAT  NDARTV+KF HKNIF+RF                                             
Subjt:  WGIDFMGPFPISYNGYLYILVAVDYVSKWVEAIATRTNDARTVVKFLHKNIFTRF---------------------------------------------

Query:  -------------VKTNRKEWALKHDDALWAYRTAFKTPIGTSPYRL
                     V TNRK+WALK DDALWAYRTAFKTPIG SPY+L
Subjt:  -------------VKTNRKEWALKHDDALWAYRTAFKTPIGTSPYRL

XP_012833687.1 PREDICTED: uncharacterized protein LOC105954563 [Erythranthe guttata]4.6e-25265.07Show/hide
Query:  MLDRLAGYSHYCFLDGYSGYNQIVIAPEDQEKMTFTCPYGTFAFKRMPFGLCNAPTTFQRCMMSIFQGLIEDIMEVFMDYFSVFGSSFDSCLVNLTRVLQ
        MLDRL G+ +YCFLDGYSGYNQI IAPEDQEK TFTCPYGTFAF+RMPFGLCNAP TFQRCMMSIF  ++E+ +EVFMD FSVFGSSFD C+ NL  VL+
Subjt:  MLDRLAGYSHYCFLDGYSGYNQIVIAPEDQEKMTFTCPYGTFAFKRMPFGLCNAPTTFQRCMMSIFQGLIEDIMEVFMDYFSVFGSSFDSCLVNLTRVLQ

Query:  RCQDANLVLNWEKCHFMVTEGIVLGHKVSKKGLEVDRAKIVAIEQLPPPTNVKGVRSFLGHAGFYRRFIKDFSKIAKPLSNLLEKEAKFIFDDACLLAFN
        RC + NLVLNWEKCHFMV EGIVLGHKVSKKGLEVDRAKI  IE+LPPP +VKGVRSFLGHAGFYRRFIKDFSKI KPL +LLEKEA F FD ACL AF 
Subjt:  RCQDANLVLNWEKCHFMVTEGIVLGHKVSKKGLEVDRAKIVAIEQLPPPTNVKGVRSFLGHAGFYRRFIKDFSKIAKPLSNLLEKEAKFIFDDACLLAFN

Query:  TLKERLIAAPIIVVPDWSQSFEIMCDASDYALGAVLGQRRDNMFRAIYYSSRTLDNTQQKYTTTEKELLAVVFAIDKFRSYLLGSKIVVHTDHAALKYLF
         LKE+L  +PI++ P+W + FEIMCDASDYA+GAVLGQRRD +F+AIYYSSRTLD  Q+ Y+TTEKE+LAVV+A+DKFR Y+LGS+++++TDHAA++YLF
Subjt:  TLKERLIAAPIIVVPDWSQSFEIMCDASDYALGAVLGQRRDNMFRAIYYSSRTLDNTQQKYTTTEKELLAVVFAIDKFRSYLLGSKIVVHTDHAALKYLF

Query:  VKKDSKPRLMRWILLLQEFDLEIKDRKGCENVVADHLSRIENEEAKSWPPIVEKFSDEQLYQVKDSLPWFADIVNYLAGGHLPPNMNYQQKKRFLHNIKS
         KKD+KPRL+RW+LLLQEFDLEI+D+KG ENVVADHLSR+  EE  +   I E F DEQL  +    PW+AD+ N+LA G +P +++Y QKK+FLH+ + 
Subjt:  VKKDSKPRLMRWILLLQEFDLEIKDRKGCENVVADHLSRIENEEAKSWPPIVEKFSDEQLYQVKDSLPWFADIVNYLAGGHLPPNMNYQQKKRFLHNIKS

Query:  YHWEDPLLYKVCADNMIRKCVPQEEVVSILNSCHASPYGGHFGPTRTAAKVLQSGFYWPSLFKDCCTFVKSCDRCQRTGNISRQHELPMKPILEVELFDV
        Y W++PLL++   D +IR+CVP+ EV  IL  CH+SP GGH G +RTAAKVLQSGF+WP+LF+D   FVK CDRCQRTGN+S + ++P+  + EVELFDV
Subjt:  YHWEDPLLYKVCADNMIRKCVPQEEVVSILNSCHASPYGGHFGPTRTAAKVLQSGFYWPSLFKDCCTFVKSCDRCQRTGNISRQHELPMKPILEVELFDV

Query:  WGIDFMGPFPISYNGYLYILVAVDYVSKWVEAIATRTNDARTVVKFLHKNIFTRF---------------------------------------------
        WGIDFMGPFP S NG LYIL+AVDYVSKWVEAIAT TNDARTV+KF HKNIF+RF                                             
Subjt:  WGIDFMGPFPISYNGYLYILVAVDYVSKWVEAIATRTNDARTVVKFLHKNIFTRF---------------------------------------------

Query:  -------------VKTNRKEWALKHDDALWAYRTAFKTPIGTSPYRL
                     V TNRK+WALK DDALWAYRTAFKTPIG SPY+L
Subjt:  -------------VKTNRKEWALKHDDALWAYRTAFKTPIGTSPYRL

XP_012846413.1 PREDICTED: uncharacterized protein LOC105966405 [Erythranthe guttata]6.1e-25265.07Show/hide
Query:  MLDRLAGYSHYCFLDGYSGYNQIVIAPEDQEKMTFTCPYGTFAFKRMPFGLCNAPTTFQRCMMSIFQGLIEDIMEVFMDYFSVFGSSFDSCLVNLTRVLQ
        MLDRL G+ +YCFLDGYSGYNQI IAPEDQEK TFTCPYGTFAF+RMPFGLCNAP TFQRCMMSIF  ++E+ +EVFMD FSVFGSSFD C+ NL  VLQ
Subjt:  MLDRLAGYSHYCFLDGYSGYNQIVIAPEDQEKMTFTCPYGTFAFKRMPFGLCNAPTTFQRCMMSIFQGLIEDIMEVFMDYFSVFGSSFDSCLVNLTRVLQ

Query:  RCQDANLVLNWEKCHFMVTEGIVLGHKVSKKGLEVDRAKIVAIEQLPPPTNVKGVRSFLGHAGFYRRFIKDFSKIAKPLSNLLEKEAKFIFDDACLLAFN
        RC + NLVLNWEKCHFMV EGIVLGHKVSKKGLEVDRAKI  IE+LPPP +VKGVRSFLGHAGFYRRFIKDFSKI KPL +LLEKEA F FD ACL AF 
Subjt:  RCQDANLVLNWEKCHFMVTEGIVLGHKVSKKGLEVDRAKIVAIEQLPPPTNVKGVRSFLGHAGFYRRFIKDFSKIAKPLSNLLEKEAKFIFDDACLLAFN

Query:  TLKERLIAAPIIVVPDWSQSFEIMCDASDYALGAVLGQRRDNMFRAIYYSSRTLDNTQQKYTTTEKELLAVVFAIDKFRSYLLGSKIVVHTDHAALKYLF
         LKE+L  +PI++ P+W + FEIMCDASDYA+GAVLGQRRD +F+AIYYSSRTLD  Q+ Y+TTEKE+LAVV+A+DKFR Y+LGS+++++TDHAA++YLF
Subjt:  TLKERLIAAPIIVVPDWSQSFEIMCDASDYALGAVLGQRRDNMFRAIYYSSRTLDNTQQKYTTTEKELLAVVFAIDKFRSYLLGSKIVVHTDHAALKYLF

Query:  VKKDSKPRLMRWILLLQEFDLEIKDRKGCENVVADHLSRIENEEAKSWPPIVEKFSDEQLYQVKDSLPWFADIVNYLAGGHLPPNMNYQQKKRFLHNIKS
         KKD+KPRL+RW+LLLQEFDLEI+D+KG ENVVADHLSR+  +E  +   I E F DEQL  +    PW+AD+ N+LA G +P +++Y QKK+FLH+ + 
Subjt:  VKKDSKPRLMRWILLLQEFDLEIKDRKGCENVVADHLSRIENEEAKSWPPIVEKFSDEQLYQVKDSLPWFADIVNYLAGGHLPPNMNYQQKKRFLHNIKS

Query:  YHWEDPLLYKVCADNMIRKCVPQEEVVSILNSCHASPYGGHFGPTRTAAKVLQSGFYWPSLFKDCCTFVKSCDRCQRTGNISRQHELPMKPILEVELFDV
        Y W++PLL++   D +IR+CVP+ EV  IL  CH+SP GGH G +RTAAKVLQSGF+WP+LF+D   FVK CDRCQRTGN+S + ++P+  + EVELFDV
Subjt:  YHWEDPLLYKVCADNMIRKCVPQEEVVSILNSCHASPYGGHFGPTRTAAKVLQSGFYWPSLFKDCCTFVKSCDRCQRTGNISRQHELPMKPILEVELFDV

Query:  WGIDFMGPFPISYNGYLYILVAVDYVSKWVEAIATRTNDARTVVKFLHKNIFTRF---------------------------------------------
        WGIDFMGPFP S NG LYIL+AVDYVSKWVEAIAT TNDARTV+KF HKNIF+RF                                             
Subjt:  WGIDFMGPFPISYNGYLYILVAVDYVSKWVEAIATRTNDARTVVKFLHKNIFTRF---------------------------------------------

Query:  -------------VKTNRKEWALKHDDALWAYRTAFKTPIGTSPYRL
                     V TNRK+WALK DDALWAYRTAFKTPIG SPY+L
Subjt:  -------------VKTNRKEWALKHDDALWAYRTAFKTPIGTSPYRL

XP_012847037.1 PREDICTED: uncharacterized protein LOC105967019 [Erythranthe guttata]4.6e-25265.07Show/hide
Query:  MLDRLAGYSHYCFLDGYSGYNQIVIAPEDQEKMTFTCPYGTFAFKRMPFGLCNAPTTFQRCMMSIFQGLIEDIMEVFMDYFSVFGSSFDSCLVNLTRVLQ
        MLDRL G+ +YCFLDGYSGYNQI IAPEDQEK TFTCPYGTFAF+RMPFGLCNAP TFQRCMMSIF  ++E+ +EVFMD FSVFGSSFD C+ NL  VL+
Subjt:  MLDRLAGYSHYCFLDGYSGYNQIVIAPEDQEKMTFTCPYGTFAFKRMPFGLCNAPTTFQRCMMSIFQGLIEDIMEVFMDYFSVFGSSFDSCLVNLTRVLQ

Query:  RCQDANLVLNWEKCHFMVTEGIVLGHKVSKKGLEVDRAKIVAIEQLPPPTNVKGVRSFLGHAGFYRRFIKDFSKIAKPLSNLLEKEAKFIFDDACLLAFN
        RC + NLVLNWEKCHFMV EGIVLGHKVSKKGLEVDRAKI  IE+LPPP +VKGVRSFLGHAGFYRRFIKDFSKI KPL +LLEKEA F FD ACL AF 
Subjt:  RCQDANLVLNWEKCHFMVTEGIVLGHKVSKKGLEVDRAKIVAIEQLPPPTNVKGVRSFLGHAGFYRRFIKDFSKIAKPLSNLLEKEAKFIFDDACLLAFN

Query:  TLKERLIAAPIIVVPDWSQSFEIMCDASDYALGAVLGQRRDNMFRAIYYSSRTLDNTQQKYTTTEKELLAVVFAIDKFRSYLLGSKIVVHTDHAALKYLF
         LKE+L  +PI++ P+W + FEIMCDASDYA+GAVLGQRRD +F+AIYYSSRTLD  Q+ Y+TTEKE+LAVV+A+DKFR Y+LGS+++++TDHAA++YLF
Subjt:  TLKERLIAAPIIVVPDWSQSFEIMCDASDYALGAVLGQRRDNMFRAIYYSSRTLDNTQQKYTTTEKELLAVVFAIDKFRSYLLGSKIVVHTDHAALKYLF

Query:  VKKDSKPRLMRWILLLQEFDLEIKDRKGCENVVADHLSRIENEEAKSWPPIVEKFSDEQLYQVKDSLPWFADIVNYLAGGHLPPNMNYQQKKRFLHNIKS
         KKD+KPRL+RW+LLLQEFDLEI+D+KG ENVVADHLSR+  EE  +   I E F DEQL  +    PW+AD+ N+LA G +P +++Y QKK+FLH+ + 
Subjt:  VKKDSKPRLMRWILLLQEFDLEIKDRKGCENVVADHLSRIENEEAKSWPPIVEKFSDEQLYQVKDSLPWFADIVNYLAGGHLPPNMNYQQKKRFLHNIKS

Query:  YHWEDPLLYKVCADNMIRKCVPQEEVVSILNSCHASPYGGHFGPTRTAAKVLQSGFYWPSLFKDCCTFVKSCDRCQRTGNISRQHELPMKPILEVELFDV
        Y W++PLL++   D +IR+CVP+ EV  IL  CH+SP GGH G +RTAAKVLQSGF+WP+LF+D   FVK CDRCQRTGN+S + ++P+  + EVELFDV
Subjt:  YHWEDPLLYKVCADNMIRKCVPQEEVVSILNSCHASPYGGHFGPTRTAAKVLQSGFYWPSLFKDCCTFVKSCDRCQRTGNISRQHELPMKPILEVELFDV

Query:  WGIDFMGPFPISYNGYLYILVAVDYVSKWVEAIATRTNDARTVVKFLHKNIFTRF---------------------------------------------
        WGIDFMGPFP S NG LYIL+AVDYVSKWVEAIAT TNDARTV+KF HKNIF+RF                                             
Subjt:  WGIDFMGPFPISYNGYLYILVAVDYVSKWVEAIATRTNDARTVVKFLHKNIFTRF---------------------------------------------

Query:  -------------VKTNRKEWALKHDDALWAYRTAFKTPIGTSPYRL
                     V TNRK+WALK DDALWAYRTAFKTPIG SPY+L
Subjt:  -------------VKTNRKEWALKHDDALWAYRTAFKTPIGTSPYRL

XP_023874613.1 uncharacterized protein LOC111987139 [Quercus suber]2.4e-26468.16Show/hide
Query:  MLDRLAGYSHYCFLDGYSGYNQIVIAPEDQEKMTFTCPYGTFAFKRMPFGLCNAPTTFQRCMMSIFQGLIEDIMEVFMDYFSVFGSSFDSCLVNLTRVLQ
        MLDRLAGYS+YCFLDGYSGYNQI IAPEDQEK TFTCPYGTFAF+RMPFGLCNAP TFQRCMM+IF  ++EDIME+FMD FSVFG+SFD CL NL  VLQ
Subjt:  MLDRLAGYSHYCFLDGYSGYNQIVIAPEDQEKMTFTCPYGTFAFKRMPFGLCNAPTTFQRCMMSIFQGLIEDIMEVFMDYFSVFGSSFDSCLVNLTRVLQ

Query:  RCQDANLVLNWEKCHFMVTEGIVLGHKVSKKGLEVDRAKIVAIEQLPPPTNVKGVRSFLGHAGFYRRFIKDFSKIAKPLSNLLEKEAKFIFDDACLLAFN
        RC+D NLVLNWEKCHFMV EGIVLGH+VS KG+EVDRAKI  IE+LPPP NVKG+RSFLGHAGFYRRFIKDFSK++KPL NLLEK + F FDD CL AFN
Subjt:  RCQDANLVLNWEKCHFMVTEGIVLGHKVSKKGLEVDRAKIVAIEQLPPPTNVKGVRSFLGHAGFYRRFIKDFSKIAKPLSNLLEKEAKFIFDDACLLAFN

Query:  TLKERLIAAPIIVVPDWSQSFEIMCDASDYALGAVLGQRRDNMFRAIYYSSRTLDNTQQKYTTTEKELLAVVFAIDKFRSYLLGSKIVVHTDHAALKYLF
         +KE+LI+AP++ VPDWSQ FE+MCDASD+ALGAVLGQRRD +FRAIYY+SRTL+  Q  YTTTEKE+LAVVFA DKFRSYL+ +K++V TDHAAL+YLF
Subjt:  TLKERLIAAPIIVVPDWSQSFEIMCDASDYALGAVLGQRRDNMFRAIYYSSRTLDNTQQKYTTTEKELLAVVFAIDKFRSYLLGSKIVVHTDHAALKYLF

Query:  VKKDSKPRLMRWILLLQEFDLEIKDRKGCENVVADHLSRIENEEAKSWPPIVEKFSDEQLYQVKDSLPWFADIVNYLAGGHLPPNMNYQQKKRFLHNIKS
         KKD+KPRL+RWILLLQEFDLE++D+KG EN VADHLSR+E EE +    I E F DEQL+  +  LPW+ADIVN+LA   LPP++ Y Q+K+FLH++K 
Subjt:  VKKDSKPRLMRWILLLQEFDLEIKDRKGCENVVADHLSRIENEEAKSWPPIVEKFSDEQLYQVKDSLPWFADIVNYLAGGHLPPNMNYQQKKRFLHNIKS

Query:  YHWEDPLLYKVCADNMIRKCVPQEEVVSILNSCHASPYGGHFGPTRTAAKVLQSGFYWPSLFKDCCTFVKSCDRCQRTGNISRQHELPMKPILEVELFDV
        Y W++PLL+K C D +IR+CVP+EE+ +IL+ CH+S YGGHFG TRTAAKVLQSGF+WPS+F+D  T VK+CDRCQR GNISR+ ELP+K ILEVELFDV
Subjt:  YHWEDPLLYKVCADNMIRKCVPQEEVVSILNSCHASPYGGHFGPTRTAAKVLQSGFYWPSLFKDCCTFVKSCDRCQRTGNISRQHELPMKPILEVELFDV

Query:  WGIDFMGPFPISYNGYLYILVAVDYVSKWVEAIATRTNDARTVVKFLHKNIFTRF---------------------------------------------
        WGIDFMGPFP S+ G++YIL+AVDYVSKWVEAIAT TNDA+ V+KFLHKNIFTRF                                             
Subjt:  WGIDFMGPFPISYNGYLYILVAVDYVSKWVEAIATRTNDARTVVKFLHKNIFTRF---------------------------------------------

Query:  -------------VKTNRKEWALKHDDALWAYRTAFKTPIGTSPYRL
                     V TNRK+WA K DDALWAYRTAFKTPIG SPYRL
Subjt:  -------------VKTNRKEWALKHDDALWAYRTAFKTPIGTSPYRL

TrEMBL top hitse value%identityAlignment
A0A2G9FWY3 Reverse transcriptase5.0e-24463.48Show/hide
Query:  MLDRLAGYSHYCFLDGYSGYNQIVIAPEDQEKMTFTCPYGTFAFKRMPFGLCNAPTTFQRCMMSIFQGLIEDIMEVFMDYFSVFGSSFDSCLVNLTRVLQ
        MLDRLAG   YCFLDGYSGYNQI IAPEDQEK+TFTCPYGTFAF+RMPFGLCNAP TFQRCMM+IF  ++E+ +EVFMD FSV+G+SFD CL NL+ VL+
Subjt:  MLDRLAGYSHYCFLDGYSGYNQIVIAPEDQEKMTFTCPYGTFAFKRMPFGLCNAPTTFQRCMMSIFQGLIEDIMEVFMDYFSVFGSSFDSCLVNLTRVLQ

Query:  RCQDANLVLNWEKCHFMVTEGIVLGHKVSKKGLEVDRAKIVAIEQLPPPTNVKGVRSFLGHAGFYRRFIKDFSKIAKPLSNLLEKEAKFIFDDACLLAFN
        RC+D NL+LNWEKCHFMV EGIVLGHKVS +G+EVD+AK+  IE+LPPPT+VKGVRSFLGHAGFYRRFIKDFSKI+KPL NLLEK+  F FDDAC  AFN
Subjt:  RCQDANLVLNWEKCHFMVTEGIVLGHKVSKKGLEVDRAKIVAIEQLPPPTNVKGVRSFLGHAGFYRRFIKDFSKIAKPLSNLLEKEAKFIFDDACLLAFN

Query:  TLKERLIAAPIIVVPDWSQSFEIMCDASDYALGAVLGQRRDNMFRAIYYSSRTLDNTQQKYTTTEKELLAVVFAIDKFRSYLLGSKIVVHTDHAALKYLF
         LK RLI+APII VPDWS  FE+MCDASD+A+GAVLGQR+D +FR+IYY+S+TL++ Q  YTTTEKELLAVVFA DKFRSYL+G+K++V+TDHAA++YL 
Subjt:  TLKERLIAAPIIVVPDWSQSFEIMCDASDYALGAVLGQRRDNMFRAIYYSSRTLDNTQQKYTTTEKELLAVVFAIDKFRSYLLGSKIVVHTDHAALKYLF

Query:  VKKDSKPRLMRWILLLQEFDLEIKDRKGCENVVADHLSRIENEEAKSWPPIV-EKFSDEQLYQ-VKDSLPWFADIVNYLAGGHLPPNMNYQQKKRFLHNI
         KKD+KPRL+RW+LLLQEFDLEI+DRKG EN +ADHLSR+E+      P ++ + F DEQL   V   +PW+ADIVNYL  G +P +++ QQKK+FL + 
Subjt:  VKKDSKPRLMRWILLLQEFDLEIKDRKGCENVVADHLSRIENEEAKSWPPIV-EKFSDEQLYQ-VKDSLPWFADIVNYLAGGHLPPNMNYQQKKRFLHNI

Query:  KSYHWEDPLLYKVCADNMIRKCVPQEEVVSILNSCHASPYGGHFGPTRTAAKVLQSGFYWPSLFKDCCTFVKSCDRCQRTGNISRQHELPMKPILEVELF
        + Y W+DP L+K   DN++R+CVP+ E+  IL  CHASPYGGHF   RTAAK+LQSGF+WP+LFKD  +FV +CDRCQRTGNISR+HE+P+  ILEVELF
Subjt:  KSYHWEDPLLYKVCADNMIRKCVPQEEVVSILNSCHASPYGGHFGPTRTAAKVLQSGFYWPSLFKDCCTFVKSCDRCQRTGNISRQHELPMKPILEVELF

Query:  DVWGIDFMGPFPISYNGYLYILVAVDYVSKWVEAIATRTNDARTVVKFLHKNIFTRF-------------------------------------------
        DVWGIDFMGPF  S+ G +YILVAVDYVSKWVEA A   ND++ VV F+ KNIFTRF                                           
Subjt:  DVWGIDFMGPFPISYNGYLYILVAVDYVSKWVEAIATRTNDARTVVKFLHKNIFTRF-------------------------------------------

Query:  ---------------VKTNRKEWALKHDDALWAYRTAFKTPIGTSPYRL
                       V + RK+W+ + D+ALWAYRTA+KTPIG SPYRL
Subjt:  ---------------VKTNRKEWALKHDDALWAYRTAFKTPIGTSPYRL

A0A2G9HWF8 Reverse transcriptase1.5e-24062.71Show/hide
Query:  MLDRLAGYSHYCFLDGYSGYNQIVIAPEDQEKMTFTCPYGTFAFKRMPFGLCNAPTTFQRCMMSIFQGLIEDIMEVFMDYFSVFGSSFDSCLVNLTRVLQ
        MLDRLAG   YCFLDGYSGYNQI IAPEDQEK TFTCPYGTFAF+R+PF LCNAP TFQRCMM+IF  ++E+ +EVFMD FSV+G SFD CL NL+ VL+
Subjt:  MLDRLAGYSHYCFLDGYSGYNQIVIAPEDQEKMTFTCPYGTFAFKRMPFGLCNAPTTFQRCMMSIFQGLIEDIMEVFMDYFSVFGSSFDSCLVNLTRVLQ

Query:  RCQDANLVLNWEKCHFMVTEGIVLGHKVSKKGLEVDRAKIVAIEQLPPPTNVKGVRSFLGHAGFYRRFIKDFSKIAKPLSNLLEKEAKFIFDDACLLAFN
        RC+D NLVLNWEKCHFMV EGIVLGHKVS +G+EVD+AK+  IE+LPP T+VKGVRSFLGHAGFYRRFIKDF KI+KPL  LLEK+  F FDDACL AF+
Subjt:  RCQDANLVLNWEKCHFMVTEGIVLGHKVSKKGLEVDRAKIVAIEQLPPPTNVKGVRSFLGHAGFYRRFIKDFSKIAKPLSNLLEKEAKFIFDDACLLAFN

Query:  TLKERLIAAPIIVVPDWSQSFEIMCDASDYALGAVLGQRRDNMFRAIYYSSRTLDNTQQKYTTTEKELLAVVFAIDKFRSYLLGSKIVVHTDHAALKYLF
         LK RLI+APII VPDWS  FE+MCDASD+A+GAVLGQR+D +FR+IYY+S+TL++ Q  YTTTEKELLAVVFA DKFRSYL+G+K++V+TDHAA++YL 
Subjt:  TLKERLIAAPIIVVPDWSQSFEIMCDASDYALGAVLGQRRDNMFRAIYYSSRTLDNTQQKYTTTEKELLAVVFAIDKFRSYLLGSKIVVHTDHAALKYLF

Query:  VKKDSKPRLMRWILLLQEFDLEIKDRKGCENVVADHLSRIENEEAKSWPPIV-EKFSDEQLYQ-VKDSLPWFADIVNYLAGGHLPPNMNYQQKKRFLHNI
         KKD+KPRL+RW+LLLQEFDLEI+DRKG EN +ADHLSR+E+      P ++ + F DEQL   V   +PW+ADIVNYL  G +P +++ QQKK+FL + 
Subjt:  VKKDSKPRLMRWILLLQEFDLEIKDRKGCENVVADHLSRIENEEAKSWPPIV-EKFSDEQLYQ-VKDSLPWFADIVNYLAGGHLPPNMNYQQKKRFLHNI

Query:  KSYHWEDPLLYKVCADNMIRKCVPQEEVVSILNSCHASPYGGHFGPTRTAAKVLQSGFYWPSLFKDCCTFVKSCDRCQRTGNISRQHELPMKPILEVELF
        + Y W+DP L+K   DN++R+CVP+ E+  I   CHASPYGGHF   RTAAK+LQSGF+WP+LFKD  +FV +CDRCQRTGNISR+HE+P+K ILEVELF
Subjt:  KSYHWEDPLLYKVCADNMIRKCVPQEEVVSILNSCHASPYGGHFGPTRTAAKVLQSGFYWPSLFKDCCTFVKSCDRCQRTGNISRQHELPMKPILEVELF

Query:  DVWGIDFMGPFPISYNGYLYILVAVDYVSKWVEAIATRTNDARTVVKFLHKNIFTRF-------------------------------------------
        DVWGIDFMGPF  S+ G +YILVAVDY+SKWVEA+A   ND++ VV F+ KNIFTRF                                           
Subjt:  DVWGIDFMGPFPISYNGYLYILVAVDYVSKWVEAIATRTNDARTVVKFLHKNIFTRF-------------------------------------------

Query:  ---------------VKTNRKEWALKHDDALWAYRTAFKTPIGTSPYRL
                       V + RK+W+ + D+ALWAYRTAFKTPIG SPY L
Subjt:  ---------------VKTNRKEWALKHDDALWAYRTAFKTPIGTSPYRL

A0A4Y1RS99 Transposable element protein2.0e-24062.63Show/hide
Query:  MLDRLAGYSHYCFLDGYSGYNQIVIAPEDQEKMTFTCPYGTFAFKRMPFGLCNAPTTFQRCMMSIFQGLIEDIMEVFMDYFSVFGSSFDSCLVNLTRVLQ
        ML+RLAG+++YCFLDGYSGYNQI IAPEDQEK TFTCP+GTFA++RMPFGLCNAP TFQRCMMSIF  ++E  +EVFMD FSVFGSSFDSCL NL  VL 
Subjt:  MLDRLAGYSHYCFLDGYSGYNQIVIAPEDQEKMTFTCPYGTFAFKRMPFGLCNAPTTFQRCMMSIFQGLIEDIMEVFMDYFSVFGSSFDSCLVNLTRVLQ

Query:  RCQDANLVLNWEKCHFMVTEGIVLGHKVSKKGLEVDRAKIVAIEQLPPPTNVKGVRSFLGHAGFYRRFIKDFSKIAKPLSNLLEKEAKFIFDDACLLAFN
        RC++ NLVLNWEKCHFMV EGIVLGHK+S +G+EVDRAKI  IE+LPPP+ VKG+RSFLGHAGFYRRFIKDFSKI KPL  LL K+++F FD  CL AFN
Subjt:  RCQDANLVLNWEKCHFMVTEGIVLGHKVSKKGLEVDRAKIVAIEQLPPPTNVKGVRSFLGHAGFYRRFIKDFSKIAKPLSNLLEKEAKFIFDDACLLAFN

Query:  TLKERLIAAPIIVVPDWSQSFEIMCDASDYALGAVLGQRRDNMFRAIYYSSRTLDNTQQKYTTTEKELLAVVFAIDKFRSYLLGSKIVVHTDHAALKYLF
         LK +L  AP+I+ PDW   FEIMCDASDYA+GAVLGQR++ +   I+Y+SRTL++ Q  Y TTEKELLAVVFA+DKFRSYLLG+K++V+TDHAALK+L 
Subjt:  TLKERLIAAPIIVVPDWSQSFEIMCDASDYALGAVLGQRRDNMFRAIYYSSRTLDNTQQKYTTTEKELLAVVFAIDKFRSYLLGSKIVVHTDHAALKYLF

Query:  VKKDSKPRLMRWILLLQEFDLEIKDRKGCENVVADHLSRI--ENEEAKSWPPIVEKFSDEQLYQVKDS----LPWFADIVNYLAGGHLPPNMNYQQKKRF
         KK++KPRL+RW+LLLQEFD+EI+D+KG ENVVADHLSR+  E+E  +   PI+E F DEQLY +  +     PW+AD VNYLA G LPP+M++ QKK+F
Subjt:  VKKDSKPRLMRWILLLQEFDLEIKDRKGCENVVADHLSRI--ENEEAKSWPPIVEKFSDEQLYQVKDS----LPWFADIVNYLAGGHLPPNMNYQQKKRF

Query:  LHNIKSYHWEDPLLYKVCADNMIRKCVPQEEVVSILNSCHASPYGGHFGPTRTAAKVLQSGFYWPSLFKDCCTFVKSCDRCQRTGNISRQHELPMKPILE
        L  +K Y+W+DP L+K   D +IR+CVP+ E+  IL  CH    GGH+G ++T AKVLQSGF+WP+LFKD   FV  CD CQRTGNIS ++++P+  ILE
Subjt:  LHNIKSYHWEDPLLYKVCADNMIRKCVPQEEVVSILNSCHASPYGGHFGPTRTAAKVLQSGFYWPSLFKDCCTFVKSCDRCQRTGNISRQHELPMKPILE

Query:  VELFDVWGIDFMGPFPISYNGYLYILVAVDYVSKWVEAIATRTNDARTVVKFLHKNIFTRF---------------------------------------
        VELFDVWGIDFMGPFP SY G LYILVAVDYVSKWVEA A  TNDA+ VV+FL KNIFTRF                                       
Subjt:  VELFDVWGIDFMGPFPISYNGYLYILVAVDYVSKWVEAIATRTNDARTVVKFLHKNIFTRF---------------------------------------

Query:  -------------------VKTNRKEWALKHDDALWAYRTAFKTPIGTSPYRL
                           V  +RK+W+LK DDALWAYRTAFK PIG SPYRL
Subjt:  -------------------VKTNRKEWALKHDDALWAYRTAFKTPIGTSPYRL

A0A6P8CBX2 Reverse transcriptase1.7e-24462.96Show/hide
Query:  MLDRLAGYSHYCFLDGYSGYNQIVIAPEDQEKMTFTCPYGTFAFKRMPFGLCNAPTTFQRCMMSIFQGLIEDIMEVFMDYFSVFGSSFDSCLVNLTRVLQ
        ML++LAG+ +YCFLDGYSGYNQI IAPEDQEK TFTCPYGTFAF+RMPFGLCNAP TFQRCMMSIF  ++E+ +E+FMD FSVFG SF+SCL NL  VL+
Subjt:  MLDRLAGYSHYCFLDGYSGYNQIVIAPEDQEKMTFTCPYGTFAFKRMPFGLCNAPTTFQRCMMSIFQGLIEDIMEVFMDYFSVFGSSFDSCLVNLTRVLQ

Query:  RCQDANLVLNWEKCHFMVTEGIVLGHKVSKKGLEVDRAKIVAIEQLPPPTNVKGVRSFLGHAGFYRRFIKDFSKIAKPLSNLLEKEAKFIFDDACLLAFN
        RC++ NL+LNWEKCHFMV EGIVLGHKVSKKG+EVDRAK+  IE+LPPPT+ KGVRSFLGHAGFYRRFIKDFSKI++PL NLLEK++ F+F+D CL AFN
Subjt:  RCQDANLVLNWEKCHFMVTEGIVLGHKVSKKGLEVDRAKIVAIEQLPPPTNVKGVRSFLGHAGFYRRFIKDFSKIAKPLSNLLEKEAKFIFDDACLLAFN

Query:  TLKERLIAAPIIVVPDWSQSFEIMCDASDYALGAVLGQRRDNMFRAIYYSSRTLDNTQQKYTTTEKELLAVVFAIDKFRSYLLGSKIVVHTDHAALKYLF
         LKE+L +AP+IV P+W   FE+MCDASDYA+GAVLGQRR  +F AIYY+SRTL+  Q+ Y TTEKELLAV+FA DKFR YL+GSKI+V+TDHAALKYLF
Subjt:  TLKERLIAAPIIVVPDWSQSFEIMCDASDYALGAVLGQRRDNMFRAIYYSSRTLDNTQQKYTTTEKELLAVVFAIDKFRSYLLGSKIVVHTDHAALKYLF

Query:  VKKDSKPRLMRWILLLQEFDLEIKDRKGCENVVADHLSRIENEEAKSWPPIVEKFSDEQLYQVK-DSLPWFADIVNYLAGGHLPPNMNYQQKKRFLHNIK
         K D+KPRL+RWILLLQEFDLEI+D KG ENVVADHLSR+E++   S  PI EKF DEQL+  +   LPW+ADIVNY+     P  ++ QQKK+FLH++K
Subjt:  VKKDSKPRLMRWILLLQEFDLEIKDRKGCENVVADHLSRIENEEAKSWPPIVEKFSDEQLYQVK-DSLPWFADIVNYLAGGHLPPNMNYQQKKRFLHNIK

Query:  SYHWEDPLLYKVCADNMIRKCVPQEEVVSILNSCHASPYGGHFGPTRTAAKVLQSGFYWPSLFKDCCTFVKSCDRCQRTGNISRQHELPMKPILEVELFD
         Y W++P L+K CAD +IR+CVP+ E +SI+  CH+   GGHFG  RTA K+L  GFYWP +F DC  ++ SC  CQRTGNISR+HE+P   IL +ELFD
Subjt:  SYHWEDPLLYKVCADNMIRKCVPQEEVVSILNSCHASPYGGHFGPTRTAAKVLQSGFYWPSLFKDCCTFVKSCDRCQRTGNISRQHELPMKPILEVELFD

Query:  VWGIDFMGPFPISYNGYLYILVAVDYVSKWVEAIATRTNDARTVVKFLHKNIFTRF--------------------------------------------
        VWGIDFMGPFP S++   YILVAVDYVSKWVEA+A ++NDAR V++FL KNIF+RF                                            
Subjt:  VWGIDFMGPFPISYNGYLYILVAVDYVSKWVEAIATRTNDARTVVKFLHKNIFTRF--------------------------------------------

Query:  --------------VKTNRKEWALKHDDALWAYRTAFKTPIGTSPYRL
                      V  +RK+W+LK DDALWAYRTAFKTPIG SPY++
Subjt:  --------------VKTNRKEWALKHDDALWAYRTAFKTPIGTSPYRL

A0A6P8DLJ8 Reverse transcriptase2.1e-24262.5Show/hide
Query:  MLDRLAGYSHYCFLDGYSGYNQIVIAPEDQEKMTFTCPYGTFAFKRMPFGLCNAPTTFQRCMMSIFQGLIEDIMEVFMDYFSVFGSSFDSCLVNLTRVLQ
        ML++L G+ +YCFLDGYSGYNQI IAPEDQEK TFTCPYGTFAF+RMPFGLCNAP TFQRCMMSIF  ++E+ +E+FMD FSVFG SF+SCL NL  VL+
Subjt:  MLDRLAGYSHYCFLDGYSGYNQIVIAPEDQEKMTFTCPYGTFAFKRMPFGLCNAPTTFQRCMMSIFQGLIEDIMEVFMDYFSVFGSSFDSCLVNLTRVLQ

Query:  RCQDANLVLNWEKCHFMVTEGIVLGHKVSKKGLEVDRAKIVAIEQLPPPTNVKGVRSFLGHAGFYRRFIKDFSKIAKPLSNLLEKEAKFIFDDACLLAFN
        RC++ NL+LNWEKCHFMV EGIVLGHKVSKKG+EVDRAK+  IE+LPPPT+ KGVRSFLGHAGFYRRFIKDFSKI++PL NLLEK++ F+F+D CL AFN
Subjt:  RCQDANLVLNWEKCHFMVTEGIVLGHKVSKKGLEVDRAKIVAIEQLPPPTNVKGVRSFLGHAGFYRRFIKDFSKIAKPLSNLLEKEAKFIFDDACLLAFN

Query:  TLKERLIAAPIIVVPDWSQSFEIMCDASDYALGAVLGQRRDNMFRAIYYSSRTLDNTQQKYTTTEKELLAVVFAIDKFRSYLLGSKIVVHTDHAALKYLF
         LKE+L +AP+IV P+W   FE+MC ASDYA+GAVLGQRR  +F AIYY+SRTL+  Q+ Y TTEKELLAV+FA DKFR YL+GSKI+V+TDHAALKYLF
Subjt:  TLKERLIAAPIIVVPDWSQSFEIMCDASDYALGAVLGQRRDNMFRAIYYSSRTLDNTQQKYTTTEKELLAVVFAIDKFRSYLLGSKIVVHTDHAALKYLF

Query:  VKKDSKPRLMRWILLLQEFDLEIKDRKGCENVVADHLSRIENEEAKSWPPIVEKFSDEQLYQVK-DSLPWFADIVNYLAGGHLPPNMNYQQKKRFLHNIK
         K D+KPRL+RWILLLQEFDLEI+D KG ENVVADHLSR+E++   S  PI EKF DEQL+  +   LPW+ADIVNY+     P  ++ QQKK+FLH++K
Subjt:  VKKDSKPRLMRWILLLQEFDLEIKDRKGCENVVADHLSRIENEEAKSWPPIVEKFSDEQLYQVK-DSLPWFADIVNYLAGGHLPPNMNYQQKKRFLHNIK

Query:  SYHWEDPLLYKVCADNMIRKCVPQEEVVSILNSCHASPYGGHFGPTRTAAKVLQSGFYWPSLFKDCCTFVKSCDRCQRTGNISRQHELPMKPILEVELFD
         Y W++P L+K CAD +IR+CVP+ E +SI+  CH+   GGHFG  RTA K+L  GFYWP +F DC  ++ SC  CQRTGNISR+HE+P   IL +ELFD
Subjt:  SYHWEDPLLYKVCADNMIRKCVPQEEVVSILNSCHASPYGGHFGPTRTAAKVLQSGFYWPSLFKDCCTFVKSCDRCQRTGNISRQHELPMKPILEVELFD

Query:  VWGIDFMGPFPISYNGYLYILVAVDYVSKWVEAIATRTNDARTVVKFLHKNIFTRF--------------------------------------------
        VWGIDFMGPFP S++   YILVAVDYVSKWVEA+A ++NDAR V++FL KNIF+R                                             
Subjt:  VWGIDFMGPFPISYNGYLYILVAVDYVSKWVEAIATRTNDARTVVKFLHKNIFTRF--------------------------------------------

Query:  --------------VKTNRKEWALKHDDALWAYRTAFKTPIGTSPYRL
                      V  +RK+W+LK DDALWAYRTAFKTPIG SPY++
Subjt:  --------------VKTNRKEWALKHDDALWAYRTAFKTPIGTSPYRL

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.62.8e-6641.45Show/hide
Query:  MLDRLAGYSHYCFLDGYSGYNQIVIAPEDQEKMTFTCPYGTFAFKRMPFGLCNAPTTFQRCMMSIFQGLIEDIMEVFMDYFSVFGSSFDSCLVNLTRVLQ
        +L +L   +++  +D   G++QI + PE   K  F+  +G + + RMPFGL NAP TFQRCM  I + L+     V++D   VF +S D  L +L  V +
Subjt:  MLDRLAGYSHYCFLDGYSGYNQIVIAPEDQEKMTFTCPYGTFAFKRMPFGLCNAPTTFQRCMMSIFQGLIEDIMEVFMDYFSVFGSSFDSCLVNLTRVLQ

Query:  RCQDANLVLNWEKCHFMVTEGIVLGHKVSKKGLEVDRAKIVAIEQLPPPTNVKGVRSFLGHAGFYRRFIKDFSKIAKPLSNLLEKEAKF-IFDDACLLAF
        +   ANL L  +KC F+  E   LGH ++  G++ +  KI AI++ P PT  K +++FLG  G+YR+FI +F+ IAKP++  L+K  K    +     AF
Subjt:  RCQDANLVLNWEKCHFMVTEGIVLGHKVSKKGLEVDRAKIVAIEQLPPPTNVKGVRSFLGHAGFYRRFIKDFSKIAKPLSNLLEKEAKF-IFDDACLLAF

Query:  NTLKERLIAAPIIVVPDWSQSFEIMCDASDYALGAVLGQRRDNMFRAIYYSSRTLDNTQQKYTTTEKELLAVVFAIDKFRSYLLGSKIVVHTDHAALKYL
          LK  +   PI+ VPD+++ F +  DASD ALGAVL Q        + Y SRTL+  +  Y+T EKELLA+V+A   FR YLLG    + +DH  L +L
Subjt:  NTLKERLIAAPIIVVPDWSQSFEIMCDASDYALGAVLGQRRDNMFRAIYYSSRTLDNTQQKYTTTEKELLAVVFAIDKFRSYLLGSKIVVHTDHAALKYL

Query:  FVKKDSKPRLMRWILLLQEFDLEIKDRKGCENVVADHLSRIENEE
        +  KD   +L RW + L EFD +IK  KG EN VAD LSRI+ EE
Subjt:  FVKKDSKPRLMRWILLLQEFDLEIKDRKGCENVVADHLSRIENEE

P20825 Retrovirus-related Pol polyprotein from transposon 2971.3e-6038.84Show/hide
Query:  MLDRLAGYSHYCFLDGYSGYNQIVIAPEDQEKMTFTCPYGTFAFKRMPFGLCNAPTTFQRCMMSIFQGLIEDIMEVFMDYFSVFGSSFDSCLVNLTRVLQ
        +L +L    ++  +D   G++QI +  E   K  F+   G + + RMPFGL NAP TFQRCM +I + L+     V++D   +F +S    L ++  V  
Subjt:  MLDRLAGYSHYCFLDGYSGYNQIVIAPEDQEKMTFTCPYGTFAFKRMPFGLCNAPTTFQRCMMSIFQGLIEDIMEVFMDYFSVFGSSFDSCLVNLTRVLQ

Query:  RCQDANLVLNWEKCHFMVTEGIVLGHKVSKKGLEVDRAKIVAIEQLPPPTNVKGVRSFLGHAGFYRRFIKDFSKIAKPLSNLLEKEAKFIFDD-ACLLAF
        +  DANL L  +KC F+  E   LGH V+  G++ +  K+ AI   P PT  K +R+FLG  G+YR+FI +++ IAKP+++ L+K  K        + AF
Subjt:  RCQDANLVLNWEKCHFMVTEGIVLGHKVSKKGLEVDRAKIVAIEQLPPPTNVKGVRSFLGHAGFYRRFIKDFSKIAKPLSNLLEKEAKFIFDD-ACLLAF

Query:  NTLKERLIAAPIIVVPDWSQSFEIMCDASDYALGAVLGQRRDNMFRAIYYSSRTLDNTQQKYTTTEKELLAVVFAIDKFRSYLLGSKIVVHTDHAALKYL
          LK  +I  PI+ +PD+ + F +  DAS+ ALGAVL Q        I + SRTL++ +  Y+  EKELLA+V+A   FR YLLG + ++ +DH  L++L
Subjt:  NTLKERLIAAPIIVVPDWSQSFEIMCDASDYALGAVLGQRRDNMFRAIYYSSRTLDNTQQKYTTTEKELLAVVFAIDKFRSYLLGSKIVVHTDHAALKYL

Query:  FVKKDSKPRLMRWILLLQEFDLEIKDRKGCENVVADHLSRIENEE
           K+   +L RW + L E+  +I   KG EN VAD LSRI+ EE
Subjt:  FVKKDSKPRLMRWILLLQEFDLEIKDRKGCENVVADHLSRIENEE

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein2.1e-6130.72Show/hide
Query:  MLDRLAGYSHYCFLDGYSGYNQIVIAPEDQEKMTFTCPYGTFAFKRMPFGLCNAPTTFQRCMMSIFQGLIEDIMEVFMDYFSVFGSSFDSCLVNLTRVLQ
        +L R+     +  LD +SGY+QI + P+D+ K  F  P G + +  MPFGL NAP+TF R M   F+ L    + V++D   +F  S +    +L  VL+
Subjt:  MLDRLAGYSHYCFLDGYSGYNQIVIAPEDQEKMTFTCPYGTFAFKRMPFGLCNAPTTFQRCMMSIFQGLIEDIMEVFMDYFSVFGSSFDSCLVNLTRVLQ

Query:  RCQDANLVLNWEKCHFMVTEGIVLGHKVSKKGLEVDRAKIVAIEQLPPPTNVKGVRSFLGHAGFYRRFIKDFSKIAKPL-------SNLLEKEAKFIFDD
        R ++ NL++  +KC F   E   LG+ +  + +   + K  AI   P P  VK  + FLG   +YRRFI + SKIA+P+       S   EK+ K     
Subjt:  RCQDANLVLNWEKCHFMVTEGIVLGHKVSKKGLEVDRAKIVAIEQLPPPTNVKGVRSFLGHAGFYRRFIKDFSKIAKPL-------SNLLEKEAKFIFDD

Query:  ACLLAFNTLKERLIAAPIIVVPDWSQSFEIMCDASDYALGAVLGQ--RRDNMFRAIYYSSRTLDNTQQKYTTTEKELLAVVFAIDKFRSYLLGSKIVVHT
            A   LK  L  +P++V  +   ++ +  DAS   +GAVL +   ++ +   + Y S++L++ Q+ Y   E ELL ++ A+  FR  L G    + T
Subjt:  ACLLAFNTLKERLIAAPIIVVPDWSQSFEIMCDASDYALGAVLGQ--RRDNMFRAIYYSSRTLDNTQQKYTTTEKELLAVVFAIDKFRSYLLGSKIVVHT

Query:  DHAALKYLFVKKDSKPRLMRWILLLQEFDLEIKDRKGCENVVADHLSRIENEEAKSWPPIVEKFSDEQLYQVKDSLPWFADIVNYLAGGHLPPN-----M
        DH +L  L  K +   R+ RW+  L  +D  ++   G +NVVAD +SR            ++  S +  Y+           +  L   ++ P       
Subjt:  DHAALKYLFVKKDSKPRLMRWILLLQEFDLEIKDRKGCENVVADHLSRIENEEAKSWPPIVEKFSDEQLYQVKDSLPWFADIVNYLAGGHLPPN-----M

Query:  NYQQKKRFLHNI-KSYHWEDPLLYKVCADNMIRKCVPQEEVVSILNSCH-ASPYGGHFGPTRTAAKVLQSGFYWPSLFKDCCTFVKSCDRCQR-TGNISR
        +YQ+K        K+Y  ED ++Y        R  VP ++  +++   H  + +GGHFG T T AK+    +YWP L      ++++C +CQ    +  R
Subjt:  NYQQKKRFLHNI-KSYHWEDPLLYKVCADNMIRKCVPQEEVVSILNSCH-ASPYGGHFGPTRTAAKVLQSGFYWPSLFKDCCTFVKSCDRCQR-TGNISR

Query:  QHEL--PMKPILEVELFDVWGIDFMGPFPISYNGYLYILVAVDYVSKWVEAIATR-TNDARTVVKFLHKNIFT
         H L  P+ PI E    D+  +DF+   P + N    ILV VD  SK    IATR T DA  ++  L + IF+
Subjt:  QHEL--PMKPILEVELFDVWGIDFMGPFPISYNGYLYILVAVDYVSKWVEAIATR-TNDARTVVKFLHKNIFT

Q8I7P9 Retrovirus-related Pol polyprotein from transposon opus2.4e-6237.61Show/hide
Query:  LDRLAGYSHYCFLDGYSGYNQIVIAPEDQEKMTFTCPYGTFAFKRMPFGLCNAPTTFQRCMMSIFQGLIEDIMEVFMDYFSVFGSSFDSCLVNLTRVLQR
        L  L    ++  LD  SG++QI +   D  K  F+   G + F R+PFGL NAP  FQR +  I +  I  +  V++D   VF   +D+   NL  VL  
Subjt:  LDRLAGYSHYCFLDGYSGYNQIVIAPEDQEKMTFTCPYGTFAFKRMPFGLCNAPTTFQRCMMSIFQGLIEDIMEVFMDYFSVFGSSFDSCLVNLTRVLQR

Query:  CQDANLVLNWEKCHFMVTEGIVLGHKVSKKGLEVDRAKIVAIEQLPPPTNVKGVRSFLGHAGFYRRFIKDFSKIAKPLSNLL-----------EKEAKFI
           ANL +N EK HF+ T+   LG+ V+  G++ D  K+ AI ++PPPT+VK ++ FLG   +YR+FI+D++K+AKPL+NL              +    
Subjt:  CQDANLVLNWEKCHFMVTEGIVLGHKVSKKGLEVDRAKIVAIEQLPPPTNVKGVRSFLGHAGFYRRFIKDFSKIAKPLSNLL-----------EKEAKFI

Query:  FDDACLLAFNTLKERLIAAPIIVVPDWSQSFEIMCDASDYALGAVLGQRRDNMFRAIYYSSRTLDNTQQKYTTTEKELLAVVFAIDKFRSYLLGS-KIVV
         D+  L +FN LK  L ++ I+  P +++ F +  DAS++A+GAVL Q      R I Y SR+L+ T++ Y T EKE+LA+++++D  R+YL G+  I V
Subjt:  FDDACLLAFNTLKERLIAAPIIVVPDWSQSFEIMCDASDYALGAVLGQRRDNMFRAIYYSSRTLDNTQQKYTTTEKELLAVVFAIDKFRSYLLGS-KIVV

Query:  HTDHAALKYLFVKKDSKPRLMRWILLLQEFDLEIKDRKGCENVVADHLSRI
        +TDH  L +    ++   +L RW   ++E++ E+  + G  NVVAD LSRI
Subjt:  HTDHAALKYLFVKKDSKPRLMRWILLLQEFDLEIKDRKGCENVVADHLSRI

Q99315 Transposon Ty3-G Gag-Pol polyprotein5.4e-6230.72Show/hide
Query:  MLDRLAGYSHYCFLDGYSGYNQIVIAPEDQEKMTFTCPYGTFAFKRMPFGLCNAPTTFQRCMMSIFQGLIEDIMEVFMDYFSVFGSSFDSCLVNLTRVLQ
        +L R+     +  LD +SGY+QI + P+D+ K  F  P G + +  MPFGL NAP+TF R M   F+ L    + V++D   +F  S +    +L  VL+
Subjt:  MLDRLAGYSHYCFLDGYSGYNQIVIAPEDQEKMTFTCPYGTFAFKRMPFGLCNAPTTFQRCMMSIFQGLIEDIMEVFMDYFSVFGSSFDSCLVNLTRVLQ

Query:  RCQDANLVLNWEKCHFMVTEGIVLGHKVSKKGLEVDRAKIVAIEQLPPPTNVKGVRSFLGHAGFYRRFIKDFSKIAKPL-------SNLLEKEAKFIFDD
        R ++ NL++  +KC F   E   LG+ +  + +   + K  AI   P P  VK  + FLG   +YRRFI + SKIA+P+       S   EK+ K     
Subjt:  RCQDANLVLNWEKCHFMVTEGIVLGHKVSKKGLEVDRAKIVAIEQLPPPTNVKGVRSFLGHAGFYRRFIKDFSKIAKPL-------SNLLEKEAKFIFDD

Query:  ACLLAFNTLKERLIAAPIIVVPDWSQSFEIMCDASDYALGAVLGQ--RRDNMFRAIYYSSRTLDNTQQKYTTTEKELLAVVFAIDKFRSYLLGSKIVVHT
            A + LK+ L  +P++V  +   ++ +  DAS   +GAVL +   ++ +   + Y S++L++ Q+ Y   E ELL ++ A+  FR  L G    + T
Subjt:  ACLLAFNTLKERLIAAPIIVVPDWSQSFEIMCDASDYALGAVLGQ--RRDNMFRAIYYSSRTLDNTQQKYTTTEKELLAVVFAIDKFRSYLLGSKIVVHT

Query:  DHAALKYLFVKKDSKPRLMRWILLLQEFDLEIKDRKGCENVVADHLSRIENEEAKSWPPIVEKFSDEQLYQVKDSLPWFADIVNYLAGGHLPPN-----M
        DH +L  L  K +   R+ RW+  L  +D  ++   G +NVVAD +SR            ++  S +  Y+           +  L   ++ P       
Subjt:  DHAALKYLFVKKDSKPRLMRWILLLQEFDLEIKDRKGCENVVADHLSRIENEEAKSWPPIVEKFSDEQLYQVKDSLPWFADIVNYLAGGHLPPN-----M

Query:  NYQQKKRFLHNI-KSYHWEDPLLYKVCADNMIRKCVPQEEVVSILNSCH-ASPYGGHFGPTRTAAKVLQSGFYWPSLFKDCCTFVKSCDRCQR-TGNISR
        +YQ+K        K+Y  ED ++Y        R  VP ++  +++   H  + +GGHFG T T AK+    +YWP L      ++++C +CQ    +  R
Subjt:  NYQQKKRFLHNI-KSYHWEDPLLYKVCADNMIRKCVPQEEVVSILNSCH-ASPYGGHFGPTRTAAKVLQSGFYWPSLFKDCCTFVKSCDRCQR-TGNISR

Query:  QHEL--PMKPILEVELFDVWGIDFMGPFPISYNGYLYILVAVDYVSKWVEAIATR-TNDARTVVKFLHKNIFT
         H L  P+ PI E    D+  +DF+   P + N    ILV VD  SK    IATR T DA  ++  L + IF+
Subjt:  QHEL--PMKPILEVELFDVWGIDFMGPFPISYNGYLYILVAVDYVSKWVEAIATR-TNDARTVVKFLHKNIFT

Arabidopsis top hitse value%identityAlignment
ATMG00750.1 GAG/POL/ENV polyprotein2.7e-1664.29Show/hide
Query:  VLQSGFYWPSLFKDCCTFVKSCDRCQRTGNISRQHELPMKPILEVELFDVWGIDFM
        VLQ+GFYWP+ FKD   FV SCD CQR GN ++++E+P   ILEVE+FDVWGI FM
Subjt:  VLQSGFYWPSLFKDCCTFVKSCDRCQRTGNISRQHELPMKPILEVELFDVWGIDFM

ATMG00860.1 DNA/RNA polymerases superfamily protein2.1e-1635.38Show/hide
Query:  NLTRVLQRCQDANLVLNWEKCHFMVTEGIVLGHK--VSKKGLEVDRAKIVAIEQLPPPTNVKGVRSFLGHAGFYRRFIKDFSKIAKPLSNLLEKEAKFIF
        +L  VLQ  +      N +KC F   +   LGH+  +S +G+  D AK+ A+   P P N   +R FLG  G+YRRF+K++ KI +PL+ LL+K +   +
Subjt:  NLTRVLQRCQDANLVLNWEKCHFMVTEGIVLGHK--VSKKGLEVDRAKIVAIEQLPPPTNVKGVRSFLGHAGFYRRFIKDFSKIAKPLSNLLEKEAKFIF

Query:  DDACLLAFNTLKERLIAAPIIVVPDWSQSF
         +   LAF  LK  +   P++ +PD    F
Subjt:  DDACLLAFNTLKERLIAAPIIVVPDWSQSF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTTGATAGACTTGCAGGTTATTCTCATTATTGCTTTCTAGATGGATATTCAGGTTACAATCAAATTGTCATTGCTCCTGAGGATCAAGAAAAAATGACATTTACATG
TCCCTATGGGACATTTGCTTTTAAAAGAATGCCATTTGGTTTATGTAATGCCCCTACTACTTTTCAACGTTGTATGATGTCTATTTTTCAGGGGCTCATCGAAGACATAA
TGGAGGTTTTTATGGATTATTTTTCTGTATTTGGGTCTTCATTTGATTCTTGTCTTGTTAATCTAACCCGTGTTTTGCAGAGGTGTCAAGATGCTAACCTTGTTCTAAAC
TGGGAGAAGTGTCACTTCATGGTGACGGAAGGCATCGTCCTGGGGCACAAAGTCTCTAAAAAAGGATTGGAGGTGGATAGGGCTAAAATAGTTGCTATTGAACAACTACC
ACCTCCCACCAATGTGAAAGGAGTCAGAAGCTTCCTAGGGCATGCAGGGTTTTACAGGAGATTTATTAAAGATTTTTCTAAAATTGCTAAACCCTTGAGTAATTTGTTAG
AAAAGGAAGCTAAATTTATTTTTGATGATGCATGTCTACTGGCTTTTAATACTTTGAAGGAAAGACTAATTGCTGCACCCATTATTGTTGTACCAGATTGGAGTCAGTCT
TTTGAGATCATGTGTGATGCTAGTGATTATGCTTTAGGAGCTGTTTTAGGCCAACGTAGAGATAACATGTTTAGGGCAATTTATTATTCTAGTAGGACTCTTGATAATAC
TCAACAGAAATACACTACTACTGAAAAAGAACTACTAGCTGTTGTGTTTGCCATTGATAAATTTAGATCATACTTGCTTGGCTCTAAAATAGTAGTGCATACTGACCATG
CTGCTTTAAAGTACTTGTTTGTTAAGAAAGATTCTAAACCTAGGCTAATGAGGTGGATATTATTGTTACAGGAATTTGACCTAGAAATCAAAGACAGGAAAGGATGTGAA
AATGTGGTTGCAGACCACTTATCTAGAATTGAGAATGAGGAAGCTAAATCATGGCCCCCAATTGTTGAGAAGTTCTCTGATGAACAACTGTATCAGGTAAAAGATAGTTT
GCCCTGGTTTGCTGACATAGTTAATTATCTTGCAGGAGGACATTTGCCACCTAACATGAACTATCAACAAAAGAAAAGATTCCTGCACAATATTAAGTCTTACCATTGGG
AGGACCCACTTCTCTACAAGGTTTGTGCTGACAATATGATAAGAAAGTGTGTACCTCAAGAGGAAGTGGTAAGTATTTTAAATTCATGTCATGCTTCACCCTATGGAGGT
CATTTTGGACCCACCAGAACTGCAGCTAAGGTACTTCAGTCAGGATTTTATTGGCCATCCCTTTTTAAAGACTGTTGTACCTTTGTTAAGTCATGTGATAGGTGCCAACG
TACTGGCAATATTTCTAGACAACATGAGCTTCCGATGAAACCTATCTTGGAAGTGGAGTTATTTGATGTTTGGGGTATTGACTTTATGGGACCTTTTCCAATTTCTTATA
ATGGCTACCTATATATTCTAGTTGCAGTAGATTATGTATCTAAGTGGGTAGAAGCCATAGCTACTAGGACCAATGATGCTCGCACTGTTGTAAAATTCTTGCATAAAAAC
ATTTTCACACGTTTTGTCAAGACCAATAGGAAGGAATGGGCCCTTAAGCACGATGATGCACTGTGGGCCTACCGCACAGCTTTCAAAACCCCAATTGGTACTTCCCCGTA
TAGGTTGACAAGGAAAAGCTTGTCACTTACCGGTAGAGCTCGAGCATAG
mRNA sequenceShow/hide mRNA sequence
ATGCTTGATAGACTTGCAGGTTATTCTCATTATTGCTTTCTAGATGGATATTCAGGTTACAATCAAATTGTCATTGCTCCTGAGGATCAAGAAAAAATGACATTTACATG
TCCCTATGGGACATTTGCTTTTAAAAGAATGCCATTTGGTTTATGTAATGCCCCTACTACTTTTCAACGTTGTATGATGTCTATTTTTCAGGGGCTCATCGAAGACATAA
TGGAGGTTTTTATGGATTATTTTTCTGTATTTGGGTCTTCATTTGATTCTTGTCTTGTTAATCTAACCCGTGTTTTGCAGAGGTGTCAAGATGCTAACCTTGTTCTAAAC
TGGGAGAAGTGTCACTTCATGGTGACGGAAGGCATCGTCCTGGGGCACAAAGTCTCTAAAAAAGGATTGGAGGTGGATAGGGCTAAAATAGTTGCTATTGAACAACTACC
ACCTCCCACCAATGTGAAAGGAGTCAGAAGCTTCCTAGGGCATGCAGGGTTTTACAGGAGATTTATTAAAGATTTTTCTAAAATTGCTAAACCCTTGAGTAATTTGTTAG
AAAAGGAAGCTAAATTTATTTTTGATGATGCATGTCTACTGGCTTTTAATACTTTGAAGGAAAGACTAATTGCTGCACCCATTATTGTTGTACCAGATTGGAGTCAGTCT
TTTGAGATCATGTGTGATGCTAGTGATTATGCTTTAGGAGCTGTTTTAGGCCAACGTAGAGATAACATGTTTAGGGCAATTTATTATTCTAGTAGGACTCTTGATAATAC
TCAACAGAAATACACTACTACTGAAAAAGAACTACTAGCTGTTGTGTTTGCCATTGATAAATTTAGATCATACTTGCTTGGCTCTAAAATAGTAGTGCATACTGACCATG
CTGCTTTAAAGTACTTGTTTGTTAAGAAAGATTCTAAACCTAGGCTAATGAGGTGGATATTATTGTTACAGGAATTTGACCTAGAAATCAAAGACAGGAAAGGATGTGAA
AATGTGGTTGCAGACCACTTATCTAGAATTGAGAATGAGGAAGCTAAATCATGGCCCCCAATTGTTGAGAAGTTCTCTGATGAACAACTGTATCAGGTAAAAGATAGTTT
GCCCTGGTTTGCTGACATAGTTAATTATCTTGCAGGAGGACATTTGCCACCTAACATGAACTATCAACAAAAGAAAAGATTCCTGCACAATATTAAGTCTTACCATTGGG
AGGACCCACTTCTCTACAAGGTTTGTGCTGACAATATGATAAGAAAGTGTGTACCTCAAGAGGAAGTGGTAAGTATTTTAAATTCATGTCATGCTTCACCCTATGGAGGT
CATTTTGGACCCACCAGAACTGCAGCTAAGGTACTTCAGTCAGGATTTTATTGGCCATCCCTTTTTAAAGACTGTTGTACCTTTGTTAAGTCATGTGATAGGTGCCAACG
TACTGGCAATATTTCTAGACAACATGAGCTTCCGATGAAACCTATCTTGGAAGTGGAGTTATTTGATGTTTGGGGTATTGACTTTATGGGACCTTTTCCAATTTCTTATA
ATGGCTACCTATATATTCTAGTTGCAGTAGATTATGTATCTAAGTGGGTAGAAGCCATAGCTACTAGGACCAATGATGCTCGCACTGTTGTAAAATTCTTGCATAAAAAC
ATTTTCACACGTTTTGTCAAGACCAATAGGAAGGAATGGGCCCTTAAGCACGATGATGCACTGTGGGCCTACCGCACAGCTTTCAAAACCCCAATTGGTACTTCCCCGTA
TAGGTTGACAAGGAAAAGCTTGTCACTTACCGGTAGAGCTCGAGCATAG
Protein sequenceShow/hide protein sequence
MLDRLAGYSHYCFLDGYSGYNQIVIAPEDQEKMTFTCPYGTFAFKRMPFGLCNAPTTFQRCMMSIFQGLIEDIMEVFMDYFSVFGSSFDSCLVNLTRVLQRCQDANLVLN
WEKCHFMVTEGIVLGHKVSKKGLEVDRAKIVAIEQLPPPTNVKGVRSFLGHAGFYRRFIKDFSKIAKPLSNLLEKEAKFIFDDACLLAFNTLKERLIAAPIIVVPDWSQS
FEIMCDASDYALGAVLGQRRDNMFRAIYYSSRTLDNTQQKYTTTEKELLAVVFAIDKFRSYLLGSKIVVHTDHAALKYLFVKKDSKPRLMRWILLLQEFDLEIKDRKGCE
NVVADHLSRIENEEAKSWPPIVEKFSDEQLYQVKDSLPWFADIVNYLAGGHLPPNMNYQQKKRFLHNIKSYHWEDPLLYKVCADNMIRKCVPQEEVVSILNSCHASPYGG
HFGPTRTAAKVLQSGFYWPSLFKDCCTFVKSCDRCQRTGNISRQHELPMKPILEVELFDVWGIDFMGPFPISYNGYLYILVAVDYVSKWVEAIATRTNDARTVVKFLHKN
IFTRFVKTNRKEWALKHDDALWAYRTAFKTPIGTSPYRLTRKSLSLTGRARA