; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc02g0048461 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc02g0048461
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionGag/pol protein
Genome locationCMiso1.1chr02:13960274..13961818
RNA-Seq ExpressionCmc02g0048461
SyntenyCmc02g0048461
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008234 - cysteine-type peptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR036397 - Ribonuclease H superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025945.1 gag/pol protein [Cucumis melo var. makuwa]2.2e-28794.55Show/hide
Query:  MDLTFQEYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAELPSSFWGYVVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAH
        MDL FQ+YMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYA+LPSSFWGY VETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAH
Subjt:  MDLTFQEYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAELPSSFWGYVVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAH

Query:  VLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENKVFVSTNATFLDEDHMRDHKPRSKLLLNEATDESRRVVDEVGPSSRVDETTTLGQSHPSQSLRM
        VLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQEN+VFVSTNATFL+EDHMR+HKPRSKL+L+EATDES RVVDEVGPSSRVDETTT GQSHPSQSLRM
Subjt:  VLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENKVFVSTNATFLDEDHMRDHKPRSKLLLNEATDESRRVVDEVGPSSRVDETTTLGQSHPSQSLRM

Query:  PRRSGRIESQPNRYLGLSETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMNLEMESMYFNSVWELVDLLEGVKPIGCRWIYKRKRDSAGKVQTFKTRLV
        PRRSGR+ SQPNRYLGL+ETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAM+LEMESMYFNSVWELVDL EGVKPIGC+WIYKRKRDSAGKVQTFK RLV
Subjt:  PRRSGRIESQPNRYLGLSETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMNLEMESMYFNSVWELVDLLEGVKPIGCRWIYKRKRDSAGKVQTFKTRLV

Query:  AKGYTQREGVDYEETFSPVAMLKSIRILFSIATFYDYEIWQMDVKTAFLNGNLKKSIFLSQPEGFITEGQEQKVCKLNGSIYGLKQASRSWNIRFDTAIK
        AKGYTQREGVDYEETFSPVAMLKSIRIL SIATFYDYEIWQMDVKTAFLNGNL++SIF+SQPEGFIT+GQEQKVCKLN SIYGLKQASRSWNIRFDTAIK
Subjt:  AKGYTQREGVDYEETFSPVAMLKSIRILFSIATFYDYEIWQMDVKTAFLNGNLKKSIFLSQPEGFITEGQEQKVCKLNGSIYGLKQASRSWNIRFDTAIK

Query:  SYGFDQNVDEPCVYKKINKGKVAFLVLYMDDILLIRNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIKIIRDRKNKTLALSQATYIDKMLVRYSMQNSKK
        SYGFDQNVDEPCVYKKINKGKVAFLVLY+DDILLI NDVGYLTDVKAWLAAQFQMKDLGEAQYVLGI+IIRDRKNKTLALSQATYIDK+LVRYSMQNSKK
Subjt:  SYGFDQNVDEPCVYKKINKGKVAFLVLYMDDILLIRNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIKIIRDRKNKTLALSQATYIDKMLVRYSMQNSKK

Query:  GLLPFRHGVHLSKE
        GLLPFRHGVHLSKE
Subjt:  GLLPFRHGVHLSKE

KAA0035907.1 gag/pol protein [Cucumis melo var. makuwa]5.1e-28493.39Show/hide
Query:  MDLTFQEYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAELPSSFWGYVVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAH
        MDL FQ+YMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYA+LPSSFWGY VETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAH
Subjt:  MDLTFQEYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAELPSSFWGYVVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAH

Query:  VLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENKVFVSTNATFLDEDHMRDHKPRSKLLLNEATDESRRVVDEVGPSSRVDETTTLGQSHPSQSLRM
        VLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDP+EN+VFVSTNATFL+EDHMR+HKPRSKL+L+EATDES RVVDEVGPSSRVDETTT GQSHPSQSLRM
Subjt:  VLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENKVFVSTNATFLDEDHMRDHKPRSKLLLNEATDESRRVVDEVGPSSRVDETTTLGQSHPSQSLRM

Query:  PRRSGRIESQPNRYLGLSETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMNLEMESMYFNSVWELVDLLEGVKPIGCRWIYKRKRDSAGKVQTFKTRLV
        PRRSGR+ SQPNRYLGL+ETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAM+LEMESMYFNSVWELVDL EGVKPIGC+WIYKRKRDSAGKVQTFK RLV
Subjt:  PRRSGRIESQPNRYLGLSETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMNLEMESMYFNSVWELVDLLEGVKPIGCRWIYKRKRDSAGKVQTFKTRLV

Query:  AKGYTQREGVDYEETFSPVAMLKSIRILFSIATFYDYEIWQMDVKTAFLNGNLKKSIFLSQPEGFITEGQEQKVCKLNGSIYGLKQASRSWNIRFDTAIK
        AKGYT++EGVDYEETFS VAMLKSIRIL SIA FYDYEIWQMDVKTAFLNGNL++SIF+SQPEGFIT+GQEQKVCKLN SIYGLKQASRSWNIRFDTAIK
Subjt:  AKGYTQREGVDYEETFSPVAMLKSIRILFSIATFYDYEIWQMDVKTAFLNGNLKKSIFLSQPEGFITEGQEQKVCKLNGSIYGLKQASRSWNIRFDTAIK

Query:  SYGFDQNVDEPCVYKKINKGKVAFLVLYMDDILLIRNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIKIIRDRKNKTLALSQATYIDKMLVRYSMQNSKK
        SYGFDQNVDEPCVYKKINKGKVAFLVLY+DDILLI NDVGYLTDVKAWLAAQFQMKDLGE QYVLGI+IIRDRKNKTLALSQATYIDK+LVRYSMQNSKK
Subjt:  SYGFDQNVDEPCVYKKINKGKVAFLVLYMDDILLIRNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIKIIRDRKNKTLALSQATYIDKMLVRYSMQNSKK

Query:  GLLPFRHGVHLSKE
        GLLPFRHGVHLSKE
Subjt:  GLLPFRHGVHLSKE

KAA0059226.1 gag/pol protein [Cucumis melo var. makuwa]2.2e-28794.55Show/hide
Query:  MDLTFQEYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAELPSSFWGYVVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAH
        MDL FQ+YMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYA+LPSSFWGY VETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAH
Subjt:  MDLTFQEYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAELPSSFWGYVVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAH

Query:  VLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENKVFVSTNATFLDEDHMRDHKPRSKLLLNEATDESRRVVDEVGPSSRVDETTTLGQSHPSQSLRM
        VLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQEN+VFVSTNATFL+EDHMR+HKPRSKL+L+EATDES RVVDEVGPSSRVDETTT GQSHPSQSLRM
Subjt:  VLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENKVFVSTNATFLDEDHMRDHKPRSKLLLNEATDESRRVVDEVGPSSRVDETTTLGQSHPSQSLRM

Query:  PRRSGRIESQPNRYLGLSETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMNLEMESMYFNSVWELVDLLEGVKPIGCRWIYKRKRDSAGKVQTFKTRLV
        PRRSGR+ SQPNRYLGL+ETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAM+LEMESMYFNSVWELVDL EGVKPIGC+WIYKRKRDSAGKVQTFK RLV
Subjt:  PRRSGRIESQPNRYLGLSETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMNLEMESMYFNSVWELVDLLEGVKPIGCRWIYKRKRDSAGKVQTFKTRLV

Query:  AKGYTQREGVDYEETFSPVAMLKSIRILFSIATFYDYEIWQMDVKTAFLNGNLKKSIFLSQPEGFITEGQEQKVCKLNGSIYGLKQASRSWNIRFDTAIK
        AKGYTQREGVDYEETFSPVAMLKSIRIL SIATFYDYEIWQMDVKTAFLNGNL++SIF+SQPEGFIT+GQEQKVCKLN SIYGLKQASRSWNIRFDTAIK
Subjt:  AKGYTQREGVDYEETFSPVAMLKSIRILFSIATFYDYEIWQMDVKTAFLNGNLKKSIFLSQPEGFITEGQEQKVCKLNGSIYGLKQASRSWNIRFDTAIK

Query:  SYGFDQNVDEPCVYKKINKGKVAFLVLYMDDILLIRNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIKIIRDRKNKTLALSQATYIDKMLVRYSMQNSKK
        SYGFDQNVDEPCVYKKINKGKVAFLVLY+DDILLI NDVGYLTDVKAWLAAQFQMKDLGEAQYVLGI+IIRDRKNKTLALSQATYIDK+LVRYSMQNSKK
Subjt:  SYGFDQNVDEPCVYKKINKGKVAFLVLYMDDILLIRNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIKIIRDRKNKTLALSQATYIDKMLVRYSMQNSKK

Query:  GLLPFRHGVHLSKE
        GLLPFRHGVHLSKE
Subjt:  GLLPFRHGVHLSKE

TYK10336.1 gag/pol protein [Cucumis melo var. makuwa]9.3e-302100Show/hide
Query:  MDLTFQEYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAELPSSFWGYVVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAH
        MDLTFQEYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAELPSSFWGYVVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAH
Subjt:  MDLTFQEYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAELPSSFWGYVVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAH

Query:  VLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENKVFVSTNATFLDEDHMRDHKPRSKLLLNEATDESRRVVDEVGPSSRVDETTTLGQSHPSQSLRM
        VLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENKVFVSTNATFLDEDHMRDHKPRSKLLLNEATDESRRVVDEVGPSSRVDETTTLGQSHPSQSLRM
Subjt:  VLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENKVFVSTNATFLDEDHMRDHKPRSKLLLNEATDESRRVVDEVGPSSRVDETTTLGQSHPSQSLRM

Query:  PRRSGRIESQPNRYLGLSETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMNLEMESMYFNSVWELVDLLEGVKPIGCRWIYKRKRDSAGKVQTFKTRLV
        PRRSGRIESQPNRYLGLSETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMNLEMESMYFNSVWELVDLLEGVKPIGCRWIYKRKRDSAGKVQTFKTRLV
Subjt:  PRRSGRIESQPNRYLGLSETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMNLEMESMYFNSVWELVDLLEGVKPIGCRWIYKRKRDSAGKVQTFKTRLV

Query:  AKGYTQREGVDYEETFSPVAMLKSIRILFSIATFYDYEIWQMDVKTAFLNGNLKKSIFLSQPEGFITEGQEQKVCKLNGSIYGLKQASRSWNIRFDTAIK
        AKGYTQREGVDYEETFSPVAMLKSIRILFSIATFYDYEIWQMDVKTAFLNGNLKKSIFLSQPEGFITEGQEQKVCKLNGSIYGLKQASRSWNIRFDTAIK
Subjt:  AKGYTQREGVDYEETFSPVAMLKSIRILFSIATFYDYEIWQMDVKTAFLNGNLKKSIFLSQPEGFITEGQEQKVCKLNGSIYGLKQASRSWNIRFDTAIK

Query:  SYGFDQNVDEPCVYKKINKGKVAFLVLYMDDILLIRNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIKIIRDRKNKTLALSQATYIDKMLVRYSMQNSKK
        SYGFDQNVDEPCVYKKINKGKVAFLVLYMDDILLIRNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIKIIRDRKNKTLALSQATYIDKMLVRYSMQNSKK
Subjt:  SYGFDQNVDEPCVYKKINKGKVAFLVLYMDDILLIRNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIKIIRDRKNKTLALSQATYIDKMLVRYSMQNSKK

Query:  GLLPFRHGVHLSKE
        GLLPFRHGVHLSKE
Subjt:  GLLPFRHGVHLSKE

TYK15984.1 gag/pol protein [Cucumis melo var. makuwa]4.3e-28393.19Show/hide
Query:  MDLTFQEYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAELPSSFWGYVVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAH
        MDL FQ+YMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMV SMMSYA+LPSSFWGY VETAVHILNNVPSKSVS+ PFELWRGRKPSLSHFRIWGCPAH
Subjt:  MDLTFQEYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAELPSSFWGYVVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAH

Query:  VLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENKVFVSTNATFLDEDHMRDHKPRSKLLLNEATDESRRVVDEVGPSSRVDETTTLGQSHPSQSLRM
        VLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQEN+VFVSTNATFL+EDHMR+HKPRSKL+L+EATDES RVVDEVGPSSRVDETTT GQSHPSQSLRM
Subjt:  VLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENKVFVSTNATFLDEDHMRDHKPRSKLLLNEATDESRRVVDEVGPSSRVDETTTLGQSHPSQSLRM

Query:  PRRSGRIESQPNRYLGLSETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMNLEMESMYFNSVWELVDLLEGVKPIGCRWIYKRKRDSAGKVQTFKTRLV
        PRRSGR+ SQPNRYLGL+ETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAM+LEMESMYFNSVWELVDL EGVKPIGC+WIYKRKRDSAGKVQTFK RLV
Subjt:  PRRSGRIESQPNRYLGLSETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMNLEMESMYFNSVWELVDLLEGVKPIGCRWIYKRKRDSAGKVQTFKTRLV

Query:  AKGYTQREGVDYEETFSPVAMLKSIRILFSIATFYDYEIWQMDVKTAFLNGNLKKSIFLSQPEGFITEGQEQKVCKLNGSIYGLKQASRSWNIRFDTAIK
        AKGYTQREGVDYEETFSPVAMLKSIRIL SIATFYDYEIWQMDVKTAFLNGNL++SIF+SQPEGFIT+GQEQKVCKLN SIYGLKQASRSWNIRFDTAIK
Subjt:  AKGYTQREGVDYEETFSPVAMLKSIRILFSIATFYDYEIWQMDVKTAFLNGNLKKSIFLSQPEGFITEGQEQKVCKLNGSIYGLKQASRSWNIRFDTAIK

Query:  SYGFDQNVDEPCVYKKINKGKVAFLVLYMDDILLIRNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIKIIRDRKNKTLALSQATYIDKMLVRYSMQNSKK
        SYGFDQNVDEPCVYKKINKGKVAFLVLY+DDILLI NDVGYLTDVKAWLAAQFQMKDLGEAQYVLGI+IIRDRKNKTLALSQATYIDKMLVRY MQNSKK
Subjt:  SYGFDQNVDEPCVYKKINKGKVAFLVLYMDDILLIRNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIKIIRDRKNKTLALSQATYIDKMLVRYSMQNSKK

Query:  GLLPFRHGVHLSKE
         LLPF+HG HLS+E
Subjt:  GLLPFRHGVHLSKE

TrEMBL top hitse value%identityAlignment
A0A5A7T2V9 Gag/pol protein2.5e-28493.39Show/hide
Query:  MDLTFQEYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAELPSSFWGYVVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAH
        MDL FQ+YMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYA+LPSSFWGY VETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAH
Subjt:  MDLTFQEYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAELPSSFWGYVVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAH

Query:  VLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENKVFVSTNATFLDEDHMRDHKPRSKLLLNEATDESRRVVDEVGPSSRVDETTTLGQSHPSQSLRM
        VLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDP+EN+VFVSTNATFL+EDHMR+HKPRSKL+L+EATDES RVVDEVGPSSRVDETTT GQSHPSQSLRM
Subjt:  VLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENKVFVSTNATFLDEDHMRDHKPRSKLLLNEATDESRRVVDEVGPSSRVDETTTLGQSHPSQSLRM

Query:  PRRSGRIESQPNRYLGLSETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMNLEMESMYFNSVWELVDLLEGVKPIGCRWIYKRKRDSAGKVQTFKTRLV
        PRRSGR+ SQPNRYLGL+ETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAM+LEMESMYFNSVWELVDL EGVKPIGC+WIYKRKRDSAGKVQTFK RLV
Subjt:  PRRSGRIESQPNRYLGLSETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMNLEMESMYFNSVWELVDLLEGVKPIGCRWIYKRKRDSAGKVQTFKTRLV

Query:  AKGYTQREGVDYEETFSPVAMLKSIRILFSIATFYDYEIWQMDVKTAFLNGNLKKSIFLSQPEGFITEGQEQKVCKLNGSIYGLKQASRSWNIRFDTAIK
        AKGYT++EGVDYEETFS VAMLKSIRIL SIA FYDYEIWQMDVKTAFLNGNL++SIF+SQPEGFIT+GQEQKVCKLN SIYGLKQASRSWNIRFDTAIK
Subjt:  AKGYTQREGVDYEETFSPVAMLKSIRILFSIATFYDYEIWQMDVKTAFLNGNLKKSIFLSQPEGFITEGQEQKVCKLNGSIYGLKQASRSWNIRFDTAIK

Query:  SYGFDQNVDEPCVYKKINKGKVAFLVLYMDDILLIRNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIKIIRDRKNKTLALSQATYIDKMLVRYSMQNSKK
        SYGFDQNVDEPCVYKKINKGKVAFLVLY+DDILLI NDVGYLTDVKAWLAAQFQMKDLGE QYVLGI+IIRDRKNKTLALSQATYIDK+LVRYSMQNSKK
Subjt:  SYGFDQNVDEPCVYKKINKGKVAFLVLYMDDILLIRNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIKIIRDRKNKTLALSQATYIDKMLVRYSMQNSKK

Query:  GLLPFRHGVHLSKE
        GLLPFRHGVHLSKE
Subjt:  GLLPFRHGVHLSKE

A0A5A7TZD0 Gag/pol protein1.1e-28794.55Show/hide
Query:  MDLTFQEYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAELPSSFWGYVVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAH
        MDL FQ+YMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYA+LPSSFWGY VETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAH
Subjt:  MDLTFQEYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAELPSSFWGYVVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAH

Query:  VLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENKVFVSTNATFLDEDHMRDHKPRSKLLLNEATDESRRVVDEVGPSSRVDETTTLGQSHPSQSLRM
        VLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQEN+VFVSTNATFL+EDHMR+HKPRSKL+L+EATDES RVVDEVGPSSRVDETTT GQSHPSQSLRM
Subjt:  VLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENKVFVSTNATFLDEDHMRDHKPRSKLLLNEATDESRRVVDEVGPSSRVDETTTLGQSHPSQSLRM

Query:  PRRSGRIESQPNRYLGLSETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMNLEMESMYFNSVWELVDLLEGVKPIGCRWIYKRKRDSAGKVQTFKTRLV
        PRRSGR+ SQPNRYLGL+ETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAM+LEMESMYFNSVWELVDL EGVKPIGC+WIYKRKRDSAGKVQTFK RLV
Subjt:  PRRSGRIESQPNRYLGLSETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMNLEMESMYFNSVWELVDLLEGVKPIGCRWIYKRKRDSAGKVQTFKTRLV

Query:  AKGYTQREGVDYEETFSPVAMLKSIRILFSIATFYDYEIWQMDVKTAFLNGNLKKSIFLSQPEGFITEGQEQKVCKLNGSIYGLKQASRSWNIRFDTAIK
        AKGYTQREGVDYEETFSPVAMLKSIRIL SIATFYDYEIWQMDVKTAFLNGNL++SIF+SQPEGFIT+GQEQKVCKLN SIYGLKQASRSWNIRFDTAIK
Subjt:  AKGYTQREGVDYEETFSPVAMLKSIRILFSIATFYDYEIWQMDVKTAFLNGNLKKSIFLSQPEGFITEGQEQKVCKLNGSIYGLKQASRSWNIRFDTAIK

Query:  SYGFDQNVDEPCVYKKINKGKVAFLVLYMDDILLIRNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIKIIRDRKNKTLALSQATYIDKMLVRYSMQNSKK
        SYGFDQNVDEPCVYKKINKGKVAFLVLY+DDILLI NDVGYLTDVKAWLAAQFQMKDLGEAQYVLGI+IIRDRKNKTLALSQATYIDK+LVRYSMQNSKK
Subjt:  SYGFDQNVDEPCVYKKINKGKVAFLVLYMDDILLIRNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIKIIRDRKNKTLALSQATYIDKMLVRYSMQNSKK

Query:  GLLPFRHGVHLSKE
        GLLPFRHGVHLSKE
Subjt:  GLLPFRHGVHLSKE

A0A5A7UYE8 Gag/pol protein1.1e-28794.55Show/hide
Query:  MDLTFQEYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAELPSSFWGYVVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAH
        MDL FQ+YMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYA+LPSSFWGY VETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAH
Subjt:  MDLTFQEYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAELPSSFWGYVVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAH

Query:  VLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENKVFVSTNATFLDEDHMRDHKPRSKLLLNEATDESRRVVDEVGPSSRVDETTTLGQSHPSQSLRM
        VLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQEN+VFVSTNATFL+EDHMR+HKPRSKL+L+EATDES RVVDEVGPSSRVDETTT GQSHPSQSLRM
Subjt:  VLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENKVFVSTNATFLDEDHMRDHKPRSKLLLNEATDESRRVVDEVGPSSRVDETTTLGQSHPSQSLRM

Query:  PRRSGRIESQPNRYLGLSETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMNLEMESMYFNSVWELVDLLEGVKPIGCRWIYKRKRDSAGKVQTFKTRLV
        PRRSGR+ SQPNRYLGL+ETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAM+LEMESMYFNSVWELVDL EGVKPIGC+WIYKRKRDSAGKVQTFK RLV
Subjt:  PRRSGRIESQPNRYLGLSETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMNLEMESMYFNSVWELVDLLEGVKPIGCRWIYKRKRDSAGKVQTFKTRLV

Query:  AKGYTQREGVDYEETFSPVAMLKSIRILFSIATFYDYEIWQMDVKTAFLNGNLKKSIFLSQPEGFITEGQEQKVCKLNGSIYGLKQASRSWNIRFDTAIK
        AKGYTQREGVDYEETFSPVAMLKSIRIL SIATFYDYEIWQMDVKTAFLNGNL++SIF+SQPEGFIT+GQEQKVCKLN SIYGLKQASRSWNIRFDTAIK
Subjt:  AKGYTQREGVDYEETFSPVAMLKSIRILFSIATFYDYEIWQMDVKTAFLNGNLKKSIFLSQPEGFITEGQEQKVCKLNGSIYGLKQASRSWNIRFDTAIK

Query:  SYGFDQNVDEPCVYKKINKGKVAFLVLYMDDILLIRNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIKIIRDRKNKTLALSQATYIDKMLVRYSMQNSKK
        SYGFDQNVDEPCVYKKINKGKVAFLVLY+DDILLI NDVGYLTDVKAWLAAQFQMKDLGEAQYVLGI+IIRDRKNKTLALSQATYIDK+LVRYSMQNSKK
Subjt:  SYGFDQNVDEPCVYKKINKGKVAFLVLYMDDILLIRNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIKIIRDRKNKTLALSQATYIDKMLVRYSMQNSKK

Query:  GLLPFRHGVHLSKE
        GLLPFRHGVHLSKE
Subjt:  GLLPFRHGVHLSKE

A0A5D3CIZ2 Gag/pol protein4.5e-302100Show/hide
Query:  MDLTFQEYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAELPSSFWGYVVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAH
        MDLTFQEYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAELPSSFWGYVVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAH
Subjt:  MDLTFQEYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAELPSSFWGYVVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAH

Query:  VLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENKVFVSTNATFLDEDHMRDHKPRSKLLLNEATDESRRVVDEVGPSSRVDETTTLGQSHPSQSLRM
        VLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENKVFVSTNATFLDEDHMRDHKPRSKLLLNEATDESRRVVDEVGPSSRVDETTTLGQSHPSQSLRM
Subjt:  VLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENKVFVSTNATFLDEDHMRDHKPRSKLLLNEATDESRRVVDEVGPSSRVDETTTLGQSHPSQSLRM

Query:  PRRSGRIESQPNRYLGLSETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMNLEMESMYFNSVWELVDLLEGVKPIGCRWIYKRKRDSAGKVQTFKTRLV
        PRRSGRIESQPNRYLGLSETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMNLEMESMYFNSVWELVDLLEGVKPIGCRWIYKRKRDSAGKVQTFKTRLV
Subjt:  PRRSGRIESQPNRYLGLSETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMNLEMESMYFNSVWELVDLLEGVKPIGCRWIYKRKRDSAGKVQTFKTRLV

Query:  AKGYTQREGVDYEETFSPVAMLKSIRILFSIATFYDYEIWQMDVKTAFLNGNLKKSIFLSQPEGFITEGQEQKVCKLNGSIYGLKQASRSWNIRFDTAIK
        AKGYTQREGVDYEETFSPVAMLKSIRILFSIATFYDYEIWQMDVKTAFLNGNLKKSIFLSQPEGFITEGQEQKVCKLNGSIYGLKQASRSWNIRFDTAIK
Subjt:  AKGYTQREGVDYEETFSPVAMLKSIRILFSIATFYDYEIWQMDVKTAFLNGNLKKSIFLSQPEGFITEGQEQKVCKLNGSIYGLKQASRSWNIRFDTAIK

Query:  SYGFDQNVDEPCVYKKINKGKVAFLVLYMDDILLIRNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIKIIRDRKNKTLALSQATYIDKMLVRYSMQNSKK
        SYGFDQNVDEPCVYKKINKGKVAFLVLYMDDILLIRNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIKIIRDRKNKTLALSQATYIDKMLVRYSMQNSKK
Subjt:  SYGFDQNVDEPCVYKKINKGKVAFLVLYMDDILLIRNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIKIIRDRKNKTLALSQATYIDKMLVRYSMQNSKK

Query:  GLLPFRHGVHLSKE
        GLLPFRHGVHLSKE
Subjt:  GLLPFRHGVHLSKE

A0A5D3CYF4 Gag/pol protein2.1e-28393.19Show/hide
Query:  MDLTFQEYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAELPSSFWGYVVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAH
        MDL FQ+YMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMV SMMSYA+LPSSFWGY VETAVHILNNVPSKSVS+ PFELWRGRKPSLSHFRIWGCPAH
Subjt:  MDLTFQEYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAELPSSFWGYVVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAH

Query:  VLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENKVFVSTNATFLDEDHMRDHKPRSKLLLNEATDESRRVVDEVGPSSRVDETTTLGQSHPSQSLRM
        VLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQEN+VFVSTNATFL+EDHMR+HKPRSKL+L+EATDES RVVDEVGPSSRVDETTT GQSHPSQSLRM
Subjt:  VLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENKVFVSTNATFLDEDHMRDHKPRSKLLLNEATDESRRVVDEVGPSSRVDETTTLGQSHPSQSLRM

Query:  PRRSGRIESQPNRYLGLSETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMNLEMESMYFNSVWELVDLLEGVKPIGCRWIYKRKRDSAGKVQTFKTRLV
        PRRSGR+ SQPNRYLGL+ETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAM+LEMESMYFNSVWELVDL EGVKPIGC+WIYKRKRDSAGKVQTFK RLV
Subjt:  PRRSGRIESQPNRYLGLSETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMNLEMESMYFNSVWELVDLLEGVKPIGCRWIYKRKRDSAGKVQTFKTRLV

Query:  AKGYTQREGVDYEETFSPVAMLKSIRILFSIATFYDYEIWQMDVKTAFLNGNLKKSIFLSQPEGFITEGQEQKVCKLNGSIYGLKQASRSWNIRFDTAIK
        AKGYTQREGVDYEETFSPVAMLKSIRIL SIATFYDYEIWQMDVKTAFLNGNL++SIF+SQPEGFIT+GQEQKVCKLN SIYGLKQASRSWNIRFDTAIK
Subjt:  AKGYTQREGVDYEETFSPVAMLKSIRILFSIATFYDYEIWQMDVKTAFLNGNLKKSIFLSQPEGFITEGQEQKVCKLNGSIYGLKQASRSWNIRFDTAIK

Query:  SYGFDQNVDEPCVYKKINKGKVAFLVLYMDDILLIRNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIKIIRDRKNKTLALSQATYIDKMLVRYSMQNSKK
        SYGFDQNVDEPCVYKKINKGKVAFLVLY+DDILLI NDVGYLTDVKAWLAAQFQMKDLGEAQYVLGI+IIRDRKNKTLALSQATYIDKMLVRY MQNSKK
Subjt:  SYGFDQNVDEPCVYKKINKGKVAFLVLYMDDILLIRNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIKIIRDRKNKTLALSQATYIDKMLVRYSMQNSKK

Query:  GLLPFRHGVHLSKE
         LLPF+HG HLS+E
Subjt:  GLLPFRHGVHLSKE

SwissProt top hitse value%identityAlignment
P04146 Copia protein2.7e-7029.78Show/hide
Query:  QEYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAELPSSFWGYVVETAVHILNNVPSKSV---SETPFELWRGRKPSLSHFRIWGCPAHVL
        +++ ++ GI   L+ P TPQ NGVSER  RT+ +  R+M+S A+L  SFWG  V TA +++N +PS+++   S+TP+E+W  +KP L H R++G   +V 
Subjt:  QEYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAELPSSFWGYVVETAVHILNNVPSKSV---SETPFELWRGRKPSLSHFRIWGCPAHVL

Query:  VTNPK-KLEPRSRLCQFVGYPKETRGGLFFDPQENKVFVSTNATFLDEDHM---RDHKPRSKLLLNEATDE-------SRRVVDEVGP--SSRVDETTTL
        + N + K + +S    FVGY  E  G   +D   N+ F+      +DE +M   R  K  +  L +    E       SR+++    P  S   D    L
Subjt:  VTNPK-KLEPRSRLCQFVGYPKETRGGLFFDPQENKVFVSTNATFLDEDHM---RDHKPRSKLLLNEATDE-------SRRVVDEVGP--SSRVDETTTL

Query:  GQSHPSQSLRMPRRSGRI--------------------ESQPNRYL-------------------------GLSETQVVIPDDGVEDP------------
          S  S++   P  S +I                      + N+Y                            SET   + + G+++P            
Subjt:  GQSHPSQSLRMPRRSGRI--------------------ESQPNRYL-------------------------GLSETQVVIPDDGVEDP------------

Query:  ---------LSYKQ--------------AMNDV-----------DKDQWVKAMNLEMESMYFNSVWELVDLLEGVKPIGCRWIYKRKRDSAGKVQTFKTR
                 +SY +                NDV           DK  W +A+N E+ +   N+ W +    E    +  RW++  K +  G    +K R
Subjt:  ---------LSYKQ--------------AMNDV-----------DKDQWVKAMNLEMESMYFNSVWELVDLLEGVKPIGCRWIYKRKRDSAGKVQTFKTR

Query:  LVAKGYTQREGVDYEETFSPVAMLKSIRILFSIATFYDYEIWQMDVKTAFLNGNLKKSIFLSQPEGFITEGQEQKVCKLNGSIYGLKQASRSWNIRFDTA
        LVA+G+TQ+  +DYEETF+PVA + S R + S+   Y+ ++ QMDVKTAFLNG LK+ I++  P+G         VCKLN +IYGLKQA+R W   F+ A
Subjt:  LVAKGYTQREGVDYEETFSPVAMLKSIRILFSIATFYDYEIWQMDVKTAFLNGNLKKSIFLSQPEGFITEGQEQKVCKLNGSIYGLKQASRSWNIRFDTA

Query:  IKSYGFDQNVDEPCVY--KKINKGKVAFLVLYMDDILLIRNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIKIIRDRKNKTLALSQATYIDKMLVRYSMQ
        +K   F  +  + C+Y   K N  +  +++LY+DD+++   D+  + + K +L  +F+M DL E ++ +GI+I  + +   + LSQ+ Y+ K+L +++M+
Subjt:  IKSYGFDQNVDEPCVY--KKINKGKVAFLVLYMDDILLIRNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIKIIRDRKNKTLALSQATYIDKMLVRYSMQ

Query:  N
        N
Subjt:  N

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-946.4e-10438.59Show/hide
Query:  FQEYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAELPSSFWGYVVETAVHILNNVPSKSVS-ETPFELWRGRKPSLSHFRIWGCP--AHV
        F+EY   HGI+ + + PGTPQ NGV+ER NRT+++ VRSM+  A+LP SFWG  V+TA +++N  PS  ++ E P  +W  ++ S SH +++GC   AHV
Subjt:  FQEYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAELPSSFWGYVVETAVHILNNVPSKSVS-ETPFELWRGRKPSLSHFRIWGCP--AHV

Query:  LVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENKVFVSTNATFLDEDHM----RDHKPRSKLLLNEAT--------DESRRVVDEVG-----PSSRVD
              KL+ +S  C F+GY  E  G   +DP + KV  S +  F + +         K ++ ++ N  T          +    DEV      P   ++
Subjt:  LVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENKVFVSTNATFLDEDHM----RDHKPRSKLLLNEAT--------DESRRVVDEVG-----PSSRVD

Query:  ETTTLGQ-----SHPSQSLRMP---RRSGRIESQPNRYLGLSETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMNLEMESMYFNSVWELVDLLEGVKPI
        +   L +      HP+Q        RRS R   +  RY   S   V+I DD   +P S K+ ++  +K+Q +KAM  EMES+  N  ++LV+L +G +P+
Subjt:  ETTTLGQ-----SHPSQSLRMP---RRSGRIESQPNRYLGLSETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMNLEMESMYFNSVWELVDLLEGVKPI

Query:  GCRWIYKRKRDSAGKVQTFKTRLVAKGYTQREGVDYEETFSPVAMLKSIRILFSIATFYDYEIWQMDVKTAFLNGNLKKSIFLSQPEGFITEGQEQKVCK
         C+W++K K+D   K+  +K RLV KG+ Q++G+D++E FSPV  + SIR + S+A   D E+ Q+DVKTAFL+G+L++ I++ QPEGF   G++  VCK
Subjt:  GCRWIYKRKRDSAGKVQTFKTRLVAKGYTQREGVDYEETFSPVAMLKSIRILFSIATFYDYEIWQMDVKTAFLNGNLKKSIFLSQPEGFITEGQEQKVCK

Query:  LNGSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVY-KKINKGKVAFLVLYMDDILLIRNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIKIIRDRKN
        LN S+YGLKQA R W ++FD+ +KS  + +   +PCVY K+ ++     L+LY+DD+L++  D G +  +K  L+  F MKDLG AQ +LG+KI+R+R +
Subjt:  LNGSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVY-KKINKGKVAFLVLYMDDILLIRNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIKIIRDRKN

Query:  KTLALSQATYIDKMLVRYSMQNSKKGLLPFRHGVHLSKE
        + L LSQ  YI+++L R++M+N+K    P    + LSK+
Subjt:  KTLALSQATYIDKMLVRYSMQNSKKGLLPFRHGVHLSKE

P25600 Putative transposon Ty5-1 protein YCL074W2.0e-1734.69Show/hide
Query:  MDVKTAFLNGNLKKSIFLSQPEGFITEGQEQKVCKLNGSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYMDDILLIRNDVGY
        MDV TAFLN  + + I++ QP GF+ E     V +L G +YGLKQA   WN   +  +K  GF ++  E  +Y +       ++ +Y+DD+L+       
Subjt:  MDVKTAFLNGNLKKSIFLSQPEGFITEGQEQKVCKLNGSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYMDDILLIRNDVGY

Query:  LTDVKAWLAAQFQMKDLGEAQYVLGIKIIRDRKNKTLALSQATYIDK
           VK  L   + MKDLG+    LG+  I    N  + LS   YI K
Subjt:  LTDVKAWLAAQFQMKDLGEAQYVLGIKIIRDRKNKTLALSQATYIDK

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.3e-6127.01Show/hide
Query:  EYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAELPSSFWGYVVETAVHILNNVPSKSVS-ETPFELWRGRKPSLSHFRIWGCPAHVLVT-
        EY  +HGI    S P TP+ NG+SER++R +++   +++S+A +P ++W Y    AV+++N +P+  +  E+PF+   G  P+    R++GC  +  +  
Subjt:  EYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAELPSSFWGYVVETAVHILNNVPSKSVS-ETPFELWRGRKPSLSHFRIWGCPAHVLVT-

Query:  -NPKKLEPRSRLCQFVGYPKETRGGLFFDPQENKVFVSTNATF---------------------------------------------LDEDHMRDHKPR
         N  KL+ +SR C F+GY       L    Q +++++S +  F                                               + H     P 
Subjt:  -NPKKLEPRSRLCQFVGYPKETRGGLFFDPQENKVFVSTNATF---------------------------------------------LDEDHMRDHKPR

Query:  SKLLLNEATDESRRVVD-----------------EVGPSSRVDETTTLGQSHPS-----------------QSLRMPRRSGRIESQPNRYLGLSETQVVI
        S       +  S   +D                 + GP      T T  Q+H S                 QSL  P +S      P      S T    
Subjt:  SKLLLNEATDESRRVVD-----------------EVGPSSRVDETTTLGQSHPS-----------------QSLRMPRRSGRIESQPNRYLGLSETQVVI

Query:  PDDGVEDPLSYKQAMND--------------------------------------------VDKDQWVKAMNLEMESMYFNSVWELVDLLEG-VKPIGCR
        P   +  P    Q +N+                                            +  ++W  AM  E+ +   N  W+LV      V  +GCR
Subjt:  PDDGVEDPLSYKQAMND--------------------------------------------VDKDQWVKAMNLEMESMYFNSVWELVDLLEG-VKPIGCR

Query:  WIYKRKRDSAGKVQTFKTRLVAKGYTQREGVDYEETFSPVAMLKSIRILFSIATFYDYEIWQMDVKTAFLNGNLKKSIFLSQPEGFITEGQEQKVCKLNG
        WI+ +K +S G +  +K RLVAKGY QR G+DY ETFSPV    SIRI+  +A    + I Q+DV  AFL G L   +++SQP GFI + +   VCKL  
Subjt:  WIYKRKRDSAGKVQTFKTRLVAKGYTQREGVDYEETFSPVAMLKSIRILFSIATFYDYEIWQMDVKTAFLNGNLKKSIFLSQPEGFITEGQEQKVCKLNG

Query:  SIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYMDDILLIRNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIKIIRDRKNKTLA
        ++YGLKQA R+W +     + + GF  +V +  ++       + ++++Y+DDIL+  ND   L +    L+ +F +KD  E  Y LGI+    R    L 
Subjt:  SIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYMDDILLIRNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIKIIRDRKNKTLA

Query:  LSQATYIDKMLVRYSMQNSKKGLLPFRHGVHLS
        LSQ  YI  +L R +M  +K    P      LS
Subjt:  LSQATYIDKMLVRYSMQNSKKGLLPFRHGVHLS

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE23.1e-5825.91Show/hide
Query:  LTFQEYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAELPSSFWGYVVETAVHILNNVPSKSVS-ETPFELWRGRKPSLSHFRIWGCPAHV
        +  ++Y+ +HGI    S P TP+ NG+SER++R +++M  +++S+A +P ++W Y    AV+++N +P+  +  ++PF+   G+ P+    +++GC  + 
Subjt:  LTFQEYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAELPSSFWGYVVETAVHILNNVPSKSVS-ETPFELWRGRKPSLSHFRIWGCPAHV

Query:  LVT--NPKKLEPRSRLCQFVGYPKETRGGLFFDPQENKVFVSTNATF---------------LDEDHMRDHKPR---------------SKLLLNEATDE
         +   N  KLE +S+ C F+GY       L       +++ S +  F                 ++   D  P                +   L    D 
Subjt:  LVT--NPKKLEPRSRLCQFVGYPKETRGGLFFDPQENKVFVSTNATF---------------LDEDHMRDHKPR---------------SKLLLNEATDE

Query:  SRR------------VVDEVGPSSRVDETTT-------------LGQSHPSQS-------LRMPRRSGRIESQPNRYLGLSETQVVIP------------
        S R            V     PSS +   ++               Q H +Q+       L  P  +    + PN+   L ++ +  P            
Subjt:  SRR------------VVDEVGPSSRVDETTT-------------LGQSHPSQS-------LRMPRRSGRIESQPNRYLGLSETQVVIP------------

Query:  ----------------------------------------DDGVEDP---LSY----------KQAMNDVDKDQWVKAMNLEMESMYFNSVWELV-DLLE
                                                 DG+  P    SY          + A+  +  D+W +AM  E+ +   N  W+LV     
Subjt:  ----------------------------------------DDGVEDP---LSY----------KQAMNDVDKDQWVKAMNLEMESMYFNSVWELV-DLLE

Query:  GVKPIGCRWIYKRKRDSAGKVQTFKTRLVAKGYTQREGVDYEETFSPVAMLKSIRILFSIATFYDYEIWQMDVKTAFLNGNLKKSIFLSQPEGFITEGQE
         V  +GCRWI+ +K +S G +  +K RLVAKGY QR G+DY ETFSPV    SIRI+  +A    + I Q+DV  AFL G L   +++SQP GF+ + + 
Subjt:  GVKPIGCRWIYKRKRDSAGKVQTFKTRLVAKGYTQREGVDYEETFSPVAMLKSIRILFSIATFYDYEIWQMDVKTAFLNGNLKKSIFLSQPEGFITEGQE

Query:  QKVCKLNGSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYMDDILLIRNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIKIIR
          VC+L  +IYGLKQA R+W +   T + + GF  ++ +  ++       + ++++Y+DDIL+  ND   L      L+ +F +K+  +  Y LGI+   
Subjt:  QKVCKLNGSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYMDDILLIRNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIKIIR

Query:  DRKNKTLALSQATYIDKMLVRYSMQNSKKGLLP
         R  + L LSQ  Y   +L R +M  +K    P
Subjt:  DRKNKTLALSQATYIDKMLVRYSMQNSKKGLLP

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 84.6e-4934.49Show/hide
Query:  EDPLSYKQAMNDVDKDQWVKAMNLEMESMYFNSVWELVDLLEGVKPIGCRWIYKRKRDSAGKVQTFKTRLVAKGYTQREGVDYEETFSPVAMLKSIRILF
        ++P +Y +A   +    W  AM+ E+ +M     WE+  L    KPIGC+W+YK K +S G ++ +K RLVAKGYTQ+EG+D+ ETFSPV  L S++++ 
Subjt:  EDPLSYKQAMNDVDKDQWVKAMNLEMESMYFNSVWELVDLLEGVKPIGCRWIYKRKRDSAGKVQTFKTRLVAKGYTQREGVDYEETFSPVAMLKSIRILF

Query:  SIATFYDYEIWQMDVKTAFLNGNLKKSIFLSQPEGFITEGQE----QKVCKLNGSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFL
        +I+  Y++ + Q+D+  AFLNG+L + I++  P G+     +      VC L  SIYGLKQASR W ++F   +  +GF Q+  +   + KI       +
Subjt:  SIATFYDYEIWQMDVKTAFLNGNLKKSIFLSQPEGFITEGQE----QKVCKLNGSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFL

Query:  VLYMDDILLIRNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIKIIRDRKNKTLALSQATYIDKMLVRYSMQNSKKGLLPFRHGVHLS
        ++Y+DDI++  N+   + ++K+ L + F+++DLG  +Y LG++I R      + + Q  Y   +L    +   K   +P    V  S
Subjt:  VLYMDDILLIRNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIKIIRDRKNKTLALSQATYIDKMLVRYSMQNSKKGLLPFRHGVHLS

ATMG00710.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein1.2e-0937.8Show/hide
Query:  NRTLLDMVRSMMSYAELPSSFWGYVVETAVHILNNVPSKSVS-ETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSR
        NRT+++ VRSM+    LP +F      TAVHI+N  PS +++   P E+W    P+ S+ R +GC A++   +  KL+PR++
Subjt:  NRTLLDMVRSMMSYAELPSSFWGYVVETAVHILNNVPSKSVS-ETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSR

ATMG00810.1 DNA/RNA polymerases superfamily protein3.5e-0438.16Show/hide
Query:  FLVLYMDDILLIRNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIKIIRDRKNKTLALSQATYIDKMLVRYSMQNSK
        +L+LY+DDILL  +    L  +   L++ F MKDLG   Y LGI+I        L LSQ  Y +++L    M + K
Subjt:  FLVLYMDDILLIRNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIKIIRDRKNKTLALSQATYIDKMLVRYSMQNSK

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)4.1e-1338.37Show/hide
Query:  WVKAMNLEMESMYFNSVWELVDLLEGVKPIGCRWIYKRKRDSAGKVQTFKTRLVAKGYTQREGVDYEETFSPVAMLKSIRILFSIA
        W +AM  E++++  N  W LV        +GC+W++K K  S G +   K RLVAKG+ Q EG+ + ET+SPV    +IR + ++A
Subjt:  WVKAMNLEMESMYFNSVWELVDLLEGVKPIGCRWIYKRKRDSAGKVQTFKTRLVAKGYTQREGVDYEETFSPVAMLKSIRILFSIA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTTGACATTCCAAGAATATATGATAGAACATGGAATCCAATCTCAACTCTCAGCACCTGGTACACCTCAACAAAATGGTGTATCAGAAAGGAGAAATAGAACCTT
GTTAGACATGGTCCGTTCAATGATGAGTTACGCTGAATTGCCTAGCTCGTTTTGGGGTTATGTAGTAGAGACTGCAGTTCATATCTTGAACAATGTTCCCTCGAAGAGTG
TTTCTGAAACACCTTTCGAGTTATGGAGAGGACGTAAACCTAGTTTAAGTCATTTCAGAATTTGGGGTTGTCCAGCACACGTGTTAGTGACAAATCCCAAGAAGTTGGAA
CCTCGTTCAAGGTTATGCCAATTTGTTGGTTACCCTAAAGAGACGAGAGGTGGTCTATTCTTTGATCCACAAGAAAATAAAGTGTTTGTATCGACAAATGCTACTTTCTT
GGACGAAGACCACATGAGAGATCATAAACCACGAAGCAAATTATTATTAAATGAAGCTACTGATGAATCAAGAAGGGTTGTTGATGAAGTTGGTCCCTCATCAAGAGTTG
ATGAAACTACCACATTAGGTCAATCTCATCCTTCTCAATCGTTGAGAATGCCTCGACGCAGTGGGAGAATTGAATCGCAACCTAACCGTTATTTGGGTTTAAGTGAAACT
CAGGTTGTCATACCAGATGATGGTGTTGAGGATCCATTGTCCTATAAACAGGCAATGAATGATGTAGATAAGGACCAATGGGTCAAAGCCATGAACCTTGAAATGGAGTC
TATGTACTTCAATTCAGTGTGGGAGCTTGTAGATCTACTTGAAGGGGTAAAACCTATAGGGTGTAGATGGATCTATAAGAGAAAGAGAGATTCAGCTGGGAAGGTACAGA
CCTTCAAAACGAGACTTGTAGCAAAAGGGTATACCCAAAGGGAAGGGGTTGACTATGAAGAAACTTTTTCTCCTGTTGCTATGTTAAAGTCTATAAGGATTCTCTTTTCC
ATCGCCACATTTTATGATTATGAAATATGGCAAATGGATGTCAAGACTGCTTTTCTGAATGGCAATCTTAAAAAGAGTATCTTTTTGTCTCAGCCCGAGGGGTTCATAAC
CGAAGGTCAAGAGCAAAAAGTTTGCAAGCTGAATGGATCCATTTATGGGTTGAAACAAGCATCTAGATCTTGGAACATTAGATTTGATACTGCGATCAAATCCTATGGTT
TTGACCAAAACGTTGATGAACCTTGTGTATATAAGAAAATCAACAAAGGAAAAGTAGCTTTCTTAGTACTTTATATGGACGATATCCTCCTCATTAGGAATGATGTGGGA
TACCTTACTGACGTTAAAGCTTGGCTAGCAGCCCAATTCCAAATGAAAGATTTAGGAGAGGCACAATATGTTCTTGGGATCAAAATCATAAGGGATCGTAAGAACAAAAC
ACTAGCACTGTCTCAAGCAACCTATATCGACAAAATGTTGGTTCGATATTCGATGCAGAACTCTAAGAAGGGTTTATTACCTTTCAGGCATGGGGTTCACTTGTCTAAGG
AATAG
mRNA sequenceShow/hide mRNA sequence
ATGGATTTGACATTCCAAGAATATATGATAGAACATGGAATCCAATCTCAACTCTCAGCACCTGGTACACCTCAACAAAATGGTGTATCAGAAAGGAGAAATAGAACCTT
GTTAGACATGGTCCGTTCAATGATGAGTTACGCTGAATTGCCTAGCTCGTTTTGGGGTTATGTAGTAGAGACTGCAGTTCATATCTTGAACAATGTTCCCTCGAAGAGTG
TTTCTGAAACACCTTTCGAGTTATGGAGAGGACGTAAACCTAGTTTAAGTCATTTCAGAATTTGGGGTTGTCCAGCACACGTGTTAGTGACAAATCCCAAGAAGTTGGAA
CCTCGTTCAAGGTTATGCCAATTTGTTGGTTACCCTAAAGAGACGAGAGGTGGTCTATTCTTTGATCCACAAGAAAATAAAGTGTTTGTATCGACAAATGCTACTTTCTT
GGACGAAGACCACATGAGAGATCATAAACCACGAAGCAAATTATTATTAAATGAAGCTACTGATGAATCAAGAAGGGTTGTTGATGAAGTTGGTCCCTCATCAAGAGTTG
ATGAAACTACCACATTAGGTCAATCTCATCCTTCTCAATCGTTGAGAATGCCTCGACGCAGTGGGAGAATTGAATCGCAACCTAACCGTTATTTGGGTTTAAGTGAAACT
CAGGTTGTCATACCAGATGATGGTGTTGAGGATCCATTGTCCTATAAACAGGCAATGAATGATGTAGATAAGGACCAATGGGTCAAAGCCATGAACCTTGAAATGGAGTC
TATGTACTTCAATTCAGTGTGGGAGCTTGTAGATCTACTTGAAGGGGTAAAACCTATAGGGTGTAGATGGATCTATAAGAGAAAGAGAGATTCAGCTGGGAAGGTACAGA
CCTTCAAAACGAGACTTGTAGCAAAAGGGTATACCCAAAGGGAAGGGGTTGACTATGAAGAAACTTTTTCTCCTGTTGCTATGTTAAAGTCTATAAGGATTCTCTTTTCC
ATCGCCACATTTTATGATTATGAAATATGGCAAATGGATGTCAAGACTGCTTTTCTGAATGGCAATCTTAAAAAGAGTATCTTTTTGTCTCAGCCCGAGGGGTTCATAAC
CGAAGGTCAAGAGCAAAAAGTTTGCAAGCTGAATGGATCCATTTATGGGTTGAAACAAGCATCTAGATCTTGGAACATTAGATTTGATACTGCGATCAAATCCTATGGTT
TTGACCAAAACGTTGATGAACCTTGTGTATATAAGAAAATCAACAAAGGAAAAGTAGCTTTCTTAGTACTTTATATGGACGATATCCTCCTCATTAGGAATGATGTGGGA
TACCTTACTGACGTTAAAGCTTGGCTAGCAGCCCAATTCCAAATGAAAGATTTAGGAGAGGCACAATATGTTCTTGGGATCAAAATCATAAGGGATCGTAAGAACAAAAC
ACTAGCACTGTCTCAAGCAACCTATATCGACAAAATGTTGGTTCGATATTCGATGCAGAACTCTAAGAAGGGTTTATTACCTTTCAGGCATGGGGTTCACTTGTCTAAGG
AATAG
Protein sequenceShow/hide protein sequence
MDLTFQEYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAELPSSFWGYVVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLE
PRSRLCQFVGYPKETRGGLFFDPQENKVFVSTNATFLDEDHMRDHKPRSKLLLNEATDESRRVVDEVGPSSRVDETTTLGQSHPSQSLRMPRRSGRIESQPNRYLGLSET
QVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMNLEMESMYFNSVWELVDLLEGVKPIGCRWIYKRKRDSAGKVQTFKTRLVAKGYTQREGVDYEETFSPVAMLKSIRILFS
IATFYDYEIWQMDVKTAFLNGNLKKSIFLSQPEGFITEGQEQKVCKLNGSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYMDDILLIRNDVG
YLTDVKAWLAAQFQMKDLGEAQYVLGIKIIRDRKNKTLALSQATYIDKMLVRYSMQNSKKGLLPFRHGVHLSKE