; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc07g0193041 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc07g0193041
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionGag/pol protein
Genome locationCMiso1.1chr07:14284081..14285589
RNA-Seq ExpressionCmc07g0193041
SyntenyCmc07g0193041
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008234 - cysteine-type peptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025945.1 gag/pol protein [Cucumis melo var. makuwa]3.0e-27393.65Show/hide
Query:  MDLEMECMYFNSVWELVDLPEGVKSIGCKWIYKRKRNSAGKVQTFKARLVAKGYTQREGVNYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFFN
        MDLEME MYFNSVWELVDLPEGVK IGCKWIYKRKR+SAGKVQTFKARLVAKGYTQREGV+YEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAF N
Subjt:  MDLEMECMYFNSVWELVDLPEGVKSIGCKWIYKRKRNSAGKVQTFKARLVAKGYTQREGVNYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFFN

Query:  GNLEESIFMSHPEGFITQGQEQKVCKMNRSIYGLKQ-SRSWNIRFDTAIKSYCFDQNVDEPCVYKKINKGKVAFLVLYVDDI-LIGNDVGYFTDIKAWLA
        GNLEESIFMS PEGFITQGQEQKVCK+NRSIYGLKQ SRSWNIRFDTAIKSY FDQNVDEPCVYKKINKGKVAFLVLYVDDI LIGNDVGY TD+KAWLA
Subjt:  GNLEESIFMSHPEGFITQGQEQKVCKMNRSIYGLKQ-SRSWNIRFDTAIKSYCFDQNVDEPCVYKKINKGKVAFLVLYVDDI-LIGNDVGYFTDIKAWLA

Query:  VQFQMKDLEEAQYVLGIQIIRDHKNKTLALSQATYIDKMLVRYSMKNSKKGLLPFRHGVHLSKEQCPKTPQEVEDMRRIPYASAVDSLVYYMLCTRPDIC
         QFQMKDL EAQYVLGIQIIRD KNKTLALSQATYIDK+LVRYSM+NSKKGLLPFRHGVHLSKEQ PKTPQEVEDMRRIPYASAV SL+Y MLCTRPDIC
Subjt:  VQFQMKDLEEAQYVLGIQIIRDHKNKTLALSQATYIDKMLVRYSMKNSKKGLLPFRHGVHLSKEQCPKTPQEVEDMRRIPYASAVDSLVYYMLCTRPDIC

Query:  YVVGIVNRYQSNPGLDHWTAVKIILKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSRSVFTLNGGAVVWRSIKQGCIVDSTIEAEYVATCEA
        Y VGIV+RYQSNPGLDHWTAVKI+LKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTS SVFTLNGGAVVWRSIKQGCI DST+EAEYVA CEA
Subjt:  YVVGIVNRYQSNPGLDHWTAVKIILKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSRSVFTLNGGAVVWRSIKQGCIVDSTIEAEYVATCEA

Query:  AKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGTVANSKEPRSHKRGKHIERKYHLIREIVQRRDVIVTKIASEHNIVDPFTKTLTAKVFEGHLESLGLQD
        AKEAVWLRKFLHDLEVVPNMNLPITLYCDNSG VANSKEPRSHKRGKHIERKYHLIREIVQR DVIVTKIASEHNI DPFTKTLTAKVFEGHLESLGL+D
Subjt:  AKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGTVANSKEPRSHKRGKHIERKYHLIREIVQRRDVIVTKIASEHNIVDPFTKTLTAKVFEGHLESLGLQD

Query:  MYIR
        MYIR
Subjt:  MYIR

KAA0035907.1 gag/pol protein [Cucumis melo var. makuwa]2.4e-27092.66Show/hide
Query:  MDLEMECMYFNSVWELVDLPEGVKSIGCKWIYKRKRNSAGKVQTFKARLVAKGYTQREGVNYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFFN
        MDLEME MYFNSVWELVDLPEGVK IGCKWIYKRKR+SAGKVQTFKARLVAKGYT++EGV+YEETFS VAMLKSIRILLSIA FYDYEIWQMDVKTAF N
Subjt:  MDLEMECMYFNSVWELVDLPEGVKSIGCKWIYKRKRNSAGKVQTFKARLVAKGYTQREGVNYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFFN

Query:  GNLEESIFMSHPEGFITQGQEQKVCKMNRSIYGLKQ-SRSWNIRFDTAIKSYCFDQNVDEPCVYKKINKGKVAFLVLYVDDI-LIGNDVGYFTDIKAWLA
        GNLEESIFMS PEGFITQGQEQKVCK+NRSIYGLKQ SRSWNIRFDTAIKSY FDQNVDEPCVYKKINKGKVAFLVLYVDDI LIGNDVGY TD+KAWLA
Subjt:  GNLEESIFMSHPEGFITQGQEQKVCKMNRSIYGLKQ-SRSWNIRFDTAIKSYCFDQNVDEPCVYKKINKGKVAFLVLYVDDI-LIGNDVGYFTDIKAWLA

Query:  VQFQMKDLEEAQYVLGIQIIRDHKNKTLALSQATYIDKMLVRYSMKNSKKGLLPFRHGVHLSKEQCPKTPQEVEDMRRIPYASAVDSLVYYMLCTRPDIC
         QFQMKDL E QYVLGIQIIRD KNKTLALSQATYIDK+LVRYSM+NSKKGLLPFRHGVHLSKEQ PKTPQEVEDMRRIPYASAV SL+Y MLCTRPDIC
Subjt:  VQFQMKDLEEAQYVLGIQIIRDHKNKTLALSQATYIDKMLVRYSMKNSKKGLLPFRHGVHLSKEQCPKTPQEVEDMRRIPYASAVDSLVYYMLCTRPDIC

Query:  YVVGIVNRYQSNPGLDHWTAVKIILKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSRSVFTLNGGAVVWRSIKQGCIVDSTIEAEYVATCEA
        Y VGIV+RYQSNPGLDHWTAVKIILKYLRRTRDYMLVYGAKDLILTGYT+SDFQTDKDSRKSTSRSVFTLNGGAVVWRSIKQGCI DST+EAEYVA CEA
Subjt:  YVVGIVNRYQSNPGLDHWTAVKIILKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSRSVFTLNGGAVVWRSIKQGCIVDSTIEAEYVATCEA

Query:  AKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGTVANSKEPRSHKRGKHIERKYHLIREIVQRRDVIVTKIASEHNIVDPFTKTLTAKVFEGHLESLGLQD
        AKEAVWL+KFLHDLEVVPNMNLPITLYCDNSG VANSKEPRSHKRGKHIERKYHLIREIVQR DVIVTKIASEHNI DPFTKTLTAKVFEGHLESLGL+D
Subjt:  AKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGTVANSKEPRSHKRGKHIERKYHLIREIVQRRDVIVTKIASEHNIVDPFTKTLTAKVFEGHLESLGLQD

Query:  MYIR
        MYIR
Subjt:  MYIR

KAA0045325.1 gag/pol protein [Cucumis melo var. makuwa]5.9e-293100Show/hide
Query:  MECMYFNSVWELVDLPEGVKSIGCKWIYKRKRNSAGKVQTFKARLVAKGYTQREGVNYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFFNGNLE
        MECMYFNSVWELVDLPEGVKSIGCKWIYKRKRNSAGKVQTFKARLVAKGYTQREGVNYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFFNGNLE
Subjt:  MECMYFNSVWELVDLPEGVKSIGCKWIYKRKRNSAGKVQTFKARLVAKGYTQREGVNYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFFNGNLE

Query:  ESIFMSHPEGFITQGQEQKVCKMNRSIYGLKQSRSWNIRFDTAIKSYCFDQNVDEPCVYKKINKGKVAFLVLYVDDILIGNDVGYFTDIKAWLAVQFQMK
        ESIFMSHPEGFITQGQEQKVCKMNRSIYGLKQSRSWNIRFDTAIKSYCFDQNVDEPCVYKKINKGKVAFLVLYVDDILIGNDVGYFTDIKAWLAVQFQMK
Subjt:  ESIFMSHPEGFITQGQEQKVCKMNRSIYGLKQSRSWNIRFDTAIKSYCFDQNVDEPCVYKKINKGKVAFLVLYVDDILIGNDVGYFTDIKAWLAVQFQMK

Query:  DLEEAQYVLGIQIIRDHKNKTLALSQATYIDKMLVRYSMKNSKKGLLPFRHGVHLSKEQCPKTPQEVEDMRRIPYASAVDSLVYYMLCTRPDICYVVGIV
        DLEEAQYVLGIQIIRDHKNKTLALSQATYIDKMLVRYSMKNSKKGLLPFRHGVHLSKEQCPKTPQEVEDMRRIPYASAVDSLVYYMLCTRPDICYVVGIV
Subjt:  DLEEAQYVLGIQIIRDHKNKTLALSQATYIDKMLVRYSMKNSKKGLLPFRHGVHLSKEQCPKTPQEVEDMRRIPYASAVDSLVYYMLCTRPDICYVVGIV

Query:  NRYQSNPGLDHWTAVKIILKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSRSVFTLNGGAVVWRSIKQGCIVDSTIEAEYVATCEAAKEAVW
        NRYQSNPGLDHWTAVKIILKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSRSVFTLNGGAVVWRSIKQGCIVDSTIEAEYVATCEAAKEAVW
Subjt:  NRYQSNPGLDHWTAVKIILKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSRSVFTLNGGAVVWRSIKQGCIVDSTIEAEYVATCEAAKEAVW

Query:  LRKFLHDLEVVPNMNLPITLYCDNSGTVANSKEPRSHKRGKHIERKYHLIREIVQRRDVIVTKIASEHNIVDPFTKTLTAKVFEGHLESLGLQDMYIR
        LRKFLHDLEVVPNMNLPITLYCDNSGTVANSKEPRSHKRGKHIERKYHLIREIVQRRDVIVTKIASEHNIVDPFTKTLTAKVFEGHLESLGLQDMYIR
Subjt:  LRKFLHDLEVVPNMNLPITLYCDNSGTVANSKEPRSHKRGKHIERKYHLIREIVQRRDVIVTKIASEHNIVDPFTKTLTAKVFEGHLESLGLQDMYIR

KAA0059226.1 gag/pol protein [Cucumis melo var. makuwa]3.0e-27393.65Show/hide
Query:  MDLEMECMYFNSVWELVDLPEGVKSIGCKWIYKRKRNSAGKVQTFKARLVAKGYTQREGVNYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFFN
        MDLEME MYFNSVWELVDLPEGVK IGCKWIYKRKR+SAGKVQTFKARLVAKGYTQREGV+YEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAF N
Subjt:  MDLEMECMYFNSVWELVDLPEGVKSIGCKWIYKRKRNSAGKVQTFKARLVAKGYTQREGVNYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFFN

Query:  GNLEESIFMSHPEGFITQGQEQKVCKMNRSIYGLKQ-SRSWNIRFDTAIKSYCFDQNVDEPCVYKKINKGKVAFLVLYVDDI-LIGNDVGYFTDIKAWLA
        GNLEESIFMS PEGFITQGQEQKVCK+NRSIYGLKQ SRSWNIRFDTAIKSY FDQNVDEPCVYKKINKGKVAFLVLYVDDI LIGNDVGY TD+KAWLA
Subjt:  GNLEESIFMSHPEGFITQGQEQKVCKMNRSIYGLKQ-SRSWNIRFDTAIKSYCFDQNVDEPCVYKKINKGKVAFLVLYVDDI-LIGNDVGYFTDIKAWLA

Query:  VQFQMKDLEEAQYVLGIQIIRDHKNKTLALSQATYIDKMLVRYSMKNSKKGLLPFRHGVHLSKEQCPKTPQEVEDMRRIPYASAVDSLVYYMLCTRPDIC
         QFQMKDL EAQYVLGIQIIRD KNKTLALSQATYIDK+LVRYSM+NSKKGLLPFRHGVHLSKEQ PKTPQEVEDMRRIPYASAV SL+Y MLCTRPDIC
Subjt:  VQFQMKDLEEAQYVLGIQIIRDHKNKTLALSQATYIDKMLVRYSMKNSKKGLLPFRHGVHLSKEQCPKTPQEVEDMRRIPYASAVDSLVYYMLCTRPDIC

Query:  YVVGIVNRYQSNPGLDHWTAVKIILKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSRSVFTLNGGAVVWRSIKQGCIVDSTIEAEYVATCEA
        Y VGIV+RYQSNPGLDHWTAVKI+LKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTS SVFTLNGGAVVWRSIKQGCI DST+EAEYVA CEA
Subjt:  YVVGIVNRYQSNPGLDHWTAVKIILKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSRSVFTLNGGAVVWRSIKQGCIVDSTIEAEYVATCEA

Query:  AKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGTVANSKEPRSHKRGKHIERKYHLIREIVQRRDVIVTKIASEHNIVDPFTKTLTAKVFEGHLESLGLQD
        AKEAVWLRKFLHDLEVVPNMNLPITLYCDNSG VANSKEPRSHKRGKHIERKYHLIREIVQR DVIVTKIASEHNI DPFTKTLTAKVFEGHLESLGL+D
Subjt:  AKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGTVANSKEPRSHKRGKHIERKYHLIREIVQRRDVIVTKIASEHNIVDPFTKTLTAKVFEGHLESLGLQD

Query:  MYIR
        MYIR
Subjt:  MYIR

TYJ96907.1 gag/pol protein [Cucumis melo var. makuwa]2.2e-27996.39Show/hide
Query:  MECMYFNSVWELVDLPEGVKSIGCKWIYKRKRNSAGKVQTFKARLVAKGYTQREGVNYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFFNGNLE
        ME MYFNSVWELVDLPEGVKSIGCKWIYKRKRNSAGKVQT KARLVAKGYTQREGVNYEETFS VAMLKSIRILLSIATFYDYEIWQMDVKTAFFNGNLE
Subjt:  MECMYFNSVWELVDLPEGVKSIGCKWIYKRKRNSAGKVQTFKARLVAKGYTQREGVNYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFFNGNLE

Query:  ESIFMSHPEGFITQGQEQKVCKMNRSIYGLKQSRSWNIRFDTAIKSYCFDQNVDEPCVYKKINKGKVAFLVLYVDDI-LIGNDVGYFTDIKAWLAVQFQM
        ESIFMS  EGFITQGQ+QKVCK+NRSIYGLKQSRSWNIRFDTAIKSYCFDQNVDEPCVYKKINKGKVAFLVLYVDDI LIGNDVGY  DIKAWLAVQFQM
Subjt:  ESIFMSHPEGFITQGQEQKVCKMNRSIYGLKQSRSWNIRFDTAIKSYCFDQNVDEPCVYKKINKGKVAFLVLYVDDI-LIGNDVGYFTDIKAWLAVQFQM

Query:  KDLEEAQYVLGIQIIRDHKNKTLALSQATYIDKMLVRYSMKNSKKGLLPFRHGVHLSKEQCPKTPQEVEDMRRIPYASAVDSLVYYMLCTRPDICYVVGI
        KDLEEAQYVLGIQIIRDHKNKTLALSQATYI+KMLVRYSM+NSKKGLLPFRHGVHLSKEQCPKTPQEVEDMR IPYASAVDSLVYYMLCTRPDICYVVGI
Subjt:  KDLEEAQYVLGIQIIRDHKNKTLALSQATYIDKMLVRYSMKNSKKGLLPFRHGVHLSKEQCPKTPQEVEDMRRIPYASAVDSLVYYMLCTRPDICYVVGI

Query:  VNRYQSNPGLDHWTAVKIILKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSRSVFTLNGGAVVWRSIKQGCIVDSTIEAEYVATCEAAKEAV
        VNRYQSNPGLDHWTAVKIILKYLRRTRDYMLVYGAKDLILTGYTD DFQTDKDSRKSTSRSVFTLNGGAVVWRSIKQGCIVDSTIE EYVATCEAAK+A 
Subjt:  VNRYQSNPGLDHWTAVKIILKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSRSVFTLNGGAVVWRSIKQGCIVDSTIEAEYVATCEAAKEAV

Query:  WLRKFLHDLEVVPNMNLPITLYCDNSGTVANSKEPRSHKRGKHIERKYHLIREIVQRRDVIVTKIASEHNIVDPFTKTLTAKVFEGHLESLGLQDMYIR
        WLRKFLHDLEVVPNMNLPITLYCDNSGTVANSKEPRSHKRGKHIERKYHLIREIVQRRDVIVTKIASEHNIVDPFTKTLTAKVFEGHLESLGL+DMYIR
Subjt:  WLRKFLHDLEVVPNMNLPITLYCDNSGTVANSKEPRSHKRGKHIERKYHLIREIVQRRDVIVTKIASEHNIVDPFTKTLTAKVFEGHLESLGLQDMYIR

TrEMBL top hitse value%identityAlignment
A0A5A7T2V9 Gag/pol protein1.2e-27092.66Show/hide
Query:  MDLEMECMYFNSVWELVDLPEGVKSIGCKWIYKRKRNSAGKVQTFKARLVAKGYTQREGVNYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFFN
        MDLEME MYFNSVWELVDLPEGVK IGCKWIYKRKR+SAGKVQTFKARLVAKGYT++EGV+YEETFS VAMLKSIRILLSIA FYDYEIWQMDVKTAF N
Subjt:  MDLEMECMYFNSVWELVDLPEGVKSIGCKWIYKRKRNSAGKVQTFKARLVAKGYTQREGVNYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFFN

Query:  GNLEESIFMSHPEGFITQGQEQKVCKMNRSIYGLKQ-SRSWNIRFDTAIKSYCFDQNVDEPCVYKKINKGKVAFLVLYVDDI-LIGNDVGYFTDIKAWLA
        GNLEESIFMS PEGFITQGQEQKVCK+NRSIYGLKQ SRSWNIRFDTAIKSY FDQNVDEPCVYKKINKGKVAFLVLYVDDI LIGNDVGY TD+KAWLA
Subjt:  GNLEESIFMSHPEGFITQGQEQKVCKMNRSIYGLKQ-SRSWNIRFDTAIKSYCFDQNVDEPCVYKKINKGKVAFLVLYVDDI-LIGNDVGYFTDIKAWLA

Query:  VQFQMKDLEEAQYVLGIQIIRDHKNKTLALSQATYIDKMLVRYSMKNSKKGLLPFRHGVHLSKEQCPKTPQEVEDMRRIPYASAVDSLVYYMLCTRPDIC
         QFQMKDL E QYVLGIQIIRD KNKTLALSQATYIDK+LVRYSM+NSKKGLLPFRHGVHLSKEQ PKTPQEVEDMRRIPYASAV SL+Y MLCTRPDIC
Subjt:  VQFQMKDLEEAQYVLGIQIIRDHKNKTLALSQATYIDKMLVRYSMKNSKKGLLPFRHGVHLSKEQCPKTPQEVEDMRRIPYASAVDSLVYYMLCTRPDIC

Query:  YVVGIVNRYQSNPGLDHWTAVKIILKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSRSVFTLNGGAVVWRSIKQGCIVDSTIEAEYVATCEA
        Y VGIV+RYQSNPGLDHWTAVKIILKYLRRTRDYMLVYGAKDLILTGYT+SDFQTDKDSRKSTSRSVFTLNGGAVVWRSIKQGCI DST+EAEYVA CEA
Subjt:  YVVGIVNRYQSNPGLDHWTAVKIILKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSRSVFTLNGGAVVWRSIKQGCIVDSTIEAEYVATCEA

Query:  AKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGTVANSKEPRSHKRGKHIERKYHLIREIVQRRDVIVTKIASEHNIVDPFTKTLTAKVFEGHLESLGLQD
        AKEAVWL+KFLHDLEVVPNMNLPITLYCDNSG VANSKEPRSHKRGKHIERKYHLIREIVQR DVIVTKIASEHNI DPFTKTLTAKVFEGHLESLGL+D
Subjt:  AKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGTVANSKEPRSHKRGKHIERKYHLIREIVQRRDVIVTKIASEHNIVDPFTKTLTAKVFEGHLESLGLQD

Query:  MYIR
        MYIR
Subjt:  MYIR

A0A5A7TVB5 Gag/pol protein2.9e-293100Show/hide
Query:  MECMYFNSVWELVDLPEGVKSIGCKWIYKRKRNSAGKVQTFKARLVAKGYTQREGVNYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFFNGNLE
        MECMYFNSVWELVDLPEGVKSIGCKWIYKRKRNSAGKVQTFKARLVAKGYTQREGVNYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFFNGNLE
Subjt:  MECMYFNSVWELVDLPEGVKSIGCKWIYKRKRNSAGKVQTFKARLVAKGYTQREGVNYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFFNGNLE

Query:  ESIFMSHPEGFITQGQEQKVCKMNRSIYGLKQSRSWNIRFDTAIKSYCFDQNVDEPCVYKKINKGKVAFLVLYVDDILIGNDVGYFTDIKAWLAVQFQMK
        ESIFMSHPEGFITQGQEQKVCKMNRSIYGLKQSRSWNIRFDTAIKSYCFDQNVDEPCVYKKINKGKVAFLVLYVDDILIGNDVGYFTDIKAWLAVQFQMK
Subjt:  ESIFMSHPEGFITQGQEQKVCKMNRSIYGLKQSRSWNIRFDTAIKSYCFDQNVDEPCVYKKINKGKVAFLVLYVDDILIGNDVGYFTDIKAWLAVQFQMK

Query:  DLEEAQYVLGIQIIRDHKNKTLALSQATYIDKMLVRYSMKNSKKGLLPFRHGVHLSKEQCPKTPQEVEDMRRIPYASAVDSLVYYMLCTRPDICYVVGIV
        DLEEAQYVLGIQIIRDHKNKTLALSQATYIDKMLVRYSMKNSKKGLLPFRHGVHLSKEQCPKTPQEVEDMRRIPYASAVDSLVYYMLCTRPDICYVVGIV
Subjt:  DLEEAQYVLGIQIIRDHKNKTLALSQATYIDKMLVRYSMKNSKKGLLPFRHGVHLSKEQCPKTPQEVEDMRRIPYASAVDSLVYYMLCTRPDICYVVGIV

Query:  NRYQSNPGLDHWTAVKIILKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSRSVFTLNGGAVVWRSIKQGCIVDSTIEAEYVATCEAAKEAVW
        NRYQSNPGLDHWTAVKIILKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSRSVFTLNGGAVVWRSIKQGCIVDSTIEAEYVATCEAAKEAVW
Subjt:  NRYQSNPGLDHWTAVKIILKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSRSVFTLNGGAVVWRSIKQGCIVDSTIEAEYVATCEAAKEAVW

Query:  LRKFLHDLEVVPNMNLPITLYCDNSGTVANSKEPRSHKRGKHIERKYHLIREIVQRRDVIVTKIASEHNIVDPFTKTLTAKVFEGHLESLGLQDMYIR
        LRKFLHDLEVVPNMNLPITLYCDNSGTVANSKEPRSHKRGKHIERKYHLIREIVQRRDVIVTKIASEHNIVDPFTKTLTAKVFEGHLESLGLQDMYIR
Subjt:  LRKFLHDLEVVPNMNLPITLYCDNSGTVANSKEPRSHKRGKHIERKYHLIREIVQRRDVIVTKIASEHNIVDPFTKTLTAKVFEGHLESLGLQDMYIR

A0A5A7TZD0 Gag/pol protein1.5e-27393.65Show/hide
Query:  MDLEMECMYFNSVWELVDLPEGVKSIGCKWIYKRKRNSAGKVQTFKARLVAKGYTQREGVNYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFFN
        MDLEME MYFNSVWELVDLPEGVK IGCKWIYKRKR+SAGKVQTFKARLVAKGYTQREGV+YEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAF N
Subjt:  MDLEMECMYFNSVWELVDLPEGVKSIGCKWIYKRKRNSAGKVQTFKARLVAKGYTQREGVNYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFFN

Query:  GNLEESIFMSHPEGFITQGQEQKVCKMNRSIYGLKQ-SRSWNIRFDTAIKSYCFDQNVDEPCVYKKINKGKVAFLVLYVDDI-LIGNDVGYFTDIKAWLA
        GNLEESIFMS PEGFITQGQEQKVCK+NRSIYGLKQ SRSWNIRFDTAIKSY FDQNVDEPCVYKKINKGKVAFLVLYVDDI LIGNDVGY TD+KAWLA
Subjt:  GNLEESIFMSHPEGFITQGQEQKVCKMNRSIYGLKQ-SRSWNIRFDTAIKSYCFDQNVDEPCVYKKINKGKVAFLVLYVDDI-LIGNDVGYFTDIKAWLA

Query:  VQFQMKDLEEAQYVLGIQIIRDHKNKTLALSQATYIDKMLVRYSMKNSKKGLLPFRHGVHLSKEQCPKTPQEVEDMRRIPYASAVDSLVYYMLCTRPDIC
         QFQMKDL EAQYVLGIQIIRD KNKTLALSQATYIDK+LVRYSM+NSKKGLLPFRHGVHLSKEQ PKTPQEVEDMRRIPYASAV SL+Y MLCTRPDIC
Subjt:  VQFQMKDLEEAQYVLGIQIIRDHKNKTLALSQATYIDKMLVRYSMKNSKKGLLPFRHGVHLSKEQCPKTPQEVEDMRRIPYASAVDSLVYYMLCTRPDIC

Query:  YVVGIVNRYQSNPGLDHWTAVKIILKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSRSVFTLNGGAVVWRSIKQGCIVDSTIEAEYVATCEA
        Y VGIV+RYQSNPGLDHWTAVKI+LKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTS SVFTLNGGAVVWRSIKQGCI DST+EAEYVA CEA
Subjt:  YVVGIVNRYQSNPGLDHWTAVKIILKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSRSVFTLNGGAVVWRSIKQGCIVDSTIEAEYVATCEA

Query:  AKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGTVANSKEPRSHKRGKHIERKYHLIREIVQRRDVIVTKIASEHNIVDPFTKTLTAKVFEGHLESLGLQD
        AKEAVWLRKFLHDLEVVPNMNLPITLYCDNSG VANSKEPRSHKRGKHIERKYHLIREIVQR DVIVTKIASEHNI DPFTKTLTAKVFEGHLESLGL+D
Subjt:  AKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGTVANSKEPRSHKRGKHIERKYHLIREIVQRRDVIVTKIASEHNIVDPFTKTLTAKVFEGHLESLGLQD

Query:  MYIR
        MYIR
Subjt:  MYIR

A0A5A7UYE8 Gag/pol protein1.5e-27393.65Show/hide
Query:  MDLEMECMYFNSVWELVDLPEGVKSIGCKWIYKRKRNSAGKVQTFKARLVAKGYTQREGVNYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFFN
        MDLEME MYFNSVWELVDLPEGVK IGCKWIYKRKR+SAGKVQTFKARLVAKGYTQREGV+YEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAF N
Subjt:  MDLEMECMYFNSVWELVDLPEGVKSIGCKWIYKRKRNSAGKVQTFKARLVAKGYTQREGVNYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFFN

Query:  GNLEESIFMSHPEGFITQGQEQKVCKMNRSIYGLKQ-SRSWNIRFDTAIKSYCFDQNVDEPCVYKKINKGKVAFLVLYVDDI-LIGNDVGYFTDIKAWLA
        GNLEESIFMS PEGFITQGQEQKVCK+NRSIYGLKQ SRSWNIRFDTAIKSY FDQNVDEPCVYKKINKGKVAFLVLYVDDI LIGNDVGY TD+KAWLA
Subjt:  GNLEESIFMSHPEGFITQGQEQKVCKMNRSIYGLKQ-SRSWNIRFDTAIKSYCFDQNVDEPCVYKKINKGKVAFLVLYVDDI-LIGNDVGYFTDIKAWLA

Query:  VQFQMKDLEEAQYVLGIQIIRDHKNKTLALSQATYIDKMLVRYSMKNSKKGLLPFRHGVHLSKEQCPKTPQEVEDMRRIPYASAVDSLVYYMLCTRPDIC
         QFQMKDL EAQYVLGIQIIRD KNKTLALSQATYIDK+LVRYSM+NSKKGLLPFRHGVHLSKEQ PKTPQEVEDMRRIPYASAV SL+Y MLCTRPDIC
Subjt:  VQFQMKDLEEAQYVLGIQIIRDHKNKTLALSQATYIDKMLVRYSMKNSKKGLLPFRHGVHLSKEQCPKTPQEVEDMRRIPYASAVDSLVYYMLCTRPDIC

Query:  YVVGIVNRYQSNPGLDHWTAVKIILKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSRSVFTLNGGAVVWRSIKQGCIVDSTIEAEYVATCEA
        Y VGIV+RYQSNPGLDHWTAVKI+LKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTS SVFTLNGGAVVWRSIKQGCI DST+EAEYVA CEA
Subjt:  YVVGIVNRYQSNPGLDHWTAVKIILKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSRSVFTLNGGAVVWRSIKQGCIVDSTIEAEYVATCEA

Query:  AKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGTVANSKEPRSHKRGKHIERKYHLIREIVQRRDVIVTKIASEHNIVDPFTKTLTAKVFEGHLESLGLQD
        AKEAVWLRKFLHDLEVVPNMNLPITLYCDNSG VANSKEPRSHKRGKHIERKYHLIREIVQR DVIVTKIASEHNI DPFTKTLTAKVFEGHLESLGL+D
Subjt:  AKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGTVANSKEPRSHKRGKHIERKYHLIREIVQRRDVIVTKIASEHNIVDPFTKTLTAKVFEGHLESLGLQD

Query:  MYIR
        MYIR
Subjt:  MYIR

A0A5D3BCQ5 Gag/pol protein1.1e-27996.39Show/hide
Query:  MECMYFNSVWELVDLPEGVKSIGCKWIYKRKRNSAGKVQTFKARLVAKGYTQREGVNYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFFNGNLE
        ME MYFNSVWELVDLPEGVKSIGCKWIYKRKRNSAGKVQT KARLVAKGYTQREGVNYEETFS VAMLKSIRILLSIATFYDYEIWQMDVKTAFFNGNLE
Subjt:  MECMYFNSVWELVDLPEGVKSIGCKWIYKRKRNSAGKVQTFKARLVAKGYTQREGVNYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFFNGNLE

Query:  ESIFMSHPEGFITQGQEQKVCKMNRSIYGLKQSRSWNIRFDTAIKSYCFDQNVDEPCVYKKINKGKVAFLVLYVDDI-LIGNDVGYFTDIKAWLAVQFQM
        ESIFMS  EGFITQGQ+QKVCK+NRSIYGLKQSRSWNIRFDTAIKSYCFDQNVDEPCVYKKINKGKVAFLVLYVDDI LIGNDVGY  DIKAWLAVQFQM
Subjt:  ESIFMSHPEGFITQGQEQKVCKMNRSIYGLKQSRSWNIRFDTAIKSYCFDQNVDEPCVYKKINKGKVAFLVLYVDDI-LIGNDVGYFTDIKAWLAVQFQM

Query:  KDLEEAQYVLGIQIIRDHKNKTLALSQATYIDKMLVRYSMKNSKKGLLPFRHGVHLSKEQCPKTPQEVEDMRRIPYASAVDSLVYYMLCTRPDICYVVGI
        KDLEEAQYVLGIQIIRDHKNKTLALSQATYI+KMLVRYSM+NSKKGLLPFRHGVHLSKEQCPKTPQEVEDMR IPYASAVDSLVYYMLCTRPDICYVVGI
Subjt:  KDLEEAQYVLGIQIIRDHKNKTLALSQATYIDKMLVRYSMKNSKKGLLPFRHGVHLSKEQCPKTPQEVEDMRRIPYASAVDSLVYYMLCTRPDICYVVGI

Query:  VNRYQSNPGLDHWTAVKIILKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSRSVFTLNGGAVVWRSIKQGCIVDSTIEAEYVATCEAAKEAV
        VNRYQSNPGLDHWTAVKIILKYLRRTRDYMLVYGAKDLILTGYTD DFQTDKDSRKSTSRSVFTLNGGAVVWRSIKQGCIVDSTIE EYVATCEAAK+A 
Subjt:  VNRYQSNPGLDHWTAVKIILKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSRSVFTLNGGAVVWRSIKQGCIVDSTIEAEYVATCEAAKEAV

Query:  WLRKFLHDLEVVPNMNLPITLYCDNSGTVANSKEPRSHKRGKHIERKYHLIREIVQRRDVIVTKIASEHNIVDPFTKTLTAKVFEGHLESLGLQDMYIR
        WLRKFLHDLEVVPNMNLPITLYCDNSGTVANSKEPRSHKRGKHIERKYHLIREIVQRRDVIVTKIASEHNIVDPFTKTLTAKVFEGHLESLGL+DMYIR
Subjt:  WLRKFLHDLEVVPNMNLPITLYCDNSGTVANSKEPRSHKRGKHIERKYHLIREIVQRRDVIVTKIASEHNIVDPFTKTLTAKVFEGHLESLGLQDMYIR

SwissProt top hitse value%identityAlignment
P04146 Copia protein3.0e-8235.04Show/hide
Query:  MDLEMECMYFNSVWELVDLPEGVKSIGCKWIYKRKRNSAGKVQTFKARLVAKGYTQREGVNYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFFN
        ++ E+     N+ W +   PE    +  +W++  K N  G    +KARLVA+G+TQ+  ++YEETF+PVA + S R +LS+   Y+ ++ QMDVKTAF N
Subjt:  MDLEMECMYFNSVWELVDLPEGVKSIGCKWIYKRKRNSAGKVQTFKARLVAKGYTQREGVNYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFFN

Query:  GNLEESIFMSHPEGFITQGQEQKVCKMNRSIYGLKQ-SRSWNIRFDTAIKSYCFDQNVDEPCVY--KKINKGKVAFLVLYVDDILIG-NDVGYFTDIKAW
        G L+E I+M  P+G         VCK+N++IYGLKQ +R W   F+ A+K   F  +  + C+Y   K N  +  +++LYVDD++I   D+    + K +
Subjt:  GNLEESIFMSHPEGFITQGQEQKVCKMNRSIYGLKQ-SRSWNIRFDTAIKSYCFDQNVDEPCVY--KKINKGKVAFLVLYVDDILIG-NDVGYFTDIKAW

Query:  LAVQFQMKDLEEAQYVLGIQIIRDHKNKTLALSQATYIDKMLVRYSMKNSKKGLLPFRHGVHL----SKEQCPKTPQEVEDMRRIPYASAVDSLVYYMLC
        L  +F+M DL E ++ +GI+I  + +   + LSQ+ Y+ K+L +++M+N      P    ++     S E C             P  S +  L+Y MLC
Subjt:  LAVQFQMKDLEEAQYVLGIQIIRDHKNKTLALSQATYIDKMLVRYSMKNSKKGLLPFRHGVHL----SKEQCPKTPQEVEDMRRIPYASAVDSLVYYMLC

Query:  TRPDICYVVGIVNRYQSNPGLDHWTAVKIILKYLRRTRDYMLVYG---AKDLILTGYTDSDFQTDKDSRKSTSRSVFTL-NGGAVVWRSIKQGCIVDSTI
        TRPD+   V I++RY S    + W  +K +L+YL+ T D  L++    A +  + GY DSD+   +  RKST+  +F + +   + W + +Q  +  S+ 
Subjt:  TRPDICYVVGIVNRYQSNPGLDHWTAVKIILKYLRRTRDYMLVYG---AKDLILTGYTDSDFQTDKDSRKSTSRSVFTL-NGGAVVWRSIKQGCIVDSTI

Query:  EAEYVATCEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGTVANSKEPRSHKRGKHIERKYHLIREIVQRRDVIVTKIASEHNIVDPFTKTLTAKVFE
        EAEY+A  EA +EA+WL+  L  + +   +  PI +Y DN G ++ +  P  HKR KHI+ KYH  RE VQ   + +  I +E+ + D FTK L A  F 
Subjt:  EAEYVATCEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGTVANSKEPRSHKRGKHIERKYHLIREIVQRRDVIVTKIASEHNIVDPFTKTLTAKVFE

Query:  GHLESLGL
           + LGL
Subjt:  GHLESLGL

P0CV72 Secreted RxLR effector protein 1611.4e-2645.86Show/hide
Query:  MRRIPYASAVDSLVYYMLCTRPDICYVVGIVNRYQSNPGLDHWTAVKIILKYLRRTRDYMLVY-GAKDLILTGYTDSDFQTDKDSRKSTSRSVFTLNGGA
        M+ +PY SAV +++Y M+ TRPD+   VG+++++ S+P   HW A+K +L+YL+ T+ Y L +  A    L GY+D+D+  D +SR+STS  +F LNGG 
Subjt:  MRRIPYASAVDSLVYYMLCTRPDICYVVGIVNRYQSNPGLDHWTAVKIILKYLRRTRDYMLVY-GAKDLILTGYTDSDFQTDKDSRKSTSRSVFTLNGGA

Query:  VVWRSIKQGCIVDSTIEAEYVATCEAAKEAVWL
        V WRS KQ  +  S+ E EY+A  EA +EAVWL
Subjt:  VVWRSIKQGCIVDSTIEAEYVATCEAAKEAVWL

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-943.9e-12244.29Show/hide
Query:  MDLEMECMYFNSVWELVDLPEGVKSIGCKWIYKRKRNSAGKVQTFKARLVAKGYTQREGVNYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFFN
        M  EME +  N  ++LV+LP+G + + CKW++K K++   K+  +KARLV KG+ Q++G++++E FSPV  + SIR +LS+A   D E+ Q+DVKTAF +
Subjt:  MDLEMECMYFNSVWELVDLPEGVKSIGCKWIYKRKRNSAGKVQTFKARLVAKGYTQREGVNYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFFN

Query:  GNLEESIFMSHPEGFITQGQEQKVCKMNRSIYGLKQS-RSWNIRFDTAIKSYCFDQNVDEPCVY-KKINKGKVAFLVLYVDDILI-GNDVGYFTDIKAWL
        G+LEE I+M  PEGF   G++  VCK+N+S+YGLKQ+ R W ++FD+ +KS  + +   +PCVY K+ ++     L+LYVDD+LI G D G    +K  L
Subjt:  GNLEESIFMSHPEGFITQGQEQKVCKMNRSIYGLKQS-RSWNIRFDTAIKSYCFDQNVDEPCVY-KKINKGKVAFLVLYVDDILI-GNDVGYFTDIKAWL

Query:  AVQFQMKDLEEAQYVLGIQIIRDHKNKTLALSQATYIDKMLVRYSMKNSKKGLLPFRHGVHLSKEQCPKTPQEVEDMRRIPYASAVDSLVYYMLCTRPDI
        +  F MKDL  AQ +LG++I+R+  ++ L LSQ  YI+++L R++MKN+K    P    + LSK+ CP T +E  +M ++PY+SAV SL+Y M+CTRPDI
Subjt:  AVQFQMKDLEEAQYVLGIQIIRDHKNKTLALSQATYIDKMLVRYSMKNSKKGLLPFRHGVHLSKEQCPKTPQEVEDMRRIPYASAVDSLVYYMLCTRPDI

Query:  CYVVGIVNRYQSNPGLDHWTAVKIILKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSRSVFTLNGGAVVWRSIKQGCIVDSTIEAEYVATCE
         + VG+V+R+  NPG +HW AVK IL+YLR T    L +G  D IL GYTD+D   D D+RKS++  +FT +GGA+ W+S  Q C+  ST EAEY+A  E
Subjt:  CYVVGIVNRYQSNPGLDHWTAVKIILKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSRSVFTLNGGAVVWRSIKQGCIVDSTIEAEYVATCE

Query:  AAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGTVANSKEPRSHKRGKHIERKYHLIREIVQRRDVIVTKIASEHNIVDPFTKTLTAKVFEGHLESLGL
          KE +WL++FL +L +         +YCD+   +  SK    H R KHI+ +YH IRE+V    + V KI++  N  D  TK +    FE   E +G+
Subjt:  AAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGTVANSKEPRSHKRGKHIERKYHLIREIVQRRDVIVTKIASEHNIVDPFTKTLTAKVFEGHLESLGL

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE14.4e-7334.8Show/hide
Query:  MDLEMECMYFNSVWELVDLPEG-VKSIGCKWIYKRKRNSAGKVQTFKARLVAKGYTQREGVNYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFF
        M  E+     N  W+LV  P   V  +GC+WI+ +K NS G +  +KARLVAKGY QR G++Y ETFSPV    SIRI+L +A    + I Q+DV  AF 
Subjt:  MDLEMECMYFNSVWELVDLPEG-VKSIGCKWIYKRKRNSAGKVQTFKARLVAKGYTQREGVNYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFF

Query:  NGNLEESIFMSHPEGFITQGQEQKVCKMNRSIYGLKQS-RSWNIRFDTAIKSYCFDQNVDEPCVYKKINKGKVAFLVLYVDDILI-GNDVGYFTDIKAWL
         G L + ++MS P GFI + +   VCK+ +++YGLKQ+ R+W +     + +  F  +V +  ++       + ++++YVDDILI GND     +    L
Subjt:  NGNLEESIFMSHPEGFITQGQEQKVCKMNRSIYGLKQS-RSWNIRFDTAIKSYCFDQNVDEPCVYKKINKGKVAFLVLYVDDILI-GNDVGYFTDIKAWL

Query:  AVQFQMKDLEEAQYVLGIQIIRDHKNKTLALSQATYIDKMLVRYSMKNSKKGLLPFRHGVHLSKEQCPKTPQEVEDMRRIPYASAVDSLVYYMLCTRPDI
        + +F +KD EE  Y LGI+  R      L LSQ  YI  +L R +M  +K    P      LS     K     E      Y   V SL  Y+  TRPDI
Subjt:  AVQFQMKDLEEAQYVLGIQIIRDHKNKTLALSQATYIDKMLVRYSMKNSKKGLLPFRHGVHLSKEQCPKTPQEVEDMRRIPYASAVDSLVYYMLCTRPDI

Query:  CYVVGIVNRYQSNPGLDHWTAVKIILKYLRRTRDY-MLVYGAKDLILTGYTDSDFQTDKDSRKSTSRSVFTLNGGAVVWRSIKQGCIVDSTIEAEYVATC
         Y V  ++++   P  +H  A+K IL+YL  T ++ + +     L L  Y+D+D+  DKD   ST+  +  L    + W S KQ  +V S+ EAEY +  
Subjt:  CYVVGIVNRYQSNPGLDHWTAVKIILKYLRRTRDY-MLVYGAKDLILTGYTDSDFQTDKDSRKSTSRSVFTLNGGAVVWRSIKQGCIVDSTIEAEYVATC

Query:  EAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGTVANSKEPRSHKRGKHIERKYHLIREIVQRRDVIVTKIASEHNIVDPFTKTLTAKVFEGHLESLGL
          + E  W+   L +L +   +  P  +YCDN G       P  H R KHI   YH IR  VQ   + V  +++   + D  TK L+   F+     +G+
Subjt:  EAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGTVANSKEPRSHKRGKHIERKYHLIREIVQRRDVIVTKIASEHNIVDPFTKTLTAKVFEGHLESLGL

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE28.0e-7534.2Show/hide
Query:  MDLEMECMYFNSVWELV-DLPEGVKSIGCKWIYKRKRNSAGKVQTFKARLVAKGYTQREGVNYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFF
        M  E+     N  W+LV   P  V  +GC+WI+ +K NS G +  +KARLVAKGY QR G++Y ETFSPV    SIRI+L +A    + I Q+DV  AF 
Subjt:  MDLEMECMYFNSVWELV-DLPEGVKSIGCKWIYKRKRNSAGKVQTFKARLVAKGYTQREGVNYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFF

Query:  NGNLEESIFMSHPEGFITQGQEQKVCKMNRSIYGLKQS-RSWNIRFDTAIKSYCFDQNVDEPCVYKKINKGKVAFLVLYVDDILI-GNDVGYFTDIKAWL
         G L + ++MS P GF+ + +   VC++ ++IYGLKQ+ R+W +   T + +  F  ++ +  ++       + ++++YVDDILI GND          L
Subjt:  NGNLEESIFMSHPEGFITQGQEQKVCKMNRSIYGLKQS-RSWNIRFDTAIKSYCFDQNVDEPCVYKKINKGKVAFLVLYVDDILI-GNDVGYFTDIKAWL

Query:  AVQFQMKDLEEAQYVLGIQIIRDHKNKTLALSQATYIDKMLVRYSMKNSKKGLLPFRHGVHLSKEQCPKTPQEVEDMRRIPYASAVDSLVYYMLCTRPDI
        + +F +K+ E+  Y LGI+  R  +   L LSQ  Y   +L R +M  +K    P      L+     K P   E      Y   V SL  Y+  TRPD+
Subjt:  AVQFQMKDLEEAQYVLGIQIIRDHKNKTLALSQATYIDKMLVRYSMKNSKKGLLPFRHGVHLSKEQCPKTPQEVEDMRRIPYASAVDSLVYYMLCTRPDI

Query:  CYVVGIVNRYQSNPGLDHWTAVKIILKYLRRTRDY-MLVYGAKDLILTGYTDSDFQTDKDSRKSTSRSVFTLNGGAVVWRSIKQGCIVDSTIEAEYVATC
         Y V  +++Y   P  DHW A+K +L+YL  T D+ + +     L L  Y+D+D+  D D   ST+  +  L    + W S KQ  +V S+ EAEY +  
Subjt:  CYVVGIVNRYQSNPGLDHWTAVKIILKYLRRTRDY-MLVYGAKDLILTGYTDSDFQTDKDSRKSTSRSVFTLNGGAVVWRSIKQGCIVDSTIEAEYVATC

Query:  EAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGTVANSKEPRSHKRGKHIERKYHLIREIVQRRDVIVTKIASEHNIVDPFTKTLTAKVFEGHLESLGL
          + E  W+   L +L +   ++ P  +YCDN G       P  H R KHI   YH IR  VQ   + V  +++   + D  TK L+   F+     +G+
Subjt:  EAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGTVANSKEPRSHKRGKHIERKYHLIREIVQRRDVIVTKIASEHNIVDPFTKTLTAKVFEGHLESLGL

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 82.1e-7034.34Show/hide
Query:  MDLEMECMYFNSVWELVDLPEGVKSIGCKWIYKRKRNSAGKVQTFKARLVAKGYTQREGVNYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFFN
        MD E+  M     WE+  LP   K IGCKW+YK K NS G ++ +KARLVAKGYTQ+EG+++ ETFSPV  L S++++L+I+  Y++ + Q+D+  AF N
Subjt:  MDLEMECMYFNSVWELVDLPEGVKSIGCKWIYKRKRNSAGKVQTFKARLVAKGYTQREGVNYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFFN

Query:  GNLEESIFMSHPEGFIT-QGQE---QKVCKMNRSIYGLKQ-SRSWNIRFDTAIKSYCFDQNVDEPCVYKKINKGKVAFLVLYVDDILI-GNDVGYFTDIK
        G+L+E I+M  P G+   QG       VC + +SIYGLKQ SR W ++F   +  + F Q+  +   + KI       +++YVDDI+I  N+     ++K
Subjt:  GNLEESIFMSHPEGFIT-QGQE---QKVCKMNRSIYGLKQ-SRSWNIRFDTAIKSYCFDQNVDEPCVYKKINKGKVAFLVLYVDDILI-GNDVGYFTDIK

Query:  AWLAVQFQMKDLEEAQYVLGIQIIRDHKNKTLALSQATYIDKMLVRYSMKNSKKGLLPFRHGVHLSKEQCPKTPQEVEDMRRIPYASAVDSLVYYMLCTR
        + L   F+++DL   +Y LG++I R      + + Q  Y   +L    +   K   +P    V  S      +  +  D +   Y   +  L+Y  + TR
Subjt:  AWLAVQFQMKDLEEAQYVLGIQIIRDHKNKTLALSQATYIDKMLVRYSMKNSKKGLLPFRHGVHLSKEQCPKTPQEVEDMRRIPYASAVDSLVYYMLCTR

Query:  PDICYVVGIVNRYQSNPGLDHWTAVKIILKYLRRTRDYMLVYGAK-DLILTGYTDSDFQTDKDSRKSTSRSVFTLNGGAVVWRSIKQGCIVDSTIEAEYV
         DI + V  ++++   P L H  AV  IL Y++ T    L Y ++ ++ L  ++D+ FQ+ KD+R+ST+     L    + W+S KQ  +  S+ EAEY 
Subjt:  PDICYVVGIVNRYQSNPGLDHWTAVKIILKYLRRTRDYMLVYGAK-DLILTGYTDSDFQTDKDSRKSTSRSVFTLNGGAVVWRSIKQGCIVDSTIEAEYV

Query:  ATCEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGTVANSKEPRSHKRGKHIERKYHLIRE
        A   A  E +WL +F  +L++   ++ P  L+CDN+  +  +     H+R KHIE   H +RE
Subjt:  ATCEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGTVANSKEPRSHKRGKHIERKYHLIRE

ATMG00810.1 DNA/RNA polymerases superfamily protein5.0e-1630.51Show/hide
Query:  FLVLYVDDILI-GNDVGYFTDIKAWLAVQFQMKDLEEAQYVLGIQIIRDHKNKTLALSQATYIDKMLVRYSMKNSK--KGLLPFRHGVHLSKEQCPKTPQ
        +L+LYVDDIL+ G+       +   L+  F MKDL    Y LGIQ I+ H +  L LSQ  Y +++L    M + K     LP +    +S  + P    
Subjt:  FLVLYVDDILI-GNDVGYFTDIKAWLAVQFQMKDLEEAQYVLGIQIIRDHKNKTLALSQATYIDKMLVRYSMKNSK--KGLLPFRHGVHLSKEQCPKTPQ

Query:  EVEDMRRIPYASAVDSLVYYMLCTRPDICYVVGIVNRYQSNPGLDHWTAVKIILKYLRRTRDY-MLVYGAKDLILTGYTDSDFQTDKDSRKSTSRSVFTL
        +  D R I  A      + Y+  TRPDI Y V IV +    P L  +  +K +L+Y++ T  + + ++    L +  + DSD+     +R+ST+     L
Subjt:  EVEDMRRIPYASAVDSLVYYMLCTRPDICYVVGIVNRYQSNPGLDHWTAVKIILKYLRRTRDY-MLVYGAKDLILTGYTDSDFQTDKDSRKSTSRSVFTL

Query:  NGGAVVWRSIKQGCIVDSTIEAEYVATCEAAKEAVW
            + W + +Q  +  S+ E EY A    A E  W
Subjt:  NGGAVVWRSIKQGCIVDSTIEAEYVATCEAAKEAVW

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)6.2e-1442.68Show/hide
Query:  MDLEMECMYFNSVWELVDLPEGVKSIGCKWIYKRKRNSAGKVQTFKARLVAKGYTQREGVNYEETFSPVAMLKSIRILLSIA
        M  E++ +  N  W LV  P     +GCKW++K K +S G +   KARLVAKG+ Q EG+ + ET+SPV    +IR +L++A
Subjt:  MDLEMECMYFNSVWELVDLPEGVKSIGCKWIYKRKRNSAGKVQTFKARLVAKGYTQREGVNYEETFSPVAMLKSIRILLSIA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACCTTGAAATGGAGTGTATGTACTTCAATTCAGTGTGGGAACTTGTAGATCTACCTGAAGGTGTAAAATCTATAGGGTGTAAATGGATCTATAAGAGAAAG
AGAAATTCAGCTGGGAAGGTACAGACCTTCAAAGCTAGACTTGTAGCAAAAGGGTATACCCAAAGGGAAGGGGTTAACTATGAGGAAACTTTTTCTCCTGTTGCT
ATGTTAAAGTCTATAAGGATTCTCTTGTCCATCGCCACATTTTATGATTATGAAATATGGCAAATGGATGTCAAGACTGCTTTTTTTAATGGCAATCTTGAAGAG
AGTATCTTTATGTCTCACCCCGAGGGGTTCATAACCCAAGGTCAAGAGCAAAAAGTTTGTAAGATGAATCGATCCATTTATGGGTTGAAACAATCTAGATCTTGG
AACATTAGGTTTGATACTGCGATCAAATCCTACTGTTTTGACCAAAACGTTGATGAACCTTGTGTATATAAGAAAATCAACAAAGGAAAAGTAGCTTTCTTAGTA
CTTTATGTGGACGATATCCTCATTGGGAATGATGTGGGATACTTTACTGACATTAAAGCTTGGCTAGCAGTCCAATTCCAAATGAAAGATTTAGAAGAGGCACAA
TATGTTCTTGGAATCCAAATCATAAGGGATCATAAGAACAAAACGCTAGCATTGTCTCAAGCAACCTATATCGACAAAATGTTGGTTCGATATTCGATGAAGAAC
TCTAAGAAAGGTTTATTACCTTTCAGGCATGGGGTTCACTTGTCTAAGGAACAGTGTCCTAAGACACCTCAAGAAGTTGAGGATATGAGACGTATTCCCTATGCC
TCAGCTGTGGACAGCTTAGTGTATTATATGCTCTGCACTAGGCCAGACATTTGTTATGTAGTGGGAATAGTCAATAGGTATCAGTCCAATCCAGGGTTAGACCAT
TGGACGGCGGTTAAAATTATTCTCAAGTATCTTAGGAGAACGAGAGACTACATGCTTGTGTATGGAGCTAAGGATTTGATCCTTACAGGATACACTGATTCTGAT
TTCCAAACTGATAAGGATTCTAGGAAATCCACATCGAGATCAGTGTTCACCTTAAATGGGGGAGCTGTAGTATGGCGTAGCATCAAGCAAGGATGCATTGTAGAC
TCTACAATAGAGGCTGAATACGTCGCTACTTGTGAAGCAGCAAAAGAAGCAGTTTGGCTTAGGAAGTTCCTACATGATTTGGAAGTTGTTCCAAACATGAACTTG
CCCATCACTCTATATTGTGATAACAGTGGGACAGTAGCCAATTCTAAAGAACCTCGCAGCCATAAACGAGGGAAACACATAGAGAGGAAGTATCATCTGATACGG
GAGATTGTGCAACGAAGGGATGTGATCGTCACCAAGATCGCTTCGGAGCACAACATTGTTGATCCATTTACGAAGACTCTCACGGCTAAAGTGTTCGAGGGTCAT
CTAGAAAGTCTAGGTCTACAAGATATGTACATTAGGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGACCTTGAAATGGAGTGTATGTACTTCAATTCAGTGTGGGAACTTGTAGATCTACCTGAAGGTGTAAAATCTATAGGGTGTAAATGGATCTATAAGAGAAAG
AGAAATTCAGCTGGGAAGGTACAGACCTTCAAAGCTAGACTTGTAGCAAAAGGGTATACCCAAAGGGAAGGGGTTAACTATGAGGAAACTTTTTCTCCTGTTGCT
ATGTTAAAGTCTATAAGGATTCTCTTGTCCATCGCCACATTTTATGATTATGAAATATGGCAAATGGATGTCAAGACTGCTTTTTTTAATGGCAATCTTGAAGAG
AGTATCTTTATGTCTCACCCCGAGGGGTTCATAACCCAAGGTCAAGAGCAAAAAGTTTGTAAGATGAATCGATCCATTTATGGGTTGAAACAATCTAGATCTTGG
AACATTAGGTTTGATACTGCGATCAAATCCTACTGTTTTGACCAAAACGTTGATGAACCTTGTGTATATAAGAAAATCAACAAAGGAAAAGTAGCTTTCTTAGTA
CTTTATGTGGACGATATCCTCATTGGGAATGATGTGGGATACTTTACTGACATTAAAGCTTGGCTAGCAGTCCAATTCCAAATGAAAGATTTAGAAGAGGCACAA
TATGTTCTTGGAATCCAAATCATAAGGGATCATAAGAACAAAACGCTAGCATTGTCTCAAGCAACCTATATCGACAAAATGTTGGTTCGATATTCGATGAAGAAC
TCTAAGAAAGGTTTATTACCTTTCAGGCATGGGGTTCACTTGTCTAAGGAACAGTGTCCTAAGACACCTCAAGAAGTTGAGGATATGAGACGTATTCCCTATGCC
TCAGCTGTGGACAGCTTAGTGTATTATATGCTCTGCACTAGGCCAGACATTTGTTATGTAGTGGGAATAGTCAATAGGTATCAGTCCAATCCAGGGTTAGACCAT
TGGACGGCGGTTAAAATTATTCTCAAGTATCTTAGGAGAACGAGAGACTACATGCTTGTGTATGGAGCTAAGGATTTGATCCTTACAGGATACACTGATTCTGAT
TTCCAAACTGATAAGGATTCTAGGAAATCCACATCGAGATCAGTGTTCACCTTAAATGGGGGAGCTGTAGTATGGCGTAGCATCAAGCAAGGATGCATTGTAGAC
TCTACAATAGAGGCTGAATACGTCGCTACTTGTGAAGCAGCAAAAGAAGCAGTTTGGCTTAGGAAGTTCCTACATGATTTGGAAGTTGTTCCAAACATGAACTTG
CCCATCACTCTATATTGTGATAACAGTGGGACAGTAGCCAATTCTAAAGAACCTCGCAGCCATAAACGAGGGAAACACATAGAGAGGAAGTATCATCTGATACGG
GAGATTGTGCAACGAAGGGATGTGATCGTCACCAAGATCGCTTCGGAGCACAACATTGTTGATCCATTTACGAAGACTCTCACGGCTAAAGTGTTCGAGGGTCAT
CTAGAAAGTCTAGGTCTACAAGATATGTACATTAGGTAA
Protein sequenceShow/hide protein sequence
MDLEMECMYFNSVWELVDLPEGVKSIGCKWIYKRKRNSAGKVQTFKARLVAKGYTQREGVNYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFFNGNLEE
SIFMSHPEGFITQGQEQKVCKMNRSIYGLKQSRSWNIRFDTAIKSYCFDQNVDEPCVYKKINKGKVAFLVLYVDDILIGNDVGYFTDIKAWLAVQFQMKDLEEAQ
YVLGIQIIRDHKNKTLALSQATYIDKMLVRYSMKNSKKGLLPFRHGVHLSKEQCPKTPQEVEDMRRIPYASAVDSLVYYMLCTRPDICYVVGIVNRYQSNPGLDH
WTAVKIILKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSRSVFTLNGGAVVWRSIKQGCIVDSTIEAEYVATCEAAKEAVWLRKFLHDLEVVPNMNL
PITLYCDNSGTVANSKEPRSHKRGKHIERKYHLIREIVQRRDVIVTKIASEHNIVDPFTKTLTAKVFEGHLESLGLQDMYIR