; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh10G011380 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh10G011380
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionGag/pol protein
Genome locationCmo_Chr10:9556489..9558117
RNA-Seq ExpressionCmoCh10G011380
SyntenyCmoCh10G011380
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008234 - cysteine-type peptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025945.1 gag/pol protein [Cucumis melo var. makuwa]3.9e-28287.14Show/hide
Query:  MPRRSGRVITQPDRYLGLIETQVIIPDDGVEDPLTYKHAMNDEDRDQWIKAMNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGKVQTFKARL
        MPRRSGRV++QP+RYLGL ETQV+IPDDGVEDPL+YK AMND D+DQW+KAM+LEMESMYFNSVWELVD P+GVKPIGCKWIYKRKRD  GKVQTFKARL
Subjt:  MPRRSGRVITQPDRYLGLIETQVIIPDDGVEDPLTYKHAMNDEDRDQWIKAMNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGKVQTFKARL

Query:  VAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMAQPEGFIEQDHEQRVCKLKRSIYGLKQASRSWNIKFDTAI
        VAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESI+M+QPEGFI Q  EQ+VCKL RSIYGLKQASRSWNI+FDTAI
Subjt:  VAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMAQPEGFIEQDHEQRVCKLKRSIYGLKQASRSWNIKFDTAI

Query:  KSYGFKQNVDEPCVYKRIVNSTVALLVLYVDDILLIGNDVGILTDIKHWLATQFQMKN----------QIVRNHKNKTLALSQASYIDKMLIRYKMQDSK
        KSYGF QNVDEPCVYK+I    VA LVLYVDDILLIGNDVG LTD+K WLA QFQMK+          QI+R+ KNKTLALSQA+YIDK+L+RY MQ+SK
Subjt:  KSYGFKQNVDEPCVYKRIVNSTVALLVLYVDDILLIGNDVGILTDIKHWLATQFQMKN----------QIVRNHKNKTLALSQASYIDKMLIRYKMQDSK

Query:  KGLLPFRHGVHLSKEQSPKTPQEVEDMKHIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGRAHWTAVKNILKYLRRTRDYMLMYGAKDLILTGYT
        KGLLPFRHGVHLSKEQSPKTPQEVEDM+ IPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPG  HWTAVK +LKYLRRTRDYML+YGAKDLILTGYT
Subjt:  KGLLPFRHGVHLSKEQSPKTPQEVEDMKHIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGRAHWTAVKNILKYLRRTRDYMLMYGAKDLILTGYT

Query:  DSDFQTDVDSRKSTSGSVFTLNGGAIIWRSIKQGCIADSTMEAEYVAACEAAKESVWLRKFLTDLEVVPNMHLPVTLYCDNSGAVANSKEPRSHKRGKHI
        DSDFQTD DSRKSTSGSVFTLNGGA++WRSIKQGCIADSTMEAEYVAACEAAKE+VWLRKFL DLEVVPNM+LP+TLYCDNSGAVANSKEPRSHKRGKHI
Subjt:  DSDFQTDVDSRKSTSGSVFTLNGGAIIWRSIKQGCIADSTMEAEYVAACEAAKESVWLRKFLTDLEVVPNMHLPVTLYCDNSGAVANSKEPRSHKRGKHI

Query:  ERKYHLIREIVQRGDVIVTQIASEHNIADPFTKPLTAKVFEGHLVSLGLRVM
        ERKYHLIREIVQRGDVIVT+IASEHNIADPFTK LTAKVFEGHL SLGLR M
Subjt:  ERKYHLIREIVQRGDVIVTQIASEHNIADPFTKPLTAKVFEGHLVSLGLRVM

KAA0035907.1 gag/pol protein [Cucumis melo var. makuwa]5.8e-27886.05Show/hide
Query:  MPRRSGRVITQPDRYLGLIETQVIIPDDGVEDPLTYKHAMNDEDRDQWIKAMNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGKVQTFKARL
        MPRRSGRV++QP+RYLGL ETQV+IPDDGVEDPL+YK AMND D+DQW+KAM+LEMESMYFNSVWELVD P+GVKPIGCKWIYKRKRD  GKVQTFKARL
Subjt:  MPRRSGRVITQPDRYLGLIETQVIIPDDGVEDPLTYKHAMNDEDRDQWIKAMNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGKVQTFKARL

Query:  VAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMAQPEGFIEQDHEQRVCKLKRSIYGLKQASRSWNIKFDTAI
        VAKGYT++EGVDYEETFS VAMLKSIRILLSIA FYDYEIWQMDVKTAFLNGNLEESI+M+QPEGFI Q  EQ+VCKL RSIYGLKQASRSWNI+FDTAI
Subjt:  VAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMAQPEGFIEQDHEQRVCKLKRSIYGLKQASRSWNIKFDTAI

Query:  KSYGFKQNVDEPCVYKRIVNSTVALLVLYVDDILLIGNDVGILTDIKHWLATQFQMKN----------QIVRNHKNKTLALSQASYIDKMLIRYKMQDSK
        KSYGF QNVDEPCVYK+I    VA LVLYVDDILLIGNDVG LTD+K WLA QFQMK+          QI+R+ KNKTLALSQA+YIDK+L+RY MQ+SK
Subjt:  KSYGFKQNVDEPCVYKRIVNSTVALLVLYVDDILLIGNDVGILTDIKHWLATQFQMKN----------QIVRNHKNKTLALSQASYIDKMLIRYKMQDSK

Query:  KGLLPFRHGVHLSKEQSPKTPQEVEDMKHIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGRAHWTAVKNILKYLRRTRDYMLMYGAKDLILTGYT
        KGLLPFRHGVHLSKEQSPKTPQEVEDM+ IPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPG  HWTAVK ILKYLRRTRDYML+YGAKDLILTGYT
Subjt:  KGLLPFRHGVHLSKEQSPKTPQEVEDMKHIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGRAHWTAVKNILKYLRRTRDYMLMYGAKDLILTGYT

Query:  DSDFQTDVDSRKSTSGSVFTLNGGAIIWRSIKQGCIADSTMEAEYVAACEAAKESVWLRKFLTDLEVVPNMHLPVTLYCDNSGAVANSKEPRSHKRGKHI
        +SDFQTD DSRKSTS SVFTLNGGA++WRSIKQGCIADSTMEAEYVAACEAAKE+VWL+KFL DLEVVPNM+LP+TLYCDNSGAVANSKEPRSHKRGKHI
Subjt:  DSDFQTDVDSRKSTSGSVFTLNGGAIIWRSIKQGCIADSTMEAEYVAACEAAKESVWLRKFLTDLEVVPNMHLPVTLYCDNSGAVANSKEPRSHKRGKHI

Query:  ERKYHLIREIVQRGDVIVTQIASEHNIADPFTKPLTAKVFEGHLVSLGLRVM
        ERKYHLIREIVQRGDVIVT+IASEHNIADPFTK LTAKVFEGHL SLGLR M
Subjt:  ERKYHLIREIVQRGDVIVTQIASEHNIADPFTKPLTAKVFEGHLVSLGLRVM

KAA0048404.1 gag/pol protein [Cucumis melo var. makuwa]3.5e-26782.76Show/hide
Query:  PRRSGRVITQPDRYLGLIETQVIIPDDGVEDPLTYKHAMNDEDRDQWIKAMNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGKVQTFKARLV
        PRRSGRV   P RY+ L ET  +I D  +EDPLT+K AM D D+D+WIKAMNLE+ESMYFNSVW+LVDQPDGVKPIGCKWIYKRKR   GKVQTFKARLV
Subjt:  PRRSGRVITQPDRYLGLIETQVIIPDDGVEDPLTYKHAMNDEDRDQWIKAMNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGKVQTFKARLV

Query:  AKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMAQPEGFIEQDHEQRVCKLKRSIYGLKQASRSWNIKFDTAIK
        AKGYTQ EGVDYEETFSPVAMLKSIRILLSIA ++DYEIWQMDVKTAFLNGNLEE+IYM QPEGFI    EQ++CKL RSIYGLKQASRSWNI+FDTAIK
Subjt:  AKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMAQPEGFIEQDHEQRVCKLKRSIYGLKQASRSWNIKFDTAIK

Query:  SYGFKQNVDEPCVYKRIVNSTVALLVLYVDDILLIGNDVGILTDIKHWLATQFQMKN----------QIVRNHKNKTLALSQASYIDKMLIRYKMQDSKK
        SYGF Q VDEPCVYKRI+N +VA LVLYVDDILLIGND+G+LTDIK WLATQFQMK+          QI R+ KNK LALSQASYIDK++++Y MQ+SK+
Subjt:  SYGFKQNVDEPCVYKRIVNSTVALLVLYVDDILLIGNDVGILTDIKHWLATQFQMKN----------QIVRNHKNKTLALSQASYIDKMLIRYKMQDSKK

Query:  GLLPFRHGVHLSKEQSPKTPQEVEDMKHIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGRAHWTAVKNILKYLRRTRDYMLMYGAKDLILTGYTD
        GLLPFRHGV LSKEQ PKTPQ+VE+M+HIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPG AHWTAVK ILKYLRRTRDY L+YG+KDLILTGYTD
Subjt:  GLLPFRHGVHLSKEQSPKTPQEVEDMKHIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGRAHWTAVKNILKYLRRTRDYMLMYGAKDLILTGYTD

Query:  SDFQTDVDSRKSTSGSVFTLNGGAIIWRSIKQGCIADSTMEAEYVAACEAAKESVWLRKFLTDLEVVPNMHLPVTLYCDNSGAVANSKEPRSHKRGKHIE
        SDFQTD DSRKSTSGSVFTLNGGA++WRSIKQGCIADSTMEAEYVAACEAAKE+VWLR FL DLEVVPNM  P+TLYCDNSGAVANS+EPRSHKRGKHIE
Subjt:  SDFQTDVDSRKSTSGSVFTLNGGAIIWRSIKQGCIADSTMEAEYVAACEAAKESVWLRKFLTDLEVVPNMHLPVTLYCDNSGAVANSKEPRSHKRGKHIE

Query:  RKYHLIREIVQRGDVIVTQIASEHNIADPFTKPLTAKVFEGHLVSLGLRVM
        RKYHLIREIV RGDVIVTQIAS HN+ADPFTKPLTAKVFEGHL SLGLR M
Subjt:  RKYHLIREIVQRGDVIVTQIASEHNIADPFTKPLTAKVFEGHLVSLGLRVM

KAA0059226.1 gag/pol protein [Cucumis melo var. makuwa]3.9e-28287.14Show/hide
Query:  MPRRSGRVITQPDRYLGLIETQVIIPDDGVEDPLTYKHAMNDEDRDQWIKAMNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGKVQTFKARL
        MPRRSGRV++QP+RYLGL ETQV+IPDDGVEDPL+YK AMND D+DQW+KAM+LEMESMYFNSVWELVD P+GVKPIGCKWIYKRKRD  GKVQTFKARL
Subjt:  MPRRSGRVITQPDRYLGLIETQVIIPDDGVEDPLTYKHAMNDEDRDQWIKAMNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGKVQTFKARL

Query:  VAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMAQPEGFIEQDHEQRVCKLKRSIYGLKQASRSWNIKFDTAI
        VAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESI+M+QPEGFI Q  EQ+VCKL RSIYGLKQASRSWNI+FDTAI
Subjt:  VAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMAQPEGFIEQDHEQRVCKLKRSIYGLKQASRSWNIKFDTAI

Query:  KSYGFKQNVDEPCVYKRIVNSTVALLVLYVDDILLIGNDVGILTDIKHWLATQFQMKN----------QIVRNHKNKTLALSQASYIDKMLIRYKMQDSK
        KSYGF QNVDEPCVYK+I    VA LVLYVDDILLIGNDVG LTD+K WLA QFQMK+          QI+R+ KNKTLALSQA+YIDK+L+RY MQ+SK
Subjt:  KSYGFKQNVDEPCVYKRIVNSTVALLVLYVDDILLIGNDVGILTDIKHWLATQFQMKN----------QIVRNHKNKTLALSQASYIDKMLIRYKMQDSK

Query:  KGLLPFRHGVHLSKEQSPKTPQEVEDMKHIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGRAHWTAVKNILKYLRRTRDYMLMYGAKDLILTGYT
        KGLLPFRHGVHLSKEQSPKTPQEVEDM+ IPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPG  HWTAVK +LKYLRRTRDYML+YGAKDLILTGYT
Subjt:  KGLLPFRHGVHLSKEQSPKTPQEVEDMKHIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGRAHWTAVKNILKYLRRTRDYMLMYGAKDLILTGYT

Query:  DSDFQTDVDSRKSTSGSVFTLNGGAIIWRSIKQGCIADSTMEAEYVAACEAAKESVWLRKFLTDLEVVPNMHLPVTLYCDNSGAVANSKEPRSHKRGKHI
        DSDFQTD DSRKSTSGSVFTLNGGA++WRSIKQGCIADSTMEAEYVAACEAAKE+VWLRKFL DLEVVPNM+LP+TLYCDNSGAVANSKEPRSHKRGKHI
Subjt:  DSDFQTDVDSRKSTSGSVFTLNGGAIIWRSIKQGCIADSTMEAEYVAACEAAKESVWLRKFLTDLEVVPNMHLPVTLYCDNSGAVANSKEPRSHKRGKHI

Query:  ERKYHLIREIVQRGDVIVTQIASEHNIADPFTKPLTAKVFEGHLVSLGLRVM
        ERKYHLIREIVQRGDVIVT+IASEHNIADPFTK LTAKVFEGHL SLGLR M
Subjt:  ERKYHLIREIVQRGDVIVTQIASEHNIADPFTKPLTAKVFEGHLVSLGLRVM

TYJ97618.1 gag/pol protein [Cucumis melo var. makuwa]3.5e-26782.76Show/hide
Query:  PRRSGRVITQPDRYLGLIETQVIIPDDGVEDPLTYKHAMNDEDRDQWIKAMNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGKVQTFKARLV
        PRRSGRV   P RY+ L ET  +I D  +EDPLT+K AM D D+D+WIKAMNLE+ESMYFNSVW+LVDQPDGVKPIGCKWIYKRKR   GKVQTFKARLV
Subjt:  PRRSGRVITQPDRYLGLIETQVIIPDDGVEDPLTYKHAMNDEDRDQWIKAMNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGKVQTFKARLV

Query:  AKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMAQPEGFIEQDHEQRVCKLKRSIYGLKQASRSWNIKFDTAIK
        AKGYTQ EGVDYEETFSPVAMLKSIRILLSIA ++DYEIWQMDVKTAFLNGNLEE+IYM QPEGFI    EQ++CKL RSIYGLKQASRSWNI+FDTAIK
Subjt:  AKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMAQPEGFIEQDHEQRVCKLKRSIYGLKQASRSWNIKFDTAIK

Query:  SYGFKQNVDEPCVYKRIVNSTVALLVLYVDDILLIGNDVGILTDIKHWLATQFQMKN----------QIVRNHKNKTLALSQASYIDKMLIRYKMQDSKK
        SYGF Q VDEPCVYKRI+N +VA LVLYVDDILLIGND+G+LTDIK WLATQFQMK+          QI R+ KNK LALSQASYIDK++++Y MQ+SK+
Subjt:  SYGFKQNVDEPCVYKRIVNSTVALLVLYVDDILLIGNDVGILTDIKHWLATQFQMKN----------QIVRNHKNKTLALSQASYIDKMLIRYKMQDSKK

Query:  GLLPFRHGVHLSKEQSPKTPQEVEDMKHIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGRAHWTAVKNILKYLRRTRDYMLMYGAKDLILTGYTD
        GLLPFRHGV LSKEQ PKTPQ+VE+M+HIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPG AHWTAVK ILKYLRRTRDY L+YG+KDLILTGYTD
Subjt:  GLLPFRHGVHLSKEQSPKTPQEVEDMKHIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGRAHWTAVKNILKYLRRTRDYMLMYGAKDLILTGYTD

Query:  SDFQTDVDSRKSTSGSVFTLNGGAIIWRSIKQGCIADSTMEAEYVAACEAAKESVWLRKFLTDLEVVPNMHLPVTLYCDNSGAVANSKEPRSHKRGKHIE
        SDFQTD DSRKSTSGSVFTLNGGA++WRSIKQGCIADSTMEAEYVAACEAAKE+VWLR FL DLEVVPNM  P+TLYCDNSGAVANS+EPRSHKRGKHIE
Subjt:  SDFQTDVDSRKSTSGSVFTLNGGAIIWRSIKQGCIADSTMEAEYVAACEAAKESVWLRKFLTDLEVVPNMHLPVTLYCDNSGAVANSKEPRSHKRGKHIE

Query:  RKYHLIREIVQRGDVIVTQIASEHNIADPFTKPLTAKVFEGHLVSLGLRVM
        RKYHLIREIV RGDVIVTQIAS HN+ADPFTKPLTAKVFEGHL SLGLR M
Subjt:  RKYHLIREIVQRGDVIVTQIASEHNIADPFTKPLTAKVFEGHLVSLGLRVM

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein1.7e-26782.76Show/hide
Query:  PRRSGRVITQPDRYLGLIETQVIIPDDGVEDPLTYKHAMNDEDRDQWIKAMNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGKVQTFKARLV
        PRRSGRV   P RY+ L ET  +I D  +EDPLT+K AM D D+D+WIKAMNLE+ESMYFNSVW+LVDQPDGVKPIGCKWIYKRKR   GKVQTFKARLV
Subjt:  PRRSGRVITQPDRYLGLIETQVIIPDDGVEDPLTYKHAMNDEDRDQWIKAMNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGKVQTFKARLV

Query:  AKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMAQPEGFIEQDHEQRVCKLKRSIYGLKQASRSWNIKFDTAIK
        AKGYTQ EGVDYEETFSPVAMLKSIRILLSIA ++DYEIWQMDVKTAFLNGNLEE+IYM QPEGFI    EQ++CKL RSIYGLKQASRSWNI+FDTAIK
Subjt:  AKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMAQPEGFIEQDHEQRVCKLKRSIYGLKQASRSWNIKFDTAIK

Query:  SYGFKQNVDEPCVYKRIVNSTVALLVLYVDDILLIGNDVGILTDIKHWLATQFQMKN----------QIVRNHKNKTLALSQASYIDKMLIRYKMQDSKK
        SYGF Q VDEPCVYKRI+N +VA LVLYVDDILLIGND+G+LTDIK WLATQFQMK+          QI R+ KNK LALSQASYIDK++++Y MQ+SK+
Subjt:  SYGFKQNVDEPCVYKRIVNSTVALLVLYVDDILLIGNDVGILTDIKHWLATQFQMKN----------QIVRNHKNKTLALSQASYIDKMLIRYKMQDSKK

Query:  GLLPFRHGVHLSKEQSPKTPQEVEDMKHIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGRAHWTAVKNILKYLRRTRDYMLMYGAKDLILTGYTD
        GLLPFRHGV LSKEQ PKTPQ+VE+M+HIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPG AHWTAVK ILKYLRRTRDY L+YG+KDLILTGYTD
Subjt:  GLLPFRHGVHLSKEQSPKTPQEVEDMKHIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGRAHWTAVKNILKYLRRTRDYMLMYGAKDLILTGYTD

Query:  SDFQTDVDSRKSTSGSVFTLNGGAIIWRSIKQGCIADSTMEAEYVAACEAAKESVWLRKFLTDLEVVPNMHLPVTLYCDNSGAVANSKEPRSHKRGKHIE
        SDFQTD DSRKSTSGSVFTLNGGA++WRSIKQGCIADSTMEAEYVAACEAAKE+VWLR FL DLEVVPNM  P+TLYCDNSGAVANS+EPRSHKRGKHIE
Subjt:  SDFQTDVDSRKSTSGSVFTLNGGAIIWRSIKQGCIADSTMEAEYVAACEAAKESVWLRKFLTDLEVVPNMHLPVTLYCDNSGAVANSKEPRSHKRGKHIE

Query:  RKYHLIREIVQRGDVIVTQIASEHNIADPFTKPLTAKVFEGHLVSLGLRVM
        RKYHLIREIV RGDVIVTQIAS HN+ADPFTKPLTAKVFEGHL SLGLR M
Subjt:  RKYHLIREIVQRGDVIVTQIASEHNIADPFTKPLTAKVFEGHLVSLGLRVM

A0A5A7T2V9 Gag/pol protein2.8e-27886.05Show/hide
Query:  MPRRSGRVITQPDRYLGLIETQVIIPDDGVEDPLTYKHAMNDEDRDQWIKAMNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGKVQTFKARL
        MPRRSGRV++QP+RYLGL ETQV+IPDDGVEDPL+YK AMND D+DQW+KAM+LEMESMYFNSVWELVD P+GVKPIGCKWIYKRKRD  GKVQTFKARL
Subjt:  MPRRSGRVITQPDRYLGLIETQVIIPDDGVEDPLTYKHAMNDEDRDQWIKAMNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGKVQTFKARL

Query:  VAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMAQPEGFIEQDHEQRVCKLKRSIYGLKQASRSWNIKFDTAI
        VAKGYT++EGVDYEETFS VAMLKSIRILLSIA FYDYEIWQMDVKTAFLNGNLEESI+M+QPEGFI Q  EQ+VCKL RSIYGLKQASRSWNI+FDTAI
Subjt:  VAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMAQPEGFIEQDHEQRVCKLKRSIYGLKQASRSWNIKFDTAI

Query:  KSYGFKQNVDEPCVYKRIVNSTVALLVLYVDDILLIGNDVGILTDIKHWLATQFQMKN----------QIVRNHKNKTLALSQASYIDKMLIRYKMQDSK
        KSYGF QNVDEPCVYK+I    VA LVLYVDDILLIGNDVG LTD+K WLA QFQMK+          QI+R+ KNKTLALSQA+YIDK+L+RY MQ+SK
Subjt:  KSYGFKQNVDEPCVYKRIVNSTVALLVLYVDDILLIGNDVGILTDIKHWLATQFQMKN----------QIVRNHKNKTLALSQASYIDKMLIRYKMQDSK

Query:  KGLLPFRHGVHLSKEQSPKTPQEVEDMKHIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGRAHWTAVKNILKYLRRTRDYMLMYGAKDLILTGYT
        KGLLPFRHGVHLSKEQSPKTPQEVEDM+ IPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPG  HWTAVK ILKYLRRTRDYML+YGAKDLILTGYT
Subjt:  KGLLPFRHGVHLSKEQSPKTPQEVEDMKHIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGRAHWTAVKNILKYLRRTRDYMLMYGAKDLILTGYT

Query:  DSDFQTDVDSRKSTSGSVFTLNGGAIIWRSIKQGCIADSTMEAEYVAACEAAKESVWLRKFLTDLEVVPNMHLPVTLYCDNSGAVANSKEPRSHKRGKHI
        +SDFQTD DSRKSTS SVFTLNGGA++WRSIKQGCIADSTMEAEYVAACEAAKE+VWL+KFL DLEVVPNM+LP+TLYCDNSGAVANSKEPRSHKRGKHI
Subjt:  DSDFQTDVDSRKSTSGSVFTLNGGAIIWRSIKQGCIADSTMEAEYVAACEAAKESVWLRKFLTDLEVVPNMHLPVTLYCDNSGAVANSKEPRSHKRGKHI

Query:  ERKYHLIREIVQRGDVIVTQIASEHNIADPFTKPLTAKVFEGHLVSLGLRVM
        ERKYHLIREIVQRGDVIVT+IASEHNIADPFTK LTAKVFEGHL SLGLR M
Subjt:  ERKYHLIREIVQRGDVIVTQIASEHNIADPFTKPLTAKVFEGHLVSLGLRVM

A0A5A7TZD0 Gag/pol protein1.9e-28287.14Show/hide
Query:  MPRRSGRVITQPDRYLGLIETQVIIPDDGVEDPLTYKHAMNDEDRDQWIKAMNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGKVQTFKARL
        MPRRSGRV++QP+RYLGL ETQV+IPDDGVEDPL+YK AMND D+DQW+KAM+LEMESMYFNSVWELVD P+GVKPIGCKWIYKRKRD  GKVQTFKARL
Subjt:  MPRRSGRVITQPDRYLGLIETQVIIPDDGVEDPLTYKHAMNDEDRDQWIKAMNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGKVQTFKARL

Query:  VAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMAQPEGFIEQDHEQRVCKLKRSIYGLKQASRSWNIKFDTAI
        VAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESI+M+QPEGFI Q  EQ+VCKL RSIYGLKQASRSWNI+FDTAI
Subjt:  VAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMAQPEGFIEQDHEQRVCKLKRSIYGLKQASRSWNIKFDTAI

Query:  KSYGFKQNVDEPCVYKRIVNSTVALLVLYVDDILLIGNDVGILTDIKHWLATQFQMKN----------QIVRNHKNKTLALSQASYIDKMLIRYKMQDSK
        KSYGF QNVDEPCVYK+I    VA LVLYVDDILLIGNDVG LTD+K WLA QFQMK+          QI+R+ KNKTLALSQA+YIDK+L+RY MQ+SK
Subjt:  KSYGFKQNVDEPCVYKRIVNSTVALLVLYVDDILLIGNDVGILTDIKHWLATQFQMKN----------QIVRNHKNKTLALSQASYIDKMLIRYKMQDSK

Query:  KGLLPFRHGVHLSKEQSPKTPQEVEDMKHIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGRAHWTAVKNILKYLRRTRDYMLMYGAKDLILTGYT
        KGLLPFRHGVHLSKEQSPKTPQEVEDM+ IPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPG  HWTAVK +LKYLRRTRDYML+YGAKDLILTGYT
Subjt:  KGLLPFRHGVHLSKEQSPKTPQEVEDMKHIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGRAHWTAVKNILKYLRRTRDYMLMYGAKDLILTGYT

Query:  DSDFQTDVDSRKSTSGSVFTLNGGAIIWRSIKQGCIADSTMEAEYVAACEAAKESVWLRKFLTDLEVVPNMHLPVTLYCDNSGAVANSKEPRSHKRGKHI
        DSDFQTD DSRKSTSGSVFTLNGGA++WRSIKQGCIADSTMEAEYVAACEAAKE+VWLRKFL DLEVVPNM+LP+TLYCDNSGAVANSKEPRSHKRGKHI
Subjt:  DSDFQTDVDSRKSTSGSVFTLNGGAIIWRSIKQGCIADSTMEAEYVAACEAAKESVWLRKFLTDLEVVPNMHLPVTLYCDNSGAVANSKEPRSHKRGKHI

Query:  ERKYHLIREIVQRGDVIVTQIASEHNIADPFTKPLTAKVFEGHLVSLGLRVM
        ERKYHLIREIVQRGDVIVT+IASEHNIADPFTK LTAKVFEGHL SLGLR M
Subjt:  ERKYHLIREIVQRGDVIVTQIASEHNIADPFTKPLTAKVFEGHLVSLGLRVM

A0A5A7UYE8 Gag/pol protein1.9e-28287.14Show/hide
Query:  MPRRSGRVITQPDRYLGLIETQVIIPDDGVEDPLTYKHAMNDEDRDQWIKAMNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGKVQTFKARL
        MPRRSGRV++QP+RYLGL ETQV+IPDDGVEDPL+YK AMND D+DQW+KAM+LEMESMYFNSVWELVD P+GVKPIGCKWIYKRKRD  GKVQTFKARL
Subjt:  MPRRSGRVITQPDRYLGLIETQVIIPDDGVEDPLTYKHAMNDEDRDQWIKAMNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGKVQTFKARL

Query:  VAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMAQPEGFIEQDHEQRVCKLKRSIYGLKQASRSWNIKFDTAI
        VAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESI+M+QPEGFI Q  EQ+VCKL RSIYGLKQASRSWNI+FDTAI
Subjt:  VAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMAQPEGFIEQDHEQRVCKLKRSIYGLKQASRSWNIKFDTAI

Query:  KSYGFKQNVDEPCVYKRIVNSTVALLVLYVDDILLIGNDVGILTDIKHWLATQFQMKN----------QIVRNHKNKTLALSQASYIDKMLIRYKMQDSK
        KSYGF QNVDEPCVYK+I    VA LVLYVDDILLIGNDVG LTD+K WLA QFQMK+          QI+R+ KNKTLALSQA+YIDK+L+RY MQ+SK
Subjt:  KSYGFKQNVDEPCVYKRIVNSTVALLVLYVDDILLIGNDVGILTDIKHWLATQFQMKN----------QIVRNHKNKTLALSQASYIDKMLIRYKMQDSK

Query:  KGLLPFRHGVHLSKEQSPKTPQEVEDMKHIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGRAHWTAVKNILKYLRRTRDYMLMYGAKDLILTGYT
        KGLLPFRHGVHLSKEQSPKTPQEVEDM+ IPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPG  HWTAVK +LKYLRRTRDYML+YGAKDLILTGYT
Subjt:  KGLLPFRHGVHLSKEQSPKTPQEVEDMKHIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGRAHWTAVKNILKYLRRTRDYMLMYGAKDLILTGYT

Query:  DSDFQTDVDSRKSTSGSVFTLNGGAIIWRSIKQGCIADSTMEAEYVAACEAAKESVWLRKFLTDLEVVPNMHLPVTLYCDNSGAVANSKEPRSHKRGKHI
        DSDFQTD DSRKSTSGSVFTLNGGA++WRSIKQGCIADSTMEAEYVAACEAAKE+VWLRKFL DLEVVPNM+LP+TLYCDNSGAVANSKEPRSHKRGKHI
Subjt:  DSDFQTDVDSRKSTSGSVFTLNGGAIIWRSIKQGCIADSTMEAEYVAACEAAKESVWLRKFLTDLEVVPNMHLPVTLYCDNSGAVANSKEPRSHKRGKHI

Query:  ERKYHLIREIVQRGDVIVTQIASEHNIADPFTKPLTAKVFEGHLVSLGLRVM
        ERKYHLIREIVQRGDVIVT+IASEHNIADPFTK LTAKVFEGHL SLGLR M
Subjt:  ERKYHLIREIVQRGDVIVTQIASEHNIADPFTKPLTAKVFEGHLVSLGLRVM

A0A5D3CPJ6 Gag/pol protein1.7e-26782.76Show/hide
Query:  PRRSGRVITQPDRYLGLIETQVIIPDDGVEDPLTYKHAMNDEDRDQWIKAMNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGKVQTFKARLV
        PRRSGRV   P RY+ L ET  +I D  +EDPLT+K AM D D+D+WIKAMNLE+ESMYFNSVW+LVDQPDGVKPIGCKWIYKRKR   GKVQTFKARLV
Subjt:  PRRSGRVITQPDRYLGLIETQVIIPDDGVEDPLTYKHAMNDEDRDQWIKAMNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGKVQTFKARLV

Query:  AKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMAQPEGFIEQDHEQRVCKLKRSIYGLKQASRSWNIKFDTAIK
        AKGYTQ EGVDYEETFSPVAMLKSIRILLSIA ++DYEIWQMDVKTAFLNGNLEE+IYM QPEGFI    EQ++CKL RSIYGLKQASRSWNI+FDTAIK
Subjt:  AKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMAQPEGFIEQDHEQRVCKLKRSIYGLKQASRSWNIKFDTAIK

Query:  SYGFKQNVDEPCVYKRIVNSTVALLVLYVDDILLIGNDVGILTDIKHWLATQFQMKN----------QIVRNHKNKTLALSQASYIDKMLIRYKMQDSKK
        SYGF Q VDEPCVYKRI+N +VA LVLYVDDILLIGND+G+LTDIK WLATQFQMK+          QI R+ KNK LALSQASYIDK++++Y MQ+SK+
Subjt:  SYGFKQNVDEPCVYKRIVNSTVALLVLYVDDILLIGNDVGILTDIKHWLATQFQMKN----------QIVRNHKNKTLALSQASYIDKMLIRYKMQDSKK

Query:  GLLPFRHGVHLSKEQSPKTPQEVEDMKHIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGRAHWTAVKNILKYLRRTRDYMLMYGAKDLILTGYTD
        GLLPFRHGV LSKEQ PKTPQ+VE+M+HIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPG AHWTAVK ILKYLRRTRDY L+YG+KDLILTGYTD
Subjt:  GLLPFRHGVHLSKEQSPKTPQEVEDMKHIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGRAHWTAVKNILKYLRRTRDYMLMYGAKDLILTGYTD

Query:  SDFQTDVDSRKSTSGSVFTLNGGAIIWRSIKQGCIADSTMEAEYVAACEAAKESVWLRKFLTDLEVVPNMHLPVTLYCDNSGAVANSKEPRSHKRGKHIE
        SDFQTD DSRKSTSGSVFTLNGGA++WRSIKQGCIADSTMEAEYVAACEAAKE+VWLR FL DLEVVPNM  P+TLYCDNSGAVANS+EPRSHKRGKHIE
Subjt:  SDFQTDVDSRKSTSGSVFTLNGGAIIWRSIKQGCIADSTMEAEYVAACEAAKESVWLRKFLTDLEVVPNMHLPVTLYCDNSGAVANSKEPRSHKRGKHIE

Query:  RKYHLIREIVQRGDVIVTQIASEHNIADPFTKPLTAKVFEGHLVSLGLRVM
        RKYHLIREIV RGDVIVTQIAS HN+ADPFTKPLTAKVFEGHL SLGLR M
Subjt:  RKYHLIREIVQRGDVIVTQIASEHNIADPFTKPLTAKVFEGHLVSLGLRVM

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.4e-8835.44Show/hide
Query:  PLTYKHAMNDEDRDQWIKAMNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSI
        P ++      +D+  W +A+N E+ +   N+ W +  +P+    +  +W++  K ++ G    +KARLVA+G+TQ+  +DYEETF+PVA + S R +LS+
Subjt:  PLTYKHAMNDEDRDQWIKAMNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSI

Query:  ATFYDYEIWQMDVKTAFLNGNLEESIYMAQPEGFIEQDHEQRVCKLKRSIYGLKQASRSWNIKFDTAIKSYGFKQNVDEPCVY---KRIVNSTVALLVLY
           Y+ ++ QMDVKTAFLNG L+E IYM  P+G     +   VCKL ++IYGLKQA+R W   F+ A+K   F  +  + C+Y   K  +N  + +L LY
Subjt:  ATFYDYEIWQMDVKTAFLNGNLEESIYMAQPEGFIEQDHEQRVCKLKRSIYGLKQASRSWNIKFDTAIKSYGFKQNVDEPCVY---KRIVNSTVALLVLY

Query:  VDDILLIGNDVGILTDIKHWLATQFQMKNQIVRNH--------KNKTLALSQASYIDKMLIRYKMQDSKKGLLPFRHGVHLSKEQSPKTPQEVEDMKHIP
        VDD+++   D+  + + K +L  +F+M +     H        +   + LSQ++Y+ K+L ++ M++      P    ++     S       ++  + P
Subjt:  VDDILLIGNDVGILTDIKHWLATQFQMKNQIVRNH--------KNKTLALSQASYIDKMLIRYKMQDSKKGLLPFRHGVHLSKEQSPKTPQEVEDMKHIP

Query:  YASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGRAHWTAVKNILKYLRRTRDYMLMYG---AKDLILTGYTDSDFQTDVDSRKSTSGSVFTL-NGGAII
          S +G LMY MLCTRPD+  AV I+SRY S      W  +K +L+YL+ T D  L++    A +  + GY DSD+      RKST+G +F + +   I 
Subjt:  YASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGRAHWTAVKNILKYLRRTRDYMLMYG---AKDLILTGYTDSDFQTDVDSRKSTSGSVFTL-NGGAII

Query:  WRSIKQGCIADSTMEAEYVAACEAAKESVWLRKFLTDLEVVPNMHLPVTLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTQIASEHNI
        W + +Q  +A S+ EAEY+A  EA +E++WL+  LT + +   +  P+ +Y DN G ++ +  P  HKR KHI+ KYH  RE VQ   + +  I +E+ +
Subjt:  WRSIKQGCIADSTMEAEYVAACEAAKESVWLRKFLTDLEVVPNMHLPVTLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTQIASEHNI

Query:  ADPFTKPLTAKVFEGHLVSLGL
        AD FTKPL A  F      LGL
Subjt:  ADPFTKPLTAKVFEGHLVSLGL

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.4e-12845Show/hide
Query:  RRSGRVITQPDRYLGLIETQVIIPDDGVEDPLTYKHAMNDEDRDQWIKAMNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGKVQTFKARLVA
        RRS R   +  RY       V+I DD   +P + K  ++  +++Q +KAM  EMES+  N  ++LV+ P G +P+ CKW++K K+D   K+  +KARLV 
Subjt:  RRSGRVITQPDRYLGLIETQVIIPDDGVEDPLTYKHAMNDEDRDQWIKAMNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGKVQTFKARLVA

Query:  KGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMAQPEGFIEQDHEQRVCKLKRSIYGLKQASRSWNIKFDTAIKS
        KG+ Q++G+D++E FSPV  + SIR +LS+A   D E+ Q+DVKTAFL+G+LEE IYM QPEGF     +  VCKL +S+YGLKQA R W +KFD+ +KS
Subjt:  KGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMAQPEGFIEQDHEQRVCKLKRSIYGLKQASRSWNIKFDTAIKS

Query:  YGFKQNVDEPCVY-KRIVNSTVALLVLYVDDILLIGNDVGILTDIKHWLATQFQMKN----------QIVRNHKNKTLALSQASYIDKMLIRYKMQDSKK
          + +   +PCVY KR   +   +L+LYVDD+L++G D G++  +K  L+  F MK+          +IVR   ++ L LSQ  YI+++L R+ M+++K 
Subjt:  YGFKQNVDEPCVY-KRIVNSTVALLVLYVDDILLIGNDVGILTDIKHWLATQFQMKN----------QIVRNHKNKTLALSQASYIDKMLIRYKMQDSKK

Query:  GLLPFRHGVHLSKEQSPKTPQEVEDMKHIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGRAHWTAVKNILKYLRRTRDYMLMYGAKDLILTGYTD
           P    + LSK+  P T +E  +M  +PY+SAVGSLMYAM+CTRPDI +AVG+VSR+  NPG+ HW AVK IL+YLR T    L +G  D IL GYTD
Subjt:  GLLPFRHGVHLSKEQSPKTPQEVEDMKHIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGRAHWTAVKNILKYLRRTRDYMLMYGAKDLILTGYTD

Query:  SDFQTDVDSRKSTSGSVFTLNGGAIIWRSIKQGCIADSTMEAEYVAACEAAKESVWLRKFLTDLEVVPNMHLPVTLYCDNSGAVANSKEPRSHKRGKHIE
        +D   D+D+RKS++G +FT +GGAI W+S  Q C+A ST EAEY+AA E  KE +WL++FL +L +    ++   +YCD+  A+  SK    H R KHI+
Subjt:  SDFQTDVDSRKSTSGSVFTLNGGAIIWRSIKQGCIADSTMEAEYVAACEAAKESVWLRKFLTDLEVVPNMHLPVTLYCDNSGAVANSKEPRSHKRGKHIE

Query:  RKYHLIREIVQRGDVIVTQIASEHNIADPFTKPLTAKVFE
         +YH IRE+V    + V +I++  N AD  TK +    FE
Subjt:  RKYHLIREIVQRGDVIVTQIASEHNIADPFTKPLTAKVFE

P25600 Putative transposon Ty5-1 protein YCL074W2.4e-3231.95Show/hide
Query:  MDVKTAFLNGNLEESIYMAQPEGFIEQDHEQRVCKLKRSIYGLKQASRSWNIKFDTAIKSYGFKQNVDEPCVYKRIVNSTVALLVLYVDDILLIGNDVGI
        MDV TAFLN  ++E IY+ QP GF+ + +   V +L   +YGLKQA   WN   +  +K  GF ++  E  +Y R  +     + +YVDD+L+      I
Subjt:  MDVKTAFLNGNLEESIYMAQPEGFIEQDHEQRVCKLKRSIYGLKQASRSWNIKFDTAIKSYGFKQNVDEPCVYKRIVNSTVALLVLYVDDILLIGNDVGI

Query:  LTDIKHWLATQFQMKN---------QIVRNHKNKTLALSQASYIDKMLIRYKMQDSKKGLLPFRHGVHLSKEQSPKTPQEVEDMKHIPYASAVGSLMYAM
           +K  L   + MK+           +    N  + LS   YI K     ++   K    P  +   L +  SP     ++D+   PY S VG L++  
Subjt:  LTDIKHWLATQFQMKN---------QIVRNHKNKTLALSQASYIDKMLIRYKMQDSKKGLLPFRHGVHLSKEQSPKTPQEVEDMKHIPYASAVGSLMYAM

Query:  LCTRPDICYAVGIVSRYQSNPGRAHWTAVKNILKYLRRTRDYMLMY-GAKDLILTGYTDSDFQTDVDSRKSTSGSVFTLNGGAIIWRSIK-QGCIADSTM
           RPDI Y V ++SR+   P   H  + + +L+YL  TR   L Y     L LT Y D+      D   ST G V  L G  + W S K +G I   + 
Subjt:  LCTRPDICYAVGIVSRYQSNPGRAHWTAVKNILKYLRRTRDYMLMY-GAKDLILTGYTDSDFQTDVDSRKSTSGSVFTLNGGAIIWRSIK-QGCIADSTM

Query:  EAEYVAACEAAKE
        EAEY+ A E   E
Subjt:  EAEYVAACEAAKE

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE15.0e-8336.1Show/hide
Query:  DPLTYKHAMNDEDRDQWIKAMNLEMESMYFNSVWELVDQPDG-VKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILL
        +P T   A+ DE   +W  AM  E+ +   N  W+LV  P   V  +GC+WI+ +K +  G +  +KARLVAKGY QR G+DY ETFSPV    SIRI+L
Subjt:  DPLTYKHAMNDEDRDQWIKAMNLEMESMYFNSVWELVDQPDG-VKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILL

Query:  SIATFYDYEIWQMDVKTAFLNGNLEESIYMAQPEGFIEQDHEQRVCKLKRSIYGLKQASRSWNIKFDTAIKSYGFKQNVDEPCVYKRIVNSTVALLVLYV
         +A    + I Q+DV  AFL G L + +YM+QP GFI++D    VCKL++++YGLKQA R+W ++    + + GF  +V +  ++      ++  +++YV
Subjt:  SIATFYDYEIWQMDVKTAFLNGNLEESIYMAQPEGFIEQDHEQRVCKLKRSIYGLKQASRSWNIKFDTAIKSYGFKQNVDEPCVYKRIVNSTVALLVLYV

Query:  DDILLIGNDVGILTDIKHWLATQFQMKNQ--------IVRNHKNKTLALSQASYIDKMLIRYKMQDSKKGLLPFRHGVHLSKEQSPKTPQEVEDMKHIPY
        DDIL+ GND  +L +    L+ +F +K+         I        L LSQ  YI  +L R  M  +K    P      LS     K     E      Y
Subjt:  DDILLIGNDVGILTDIKHWLATQFQMKNQ--------IVRNHKNKTLALSQASYIDKMLIRYKMQDSKKGLLPFRHGVHLSKEQSPKTPQEVEDMKHIPY

Query:  ASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGRAHWTAVKNILKYLRRTRDY-MLMYGAKDLILTGYTDSDFQTDVDSRKSTSGSVFTLNGGAIIWRSI
           VGSL Y +  TRPDI YAV  +S++   P   H  A+K IL+YL  T ++ + +     L L  Y+D+D+  D D   ST+G +  L    I W S 
Subjt:  ASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGRAHWTAVKNILKYLRRTRDY-MLMYGAKDLILTGYTDSDFQTDVDSRKSTSGSVFTLNGGAIIWRSI

Query:  KQGCIADSTMEAEYVAACEAAKESVWLRKFLTDLEVVPNMHLPVTLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTQIASEHNIADPF
        KQ  +  S+ EAEY +    + E  W+   LT+L +   +  P  +YCDN GA      P  H R KHI   YH IR  VQ G + V  +++   +AD  
Subjt:  KQGCIADSTMEAEYVAACEAAKESVWLRKFLTDLEVVPNMHLPVTLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTQIASEHNIADPF

Query:  TKPLTAKVFEGHLVSLGL
        TKPL+   F+     +G+
Subjt:  TKPLTAKVFEGHLVSLGL

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE26.4e-8636.1Show/hide
Query:  DPLTYKHAMNDEDRDQWIKAMNLEMESMYFNSVWELV-DQPDGVKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILL
        +P T   AM D   D+W +AM  E+ +   N  W+LV   P  V  +GC+WI+ +K +  G +  +KARLVAKGY QR G+DY ETFSPV    SIRI+L
Subjt:  DPLTYKHAMNDEDRDQWIKAMNLEMESMYFNSVWELV-DQPDGVKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILL

Query:  SIATFYDYEIWQMDVKTAFLNGNLEESIYMAQPEGFIEQDHEQRVCKLKRSIYGLKQASRSWNIKFDTAIKSYGFKQNVDEPCVYKRIVNSTVALLVLYV
         +A    + I Q+DV  AFL G L + +YM+QP GF+++D    VC+L+++IYGLKQA R+W ++  T + + GF  ++ +  ++      ++  +++YV
Subjt:  SIATFYDYEIWQMDVKTAFLNGNLEESIYMAQPEGFIEQDHEQRVCKLKRSIYGLKQASRSWNIKFDTAIKSYGFKQNVDEPCVYKRIVNSTVALLVLYV

Query:  DDILLIGNDVGILTDIKHWLATQFQMKNQ--------IVRNHKNKTLALSQASYIDKMLIRYKMQDSKKGLLPFRHGVHLSKEQSPKTPQEVEDMKHIPY
        DDIL+ GND  +L      L+ +F +K          I      + L LSQ  Y   +L R  M  +K    P      L+     K P   E      Y
Subjt:  DDILLIGNDVGILTDIKHWLATQFQMKNQ--------IVRNHKNKTLALSQASYIDKMLIRYKMQDSKKGLLPFRHGVHLSKEQSPKTPQEVEDMKHIPY

Query:  ASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGRAHWTAVKNILKYLRRTRDY-MLMYGAKDLILTGYTDSDFQTDVDSRKSTSGSVFTLNGGAIIWRSI
           VGSL Y +  TRPD+ YAV  +S+Y   P   HW A+K +L+YL  T D+ + +     L L  Y+D+D+  D D   ST+G +  L    I W S 
Subjt:  ASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGRAHWTAVKNILKYLRRTRDY-MLMYGAKDLILTGYTDSDFQTDVDSRKSTSGSVFTLNGGAIIWRSI

Query:  KQGCIADSTMEAEYVAACEAAKESVWLRKFLTDLEVVPNMHLPVTLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTQIASEHNIADPF
        KQ  +  S+ EAEY +    + E  W+   LT+L +   +  P  +YCDN GA      P  H R KHI   YH IR  VQ G + V  +++   +AD  
Subjt:  KQGCIADSTMEAEYVAACEAAKESVWLRKFLTDLEVVPNMHLPVTLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTQIASEHNIADPF

Query:  TKPLTAKVFEGHLVSLGL
        TKPL+   F+     +G+
Subjt:  TKPLTAKVFEGHLVSLGL

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 86.1e-7634.71Show/hide
Query:  EDPLTYKHAMNDEDRDQWIKAMNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILL
        ++P TY  A   ++   W  AM+ E+ +M     WE+   P   KPIGCKW+YK K +  G ++ +KARLVAKGYTQ+EG+D+ ETFSPV  L S++++L
Subjt:  EDPLTYKHAMNDEDRDQWIKAMNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILL

Query:  SIATFYDYEIWQMDVKTAFLNGNLEESIYMAQPEGFIEQDHE----QRVCKLKRSIYGLKQASRSWNIKFDTAIKSYGFKQNVDEPCVYKRIVNSTVALL
        +I+  Y++ + Q+D+  AFLNG+L+E IYM  P G+  +  +      VC LK+SIYGLKQASR W +KF   +  +GF Q+  +   + +I  +    +
Subjt:  SIATFYDYEIWQMDVKTAFLNGNLEESIYMAQPEGFIEQDHE----QRVCKLKRSIYGLKQASRSWNIKFDTAIKSYGFKQNVDEPCVYKRIVNSTVALL

Query:  VLYVDDILLIGNDVGILTDIKHWLATQFQMKN----------QIVRNHKNKTLALSQASYIDKMLIRYKMQDSKKGLLPFRHGVHLSKEQSPKTPQEVED
        ++YVDDI++  N+   + ++K  L + F++++          +I R+     + + Q  Y   +L    +   K   +P    V  S         +  D
Subjt:  VLYVDDILLIGNDVGILTDIKHWLATQFQMKN----------QIVRNHKNKTLALSQASYIDKMLIRYKMQDSKKGLLPFRHGVHLSKEQSPKTPQEVED

Query:  MKHIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGRAHWTAVKNILKYLRRTRDYMLMYGAK-DLILTGYTDSDFQTDVDSRKSTSGSVFTLNGGA
         K   Y   +G LMY  + TR DI +AV  +S++   P  AH  AV  IL Y++ T    L Y ++ ++ L  ++D+ FQ+  D+R+ST+G    L    
Subjt:  MKHIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGRAHWTAVKNILKYLRRTRDYMLMYGAK-DLILTGYTDSDFQTDVDSRKSTSGSVFTLNGGA

Query:  IIWRSIKQGCIADSTMEAEYVAACEAAKESVWLRKFLTDLEVVPNMHLPVTLYCDNSGAVANSKEPRSHKRGKHIERKYHLIRE
        I W+S KQ  ++ S+ EAEY A   A  E +WL +F  +L++   +  P  L+CDN+ A+  +     H+R KHIE   H +RE
Subjt:  IIWRSIKQGCIADSTMEAEYVAACEAAKESVWLRKFLTDLEVVPNMHLPVTLYCDNSGAVANSKEPRSHKRGKHIERKYHLIRE

ATMG00240.1 Gag-Pol-related retrotransposon family protein4.3e-0537.5Show/hide
Query:  TRPDICYAVGIVSRYQSNPGRAHWTAVKNILKYLRRTRDYMLMYGA-KDLILTGYTDSDFQTDVDSRKSTSG
        TRPD+ +AV  +S++ S    A   AV  +L Y++ T    L Y A  DL L  + DSD+ +  D+R+S +G
Subjt:  TRPDICYAVGIVSRYQSNPGRAHWTAVKNILKYLRRTRDYMLMYGA-KDLILTGYTDSDFQTDVDSRKSTSG

ATMG00810.1 DNA/RNA polymerases superfamily protein5.5e-1630.6Show/hide
Query:  LVLYVDDILLIGNDVGILTDIKHWLATQFQMKNQ---------IVRNHKNKTLALSQASYIDKMLIRYKMQDSKKGLLPFRHGVHLSKEQSPKTPQEVED
        L+LYVDDILL G+   +L  +   L++ F MK+           ++ H +  L LSQ  Y +++L    M D K    P    ++ S   + K P   + 
Subjt:  LVLYVDDILLIGNDVGILTDIKHWLATQFQMKNQ---------IVRNHKNKTLALSQASYIDKMLIRYKMQDSKKGLLPFRHGVHLSKEQSPKTPQEVED

Query:  MKHIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGRAHWTAVKNILKYLRRTRDY-MLMYGAKDLILTGYTDSDFQTDVDSRKSTSGSVFTLNGGA
             + S VG+L Y  L TRPDI YAV IV +    P  A +  +K +L+Y++ T  + + ++    L +  + DSD+     +R+ST+G    L    
Subjt:  MKHIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGRAHWTAVKNILKYLRRTRDY-MLMYGAKDLILTGYTDSDFQTDVDSRKSTSGSVFTLNGGA

Query:  IIWRSIKQGCIADSTMEAEYVAACEAAKESVW
        I W + +Q  ++ S+ E EY A    A E  W
Subjt:  IIWRSIKQGCIADSTMEAEYVAACEAAKESVW

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)2.7e-1541.86Show/hide
Query:  WIKAMNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIA
        W +AM  E++++  N  W LV  P     +GCKW++K K    G +   KARLVAKG+ Q EG+ + ET+SPV    +IR +L++A
Subjt:  WIKAMNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCTCGACGTAGTGGGAGGGTTATAACTCAACCCGATCGTTACTTGGGTTTGATAGAAACTCAAGTCATCATACCTGATGATGGTGTTGAGGATCCATTGACCTATAA
ACACGCAATGAATGATGAAGATAGAGATCAGTGGATTAAAGCCATGAATCTTGAAATGGAGTCAATGTACTTCAATTCAGTCTGGGAACTTGTAGATCAACCAGATGGGG
TAAAACCTATTGGTTGCAAATGGATCTATAAGAGGAAACGAGACCAAACCGGTAAGGTACAGACCTTTAAAGCACGACTAGTAGCAAAGGGTTATACCCAAAGAGAGGGG
GTGGACTATGAAGAAACCTTCTCTCCTGTTGCTATGCTTAAATCAATTAGAATACTCTTGTCTATTGCCACATTTTATGATTATGAAATTTGGCAAATGGATGTTAAGAC
AGCTTTTCTAAACGGCAATCTTGAAGAGAGTATCTATATGGCTCAACCAGAGGGGTTCATTGAACAGGATCACGAGCAAAGGGTTTGCAAGCTTAAAAGATCCATTTATG
GGTTGAAGCAAGCATCTCGATCCTGGAATATAAAGTTTGATACTGCGATCAAATCTTATGGCTTTAAACAGAATGTTGACGAACCTTGTGTTTATAAAAGGATAGTCAAC
TCTACAGTAGCTTTGCTGGTGTTGTACGTTGACGATATCCTACTCATTGGAAATGATGTGGGAATTCTGACTGACATTAAGCATTGGCTAGCGACACAATTCCAAATGAA
AAATCAAATTGTTCGGAATCACAAGAACAAAACACTAGCATTGTCTCAAGCATCGTACATCGACAAAATGTTGATTCGATATAAGATGCAGGACTCCAAGAAAGGATTAT
TACCTTTCAGGCATGGAGTTCATTTGTCGAAGGAACAAAGTCCTAAGACACCTCAAGAAGTTGAGGATATGAAACACATTCCCTATGCATCAGCGGTCGGTAGTCTGATG
TATGCCATGCTATGTACCCGACCCGACATATGCTATGCTGTGGGAATTGTCAGCAGATATCAGTCCAATCCGGGACGTGCTCATTGGACTGCCGTTAAGAATATCCTCAA
GTATCTTAGGAGAACGAGGGACTATATGCTAATGTACGGTGCTAAGGATCTGATCCTTACAGGGTACACTGACTCAGATTTTCAGACCGATGTAGATTCGAGGAAATCGA
CATCAGGATCTGTCTTCACTCTGAACGGAGGAGCAATAATATGGAGAAGCATAAAGCAAGGTTGCATTGCTGATTCCACCATGGAGGCTGAGTATGTTGCTGCTTGTGAA
GCAGCGAAAGAATCTGTATGGCTTAGGAAGTTCTTAACTGATTTGGAAGTCGTTCCAAATATGCATCTTCCCGTCACTCTTTATTGTGATAACAGTGGAGCAGTTGCAAA
TTCAAAAGAACCAAGAAGCCATAAGCGAGGAAAACATATTGAGCGCAAATATCATCTCATAAGGGAGATTGTGCAACGAGGAGATGTGATCGTCACACAGATAGCTTCAG
AGCACAACATTGCTGATCCATTTACAAAGCCCCTCACGGCTAAAGTGTTTGAAGGGCACCTAGTGAGTCTAGGACTACGAGTTATGTAA
mRNA sequenceShow/hide mRNA sequence
ATGCCTCGACGTAGTGGGAGGGTTATAACTCAACCCGATCGTTACTTGGGTTTGATAGAAACTCAAGTCATCATACCTGATGATGGTGTTGAGGATCCATTGACCTATAA
ACACGCAATGAATGATGAAGATAGAGATCAGTGGATTAAAGCCATGAATCTTGAAATGGAGTCAATGTACTTCAATTCAGTCTGGGAACTTGTAGATCAACCAGATGGGG
TAAAACCTATTGGTTGCAAATGGATCTATAAGAGGAAACGAGACCAAACCGGTAAGGTACAGACCTTTAAAGCACGACTAGTAGCAAAGGGTTATACCCAAAGAGAGGGG
GTGGACTATGAAGAAACCTTCTCTCCTGTTGCTATGCTTAAATCAATTAGAATACTCTTGTCTATTGCCACATTTTATGATTATGAAATTTGGCAAATGGATGTTAAGAC
AGCTTTTCTAAACGGCAATCTTGAAGAGAGTATCTATATGGCTCAACCAGAGGGGTTCATTGAACAGGATCACGAGCAAAGGGTTTGCAAGCTTAAAAGATCCATTTATG
GGTTGAAGCAAGCATCTCGATCCTGGAATATAAAGTTTGATACTGCGATCAAATCTTATGGCTTTAAACAGAATGTTGACGAACCTTGTGTTTATAAAAGGATAGTCAAC
TCTACAGTAGCTTTGCTGGTGTTGTACGTTGACGATATCCTACTCATTGGAAATGATGTGGGAATTCTGACTGACATTAAGCATTGGCTAGCGACACAATTCCAAATGAA
AAATCAAATTGTTCGGAATCACAAGAACAAAACACTAGCATTGTCTCAAGCATCGTACATCGACAAAATGTTGATTCGATATAAGATGCAGGACTCCAAGAAAGGATTAT
TACCTTTCAGGCATGGAGTTCATTTGTCGAAGGAACAAAGTCCTAAGACACCTCAAGAAGTTGAGGATATGAAACACATTCCCTATGCATCAGCGGTCGGTAGTCTGATG
TATGCCATGCTATGTACCCGACCCGACATATGCTATGCTGTGGGAATTGTCAGCAGATATCAGTCCAATCCGGGACGTGCTCATTGGACTGCCGTTAAGAATATCCTCAA
GTATCTTAGGAGAACGAGGGACTATATGCTAATGTACGGTGCTAAGGATCTGATCCTTACAGGGTACACTGACTCAGATTTTCAGACCGATGTAGATTCGAGGAAATCGA
CATCAGGATCTGTCTTCACTCTGAACGGAGGAGCAATAATATGGAGAAGCATAAAGCAAGGTTGCATTGCTGATTCCACCATGGAGGCTGAGTATGTTGCTGCTTGTGAA
GCAGCGAAAGAATCTGTATGGCTTAGGAAGTTCTTAACTGATTTGGAAGTCGTTCCAAATATGCATCTTCCCGTCACTCTTTATTGTGATAACAGTGGAGCAGTTGCAAA
TTCAAAAGAACCAAGAAGCCATAAGCGAGGAAAACATATTGAGCGCAAATATCATCTCATAAGGGAGATTGTGCAACGAGGAGATGTGATCGTCACACAGATAGCTTCAG
AGCACAACATTGCTGATCCATTTACAAAGCCCCTCACGGCTAAAGTGTTTGAAGGGCACCTAGTGAGTCTAGGACTACGAGTTATGTAA
Protein sequenceShow/hide protein sequence
MPRRSGRVITQPDRYLGLIETQVIIPDDGVEDPLTYKHAMNDEDRDQWIKAMNLEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDQTGKVQTFKARLVAKGYTQREG
VDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMAQPEGFIEQDHEQRVCKLKRSIYGLKQASRSWNIKFDTAIKSYGFKQNVDEPCVYKRIVN
STVALLVLYVDDILLIGNDVGILTDIKHWLATQFQMKNQIVRNHKNKTLALSQASYIDKMLIRYKMQDSKKGLLPFRHGVHLSKEQSPKTPQEVEDMKHIPYASAVGSLM
YAMLCTRPDICYAVGIVSRYQSNPGRAHWTAVKNILKYLRRTRDYMLMYGAKDLILTGYTDSDFQTDVDSRKSTSGSVFTLNGGAIIWRSIKQGCIADSTMEAEYVAACE
AAKESVWLRKFLTDLEVVPNMHLPVTLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTQIASEHNIADPFTKPLTAKVFEGHLVSLGLRVM