; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc09g0251801 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc09g0251801
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionTransposon Ty1-GR2 Gag-Pol polyprotein
Genome locationCMiso1.1chr09:17865248..17867166
RNA-Seq ExpressionCmc09g0251801
SyntenyCmc09g0251801
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0016772 - transferase activity, transferring phosphorus-containing groups (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0047995.1 retrotransposon protein, putative, Ty1-copia sub-class [Cucumis melo var. makuwa]6.2e-29484.8Show/hide
Query:  MFIGYPQGVKGYKLWCLKKGMNKCIISRDVNFNETKMPYCVKEERKQQTSDHVVTKVRIASEGRPSVGLYAFSDQPPLVSEIEDTQQSEFDGIQSQQERI
        MFIGYPQGVKGYKLWC++KGMNKCIISRDV FNET+MPYCVKE++KQQT DHVVT+VRIASE RPS+ L    +QPPLVSEIEDTQQSEFDGIQSQQERI
Subjt:  MFIGYPQGVKGYKLWCLKKGMNKCIISRDVNFNETKMPYCVKEERKQQTSDHVVTKVRIASEGRPSVGLYAFSDQPPLVSEIEDTQQSEFDGIQSQQERI

Query:  LIDEGAFIEESSSNNDLQNYQLTRDRAQRERHAPIRYGYADLVAYALTCAADGIEEKPLTFEKAIVSDSKKRWKDAMEVELFSLHKNQTWSLVPKPLNQK
        LIDE AFIEESSSNNDLQNYQLTRDR QRERHAPIRYGYADLVAYALTCAAD IE +PLTFE+AIVSDSKK+WKDAME ELFSLHKNQTWSLVPKP NQK
Subjt:  LIDEGAFIEESSSNNDLQNYQLTRDRAQRERHAPIRYGYADLVAYALTCAADGIEEKPLTFEKAIVSDSKKRWKDAMEVELFSLHKNQTWSLVPKPLNQK

Query:  LIQSKWIYKIKPDGCHDN--------------------ISSWRTGGSDLHG---------SPKGYEVKGKEDMVCRLHKSLYGLKQSPRQWYIRLYTFIL
        LIQSKWIYKIKP G   N                    I       + LHG          PKGYEVKGKEDMVCRLHKSLYGLKQSPRQWYI   TFIL
Subjt:  LIQSKWIYKIKPDGCHDN--------------------ISSWRTGGSDLHG---------SPKGYEVKGKEDMVCRLHKSLYGLKQSPRQWYIRLYTFIL

Query:  KQGFHRNSYDACVYWKLSQKGTYIYLLLYIDDIILVSKDYVEICELKKQLSNEFEMKDLGELKRILGMDVKRDRKKGLSTISQESYVIKLLEKYNISGSK
        KQGFHRNSYDACVYWK SQKGTYIYLLLY+DD+ILVSKDY  ICELKKQLSNEFEMKDLGELKRILGMDVKRDR+KGL TISQESYVIKLLEKYN+S SK
Subjt:  KQGFHRNSYDACVYWKLSQKGTYIYLLLYIDDIILVSKDYVEICELKKQLSNEFEMKDLGELKRILGMDVKRDRKKGLSTISQESYVIKLLEKYNISGSK

Query:  TVSTPLASYFRLSSSQCPVTKQERLEMSNIPYCNAIGSIMYLMICTRPDLGYAMSMISRFMSNPGKEHWKVVKWVLRYLEGSTSVSLCYSRDYDKSTLLE
         VSTPLAS+FRLSSSQCPVTKQER+EMSNIPYCNA+GSIMYLMICTRPDLGYAMSMISRFMSNPGKEHWK VKWVLRYL+GS SVSLCYSRD DKSTLLE
Subjt:  TVSTPLASYFRLSSSQCPVTKQERLEMSNIPYCNAIGSIMYLMICTRPDLGYAMSMISRFMSNPGKEHWKVVKWVLRYLEGSTSVSLCYSRDYDKSTLLE

Query:  GFTDADYVADLDKRRSLSGHIFRLYGNVVSWKVNLHPVVALSTTESEYISLGEAVKEAVWLQRIVGELLSQEFIPIIHCDSQSAIYLAKNPYHHERSKHI
        GFTDADY ADLDKRRSLSGHIFRLYGNVVSWKVNL PVVALSTTESEYISLGEAVKEAVWL+RIVGELLSQEFIPIIHCDSQSAI+LAKNP HHERSKHI
Subjt:  GFTDADYVADLDKRRSLSGHIFRLYGNVVSWKVNLHPVVALSTTESEYISLGEAVKEAVWLQRIVGELLSQEFIPIIHCDSQSAIYLAKNPYHHERSKHI

Query:  DVKFHYIRNVIA
        DVKFHYIRNVIA
Subjt:  DVKFHYIRNVIA

KAA0050719.1 putative gag-pol polyprotein [Cucumis melo var. makuwa]9.0e-29381.06Show/hide
Query:  MFIGYPQGVKGYKLWCLKKGMNKCIISRDVNFNETKMPYCVKEERKQQTSDHVVTKVRIASEGRPSVGLYAFSDQPPLVSEIEDTQQSEFDGIQSQQERI
        MFIGYPQGVKGYKLWC++KGMNKCIISRDV FNET+MPYCVKE++KQQT DHVVT+VRIASE RPS+ L    +QPPLVSEIEDTQQSEFDGIQSQQERI
Subjt:  MFIGYPQGVKGYKLWCLKKGMNKCIISRDVNFNETKMPYCVKEERKQQTSDHVVTKVRIASEGRPSVGLYAFSDQPPLVSEIEDTQQSEFDGIQSQQERI

Query:  LIDEGAFIEESSSNNDLQNYQLTRDRAQRERHAPIRYGYADLVAYALTCAADGIEEKPLTFEKAIVSDSKKRWKDAMEVELFSLHKNQTWSLVPKPLNQK
        LIDEGAFIEESSSNNDLQNYQLTRDR QRERHAPIRYGYADLVAYALTCAAD IE +PLTFE+AIVSDSKK+WKDAME ELFSLHKNQTWSLVPKP NQK
Subjt:  LIDEGAFIEESSSNNDLQNYQLTRDRAQRERHAPIRYGYADLVAYALTCAADGIEEKPLTFEKAIVSDSKKRWKDAMEVELFSLHKNQTWSLVPKPLNQK

Query:  LIQSKWIYKIKPDGCHDNISSWRT----------GGSDLH----------------------------------------------GSPKGYEVKGKEDM
        LIQSKWIYKIKP    ++   ++            G D H                                                PKGYEVKGKEDM
Subjt:  LIQSKWIYKIKPDGCHDNISSWRT----------GGSDLH----------------------------------------------GSPKGYEVKGKEDM

Query:  VCRLHKSLYGLKQSPRQWYIRLYTFILKQGFHRNSYDACVYWKLSQKGTYIYLLLYIDDIILVSKDYVEICELKKQLSNEFEMKDLGELKRILGMDVKRD
        VCRLHKSLYGLKQSPRQWYIR  TFILKQGFHRNSYDACVYWK SQKGTYIYLLLY+DD+ILVSKDY EICELKKQLSNEFEMKDLGELKRILGMDVKRD
Subjt:  VCRLHKSLYGLKQSPRQWYIRLYTFILKQGFHRNSYDACVYWKLSQKGTYIYLLLYIDDIILVSKDYVEICELKKQLSNEFEMKDLGELKRILGMDVKRD

Query:  RKKGLSTISQESYVIKLLEKYNISGSKTVSTPLASYFRLSSSQCPVTKQERLEMSNIPYCNAIGSIMYLMICTRPDLGYAMSMISRFMSNPGKEHWKVVK
        ++KGL TISQESYVIKLLEKYN+S SK VSTPLAS+FRLSSSQCPVTKQER+EMSNIPYCNA+GSIMYLMICTRPDLGYAMSMISRFMSNPGKEHWK VK
Subjt:  RKKGLSTISQESYVIKLLEKYNISGSKTVSTPLASYFRLSSSQCPVTKQERLEMSNIPYCNAIGSIMYLMICTRPDLGYAMSMISRFMSNPGKEHWKVVK

Query:  WVLRYLEGSTSVSLCYSRDYDKSTLLEGFTDADYVADLDKRRSLSGHIFRLYGNVVSWKVNLHPVVALSTTESEYISLGEAVKEAVWLQRIVGELLSQEF
        WVLRYL+GS SVSLCYSRD DKSTLLEGFTDADY ADLDKRRSLSGHIFRLYGNVVSWKVNL PVVALSTTESEYISLGEAVKEAVWL+RIVGELLSQEF
Subjt:  WVLRYLEGSTSVSLCYSRDYDKSTLLEGFTDADYVADLDKRRSLSGHIFRLYGNVVSWKVNLHPVVALSTTESEYISLGEAVKEAVWLQRIVGELLSQEF

Query:  IPIIHCDSQSAIYLAKNPYHHERSKHIDVKFHYIRNVIA
        IPIIHCDSQSAI+LAKNP HHERSKHIDVKFHYIRNVIA
Subjt:  IPIIHCDSQSAIYLAKNPYHHERSKHIDVKFHYIRNVIA

KAA0067607.1 retrotransposon protein, putative, Ty1-copia subclass [Cucumis melo var. makuwa]1.2e-21775.5Show/hide
Query:  PYCVKEERKQQTSDHVVTKVRIASEGRPSVGLYAFSDQPPLVSEIEDTQQSEFDGIQSQQERILIDEGAFIEESSSNNDLQNYQLTRDRAQRERHAPIRY
        P  VK+ERKQQTSDHVVTKVRIASEGRPSVGLYAFSDQPPLVS+IEDTQQSEFDGIQSQQERILIDEGAFIEESSSNNDLQNYQLTRDRAQRERHAPIRY
Subjt:  PYCVKEERKQQTSDHVVTKVRIASEGRPSVGLYAFSDQPPLVSEIEDTQQSEFDGIQSQQERILIDEGAFIEESSSNNDLQNYQLTRDRAQRERHAPIRY

Query:  GYADLVAYALTCAADGIEEKPLTFEKAIVSDSKKRWKDAMEVELFSLHKNQTWSLVPKPLNQKLIQSKWIYKIKPDGCHDNISSWRTGGSDLHGSPKGYE
        GYADLVAYALTCAADGIEEKPLTFEKAIVSDSKKRWKD MEVELFSLHKNQTWSLVPKPLNQKLIQSKWIYKIKP    ++   ++          KGY 
Subjt:  GYADLVAYALTCAADGIEEKPLTFEKAIVSDSKKRWKDAMEVELFSLHKNQTWSLVPKPLNQKLIQSKWIYKIKPDGCHDNISSWRTGGSDLHGSPKGYE

Query:  VKGKEDM------VCRLHKSLYGLKQSPRQWYIRLYTFILKQGFHRNSYDACVYW---KLSQKGTYIYLLLYIDDIILVSKDYVEICELKKQLSNEFEMK
         K   D       V R H S+  +      + + +    +   F     +  +Y    +  +  T  +++L      + + +Y+      ++LSNEFEMK
Subjt:  VKGKEDM------VCRLHKSLYGLKQSPRQWYIRLYTFILKQGFHRNSYDACVYW---KLSQKGTYIYLLLYIDDIILVSKDYVEICELKKQLSNEFEMK

Query:  DLGELKRILGMDVKRDRKKGLSTISQESYVIKLLEKYNISGSKTVSTPLASYFRLSSSQCPVTKQERLEMSNIPYCNAIGSIMYLMICTRPDLGYAMSMI
        DLGELKRILGMDVKRDRKKGLSTISQESYVIKLLEKYNISGSK VSTPLASYFRLSSSQCPVTKQE                       RPDLGY MSMI
Subjt:  DLGELKRILGMDVKRDRKKGLSTISQESYVIKLLEKYNISGSKTVSTPLASYFRLSSSQCPVTKQERLEMSNIPYCNAIGSIMYLMICTRPDLGYAMSMI

Query:  SRFMSNPGKEHWKVVKWVLRYLEGSTSVSLCYSRDYDKSTLLEGFTDADYVADLDKRRSLSGHIFRLYGNVVSWKVNLHPVVALSTTESEYISLGEAVKE
        SRFMSNPGKEHWKVVKWVLRYL+GSTSVSLCYSRDYDKSTLLEGFTDADYVADLDKRRSLSGHIFRLYGNVVSWKVNL PVVALSTTESEYISLGEAVKE
Subjt:  SRFMSNPGKEHWKVVKWVLRYLEGSTSVSLCYSRDYDKSTLLEGFTDADYVADLDKRRSLSGHIFRLYGNVVSWKVNLHPVVALSTTESEYISLGEAVKE

Query:  AVWLQRIVGELLSQEFIPIIHCDSQSAIYLAKNPYHHERSKHIDVKFHYIRNVIA
        AV LQRIVGELLSQEFIPIIHCDSQSAIYLAKNPYHHERSKHIDVKFHYIRNVIA
Subjt:  AVWLQRIVGELLSQEFIPIIHCDSQSAIYLAKNPYHHERSKHIDVKFHYIRNVIA

TYK13826.1 putative polyprotein [Cucumis melo var. makuwa]1.4e-29381.22Show/hide
Query:  MFIGYPQGVKGYKLWCLKKGMNKCIISRDVNFNETKMPYCVKEERKQQTSDHVVTKVRIASEGRPSVGLYAFSDQPPLVSEIEDTQQSEFDGIQSQQERI
        MFIGYPQGVKGYKLWC++KGMNKCIISRDV FNET+MPYCVKE++KQQT DHVVT+VRIASE RPS+ L    +QPPLVSEIEDTQQSEFDGIQSQQERI
Subjt:  MFIGYPQGVKGYKLWCLKKGMNKCIISRDVNFNETKMPYCVKEERKQQTSDHVVTKVRIASEGRPSVGLYAFSDQPPLVSEIEDTQQSEFDGIQSQQERI

Query:  LIDEGAFIEESSSNNDLQNYQLTRDRAQRERHAPIRYGYADLVAYALTCAADGIEEKPLTFEKAIVSDSKKRWKDAMEVELFSLHKNQTWSLVPKPLNQK
        LIDEGAFIEESSSNNDLQNYQLTRDR QRERHAPIRYGYADLVAYALTCAAD IE +PLTFE+AIVSDSKK+WKDAME ELFSLHKNQTWSLVPKP NQK
Subjt:  LIDEGAFIEESSSNNDLQNYQLTRDRAQRERHAPIRYGYADLVAYALTCAADGIEEKPLTFEKAIVSDSKKRWKDAMEVELFSLHKNQTWSLVPKPLNQK

Query:  LIQSKWIYKIKPDGCHDNISSWRT----------GGSDLH----------------------------------------------GSPKGYEVKGKEDM
        LIQSKWIYKIKP    ++   ++            G D H                                                PKGYEVKGKEDM
Subjt:  LIQSKWIYKIKPDGCHDNISSWRT----------GGSDLH----------------------------------------------GSPKGYEVKGKEDM

Query:  VCRLHKSLYGLKQSPRQWYIRLYTFILKQGFHRNSYDACVYWKLSQKGTYIYLLLYIDDIILVSKDYVEICELKKQLSNEFEMKDLGELKRILGMDVKRD
        VCRLHKSLYGLKQSPRQWYIR  TFILKQGFHRNSYDACVYWK SQKGTYIYLLLY+DD+ILVSKDY EICELKKQLSNEFEMKDLGELKRILGMDVKRD
Subjt:  VCRLHKSLYGLKQSPRQWYIRLYTFILKQGFHRNSYDACVYWKLSQKGTYIYLLLYIDDIILVSKDYVEICELKKQLSNEFEMKDLGELKRILGMDVKRD

Query:  RKKGLSTISQESYVIKLLEKYNISGSKTVSTPLASYFRLSSSQCPVTKQERLEMSNIPYCNAIGSIMYLMICTRPDLGYAMSMISRFMSNPGKEHWKVVK
        ++KGL TISQESYVIKLLEKYN+SGSK VSTPLAS+FRLSSSQCPVTKQER+EMSNIPYCNA+GSIMYLMICTRPDLGYAMSMISRFMSNPGKEHWK VK
Subjt:  RKKGLSTISQESYVIKLLEKYNISGSKTVSTPLASYFRLSSSQCPVTKQERLEMSNIPYCNAIGSIMYLMICTRPDLGYAMSMISRFMSNPGKEHWKVVK

Query:  WVLRYLEGSTSVSLCYSRDYDKSTLLEGFTDADYVADLDKRRSLSGHIFRLYGNVVSWKVNLHPVVALSTTESEYISLGEAVKEAVWLQRIVGELLSQEF
        WVLRYL+GS SVSLCYSRD DKSTLLEGFTDADY ADLDKRRSLSGHIFRLYGNVVSWKVNL PVVALSTTESEYISLGEAVKEAVWL+RIVGELLSQEF
Subjt:  WVLRYLEGSTSVSLCYSRDYDKSTLLEGFTDADYVADLDKRRSLSGHIFRLYGNVVSWKVNLHPVVALSTTESEYISLGEAVKEAVWLQRIVGELLSQEF

Query:  IPIIHCDSQSAIYLAKNPYHHERSKHIDVKFHYIRNVIA
        IPIIHCDSQSAI+LAKNP HHERSKHIDVKFHYIRNVIA
Subjt:  IPIIHCDSQSAIYLAKNPYHHERSKHIDVKFHYIRNVIA

TYK25306.1 putative gag-pol polyprotein [Cucumis melo var. makuwa]8.5e-25172.77Show/hide
Query:  MFIGYPQGVKGYKLWCLKKGMNKCIISRDVNFNETKMPYCVKEERKQQTSDHVVTKVRIASEGRPSVGLYAFSDQPPLVSEIEDTQQSEFDGIQSQQERI
        MFIGYPQGVKGYKLWC++KGMNKCIISRDV FNET+MPYCVKE++KQQT DHVVT+VRIASE RPS+ L    +QPPLVSEIEDTQQSEFDGIQSQQERI
Subjt:  MFIGYPQGVKGYKLWCLKKGMNKCIISRDVNFNETKMPYCVKEERKQQTSDHVVTKVRIASEGRPSVGLYAFSDQPPLVSEIEDTQQSEFDGIQSQQERI

Query:  LIDEGAFIEESSSNNDLQNYQLTRDRAQRERHAPIRYGYADLVAYALTCAADGIEEKPLTFEKAIVSDSKKRWKDAMEVELFSLHKNQTWSLVPKPLNQK
        LIDEGAFIEESSSNNDLQNYQLTRDR QRERHAPIRYGYADLVAYALTCAAD IE +PLTFE+AIVSDSKK+WKDAME ELFSLHKNQTWSLVPKP NQK
Subjt:  LIDEGAFIEESSSNNDLQNYQLTRDRAQRERHAPIRYGYADLVAYALTCAADGIEEKPLTFEKAIVSDSKKRWKDAMEVELFSLHKNQTWSLVPKPLNQK

Query:  LIQSKWIYKIKPDGCHDNISSWRT----------GGSDLH----------------------------------------------GSPKGYEVKGKEDM
        LIQSKWIYKIKP    ++   ++            G D H                                                PKGYEVKGKEDM
Subjt:  LIQSKWIYKIKPDGCHDNISSWRT----------GGSDLH----------------------------------------------GSPKGYEVKGKEDM

Query:  VCRLHKSLYGLKQSPRQWYIRLYTFILKQGFHRNSYDACVYWKLSQKGTYIYLLLYIDDIILVSKDYVEICELKKQLSNEFEMKDLGELKRILGMDVKRD
        VCRLHKSLYGLKQSPRQWYIR  TFILKQGFHRNSYDACVYWK SQKGTYIYLLLY+DD+ILVSKDY EICELKKQLSNEFEMKDLGELKRILGMDVKRD
Subjt:  VCRLHKSLYGLKQSPRQWYIRLYTFILKQGFHRNSYDACVYWKLSQKGTYIYLLLYIDDIILVSKDYVEICELKKQLSNEFEMKDLGELKRILGMDVKRD

Query:  RKKGLSTISQESYVIKLLEKYNISGSKTVSTPLASYFRLSSSQCPVTKQERLEMSNIPYCNAIGSIMYLMICTRPDLGYAMSMISRFMSNPGKEHWKVVK
        ++KGL TISQESYVIKLLEKYN+S SK VSTPLAS+FRLSSSQCPVTKQER+EMSNIPYCNA+GSIMYLMICTRPDLGYAMS                  
Subjt:  RKKGLSTISQESYVIKLLEKYNISGSKTVSTPLASYFRLSSSQCPVTKQERLEMSNIPYCNAIGSIMYLMICTRPDLGYAMSMISRFMSNPGKEHWKVVK

Query:  WVLRYLEGSTSVSLCYSRDYDKSTLLEGFTDADYVADLDKRRSLSGHIFRLYGNVVSWKVNLHPVVALSTTESEYISLGEAVKEAVWLQRIVGELLSQEF
                S SVSLCYSRD DKSTLLEGFTDADY ADLDKR  L                       L    +EYISLGEAVKEAVWL+RIVGELLSQEF
Subjt:  WVLRYLEGSTSVSLCYSRDYDKSTLLEGFTDADYVADLDKRRSLSGHIFRLYGNVVSWKVNLHPVVALSTTESEYISLGEAVKEAVWLQRIVGELLSQEF

Query:  IPIIHCDSQSAIYLAKNPYHHERSKHIDVKFHYIRNVIA
        IPIIHCDSQSAI+LAKNP HHERSKHIDVKFHYIRNVIA
Subjt:  IPIIHCDSQSAIYLAKNPYHHERSKHIDVKFHYIRNVIA

TrEMBL top hitse value%identityAlignment
A0A5A7U2U7 Retrotransposon protein, putative, Ty1-copia sub-class3.0e-29484.8Show/hide
Query:  MFIGYPQGVKGYKLWCLKKGMNKCIISRDVNFNETKMPYCVKEERKQQTSDHVVTKVRIASEGRPSVGLYAFSDQPPLVSEIEDTQQSEFDGIQSQQERI
        MFIGYPQGVKGYKLWC++KGMNKCIISRDV FNET+MPYCVKE++KQQT DHVVT+VRIASE RPS+ L    +QPPLVSEIEDTQQSEFDGIQSQQERI
Subjt:  MFIGYPQGVKGYKLWCLKKGMNKCIISRDVNFNETKMPYCVKEERKQQTSDHVVTKVRIASEGRPSVGLYAFSDQPPLVSEIEDTQQSEFDGIQSQQERI

Query:  LIDEGAFIEESSSNNDLQNYQLTRDRAQRERHAPIRYGYADLVAYALTCAADGIEEKPLTFEKAIVSDSKKRWKDAMEVELFSLHKNQTWSLVPKPLNQK
        LIDE AFIEESSSNNDLQNYQLTRDR QRERHAPIRYGYADLVAYALTCAAD IE +PLTFE+AIVSDSKK+WKDAME ELFSLHKNQTWSLVPKP NQK
Subjt:  LIDEGAFIEESSSNNDLQNYQLTRDRAQRERHAPIRYGYADLVAYALTCAADGIEEKPLTFEKAIVSDSKKRWKDAMEVELFSLHKNQTWSLVPKPLNQK

Query:  LIQSKWIYKIKPDGCHDN--------------------ISSWRTGGSDLHG---------SPKGYEVKGKEDMVCRLHKSLYGLKQSPRQWYIRLYTFIL
        LIQSKWIYKIKP G   N                    I       + LHG          PKGYEVKGKEDMVCRLHKSLYGLKQSPRQWYI   TFIL
Subjt:  LIQSKWIYKIKPDGCHDN--------------------ISSWRTGGSDLHG---------SPKGYEVKGKEDMVCRLHKSLYGLKQSPRQWYIRLYTFIL

Query:  KQGFHRNSYDACVYWKLSQKGTYIYLLLYIDDIILVSKDYVEICELKKQLSNEFEMKDLGELKRILGMDVKRDRKKGLSTISQESYVIKLLEKYNISGSK
        KQGFHRNSYDACVYWK SQKGTYIYLLLY+DD+ILVSKDY  ICELKKQLSNEFEMKDLGELKRILGMDVKRDR+KGL TISQESYVIKLLEKYN+S SK
Subjt:  KQGFHRNSYDACVYWKLSQKGTYIYLLLYIDDIILVSKDYVEICELKKQLSNEFEMKDLGELKRILGMDVKRDRKKGLSTISQESYVIKLLEKYNISGSK

Query:  TVSTPLASYFRLSSSQCPVTKQERLEMSNIPYCNAIGSIMYLMICTRPDLGYAMSMISRFMSNPGKEHWKVVKWVLRYLEGSTSVSLCYSRDYDKSTLLE
         VSTPLAS+FRLSSSQCPVTKQER+EMSNIPYCNA+GSIMYLMICTRPDLGYAMSMISRFMSNPGKEHWK VKWVLRYL+GS SVSLCYSRD DKSTLLE
Subjt:  TVSTPLASYFRLSSSQCPVTKQERLEMSNIPYCNAIGSIMYLMICTRPDLGYAMSMISRFMSNPGKEHWKVVKWVLRYLEGSTSVSLCYSRDYDKSTLLE

Query:  GFTDADYVADLDKRRSLSGHIFRLYGNVVSWKVNLHPVVALSTTESEYISLGEAVKEAVWLQRIVGELLSQEFIPIIHCDSQSAIYLAKNPYHHERSKHI
        GFTDADY ADLDKRRSLSGHIFRLYGNVVSWKVNL PVVALSTTESEYISLGEAVKEAVWL+RIVGELLSQEFIPIIHCDSQSAI+LAKNP HHERSKHI
Subjt:  GFTDADYVADLDKRRSLSGHIFRLYGNVVSWKVNLHPVVALSTTESEYISLGEAVKEAVWLQRIVGELLSQEFIPIIHCDSQSAIYLAKNPYHHERSKHI

Query:  DVKFHYIRNVIA
        DVKFHYIRNVIA
Subjt:  DVKFHYIRNVIA

A0A5A7UB25 Putative gag-pol polyprotein4.3e-29381.06Show/hide
Query:  MFIGYPQGVKGYKLWCLKKGMNKCIISRDVNFNETKMPYCVKEERKQQTSDHVVTKVRIASEGRPSVGLYAFSDQPPLVSEIEDTQQSEFDGIQSQQERI
        MFIGYPQGVKGYKLWC++KGMNKCIISRDV FNET+MPYCVKE++KQQT DHVVT+VRIASE RPS+ L    +QPPLVSEIEDTQQSEFDGIQSQQERI
Subjt:  MFIGYPQGVKGYKLWCLKKGMNKCIISRDVNFNETKMPYCVKEERKQQTSDHVVTKVRIASEGRPSVGLYAFSDQPPLVSEIEDTQQSEFDGIQSQQERI

Query:  LIDEGAFIEESSSNNDLQNYQLTRDRAQRERHAPIRYGYADLVAYALTCAADGIEEKPLTFEKAIVSDSKKRWKDAMEVELFSLHKNQTWSLVPKPLNQK
        LIDEGAFIEESSSNNDLQNYQLTRDR QRERHAPIRYGYADLVAYALTCAAD IE +PLTFE+AIVSDSKK+WKDAME ELFSLHKNQTWSLVPKP NQK
Subjt:  LIDEGAFIEESSSNNDLQNYQLTRDRAQRERHAPIRYGYADLVAYALTCAADGIEEKPLTFEKAIVSDSKKRWKDAMEVELFSLHKNQTWSLVPKPLNQK

Query:  LIQSKWIYKIKPDGCHDNISSWRT----------GGSDLH----------------------------------------------GSPKGYEVKGKEDM
        LIQSKWIYKIKP    ++   ++            G D H                                                PKGYEVKGKEDM
Subjt:  LIQSKWIYKIKPDGCHDNISSWRT----------GGSDLH----------------------------------------------GSPKGYEVKGKEDM

Query:  VCRLHKSLYGLKQSPRQWYIRLYTFILKQGFHRNSYDACVYWKLSQKGTYIYLLLYIDDIILVSKDYVEICELKKQLSNEFEMKDLGELKRILGMDVKRD
        VCRLHKSLYGLKQSPRQWYIR  TFILKQGFHRNSYDACVYWK SQKGTYIYLLLY+DD+ILVSKDY EICELKKQLSNEFEMKDLGELKRILGMDVKRD
Subjt:  VCRLHKSLYGLKQSPRQWYIRLYTFILKQGFHRNSYDACVYWKLSQKGTYIYLLLYIDDIILVSKDYVEICELKKQLSNEFEMKDLGELKRILGMDVKRD

Query:  RKKGLSTISQESYVIKLLEKYNISGSKTVSTPLASYFRLSSSQCPVTKQERLEMSNIPYCNAIGSIMYLMICTRPDLGYAMSMISRFMSNPGKEHWKVVK
        ++KGL TISQESYVIKLLEKYN+S SK VSTPLAS+FRLSSSQCPVTKQER+EMSNIPYCNA+GSIMYLMICTRPDLGYAMSMISRFMSNPGKEHWK VK
Subjt:  RKKGLSTISQESYVIKLLEKYNISGSKTVSTPLASYFRLSSSQCPVTKQERLEMSNIPYCNAIGSIMYLMICTRPDLGYAMSMISRFMSNPGKEHWKVVK

Query:  WVLRYLEGSTSVSLCYSRDYDKSTLLEGFTDADYVADLDKRRSLSGHIFRLYGNVVSWKVNLHPVVALSTTESEYISLGEAVKEAVWLQRIVGELLSQEF
        WVLRYL+GS SVSLCYSRD DKSTLLEGFTDADY ADLDKRRSLSGHIFRLYGNVVSWKVNL PVVALSTTESEYISLGEAVKEAVWL+RIVGELLSQEF
Subjt:  WVLRYLEGSTSVSLCYSRDYDKSTLLEGFTDADYVADLDKRRSLSGHIFRLYGNVVSWKVNLHPVVALSTTESEYISLGEAVKEAVWLQRIVGELLSQEF

Query:  IPIIHCDSQSAIYLAKNPYHHERSKHIDVKFHYIRNVIA
        IPIIHCDSQSAI+LAKNP HHERSKHIDVKFHYIRNVIA
Subjt:  IPIIHCDSQSAIYLAKNPYHHERSKHIDVKFHYIRNVIA

A0A5A7VKC2 Retrotransposon protein, putative, Ty1-copia subclass6.0e-21875.5Show/hide
Query:  PYCVKEERKQQTSDHVVTKVRIASEGRPSVGLYAFSDQPPLVSEIEDTQQSEFDGIQSQQERILIDEGAFIEESSSNNDLQNYQLTRDRAQRERHAPIRY
        P  VK+ERKQQTSDHVVTKVRIASEGRPSVGLYAFSDQPPLVS+IEDTQQSEFDGIQSQQERILIDEGAFIEESSSNNDLQNYQLTRDRAQRERHAPIRY
Subjt:  PYCVKEERKQQTSDHVVTKVRIASEGRPSVGLYAFSDQPPLVSEIEDTQQSEFDGIQSQQERILIDEGAFIEESSSNNDLQNYQLTRDRAQRERHAPIRY

Query:  GYADLVAYALTCAADGIEEKPLTFEKAIVSDSKKRWKDAMEVELFSLHKNQTWSLVPKPLNQKLIQSKWIYKIKPDGCHDNISSWRTGGSDLHGSPKGYE
        GYADLVAYALTCAADGIEEKPLTFEKAIVSDSKKRWKD MEVELFSLHKNQTWSLVPKPLNQKLIQSKWIYKIKP    ++   ++          KGY 
Subjt:  GYADLVAYALTCAADGIEEKPLTFEKAIVSDSKKRWKDAMEVELFSLHKNQTWSLVPKPLNQKLIQSKWIYKIKPDGCHDNISSWRTGGSDLHGSPKGYE

Query:  VKGKEDM------VCRLHKSLYGLKQSPRQWYIRLYTFILKQGFHRNSYDACVYW---KLSQKGTYIYLLLYIDDIILVSKDYVEICELKKQLSNEFEMK
         K   D       V R H S+  +      + + +    +   F     +  +Y    +  +  T  +++L      + + +Y+      ++LSNEFEMK
Subjt:  VKGKEDM------VCRLHKSLYGLKQSPRQWYIRLYTFILKQGFHRNSYDACVYW---KLSQKGTYIYLLLYIDDIILVSKDYVEICELKKQLSNEFEMK

Query:  DLGELKRILGMDVKRDRKKGLSTISQESYVIKLLEKYNISGSKTVSTPLASYFRLSSSQCPVTKQERLEMSNIPYCNAIGSIMYLMICTRPDLGYAMSMI
        DLGELKRILGMDVKRDRKKGLSTISQESYVIKLLEKYNISGSK VSTPLASYFRLSSSQCPVTKQE                       RPDLGY MSMI
Subjt:  DLGELKRILGMDVKRDRKKGLSTISQESYVIKLLEKYNISGSKTVSTPLASYFRLSSSQCPVTKQERLEMSNIPYCNAIGSIMYLMICTRPDLGYAMSMI

Query:  SRFMSNPGKEHWKVVKWVLRYLEGSTSVSLCYSRDYDKSTLLEGFTDADYVADLDKRRSLSGHIFRLYGNVVSWKVNLHPVVALSTTESEYISLGEAVKE
        SRFMSNPGKEHWKVVKWVLRYL+GSTSVSLCYSRDYDKSTLLEGFTDADYVADLDKRRSLSGHIFRLYGNVVSWKVNL PVVALSTTESEYISLGEAVKE
Subjt:  SRFMSNPGKEHWKVVKWVLRYLEGSTSVSLCYSRDYDKSTLLEGFTDADYVADLDKRRSLSGHIFRLYGNVVSWKVNLHPVVALSTTESEYISLGEAVKE

Query:  AVWLQRIVGELLSQEFIPIIHCDSQSAIYLAKNPYHHERSKHIDVKFHYIRNVIA
        AV LQRIVGELLSQEFIPIIHCDSQSAIYLAKNPYHHERSKHIDVKFHYIRNVIA
Subjt:  AVWLQRIVGELLSQEFIPIIHCDSQSAIYLAKNPYHHERSKHIDVKFHYIRNVIA

A0A5D3CTV2 Putative polyprotein6.7e-29481.22Show/hide
Query:  MFIGYPQGVKGYKLWCLKKGMNKCIISRDVNFNETKMPYCVKEERKQQTSDHVVTKVRIASEGRPSVGLYAFSDQPPLVSEIEDTQQSEFDGIQSQQERI
        MFIGYPQGVKGYKLWC++KGMNKCIISRDV FNET+MPYCVKE++KQQT DHVVT+VRIASE RPS+ L    +QPPLVSEIEDTQQSEFDGIQSQQERI
Subjt:  MFIGYPQGVKGYKLWCLKKGMNKCIISRDVNFNETKMPYCVKEERKQQTSDHVVTKVRIASEGRPSVGLYAFSDQPPLVSEIEDTQQSEFDGIQSQQERI

Query:  LIDEGAFIEESSSNNDLQNYQLTRDRAQRERHAPIRYGYADLVAYALTCAADGIEEKPLTFEKAIVSDSKKRWKDAMEVELFSLHKNQTWSLVPKPLNQK
        LIDEGAFIEESSSNNDLQNYQLTRDR QRERHAPIRYGYADLVAYALTCAAD IE +PLTFE+AIVSDSKK+WKDAME ELFSLHKNQTWSLVPKP NQK
Subjt:  LIDEGAFIEESSSNNDLQNYQLTRDRAQRERHAPIRYGYADLVAYALTCAADGIEEKPLTFEKAIVSDSKKRWKDAMEVELFSLHKNQTWSLVPKPLNQK

Query:  LIQSKWIYKIKPDGCHDNISSWRT----------GGSDLH----------------------------------------------GSPKGYEVKGKEDM
        LIQSKWIYKIKP    ++   ++            G D H                                                PKGYEVKGKEDM
Subjt:  LIQSKWIYKIKPDGCHDNISSWRT----------GGSDLH----------------------------------------------GSPKGYEVKGKEDM

Query:  VCRLHKSLYGLKQSPRQWYIRLYTFILKQGFHRNSYDACVYWKLSQKGTYIYLLLYIDDIILVSKDYVEICELKKQLSNEFEMKDLGELKRILGMDVKRD
        VCRLHKSLYGLKQSPRQWYIR  TFILKQGFHRNSYDACVYWK SQKGTYIYLLLY+DD+ILVSKDY EICELKKQLSNEFEMKDLGELKRILGMDVKRD
Subjt:  VCRLHKSLYGLKQSPRQWYIRLYTFILKQGFHRNSYDACVYWKLSQKGTYIYLLLYIDDIILVSKDYVEICELKKQLSNEFEMKDLGELKRILGMDVKRD

Query:  RKKGLSTISQESYVIKLLEKYNISGSKTVSTPLASYFRLSSSQCPVTKQERLEMSNIPYCNAIGSIMYLMICTRPDLGYAMSMISRFMSNPGKEHWKVVK
        ++KGL TISQESYVIKLLEKYN+SGSK VSTPLAS+FRLSSSQCPVTKQER+EMSNIPYCNA+GSIMYLMICTRPDLGYAMSMISRFMSNPGKEHWK VK
Subjt:  RKKGLSTISQESYVIKLLEKYNISGSKTVSTPLASYFRLSSSQCPVTKQERLEMSNIPYCNAIGSIMYLMICTRPDLGYAMSMISRFMSNPGKEHWKVVK

Query:  WVLRYLEGSTSVSLCYSRDYDKSTLLEGFTDADYVADLDKRRSLSGHIFRLYGNVVSWKVNLHPVVALSTTESEYISLGEAVKEAVWLQRIVGELLSQEF
        WVLRYL+GS SVSLCYSRD DKSTLLEGFTDADY ADLDKRRSLSGHIFRLYGNVVSWKVNL PVVALSTTESEYISLGEAVKEAVWL+RIVGELLSQEF
Subjt:  WVLRYLEGSTSVSLCYSRDYDKSTLLEGFTDADYVADLDKRRSLSGHIFRLYGNVVSWKVNLHPVVALSTTESEYISLGEAVKEAVWLQRIVGELLSQEF

Query:  IPIIHCDSQSAIYLAKNPYHHERSKHIDVKFHYIRNVIA
        IPIIHCDSQSAI+LAKNP HHERSKHIDVKFHYIRNVIA
Subjt:  IPIIHCDSQSAIYLAKNPYHHERSKHIDVKFHYIRNVIA

A0A5D3DNU1 Putative gag-pol polyprotein4.1e-25172.77Show/hide
Query:  MFIGYPQGVKGYKLWCLKKGMNKCIISRDVNFNETKMPYCVKEERKQQTSDHVVTKVRIASEGRPSVGLYAFSDQPPLVSEIEDTQQSEFDGIQSQQERI
        MFIGYPQGVKGYKLWC++KGMNKCIISRDV FNET+MPYCVKE++KQQT DHVVT+VRIASE RPS+ L    +QPPLVSEIEDTQQSEFDGIQSQQERI
Subjt:  MFIGYPQGVKGYKLWCLKKGMNKCIISRDVNFNETKMPYCVKEERKQQTSDHVVTKVRIASEGRPSVGLYAFSDQPPLVSEIEDTQQSEFDGIQSQQERI

Query:  LIDEGAFIEESSSNNDLQNYQLTRDRAQRERHAPIRYGYADLVAYALTCAADGIEEKPLTFEKAIVSDSKKRWKDAMEVELFSLHKNQTWSLVPKPLNQK
        LIDEGAFIEESSSNNDLQNYQLTRDR QRERHAPIRYGYADLVAYALTCAAD IE +PLTFE+AIVSDSKK+WKDAME ELFSLHKNQTWSLVPKP NQK
Subjt:  LIDEGAFIEESSSNNDLQNYQLTRDRAQRERHAPIRYGYADLVAYALTCAADGIEEKPLTFEKAIVSDSKKRWKDAMEVELFSLHKNQTWSLVPKPLNQK

Query:  LIQSKWIYKIKPDGCHDNISSWRT----------GGSDLH----------------------------------------------GSPKGYEVKGKEDM
        LIQSKWIYKIKP    ++   ++            G D H                                                PKGYEVKGKEDM
Subjt:  LIQSKWIYKIKPDGCHDNISSWRT----------GGSDLH----------------------------------------------GSPKGYEVKGKEDM

Query:  VCRLHKSLYGLKQSPRQWYIRLYTFILKQGFHRNSYDACVYWKLSQKGTYIYLLLYIDDIILVSKDYVEICELKKQLSNEFEMKDLGELKRILGMDVKRD
        VCRLHKSLYGLKQSPRQWYIR  TFILKQGFHRNSYDACVYWK SQKGTYIYLLLY+DD+ILVSKDY EICELKKQLSNEFEMKDLGELKRILGMDVKRD
Subjt:  VCRLHKSLYGLKQSPRQWYIRLYTFILKQGFHRNSYDACVYWKLSQKGTYIYLLLYIDDIILVSKDYVEICELKKQLSNEFEMKDLGELKRILGMDVKRD

Query:  RKKGLSTISQESYVIKLLEKYNISGSKTVSTPLASYFRLSSSQCPVTKQERLEMSNIPYCNAIGSIMYLMICTRPDLGYAMSMISRFMSNPGKEHWKVVK
        ++KGL TISQESYVIKLLEKYN+S SK VSTPLAS+FRLSSSQCPVTKQER+EMSNIPYCNA+GSIMYLMICTRPDLGYAMS                  
Subjt:  RKKGLSTISQESYVIKLLEKYNISGSKTVSTPLASYFRLSSSQCPVTKQERLEMSNIPYCNAIGSIMYLMICTRPDLGYAMSMISRFMSNPGKEHWKVVK

Query:  WVLRYLEGSTSVSLCYSRDYDKSTLLEGFTDADYVADLDKRRSLSGHIFRLYGNVVSWKVNLHPVVALSTTESEYISLGEAVKEAVWLQRIVGELLSQEF
                S SVSLCYSRD DKSTLLEGFTDADY ADLDKR  L                       L    +EYISLGEAVKEAVWL+RIVGELLSQEF
Subjt:  WVLRYLEGSTSVSLCYSRDYDKSTLLEGFTDADYVADLDKRRSLSGHIFRLYGNVVSWKVNLHPVVALSTTESEYISLGEAVKEAVWLQRIVGELLSQEF

Query:  IPIIHCDSQSAIYLAKNPYHHERSKHIDVKFHYIRNVIA
        IPIIHCDSQSAI+LAKNP HHERSKHIDVKFHYIRNVIA
Subjt:  IPIIHCDSQSAIYLAKNPYHHERSKHIDVKFHYIRNVIA

SwissProt top hitse value%identityAlignment
P04146 Copia protein6.4e-6028.8Show/hide
Query:  DTQQSEFDGIQSQQERILIDEGAFIEESSSNNDLQN--YQLTRDRAQRERHAP-IRYGYADLVAYALTCAADGI-EEKPLTFEKAIVSDSKKRWKDAMEV
        D   +E  G  +  E    +    ++E   +N  +N   ++   R++R +  P I Y   D     +   A  I  + P +F++    D K  W++A+  
Subjt:  DTQQSEFDGIQSQQERILIDEGAFIEESSSNNDLQN--YQLTRDRAQRERHAP-IRYGYADLVAYALTCAADGI-EEKPLTFEKAIVSDSKKRWKDAMEV

Query:  ELFSLHKNQTWSLVPKPLNQKLIQSKWIYKIKPDGCHD-----------------------------NISSWR-----------------TGGSDLHGS-
        EL +   N TW++  +P N+ ++ S+W++ +K +   +                              ISS+R                    + L+G+ 
Subjt:  ELFSLHKNQTWSLVPKPLNQKLIQSKWIYKIKPDGCHD-----------------------------NISSWR-----------------TGGSDLHGS-

Query:  --------PKGYEVKGKEDMVCRLHKSLYGLKQSPRQWYIRLYTFILKQGFHRNSYDACVYWKLSQKGTY---IYLLLYIDDIILVSKDYVEICELKKQL
                P+G  +    D VC+L+K++YGLKQ+ R W+      + +  F  +S D C+Y  +  KG     IY+LLY+DD+++ + D   +   K+ L
Subjt:  --------PKGYEVKGKEDMVCRLHKSLYGLKQSPRQWYIRLYTFILKQGFHRNSYDACVYWKLSQKGTY---IYLLLYIDDIILVSKDYVEICELKKQL

Query:  SNEFEMKDLGELKRILGMDVKRDRKKGLSTISQESYVIKLLEKYNISGSKTVSTPLAS---YFRLSSSQCPVTKQERLEMSNIPYCNAIGSIMYLMICTR
          +F M DL E+K  +G+ ++    K    +SQ +YV K+L K+N+     VSTPL S   Y  L+S           E  N P  + IG +MY+M+CTR
Subjt:  SNEFEMKDLGELKRILGMDVKRDRKKGLSTISQESYVIKLLEKYNISGSKTVSTPLAS---YFRLSSSQCPVTKQERLEMSNIPYCNAIGSIMYLMICTR

Query:  PDLGYAMSMISRFMSNPGKEHWKVVKWVLRYLEGSTSVSLCYSRDYDKSTLLEGFTDADYVADLDKRRSLSGHIFRLYG-NVVSWKVNLHPVVALSTTES
        PDL  A++++SR+ S    E W+ +K VLRYL+G+  + L + ++      + G+ D+D+      R+S +G++F+++  N++ W       VA S+TE+
Subjt:  PDLGYAMSMISRFMSNPGKEHWKVVKWVLRYLEGSTSVSLCYSRDYDKSTLLEGFTDADYVADLDKRRSLSGHIFRLYG-NVVSWKVNLHPVVALSTTES

Query:  EYISLGEAVKEAVWLQRIVGELLSQEFIPI-IHCDSQSAIYLAKNPYHHERSKHIDVKFHYIRNVI
        EY++L EAV+EA+WL+ ++  +  +   PI I+ D+Q  I +A NP  H+R+KHID+K+H+ R  +
Subjt:  EYISLGEAVKEAVWLQRIVGELLSQEFIPI-IHCDSQSAIYLAKNPYHHERSKHIDVKFHYIRNVI

P0CV72 Secreted RxLR effector protein 1611.4e-3047.41Show/hide
Query:  MSNIPYCNAIGSIMYLMICTRPDLGYAMSMISRFMSNPGKEHWKVVKWVLRYLEGSTSVSLCYSRDYDKSTLLEGFTDADYVADLDKRRSLSGHIFRLYG
        M N+PY +A+G+IMYLM+ TRPDL  A+ ++S+F S+P   HW+ +K VLRYL+ + +  L ++R    +  L G++DAD+  D++ RRS SG++F+L G
Subjt:  MSNIPYCNAIGSIMYLMICTRPDLGYAMSMISRFMSNPGKEHWKVVKWVLRYLEGSTSVSLCYSRDYDKSTLLEGFTDADYVADLDKRRSLSGHIFRLYG

Query:  NVVSWKVNLHPVVALSTTESEYISLGEAVKEAVWL
          VSW+      VALS+TE EY++L EA +EAVWL
Subjt:  NVVSWKVNLHPVVALSTTESEYISLGEAVKEAVWL

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.9e-10736.89Show/hide
Query:  MFIGYPQGVKGYKLWCLKKGMNKCIISRDVNFNETKMPYCVKEERKQQTSDHVVTKVRIASEGRPSVGLYAFSDQPPLVSEIEDTQQSEFDGIQSQQERI
        +FIGY     GY+LW   K   K I SRDV F E+++        K + +  +   V I S         + +D        E ++Q E  G   +Q   
Subjt:  MFIGYPQGVKGYKLWCLKKGMNKCIISRDVNFNETKMPYCVKEERKQQTSDHVVTKVRIASEGRPSVGLYAFSDQPPLVSEIEDTQQSEFDGIQSQQERI

Query:  LIDEGAFIEESSSNNDLQNYQLTRDRAQRERHAPIRYGYADLVAYALTCAADGIEEKPLTFEKAIVSDSKKRWKDAMEVELFSLHKNQTWSLVPKPLNQK
        L DEG    E  +  + Q+  L   R++R R    RY   + V  +        + +P + ++ +    K +   AM+ E+ SL KN T+ LV  P  ++
Subjt:  LIDEGAFIEESSSNNDLQNYQLTRDRAQRERHAPIRYGYADLVAYALTCAADGIEEKPLTFEKAIVSDSKKRWKDAMEVELFSLHKNQTWSLVPKPLNQK

Query:  LIQSKWIYKIKPDG-C---------------------HDNI-------SSWRT-----------------GGSDLHG---------SPKGYEVKGKEDMV
         ++ KW++K+K DG C                      D I       +S RT                   + LHG          P+G+EV GK+ MV
Subjt:  LIQSKWIYKIKPDG-C---------------------HDNI-------SSWRT-----------------GGSDLHG---------SPKGYEVKGKEDMV

Query:  CRLHKSLYGLKQSPRQWYIRLYTFILKQGFHRNSYDACVYWKLSQKGTYIYLLLYIDDIILVSKDYVEICELKKQLSNEFEMKDLGELKRILGMDVKRDR
        C+L+KSLYGLKQ+PRQWY++  +F+  Q + +   D CVY+K   +  +I LLLY+DD+++V KD   I +LK  LS  F+MKDLG  ++ILGM + R+R
Subjt:  CRLHKSLYGLKQSPRQWYIRLYTFILKQGFHRNSYDACVYWKLSQKGTYIYLLLYIDDIILVSKDYVEICELKKQLSNEFEMKDLGELKRILGMDVKRDR

Query:  KKGLSTISQESYVIKLLEKYNISGSKTVSTPLASYFRLSSSQCPVTKQERLEMSNIPYCNAIGSIMYLMICTRPDLGYAMSMISRFMSNPGKEHWKVVKW
              +SQE Y+ ++LE++N+  +K VSTPLA + +LS   CP T +E+  M+ +PY +A+GS+MY M+CTRPD+ +A+ ++SRF+ NPGKEHW+ VKW
Subjt:  KKGLSTISQESYVIKLLEKYNISGSKTVSTPLASYFRLSSSQCPVTKQERLEMSNIPYCNAIGSIMYLMICTRPDLGYAMSMISRFMSNPGKEHWKVVKW

Query:  VLRYLEGSTSVSLCYSRDYDKSTLLEGFTDADYVADLDKRRSLSGHIFRLYGNVVSWKVNLHPVVALSTTESEYISLGEAVKEAVWLQRIVGELLSQEFI
        +LRYL G+T   LC+        +L+G+TDAD   D+D R+S +G++F   G  +SW+  L   VALSTTE+EYI+  E  KE +WL+R + EL   +  
Subjt:  VLRYLEGSTSVSLCYSRDYDKSTLLEGFTDADYVADLDKRRSLSGHIFRLYGNVVSWKVNLHPVVALSTTESEYISLGEAVKEAVWLQRIVGELLSQEFI

Query:  PIIHCDSQSAIYLAKNPYHHERSKHIDVKFHYIRNVI
         +++CDSQSAI L+KN  +H R+KHIDV++H+IR ++
Subjt:  PIIHCDSQSAIYLAKNPYHHERSKHIDVKFHYIRNVI

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.3e-4934.94Show/hide
Query:  PKGYEVKGKEDMVCRLHKSLYGLKQSPRQWYIRLYTFILKQGFHRNSYDACVYWKLSQKGTYIYLLLYIDDIILVSKDYVEICELKKQLSNEFEMKDLGE
        P G+  K + + VC+L K+LYGLKQ+PR WY+ L  ++L  GF  +  D  ++  L +  + +Y+L+Y+DDI++   D   +      LS  F +KD  E
Subjt:  PKGYEVKGKEDMVCRLHKSLYGLKQSPRQWYIRLYTFILKQGFHRNSYDACVYWKLSQKGTYIYLLLYIDDIILVSKDYVEICELKKQLSNEFEMKDLGE

Query:  LKRILGMDVKRDRKKGLSTISQESYVIKLLEKYNISGSKTVSTPLASYFRLS-SSQCPVTKQERLEMSNIPYCNAIGSIMYLMICTRPDLGYAMSMISRF
        L   LG++ KR    GL  +SQ  Y++ LL + N+  +K V+TP+A   +LS  S   +T           Y   +GS+ YL   TRPD+ YA++ +S+F
Subjt:  LKRILGMDVKRDRKKGLSTISQESYVIKLLEKYNISGSKTVSTPLASYFRLS-SSQCPVTKQERLEMSNIPYCNAIGSIMYLMICTRPDLGYAMSMISRF

Query:  MSNPGKEHWKVVKWVLRYLEGSTSVSLCYSRDYDKSTLLEGFTDADYVADLDKRRSLSGHIFRLYGNVVSWKVNLHPVVALSTTESEYISLGEAVKEAVW
        M  P +EH + +K +LRYL G+ +  +   +    S  L  ++DAD+  D D   S +G+I  L  + +SW       V  S+TE+EY S+     E  W
Subjt:  MSNPGKEHWKVVKWVLRYLEGSTSVSLCYSRDYDKSTLLEGFTDADYVADLDKRRSLSGHIFRLYGNVVSWKVNLHPVVALSTTESEYISLGEAVKEAVW

Query:  LQRIVGEL-LSQEFIPIIHCDSQSAIYLAKNPYHHERSKHIDVKFHYIRNVI
        +  ++ EL +     P+I+CD+  A YL  NP  H R KHI + +H+IRN +
Subjt:  LQRIVGEL-LSQEFIPIIHCDSQSAIYLAKNPYHHERSKHIDVKFHYIRNVI

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE24.9e-5235.31Show/hide
Query:  PKGYEVKGKEDMVCRLHKSLYGLKQSPRQWYIRLYTFILKQGFHRNSYDACVYWKLSQKGTYIYLLLYIDDIILVSKDYVEICELKKQLSNEFEMKDLGE
        P G+  K + D VCRL K++YGLKQ+PR WY+ L T++L  GF  +  D  ++  L +  + IY+L+Y+DDI++   D V +      LS  F +K+  +
Subjt:  PKGYEVKGKEDMVCRLHKSLYGLKQSPRQWYIRLYTFILKQGFHRNSYDACVYWKLSQKGTYIYLLLYIDDIILVSKDYVEICELKKQLSNEFEMKDLGE

Query:  LKRILGMDVKRDRKKGLSTISQESYVIKLLEKYNISGSKTVSTPLASYFRL---SSSQCPVTKQERLEMSNIPYCNAIGSIMYLMICTRPDLGYAMSMIS
        L   LG++ KR   +GL  +SQ  Y + LL + N+  +K V+TP+A+  +L   S ++ P   +         Y   +GS+ YL   TRPDL YA++ +S
Subjt:  LKRILGMDVKRDRKKGLSTISQESYVIKLLEKYNISGSKTVSTPLASYFRL---SSSQCPVTKQERLEMSNIPYCNAIGSIMYLMICTRPDLGYAMSMIS

Query:  RFMSNPGKEHWKVVKWVLRYLEGSTSVSLCYSRDYDKSTLLEGFTDADYVADLDKRRSLSGHIFRLYGNVVSWKVNLHPVVALSTTESEYISLGEAVKEA
        ++M  P  +HW  +K VLRYL G+    +   +    S  L  ++DAD+  D D   S +G+I  L  + +SW       V  S+TE+EY S+     E 
Subjt:  RFMSNPGKEHWKVVKWVLRYLEGSTSVSLCYSRDYDKSTLLEGFTDADYVADLDKRRSLSGHIFRLYGNVVSWKVNLHPVVALSTTESEYISLGEAVKEA

Query:  VWLQRIVGEL-LSQEFIPIIHCDSQSAIYLAKNPYHHERSKHIDVKFHYIRNVI
         W+  ++ EL +     P+I+CD+  A YL  NP  H R KHI + +H+IRN +
Subjt:  VWLQRIVGEL-LSQEFIPIIHCDSQSAIYLAKNPYHHERSKHIDVKFHYIRNVI

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 81.1e-4629.34Show/hide
Query:  WKDAMEVELFSLHKNQTWSLVPKPLNQKLIQSKWIYKIK--PDGCHDNISS-------WRTGGSD-----------------------------------
        W  AM+ E+ ++    TW +   P N+K I  KW+YKIK   DG  +   +        +  G D                                   
Subjt:  WKDAMEVELFSLHKNQTWSLVPKPLNQKLIQSKWIYKIK--PDGCHDNISS-------WRTGGSD-----------------------------------

Query:  --LHGS---------PKGYEVKGKEDM----VCRLHKSLYGLKQSPRQWYIRLYTFILKQGFHRNSYDACVYWKLSQKGTYIYLLLYIDDIILVSKDYVE
          L+G          P GY  +  + +    VC L KS+YGLKQ+ RQW+++    ++  GF ++  D   + K++    ++ +L+Y+DDII+ S +   
Subjt:  --LHGS---------PKGYEVKGKEDM----VCRLHKSLYGLKQSPRQWYIRLYTFILKQGFHRNSYDACVYWKLSQKGTYIYLLLYIDDIILVSKDYVE

Query:  ICELKKQLSNEFEMKDLGELKRILGMDVKRDRKKGLSTISQESYVIKLLEKYNISGSKTVSTPLASYFRLSSSQCPVTKQERLEMSNIPYCNAIGSIMYL
        + ELK QL + F+++DLG LK  LG+++ R    G++ I Q  Y + LL++  + G K  S P+      S+         +       Y   IG +MYL
Subjt:  ICELKKQLSNEFEMKDLGELKRILGMDVKRDRKKGLSTISQESYVIKLLEKYNISGSKTVSTPLASYFRLSSSQCPVTKQERLEMSNIPYCNAIGSIMYL

Query:  MICTRPDLGYAMSMISRFMSNPGKEHWKVVKWVLRYLEGSTSVSLCYSRDYDKSTLLEGFTDADYVADLDKRRSLSGHIFRLYGNVVSWKVNLHPVVALS
         I TR D+ +A++ +S+F   P   H + V  +L Y++G+    L YS   +    L+ F+DA + +  D RRS +G+   L  +++SWK     VV+ S
Subjt:  MICTRPDLGYAMSMISRFMSNPGKEHWKVVKWVLRYLEGSTSVSLCYSRDYDKSTLLEGFTDADYVADLDKRRSLSGHIFRLYGNVVSWKVNLHPVVALS

Query:  TTESEYISLGEAVKEAVWLQRIVGEL-LSQEFIPIIHCDSQSAIYLAKNPYHHERSKHIDVKFHYIR
        + E+EY +L  A  E +WL +   EL L      ++ CD+ +AI++A N   HER+KHI+   H +R
Subjt:  TTESEYISLGEAVKEAVWLQRIVGEL-LSQEFIPIIHCDSQSAIYLAKNPYHHERSKHIDVKFHYIR

ATMG00240.1 Gag-Pol-related retrotransposon family protein3.2e-0638.75Show/hide
Query:  MYLMICTRPDLGYAMSMISRFMSNPGKEHWKVVKWVLRYLEGSTSVSLCYSRDYDKSTLLEGFTDADYVADLDKRRSLSG
        MYL I TRPDL +A++ +S+F S       + V  VL Y++G+    L YS   D    L+ F D+D+ +  D RRS++G
Subjt:  MYLMICTRPDLGYAMSMISRFMSNPGKEHWKVVKWVLRYLEGSTSVSLCYSRDYDKSTLLEGFTDADYVADLDKRRSLSG

ATMG00810.1 DNA/RNA polymerases superfamily protein4.5e-2432.07Show/hide
Query:  IYLLLYIDDIILVSKDYVEICELKKQLSNEFEMKDLGELKRILGMDVKRDRKKGLSTISQESYVIKLLEKYNISGSKTVSTPLASYFRLSSSQCPVTKQE
        +YLLLY+DDI+L       +  L  QLS+ F MKDLG +   LG+ +K     GL  +SQ  Y  ++L    +   K +STPL    +L+SS       +
Subjt:  IYLLLYIDDIILVSKDYVEICELKKQLSNEFEMKDLGELKRILGMDVKRDRKKGLSTISQESYVIKLLEKYNISGSKTVSTPLASYFRLSSSQCPVTKQE

Query:  RLEMSNIPYCNAIGSIMYLMICTRPDLGYAMSMISRFMSNPGKEHWKVVKWVLRYLEGSTSVSLCYSRDYDKSTLLEGFTDADYVADLDKRRSLSGHIFR
          +  +I     +G++ YL + TRPD+ YA++++ + M  P    + ++K VLRY++G+    L   +  +    ++ F D+D+      RRS +G    
Subjt:  RLEMSNIPYCNAIGSIMYLMICTRPDLGYAMSMISRFMSNPGKEHWKVVKWVLRYLEGSTSVSLCYSRDYDKSTLLEGFTDADYVADLDKRRSLSGHIFR

Query:  LYGNVVSWKVNLHPVVALSTTESEYISLGEAVKEAVW
        L  N++SW     P V+ S+TE+EY +L     E  W
Subjt:  LYGNVVSWKVNLHPVVALSTTESEYISLGEAVKEAVW

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)5.5e-0643.14Show/hide
Query:  WKDAMEVELFSLHKNQTWSLVPKPLNQKLIQSKWIYKIK--PDGCHDNISS
        W  AM+ EL +L +N+TW LVP P+NQ ++  KW++K K   DG  D + +
Subjt:  WKDAMEVELFSLHKNQTWSLVPKPLNQKLIQSKWIYKIK--PDGCHDNISS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTTATTGGTTATCCTCAGGGTGTCAAAGGTTATAAACTTTGGTGCTTGAAAAAAGGGATGAATAAATGCATTATCAGTAGAGATGTAAATTTTAATGAGACA
AAAATGCCTTACTGTGTTAAAGAGGAACGGAAACAACAGACAAGTGATCATGTTGTGACAAAAGTCAGAATTGCTTCAGAGGGACGACCATCAGTTGGCTTATAT
GCTTTTAGTGATCAGCCGCCACTAGTTTCAGAAATAGAGGATACACAGCAGTCTGAATTTGATGGTATACAATCTCAACAGGAGAGGATTTTGATTGATGAGGGA
GCTTTTATTGAAGAAAGCTCAAGTAACAATGACCTACAGAATTATCAGCTTACTCGTGACAGAGCTCAGAGGGAAAGACATGCTCCTATAAGGTATGGTTATGCC
GACTTAGTTGCTTACGCTCTTACTTGTGCAGCTGATGGTATTGAAGAAAAGCCTCTTACTTTTGAAAAGGCAATTGTATCTGATTCAAAAAAACGGTGGAAGGAT
GCCATGGAAGTAGAGTTGTTCTCTTTACATAAGAATCAAACATGGTCGTTGGTTCCAAAGCCTCTTAATCAGAAACTCATTCAATCAAAATGGATTTACAAAATT
AAGCCAGATGGATGTCACGACAACATTTCTTCATGGAGAACTGGAGGAAGTGATCTACATGGCTCACCTAAGGGCTATGAGGTGAAGGGTAAGGAAGACATGGTT
TGTCGTCTTCACAAGTCCCTCTATGGACTAAAACAATCTCCAAGACAGTGGTATATCAGGTTATATACTTTCATTCTAAAGCAGGGATTCCACAGGAACTCATAT
GATGCTTGTGTTTACTGGAAACTATCTCAGAAAGGTACGTACATCTATCTACTGTTGTATATAGATGATATTATACTAGTGTCTAAGGATTATGTTGAAATATGT
GAACTCAAGAAACAGTTGAGTAATGAGTTTGAAATGAAAGATTTAGGTGAACTAAAAAGGATCCTAGGCATGGATGTAAAGAGAGATAGAAAGAAAGGTTTGTCA
ACCATTTCGCAGGAGAGTTATGTGATCAAACTGCTTGAAAAGTATAATATATCTGGTAGCAAGACAGTTTCAACACCCTTAGCATCTTACTTTAGACTTTCTTCG
TCTCAATGTCCTGTTACTAAACAAGAAAGGTTAGAGATGTCTAATATTCCATATTGTAATGCTATTGGAAGTATTATGTATCTGATGATTTGTACTAGGCCTGAC
TTGGGTTATGCTATGAGTATGATTAGTAGGTTTATGTCAAATCCTGGGAAGGAACATTGGAAAGTTGTTAAATGGGTGTTACGATATTTGGAAGGTAGTACCAGT
GTATCATTGTGTTATAGTAGGGATTATGATAAATCAACACTGTTAGAAGGTTTCACAGATGCAGATTATGTTGCAGATCTTGATAAAAGAAGGTCTTTATCAGGT
CACATTTTTCGCTTGTATGGTAATGTTGTCAGTTGGAAAGTTAACCTACATCCAGTTGTTGCTTTGTCGACTACTGAGTCAGAATATATTTCTCTTGGTGAAGCA
GTTAAGGAAGCAGTGTGGTTGCAAAGAATTGTTGGTGAGTTGTTATCGCAGGAGTTTATTCCTATCATCCATTGTGATAGCCAGAGTGCTATTTATCTTGCGAAG
AATCCATATCATCATGAACGATCTAAACATATCGATGTCAAATTTCATTACATTAGAAACGTTATTGCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGTTTATTGGTTATCCTCAGGGTGTCAAAGGTTATAAACTTTGGTGCTTGAAAAAAGGGATGAATAAATGCATTATCAGTAGAGATGTAAATTTTAATGAGACA
AAAATGCCTTACTGTGTTAAAGAGGAACGGAAACAACAGACAAGTGATCATGTTGTGACAAAAGTCAGAATTGCTTCAGAGGGACGACCATCAGTTGGCTTATAT
GCTTTTAGTGATCAGCCGCCACTAGTTTCAGAAATAGAGGATACACAGCAGTCTGAATTTGATGGTATACAATCTCAACAGGAGAGGATTTTGATTGATGAGGGA
GCTTTTATTGAAGAAAGCTCAAGTAACAATGACCTACAGAATTATCAGCTTACTCGTGACAGAGCTCAGAGGGAAAGACATGCTCCTATAAGGTATGGTTATGCC
GACTTAGTTGCTTACGCTCTTACTTGTGCAGCTGATGGTATTGAAGAAAAGCCTCTTACTTTTGAAAAGGCAATTGTATCTGATTCAAAAAAACGGTGGAAGGAT
GCCATGGAAGTAGAGTTGTTCTCTTTACATAAGAATCAAACATGGTCGTTGGTTCCAAAGCCTCTTAATCAGAAACTCATTCAATCAAAATGGATTTACAAAATT
AAGCCAGATGGATGTCACGACAACATTTCTTCATGGAGAACTGGAGGAAGTGATCTACATGGCTCACCTAAGGGCTATGAGGTGAAGGGTAAGGAAGACATGGTT
TGTCGTCTTCACAAGTCCCTCTATGGACTAAAACAATCTCCAAGACAGTGGTATATCAGGTTATATACTTTCATTCTAAAGCAGGGATTCCACAGGAACTCATAT
GATGCTTGTGTTTACTGGAAACTATCTCAGAAAGGTACGTACATCTATCTACTGTTGTATATAGATGATATTATACTAGTGTCTAAGGATTATGTTGAAATATGT
GAACTCAAGAAACAGTTGAGTAATGAGTTTGAAATGAAAGATTTAGGTGAACTAAAAAGGATCCTAGGCATGGATGTAAAGAGAGATAGAAAGAAAGGTTTGTCA
ACCATTTCGCAGGAGAGTTATGTGATCAAACTGCTTGAAAAGTATAATATATCTGGTAGCAAGACAGTTTCAACACCCTTAGCATCTTACTTTAGACTTTCTTCG
TCTCAATGTCCTGTTACTAAACAAGAAAGGTTAGAGATGTCTAATATTCCATATTGTAATGCTATTGGAAGTATTATGTATCTGATGATTTGTACTAGGCCTGAC
TTGGGTTATGCTATGAGTATGATTAGTAGGTTTATGTCAAATCCTGGGAAGGAACATTGGAAAGTTGTTAAATGGGTGTTACGATATTTGGAAGGTAGTACCAGT
GTATCATTGTGTTATAGTAGGGATTATGATAAATCAACACTGTTAGAAGGTTTCACAGATGCAGATTATGTTGCAGATCTTGATAAAAGAAGGTCTTTATCAGGT
CACATTTTTCGCTTGTATGGTAATGTTGTCAGTTGGAAAGTTAACCTACATCCAGTTGTTGCTTTGTCGACTACTGAGTCAGAATATATTTCTCTTGGTGAAGCA
GTTAAGGAAGCAGTGTGGTTGCAAAGAATTGTTGGTGAGTTGTTATCGCAGGAGTTTATTCCTATCATCCATTGTGATAGCCAGAGTGCTATTTATCTTGCGAAG
AATCCATATCATCATGAACGATCTAAACATATCGATGTCAAATTTCATTACATTAGAAACGTTATTGCTTAG
Protein sequenceShow/hide protein sequence
MFIGYPQGVKGYKLWCLKKGMNKCIISRDVNFNETKMPYCVKEERKQQTSDHVVTKVRIASEGRPSVGLYAFSDQPPLVSEIEDTQQSEFDGIQSQQERILIDEG
AFIEESSSNNDLQNYQLTRDRAQRERHAPIRYGYADLVAYALTCAADGIEEKPLTFEKAIVSDSKKRWKDAMEVELFSLHKNQTWSLVPKPLNQKLIQSKWIYKI
KPDGCHDNISSWRTGGSDLHGSPKGYEVKGKEDMVCRLHKSLYGLKQSPRQWYIRLYTFILKQGFHRNSYDACVYWKLSQKGTYIYLLLYIDDIILVSKDYVEIC
ELKKQLSNEFEMKDLGELKRILGMDVKRDRKKGLSTISQESYVIKLLEKYNISGSKTVSTPLASYFRLSSSQCPVTKQERLEMSNIPYCNAIGSIMYLMICTRPD
LGYAMSMISRFMSNPGKEHWKVVKWVLRYLEGSTSVSLCYSRDYDKSTLLEGFTDADYVADLDKRRSLSGHIFRLYGNVVSWKVNLHPVVALSTTESEYISLGEA
VKEAVWLQRIVGELLSQEFIPIIHCDSQSAIYLAKNPYHHERSKHIDVKFHYIRNVIA