; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc04g0098021 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc04g0098021
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionTy1-copia retrotransposon protein
Genome locationCMiso1.1chr04:13266512..13268287
RNA-Seq ExpressionCmc04g0098021
SyntenyCmc04g0098021
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003677 - DNA binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAD6453934.1 hypothetical protein E3N88_08640 [Mikania micrantha]9.6e-20263.11Show/hide
Query:  MLLSSGLSDNIWGEAVLSACFVLNRIPHKRLDKTSYELWKGHAPNLSYLKVWGCLAKVPFPALKKSTVGSKTFDCIFIGYAQNSAAYRFMCLNDKTINES
        ML+SSG  DN+WGEAVLSAC VLNR+PHK LDKT YELWKG++PNL +LKVWGCLAKV  P  K+  +GSKT D +FIGYAQNSAAYRFM L+D++I ES
Subjt:  MLLSSGLSDNIWGEAVLSACFVLNRIPHKRLDKTSYELWKGHAPNLSYLKVWGCLAKVPFPALKKSTVGSKTFDCIFIGYAQNSAAYRFMCLNDKTINES

Query:  RDAEFFEHVFPLKQSLYAPSLSKRMHDPENTSIVSETPVSETVNTSNLR--CELELRRSKRQRTEKSFGPDFLSTFIVK--RRDEIDCNFTNLYLIDEDP
         +AEFFE  FPLK+                  I+S   VS+   +S+++    +E RRSKR RTE SFGPDFL++F+ +    D +  +F +++L++ DP
Subjt:  RDAEFFEHVFPLKQSLYAPSLSKRMHDPENTSIVSETPVSETVNTSNLR--CELELRRSKRQRTEKSFGPDFLSTFIVK--RRDEIDCNFTNLYLIDEDP

Query:  KTYQEMLNSVESSMWKEAIKSELDSLVMNHTWDLVDLPMGNKPIRCKWIFKRKIKPNGSMERYKARLVVVGYTQKQGIDYFDTYSSVTKITTIRTLIALV
        KTYQE + SV++++WKEAIKSE+DS++ NHTW+L DLP GNKPI  KWIFK+K++P+G++++YKARLV+ G+TQK G+DYFDTYS VTKITTIR+LI++ 
Subjt:  KTYQEMLNSVESSMWKEAIKSELDSLVMNHTWDLVDLPMGNKPIRCKWIFKRKIKPNGSMERYKARLVVVGYTQKQGIDYFDTYSSVTKITTIRTLIALV

Query:  AIHNLLIHQMNVKTTFLNGDLEEEIYMTQPEGFKISGQENKVCKLKKSLYNLKQAPKKWYEKFNNTLITNGFKINSSDTCVYSKVFGVDCILICLYVDDM
        AI+ LLIHQM+VKT FLNGDL+EEIYM QPEGF +SG E+KVCKL+KSLY LKQAPKKWYEKF+ TL  +G+ +N+SD+CVYSK      +LICLYVDDM
Subjt:  AIHNLLIHQMNVKTTFLNGDLEEEIYMTQPEGFKISGQENKVCKLKKSLYNLKQAPKKWYEKFNNTLITNGFKINSSDTCVYSKVFGVDCILICLYVDDM

Query:  LIFGTNMELITDTKLFLSSHFEMKDLGEANVILGVKIRKNKTSLSLCQSHYVEKILKKFDSFDVSPVRTPFDASKHLKMNKGDSVSQPEYAKIIGSVMYL
        LIFG +M  I  TK FLSS FEMKDLGEA+VILGVKI++    +SLCQSHY+E++LKKFD F+++PV+TP+D S  LK N  +SVSQ EYAKIIGSVM+L
Subjt:  LIFGTNMELITDTKLFLSSHFEMKDLGEANVILGVKIRKNKTSLSLCQSHYVEKILKKFDSFDVSPVRTPFDASKHLKMNKGDSVSQPEYAKIIGSVMYL

Query:  MNYTRPDIAYAVSRLSRYTHNPDRYHWDALRHLLRYLKGTIDYCLHFNKFSAV
        MNYTRPDIAY VSRLSRYTHNP + HW A+  L+RYL+GT++ CLH+NKF AV
Subjt:  MNYTRPDIAYAVSRLSRYTHNPDRYHWDALRHLLRYLKGTIDYCLHFNKFSAV

KAG7551885.1 Ribonuclease H-like superfamily [Arabidopsis thaliana x Arabidopsis arenosa]8.5e-19059.07Show/hide
Query:  MNVMLLSSGLSDNIWGEAVLSACFVLNRIPHKRLDKTSYELWKGHAPNLSYLKVWGCLAKVPFPALKKSTVGSKTFDCIFIGYAQNSAAYRFMCLNDKTI
        MN ML+SSGL D +WGEA+LSAC++LN++PHK +  T YELWKGH+PNLSYLKVWGCL KV  PA K++T+G KT D +F+GYA NSAAYR +     ++
Subjt:  MNVMLLSSGLSDNIWGEAVLSACFVLNRIPHKRLDKTSYELWKGHAPNLSYLKVWGCLAKVPFPALKKSTVGSKTFDCIFIGYAQNSAAYRFMCLNDKTI

Query:  N-----ESRDAEFFEHVFPLKQSLYAPSLSKRMHDPENTSIVSETPVSETVNTSNLRCELELRRSKRQRTEKSFGPDFLSTFIVKRRDEIDCNFTNLYLI
              ESRD EFFE+VFP K++L   S     HD  N    S +      + +      E RRSKR RTE  FGPDF++TF+ +  DEI   F   +LI
Subjt:  N-----ESRDAEFFEHVFPLKQSLYAPSLSKRMHDPENTSIVSETPVSETVNTSNLRCELELRRSKRQRTEKSFGPDFLSTFIVKRRDEIDCNFTNLYLI

Query:  DEDPKTYQEMLNSVESSMWKEAIKSELDSLVMNHTWDLVDLPMGNKPIRCKWIFKRKIKPNGSMERYKARLVVVGYTQKQGIDYFDTYSSVTKITTIRTL
        +EDPKT+ E + SV++  WKEA+ +E DS++ NHTW++VDLP G K IRCKWI K+K K +GS++++KARLV  G+ QKQG+DYFDTY+ VTKI +IR L
Subjt:  DEDPKTYQEMLNSVESSMWKEAIKSELDSLVMNHTWDLVDLPMGNKPIRCKWIFKRKIKPNGSMERYKARLVVVGYTQKQGIDYFDTYSSVTKITTIRTL

Query:  IALVAIHNLLIHQMNVKTTFLNGDLEEEIYMTQPEGFKISGQENKVCKLKKSLYNLKQAPKKWYEKFNNTLITNGFKINSSDTCVYSKVFGVDCILICLY
        +A+ + H L++HQM+VKT FLNGDL EEIYM QPEGF I GQENKVCKL KSLY LKQAPK+W+EKF+NTL+ NGF  N  DTCV+SKV     ++ICLY
Subjt:  IALVAIHNLLIHQMNVKTTFLNGDLEEEIYMTQPEGFKISGQENKVCKLKKSLYNLKQAPKKWYEKFNNTLITNGFKINSSDTCVYSKVFGVDCILICLY

Query:  VDDMLIFGTNMELITDTKLFLSSHFEMKDLGEANVILGVKIRKNKTSLSLCQSHYVEKILKKFDSFDVSPVRTPFDASKHLKMNKGDSVSQPEYAKIIGS
        VDDMLI GT++E++ DTK+FLSS F+MKDLGEA+VILG+K+ K  +  SL QSHY+EKILKKF  +D    ++P+D+S HL  N+G+SV+Q EYAK+IGS
Subjt:  VDDMLIFGTNMELITDTKLFLSSHFEMKDLGEANVILGVKIRKNKTSLSLCQSHYVEKILKKFDSFDVSPVRTPFDASKHLKMNKGDSVSQPEYAKIIGS

Query:  VMYLMNYTRPDIAYAVSRLSRYTHNPDRYHWDALRHLLRYLKGTIDYCLHFNKFSAV
        VMYLMN TRPDIAY VSRLSRYTHNP   HW AL  L+RYLKGTID+ L ++  S V
Subjt:  VMYLMNYTRPDIAYAVSRLSRYTHNPDRYHWDALRHLLRYLKGTIDYCLHFNKFSAV

KAG7571733.1 Integrase catalytic core [Arabidopsis suecica]3.8e-19059.57Show/hide
Query:  MNVMLLSSGLSDNIWGEAVLSACFVLNRIPHKRLDKTSYELWKGHAPNLSYLKVWGCLAKVPFPALKKSTVGSKTFDCIFIGYAQNSAAYRFMCLNDK--
        MN ML+SSGL D +WGEA+LSAC++LN++PHK +  T YELWKGH+PNLSYLKVWGCL KV  PA K++T+G KT D +F+GYA NSAAYR  CL  K  
Subjt:  MNVMLLSSGLSDNIWGEAVLSACFVLNRIPHKRLDKTSYELWKGHAPNLSYLKVWGCLAKVPFPALKKSTVGSKTFDCIFIGYAQNSAAYRFMCLNDK--

Query:  -----TINESRDAEFFEHVFPLKQSLYAPSLSKRMHDPENTSIVSETPVSETVNTSNLRCELELRRSKRQRTEKSFGPDFLSTFIVKRRDEIDCNFTNLY
             ++ ESRD EFFE+VFP K++L   S     HD  N    S +      + +      E RRSKR RTE  FGPDF++TF+ +  DEI   F   +
Subjt:  -----TINESRDAEFFEHVFPLKQSLYAPSLSKRMHDPENTSIVSETPVSETVNTSNLRCELELRRSKRQRTEKSFGPDFLSTFIVKRRDEIDCNFTNLY

Query:  LIDEDPKTYQEMLNSVESSMWKEAIKSELDSLVMNHTWDLVDLPMGNKPIRCKWIFKRKIKPNGSMERYKARLVVVGYTQKQGIDYFDTYSSVTKITTIR
        LI+EDPKT+ E + SV++  WKEA+ +E DS++ NHTW++VDLP G K IRCKWI K+K K +GS++++KARLV  G+ QKQG+DYFDTY+ VTKI +IR
Subjt:  LIDEDPKTYQEMLNSVESSMWKEAIKSELDSLVMNHTWDLVDLPMGNKPIRCKWIFKRKIKPNGSMERYKARLVVVGYTQKQGIDYFDTYSSVTKITTIR

Query:  TLIALVAIHNLLIHQMNVKTTFLNGDLEEEIYMTQPEGFKISGQENKVCKLKKSLYNLKQAPKKWYEKFNNTLITNGFKINSSDTCVYSKVFGVDCILIC
         L+A+ + H L++HQM+VKT FLNGDL EEIYM QPEGF I GQENKVCKL KSLY LKQAPK+W+EKF+NTL+ NGF  N  DTCV+SKV     ++IC
Subjt:  TLIALVAIHNLLIHQMNVKTTFLNGDLEEEIYMTQPEGFKISGQENKVCKLKKSLYNLKQAPKKWYEKFNNTLITNGFKINSSDTCVYSKVFGVDCILIC

Query:  LYVDDMLIFGTNMELITDTKLFLSSHFEMKDLGEANVILGVKIRKNKTSLSLCQSHYVEKILKKFDSFDVSPVRTPFDASKHLKMNKGDSVSQPEYAKII
        LYVDDMLI GT++E++ DTK+FLSS F+MKDLGEA+VILG+K+ K  +  SL QSHY+EKILKKF  +D    ++P+D+S HL  N+G+SV+Q EYAK+I
Subjt:  LYVDDMLIFGTNMELITDTKLFLSSHFEMKDLGEANVILGVKIRKNKTSLSLCQSHYVEKILKKFDSFDVSPVRTPFDASKHLKMNKGDSVSQPEYAKII

Query:  GSVMYLMNYTRPDIAYAVSRLSRYTHNPDRYHWDALRHLLRYLKGTIDYCLHFNKFSAV
        GSVMYLMN TRPDIAYAVSRLSRYTHNP   HW AL  L+RYLKGTID+ L ++  S V
Subjt:  GSVMYLMNYTRPDIAYAVSRLSRYTHNPDRYHWDALRHLLRYLKGTIDYCLHFNKFSAV

KAG7592238.1 Ribonuclease H-like superfamily [Arabidopsis thaliana x Arabidopsis arenosa]8.5e-19059.07Show/hide
Query:  MNVMLLSSGLSDNIWGEAVLSACFVLNRIPHKRLDKTSYELWKGHAPNLSYLKVWGCLAKVPFPALKKSTVGSKTFDCIFIGYAQNSAAYRFMCLNDKTI
        MN ML+SSGL D +WGEA+LSAC++LN++PHK +  T YELWKGH+PNLSYLKVWGCL KV  PA K++T+G KT D +F+GYA NSAAYR +     ++
Subjt:  MNVMLLSSGLSDNIWGEAVLSACFVLNRIPHKRLDKTSYELWKGHAPNLSYLKVWGCLAKVPFPALKKSTVGSKTFDCIFIGYAQNSAAYRFMCLNDKTI

Query:  N-----ESRDAEFFEHVFPLKQSLYAPSLSKRMHDPENTSIVSETPVSETVNTSNLRCELELRRSKRQRTEKSFGPDFLSTFIVKRRDEIDCNFTNLYLI
              ESRD EFFE+VFP K++L   S     HD  N    S +      + +      E RRSKR RTE  FGPDF++TF+ +  DEI   F   +LI
Subjt:  N-----ESRDAEFFEHVFPLKQSLYAPSLSKRMHDPENTSIVSETPVSETVNTSNLRCELELRRSKRQRTEKSFGPDFLSTFIVKRRDEIDCNFTNLYLI

Query:  DEDPKTYQEMLNSVESSMWKEAIKSELDSLVMNHTWDLVDLPMGNKPIRCKWIFKRKIKPNGSMERYKARLVVVGYTQKQGIDYFDTYSSVTKITTIRTL
        +EDPKT+ E + SV++  WKEA+ +E DS++ NHTW++VDLP G K IRCKWI K+K K +GS++++KARLV  G+ QKQG+DYFDTY+ VTKI +IR L
Subjt:  DEDPKTYQEMLNSVESSMWKEAIKSELDSLVMNHTWDLVDLPMGNKPIRCKWIFKRKIKPNGSMERYKARLVVVGYTQKQGIDYFDTYSSVTKITTIRTL

Query:  IALVAIHNLLIHQMNVKTTFLNGDLEEEIYMTQPEGFKISGQENKVCKLKKSLYNLKQAPKKWYEKFNNTLITNGFKINSSDTCVYSKVFGVDCILICLY
        +A+ + H L++HQM+VKT FLNGDL EEIYM QPEGF I GQENKVCKL KSLY LKQAPK+W+EKF+NTL+ NGF  N  DTCV+SKV     ++ICLY
Subjt:  IALVAIHNLLIHQMNVKTTFLNGDLEEEIYMTQPEGFKISGQENKVCKLKKSLYNLKQAPKKWYEKFNNTLITNGFKINSSDTCVYSKVFGVDCILICLY

Query:  VDDMLIFGTNMELITDTKLFLSSHFEMKDLGEANVILGVKIRKNKTSLSLCQSHYVEKILKKFDSFDVSPVRTPFDASKHLKMNKGDSVSQPEYAKIIGS
        VDDMLI GT++E++ DTK+FLSS F+MKDLGEA+VILG+K+ K  +  SL QSHY+EKILKKF  +D    ++P+D+S HL  N+G+SV+Q EYAK+IGS
Subjt:  VDDMLIFGTNMELITDTKLFLSSHFEMKDLGEANVILGVKIRKNKTSLSLCQSHYVEKILKKFDSFDVSPVRTPFDASKHLKMNKGDSVSQPEYAKIIGS

Query:  VMYLMNYTRPDIAYAVSRLSRYTHNPDRYHWDALRHLLRYLKGTIDYCLHFNKFSAV
        VMYLMN TRPDIAY VSRLSRYTHNP   HW AL  L+RYLKGTID+ L ++  S V
Subjt:  VMYLMNYTRPDIAYAVSRLSRYTHNPDRYHWDALRHLLRYLKGTIDYCLHFNKFSAV

TYK06518.1 ty1-copia retrotransposon protein [Cucumis melo var. makuwa]1.4e-29891.85Show/hide
Query:  MNVMLLSSGLSDNIWGEAVLSACFVLNRIPHKRLDKTSYELWKGHAPNLSYLKVWGCLAKVPFPALKKSTVGSKTFDCIFIGYAQNSAAYRFMCLNDKTI
        MN MLLSSGLSDN+WGEAVLSACF+LNRIPHKRLDKT YELWKGHAPNLSYLKVWGCLAKVP PALKK+TVG KTFDCIFIGYAQNSAAYRFMCLNDKTI
Subjt:  MNVMLLSSGLSDNIWGEAVLSACFVLNRIPHKRLDKTSYELWKGHAPNLSYLKVWGCLAKVPFPALKKSTVGSKTFDCIFIGYAQNSAAYRFMCLNDKTI

Query:  NESRDAEFFEHVFPLKQSLYAPSLSKRMHDPENTSIVSETPVSETVNTSNLRCELELRRSKRQRTEKSFGPDFLSTFIVKRRDEIDCNFTNLYLIDEDPK
        NESRDAEFFEHVFPLKQSLYAPSLS RMHDPE   IVSE PVSETV+T NL CELE RRSKRQRTEKSFGPDFLSTFIV+RRDEIDCNFTNL+LIDEDPK
Subjt:  NESRDAEFFEHVFPLKQSLYAPSLSKRMHDPENTSIVSETPVSETVNTSNLRCELELRRSKRQRTEKSFGPDFLSTFIVKRRDEIDCNFTNLYLIDEDPK

Query:  TYQEMLNSVESSMWKEAIKSELDSLVMNHTWDLVDLPMGNKPIRCKWIFKRKIKPNGSMERYKARLVVVGYTQKQGIDYFDTYSSVTKITTIRTLIALVA
        TYQE LNSV+S MWKEAIKSELDSL MNHTW+LVDLPMGNKPIRCKWIFKRK KPNG +ERYKARLVVVGYTQKQG+DYFDTYS VTKITTIR LIAL A
Subjt:  TYQEMLNSVESSMWKEAIKSELDSLVMNHTWDLVDLPMGNKPIRCKWIFKRKIKPNGSMERYKARLVVVGYTQKQGIDYFDTYSSVTKITTIRTLIALVA

Query:  IHNLLIHQMNVKTTFLNGDLEEEIYMTQPEGFKISGQENKVCKLKKSLYNLKQAPKKWYEKFNNTLITNGFKINSSDTCVYSKVFGVDCILICLYVDDML
        IHNLLIHQM+VKT FLNG+LEEEIYMTQPEGFKISGQENKVCKL+KSLY LKQAPK+WYEKFNNTLITNGFKINSSDTCVYSK+ G DCILICLYVDDML
Subjt:  IHNLLIHQMNVKTTFLNGDLEEEIYMTQPEGFKISGQENKVCKLKKSLYNLKQAPKKWYEKFNNTLITNGFKINSSDTCVYSKVFGVDCILICLYVDDML

Query:  IFGTNMELITDTKLFLSSHFEMKDLGEANVILGVKIRKNKTSLSLCQSHYVEKILKKFDSFDVSPVRTPFDASKHLKMNKGDSVSQPEYAKIIGSVMYLM
        IFGTNMELITDTK FLSSHFEMKDLGEA+VILGVKIRKNKTSLSLCQSHYVEKILKKFDSFDVSPVRTPFDASKHLK NKGDSVSQPEYAKIIGSVMYLM
Subjt:  IFGTNMELITDTKLFLSSHFEMKDLGEANVILGVKIRKNKTSLSLCQSHYVEKILKKFDSFDVSPVRTPFDASKHLKMNKGDSVSQPEYAKIIGSVMYLM

Query:  NYTRPDIAYAVSRLSRYTHNPDRYHWDALRHLLRYLKGTIDYCLHFNKFSAV
        NYTRPDIAYAVSRLSRYTHNP+RYHWDALRHLLRYLKGTIDYCLHF KF AV
Subjt:  NYTRPDIAYAVSRLSRYTHNPDRYHWDALRHLLRYLKGTIDYCLHFNKFSAV

TrEMBL top hitse value%identityAlignment
A0A2N9EQT1 Integrase catalytic domain-containing protein4.2e-18758.7Show/hide
Query:  MNVMLLSSGLSDNIWGEAVLSACFVLNRIPHKRLDKTSYELWKGHAPNLSYLKVWGCLAKVPFPALKKSTVGSKTFDCIFIGYAQNSAAYRFMCLNDKTI
        MN ML+SSGL  N+WGEA+LSAC VLNR+PHKR+ KT YELW+G  PNL Y KVWGCLAKV  P  ++  +G KT D +FIGYA+NSAAYR + L   TI
Subjt:  MNVMLLSSGLSDNIWGEAVLSACFVLNRIPHKRLDKTSYELWKGHAPNLSYLKVWGCLAKVPFPALKKSTVGSKTFDCIFIGYAQNSAAYRFMCLNDKTI

Query:  NESRDAEFFEHVFPLKQSLYAPSLSKRMHDPENTSIVSETPVSETVNTSNLRCELELRRSKRQRTEKSFGPDFLSTFIVKRRDEIDCNFTNLYLIDEDPK
         E+RDA+FFE +FP+K++ ++         P    + S + + E+ +T       ELRRSKR +   +FGPDF++ F                L ++DPK
Subjt:  NESRDAEFFEHVFPLKQSLYAPSLSKRMHDPENTSIVSETPVSETVNTSNLRCELELRRSKRQRTEKSFGPDFLSTFIVKRRDEIDCNFTNLYLIDEDPK

Query:  TYQEMLNSVESSMWKEAIKSELDSLVMNHTWDLVDLPMGNKPIRCKWIFKRKIKPNGSMERYKARLVVVGYTQKQGIDYFDTYSSVTKITTIRTLIALVA
        TYQE + SV+++ WK+AI SEL+S++ NHTW+LV+LP G K I  KW+FK+K+K +GS+E++KARLV  GYTQK+GIDYFDTYS VT++TTIR L+A+ +
Subjt:  TYQEMLNSVESSMWKEAIKSELDSLVMNHTWDLVDLPMGNKPIRCKWIFKRKIKPNGSMERYKARLVVVGYTQKQGIDYFDTYSSVTKITTIRTLIALVA

Query:  IHNLLIHQMNVKTTFLNGDLEEEIYMTQPEGFKISGQENKVCKLKKSLYNLKQAPKKWYEKFNNTLITNGFKINSSDTCVYSKVFGVDCILICLYVDDML
        I+ L+IHQM+VKT FLNGDL+EEIYM QPEGF + GQENKVCKL+KSLY LKQAPK+W+EKF+ TL++NGF +N SD CVYSK  G   ++ICLYVDDML
Subjt:  IHNLLIHQMNVKTTFLNGDLEEEIYMTQPEGFKISGQENKVCKLKKSLYNLKQAPKKWYEKFNNTLITNGFKINSSDTCVYSKVFGVDCILICLYVDDML

Query:  IFGTNMELITDTKLFLSSHFEMKDLGEANVILGVKIRKNKTSLSLCQSHYVEKILKKFDSFDVSPVRTPFDASKHLKMNKGDSVSQPEYAKIIGSVMYLM
        IFGT+M  + +TK FLSS+F+MKDLGEA++ILG++I +N   L+L QSHY+EK+LKKF+ +D  PVRTP+D S HLK N G  VSQ EYAKIIGSVM+LM
Subjt:  IFGTNMELITDTKLFLSSHFEMKDLGEANVILGVKIRKNKTSLSLCQSHYVEKILKKFDSFDVSPVRTPFDASKHLKMNKGDSVSQPEYAKIIGSVMYLM

Query:  NYTRPDIAYAVSRLSRYTHNPDRYHWDALRHLLRYLKGTIDYCLHFNKFSAV
        N TRPDIAYAVSRLSRYTHNP   HW+A+  LL+YLKGT++  L +    AV
Subjt:  NYTRPDIAYAVSRLSRYTHNPDRYHWDALRHLLRYLKGTIDYCLHFNKFSAV

A0A2N9H4B0 Uncharacterized protein4.2e-18758.7Show/hide
Query:  MNVMLLSSGLSDNIWGEAVLSACFVLNRIPHKRLDKTSYELWKGHAPNLSYLKVWGCLAKVPFPALKKSTVGSKTFDCIFIGYAQNSAAYRFMCLNDKTI
        MN ML+SSGL  N+WGEA+LSAC VLNR+PHKR+ KT YELW+G  PNL Y KVWGCLAKV  P  ++  +G KT D +FIGYA+NSAAYR + L   TI
Subjt:  MNVMLLSSGLSDNIWGEAVLSACFVLNRIPHKRLDKTSYELWKGHAPNLSYLKVWGCLAKVPFPALKKSTVGSKTFDCIFIGYAQNSAAYRFMCLNDKTI

Query:  NESRDAEFFEHVFPLKQSLYAPSLSKRMHDPENTSIVSETPVSETVNTSNLRCELELRRSKRQRTEKSFGPDFLSTFIVKRRDEIDCNFTNLYLIDEDPK
         E+RDA+FFE +FP+K++ ++         P    + S + + E+ +T       ELRRSKR +   +FGPDF++ F                L ++DPK
Subjt:  NESRDAEFFEHVFPLKQSLYAPSLSKRMHDPENTSIVSETPVSETVNTSNLRCELELRRSKRQRTEKSFGPDFLSTFIVKRRDEIDCNFTNLYLIDEDPK

Query:  TYQEMLNSVESSMWKEAIKSELDSLVMNHTWDLVDLPMGNKPIRCKWIFKRKIKPNGSMERYKARLVVVGYTQKQGIDYFDTYSSVTKITTIRTLIALVA
        TYQE + SV+++ WK+AI SEL+S++ NHTW+LV+LP G K I  KW+FK+K+K +GS+E++KARLV  GYTQK+GIDYFDTYS VT++TTIR L+A+ +
Subjt:  TYQEMLNSVESSMWKEAIKSELDSLVMNHTWDLVDLPMGNKPIRCKWIFKRKIKPNGSMERYKARLVVVGYTQKQGIDYFDTYSSVTKITTIRTLIALVA

Query:  IHNLLIHQMNVKTTFLNGDLEEEIYMTQPEGFKISGQENKVCKLKKSLYNLKQAPKKWYEKFNNTLITNGFKINSSDTCVYSKVFGVDCILICLYVDDML
        I+ L+IHQM+VKT FLNGDL+EEIYM QPEGF + GQENKVCKL+KSLY LKQAPK+W+EKF+ TL++NGF +N SD CVYSK  G   ++ICLYVDDML
Subjt:  IHNLLIHQMNVKTTFLNGDLEEEIYMTQPEGFKISGQENKVCKLKKSLYNLKQAPKKWYEKFNNTLITNGFKINSSDTCVYSKVFGVDCILICLYVDDML

Query:  IFGTNMELITDTKLFLSSHFEMKDLGEANVILGVKIRKNKTSLSLCQSHYVEKILKKFDSFDVSPVRTPFDASKHLKMNKGDSVSQPEYAKIIGSVMYLM
        IFGT+M  + +TK FLSS+F+MKDLGEA++ILG++I +N   L+L QSHY+EK+LKKF+ +D  PVRTP+D S HLK N G  VSQ EYAKIIGSVM+LM
Subjt:  IFGTNMELITDTKLFLSSHFEMKDLGEANVILGVKIRKNKTSLSLCQSHYVEKILKKFDSFDVSPVRTPFDASKHLKMNKGDSVSQPEYAKIIGSVMYLM

Query:  NYTRPDIAYAVSRLSRYTHNPDRYHWDALRHLLRYLKGTIDYCLHFNKFSAV
        N TRPDIAYAVSRLSRYTHNP   HW+A+  LL+YLKGT++  L +    AV
Subjt:  NYTRPDIAYAVSRLSRYTHNPDRYHWDALRHLLRYLKGTIDYCLHFNKFSAV

A0A5D3C5T2 Ty1-copia retrotransposon protein7.0e-29991.85Show/hide
Query:  MNVMLLSSGLSDNIWGEAVLSACFVLNRIPHKRLDKTSYELWKGHAPNLSYLKVWGCLAKVPFPALKKSTVGSKTFDCIFIGYAQNSAAYRFMCLNDKTI
        MN MLLSSGLSDN+WGEAVLSACF+LNRIPHKRLDKT YELWKGHAPNLSYLKVWGCLAKVP PALKK+TVG KTFDCIFIGYAQNSAAYRFMCLNDKTI
Subjt:  MNVMLLSSGLSDNIWGEAVLSACFVLNRIPHKRLDKTSYELWKGHAPNLSYLKVWGCLAKVPFPALKKSTVGSKTFDCIFIGYAQNSAAYRFMCLNDKTI

Query:  NESRDAEFFEHVFPLKQSLYAPSLSKRMHDPENTSIVSETPVSETVNTSNLRCELELRRSKRQRTEKSFGPDFLSTFIVKRRDEIDCNFTNLYLIDEDPK
        NESRDAEFFEHVFPLKQSLYAPSLS RMHDPE   IVSE PVSETV+T NL CELE RRSKRQRTEKSFGPDFLSTFIV+RRDEIDCNFTNL+LIDEDPK
Subjt:  NESRDAEFFEHVFPLKQSLYAPSLSKRMHDPENTSIVSETPVSETVNTSNLRCELELRRSKRQRTEKSFGPDFLSTFIVKRRDEIDCNFTNLYLIDEDPK

Query:  TYQEMLNSVESSMWKEAIKSELDSLVMNHTWDLVDLPMGNKPIRCKWIFKRKIKPNGSMERYKARLVVVGYTQKQGIDYFDTYSSVTKITTIRTLIALVA
        TYQE LNSV+S MWKEAIKSELDSL MNHTW+LVDLPMGNKPIRCKWIFKRK KPNG +ERYKARLVVVGYTQKQG+DYFDTYS VTKITTIR LIAL A
Subjt:  TYQEMLNSVESSMWKEAIKSELDSLVMNHTWDLVDLPMGNKPIRCKWIFKRKIKPNGSMERYKARLVVVGYTQKQGIDYFDTYSSVTKITTIRTLIALVA

Query:  IHNLLIHQMNVKTTFLNGDLEEEIYMTQPEGFKISGQENKVCKLKKSLYNLKQAPKKWYEKFNNTLITNGFKINSSDTCVYSKVFGVDCILICLYVDDML
        IHNLLIHQM+VKT FLNG+LEEEIYMTQPEGFKISGQENKVCKL+KSLY LKQAPK+WYEKFNNTLITNGFKINSSDTCVYSK+ G DCILICLYVDDML
Subjt:  IHNLLIHQMNVKTTFLNGDLEEEIYMTQPEGFKISGQENKVCKLKKSLYNLKQAPKKWYEKFNNTLITNGFKINSSDTCVYSKVFGVDCILICLYVDDML

Query:  IFGTNMELITDTKLFLSSHFEMKDLGEANVILGVKIRKNKTSLSLCQSHYVEKILKKFDSFDVSPVRTPFDASKHLKMNKGDSVSQPEYAKIIGSVMYLM
        IFGTNMELITDTK FLSSHFEMKDLGEA+VILGVKIRKNKTSLSLCQSHYVEKILKKFDSFDVSPVRTPFDASKHLK NKGDSVSQPEYAKIIGSVMYLM
Subjt:  IFGTNMELITDTKLFLSSHFEMKDLGEANVILGVKIRKNKTSLSLCQSHYVEKILKKFDSFDVSPVRTPFDASKHLKMNKGDSVSQPEYAKIIGSVMYLM

Query:  NYTRPDIAYAVSRLSRYTHNPDRYHWDALRHLLRYLKGTIDYCLHFNKFSAV
        NYTRPDIAYAVSRLSRYTHNP+RYHWDALRHLLRYLKGTIDYCLHF KF AV
Subjt:  NYTRPDIAYAVSRLSRYTHNPDRYHWDALRHLLRYLKGTIDYCLHFNKFSAV

A0A5N6PGV2 Reverse transcriptase Ty1/copia-type domain-containing protein4.7e-20263.11Show/hide
Query:  MLLSSGLSDNIWGEAVLSACFVLNRIPHKRLDKTSYELWKGHAPNLSYLKVWGCLAKVPFPALKKSTVGSKTFDCIFIGYAQNSAAYRFMCLNDKTINES
        ML+SSG  DN+WGEAVLSAC VLNR+PHK LDKT YELWKG++PNL +LKVWGCLAKV  P  K+  +GSKT D +FIGYAQNSAAYRFM L+D++I ES
Subjt:  MLLSSGLSDNIWGEAVLSACFVLNRIPHKRLDKTSYELWKGHAPNLSYLKVWGCLAKVPFPALKKSTVGSKTFDCIFIGYAQNSAAYRFMCLNDKTINES

Query:  RDAEFFEHVFPLKQSLYAPSLSKRMHDPENTSIVSETPVSETVNTSNLR--CELELRRSKRQRTEKSFGPDFLSTFIVK--RRDEIDCNFTNLYLIDEDP
         +AEFFE  FPLK+                  I+S   VS+   +S+++    +E RRSKR RTE SFGPDFL++F+ +    D +  +F +++L++ DP
Subjt:  RDAEFFEHVFPLKQSLYAPSLSKRMHDPENTSIVSETPVSETVNTSNLR--CELELRRSKRQRTEKSFGPDFLSTFIVK--RRDEIDCNFTNLYLIDEDP

Query:  KTYQEMLNSVESSMWKEAIKSELDSLVMNHTWDLVDLPMGNKPIRCKWIFKRKIKPNGSMERYKARLVVVGYTQKQGIDYFDTYSSVTKITTIRTLIALV
        KTYQE + SV++++WKEAIKSE+DS++ NHTW+L DLP GNKPI  KWIFK+K++P+G++++YKARLV+ G+TQK G+DYFDTYS VTKITTIR+LI++ 
Subjt:  KTYQEMLNSVESSMWKEAIKSELDSLVMNHTWDLVDLPMGNKPIRCKWIFKRKIKPNGSMERYKARLVVVGYTQKQGIDYFDTYSSVTKITTIRTLIALV

Query:  AIHNLLIHQMNVKTTFLNGDLEEEIYMTQPEGFKISGQENKVCKLKKSLYNLKQAPKKWYEKFNNTLITNGFKINSSDTCVYSKVFGVDCILICLYVDDM
        AI+ LLIHQM+VKT FLNGDL+EEIYM QPEGF +SG E+KVCKL+KSLY LKQAPKKWYEKF+ TL  +G+ +N+SD+CVYSK      +LICLYVDDM
Subjt:  AIHNLLIHQMNVKTTFLNGDLEEEIYMTQPEGFKISGQENKVCKLKKSLYNLKQAPKKWYEKFNNTLITNGFKINSSDTCVYSKVFGVDCILICLYVDDM

Query:  LIFGTNMELITDTKLFLSSHFEMKDLGEANVILGVKIRKNKTSLSLCQSHYVEKILKKFDSFDVSPVRTPFDASKHLKMNKGDSVSQPEYAKIIGSVMYL
        LIFG +M  I  TK FLSS FEMKDLGEA+VILGVKI++    +SLCQSHY+E++LKKFD F+++PV+TP+D S  LK N  +SVSQ EYAKIIGSVM+L
Subjt:  LIFGTNMELITDTKLFLSSHFEMKDLGEANVILGVKIRKNKTSLSLCQSHYVEKILKKFDSFDVSPVRTPFDASKHLKMNKGDSVSQPEYAKIIGSVMYL

Query:  MNYTRPDIAYAVSRLSRYTHNPDRYHWDALRHLLRYLKGTIDYCLHFNKFSAV
        MNYTRPDIAY VSRLSRYTHNP + HW A+  L+RYL+GT++ CLH+NKF AV
Subjt:  MNYTRPDIAYAVSRLSRYTHNPDRYHWDALRHLLRYLKGTIDYCLHFNKFSAV

A0A6A2Y4J7 Uncharacterized protein7.0e-19059.78Show/hide
Query:  VMLLSSGLSDNIWGEAVLSACFVLNRIPHKRLDKTSYELWKGHAPNLSYLKVWGCLAKVPFPALKKSTVGSKTFDCIFIGYAQNSAAYRFMCLNDKTINE
        ++L  +GLSDN+WGEA+LSA  +LNR+ HK+LD T YELWKG+ PNL YLKVWGCLAKV  P  KKST+G KT D +FIGY  NS AYRFM L D +I E
Subjt:  VMLLSSGLSDNIWGEAVLSACFVLNRIPHKRLDKTSYELWKGHAPNLSYLKVWGCLAKVPFPALKKSTVGSKTFDCIFIGYAQNSAAYRFMCLNDKTINE

Query:  SRDAEFFEHVFPLKQSLYAPSLSKRMHDPENTSIVSETPVSETVNTSNLRCELELRRSKRQRTEKSFGPDFLSTFI--VKRRDEIDCNFTNLYLIDEDPK
        SRD  FFE+ FPLK+         RM+       +  T  S    +S  + + E RRSKRQR E SFGPDF+ +FI  +   D I      ++L+DEDPK
Subjt:  SRDAEFFEHVFPLKQSLYAPSLSKRMHDPENTSIVSETPVSETVNTSNLRCELELRRSKRQRTEKSFGPDFLSTFI--VKRRDEIDCNFTNLYLIDEDPK

Query:  TYQEMLNSVESSMWKEAIKSELDSLVMNHTWDLVDLPMGNKPIRCKWIFKRKIKPNGSMERYKARLVVVGYTQKQGIDYFDTYSSVTKITTIRTLIALVA
         ++E + S+ +S WK A+  EL+S++ NHTW+LVDLP G KPI  KW+F++K++P+GS++RYKARLVV G+TQ+ G+DYFDTYS VTKI+TIR L AL +
Subjt:  TYQEMLNSVESSMWKEAIKSELDSLVMNHTWDLVDLPMGNKPIRCKWIFKRKIKPNGSMERYKARLVVVGYTQKQGIDYFDTYSSVTKITTIRTLIALVA

Query:  IHNLLIHQMNVKTTFLNGDLEEEIYMTQPEGFKISGQENKVCKLKKSLYNLKQAPKKWYEKFNNTLITNGFKINSSDTCVYSKVFGVDCILICLYVDDML
        IH L +HQM+VKT FLNGDL+EEIYM QP GF+  G E KV +LKKSLY LKQAPK+WYEKF+ T+++ GF +N SD CVYSK+F  +C++I LYVDDML
Subjt:  IHNLLIHQMNVKTTFLNGDLEEEIYMTQPEGFKISGQENKVCKLKKSLYNLKQAPKKWYEKFNNTLITNGFKINSSDTCVYSKVFGVDCILICLYVDDML

Query:  IFGTNMELITDTKLFLSSHFEMKDLGEANVILGVKIRKNKTSLSLCQSHYVEKILKKFDSFDVSPVRTPFDASKHLKMNKGDSVSQPEYAKIIGSVMYLM
        IF +N+E I   K FLS+ FEM  LGE +VILGV++ K +   SLCQ+HY++K+LKKFDSFDV PVRTP+D S HL  NKG SVSQ EYAK+IGS+M+LM
Subjt:  IFGTNMELITDTKLFLSSHFEMKDLGEANVILGVKIRKNKTSLSLCQSHYVEKILKKFDSFDVSPVRTPFDASKHLKMNKGDSVSQPEYAKIIGSVMYLM

Query:  NYTRPDIAYAVSRLSRYTHNPDRYHWDALRHLLRYLKGTIDYCLHFNKFSAV
        NYTRPDIAYAVSRLSRYTHNP   HW AL+ LL+YLKGT+D+ L F  F AV
Subjt:  NYTRPDIAYAVSRLSRYTHNPDRYHWDALRHLLRYLKGTIDYCLHFNKFSAV

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.1e-7231.6Show/hide
Query:  MLLSSGLSDNIWGEAVLSACFVLNRIPHKRL---DKTSYELWKGHAPNLSYLKVWGCLAKVPFPALKKSTVGSKTFDCIFIGYAQN------SAAYRFMC
        M+  + L  + WGEAVL+A +++NRIP + L    KT YE+W    P L +L+V+G    V     K+     K+F  IF+GY  N      +   +F+ 
Subjt:  MLLSSGLSDNIWGEAVLSACFVLNRIPHKRL---DKTSYELWKGHAPNLSYLKVWGCLAKVPFPALKKSTVGSKTFDCIFIGYAQN------SAAYRFMC

Query:  LNDKTINESR----DAEFFEHVFPLKQSLYA-----PSLSKRMHDPE---------NTSIVSETPVSETVNTSN----------------------LRCE
          D  ++E+      A  FE VF LK S  +     P+ S+++   E         N   + ++  SE  N  N                      L+  
Subjt:  LNDKTINESR----DAEFFEHVFPLKQSLYA-----PSLSKRMHDPE---------NTSIVSETPVSETVNTSN----------------------LRCE

Query:  LE-----LRRSKRQRTE------------------------KSFGPDFLS----TFIVKRRDE----------------IDCNFTNLYLIDED-PKTYQE
         E     L  SK+++ +                        K  G D  +      I+ RR E                ++    N + I  D P ++ E
Subjt:  LE-----LRRSKRQRTE------------------------KSFGPDFLS----TFIVKRRDE----------------IDCNFTNLYLIDED-PKTYQE

Query:  MLNSVESSMWKEAIKSELDSLVMNHTWDLVDLPMGNKPIRCKWIFKRKIKPNGSMERYKARLVVVGYTQKQGIDYFDTYSSVTKITTIRTLIALVAIHNL
        +    + S W+EAI +EL++  +N+TW +   P     +  +W+F  K    G+  RYKARLV  G+TQK  IDY +T++ V +I++ R +++LV  +NL
Subjt:  MLNSVESSMWKEAIKSELDSLVMNHTWDLVDLPMGNKPIRCKWIFKRKIKPNGSMERYKARLVVVGYTQKQGIDYFDTYSSVTKITTIRTLIALVAIHNL

Query:  LIHQMNVKTTFLNGDLEEEIYMTQPEGFKISGQENKVCKLKKSLYNLKQAPKKWYEKFNNTLITNGFKINSSDTCVYSKVFG--VDCILICLYVDDMLIF
         +HQM+VKT FLNG L+EEIYM  P+G  IS   + VCKL K++Y LKQA + W+E F   L    F  +S D C+Y    G   + I + LYVDD++I 
Subjt:  LIHQMNVKTTFLNGDLEEEIYMTQPEGFKISGQENKVCKLKKSLYNLKQAPKKWYEKFNNTLITNGFKINSSDTCVYSKVFG--VDCILICLYVDDMLIF

Query:  GTNMELITDTKLFLSSHFEMKDLGEANVILGVKIRKNKTSLSLCQSHYVEKILKKFDSFDVSPVRTPFDASKHLKMNKGDSVSQPEYAKIIGSVMYLMNY
          +M  + + K +L   F M DL E    +G++I   +  + L QS YV+KIL KF+  + + V TP  +  + ++   D         +IG +MY+M  
Subjt:  GTNMELITDTKLFLSSHFEMKDLGEANVILGVKIRKNKTSLSLCQSHYVEKILKKFDSFDVSPVRTPFDASKHLKMNKGDSVSQPEYAKIIGSVMYLMNY

Query:  TRPDIAYAVSRLSRYTHNPDRYHWDALRHLLRYLKGTIDYCLHFNKFSAVKDIVM
        TRPD+  AV+ LSRY+   +   W  L+ +LRYLKGTID  L F K  A ++ ++
Subjt:  TRPDIAYAVSRLSRYTHNPDRYHWDALRHLLRYLKGTIDYCLHFNKFSAVKDIVM

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-943.5e-9838.67Show/hide
Query:  MLLSSGLSDNIWGEAVLSACFVLNRIPHKRLD-KTSYELWKGHAPNLSYLKVWGCLAKVPFPALKKSTVGSKTFDCIFIGYAQNSAAYRFMCLNDKTINE
        ML  + L  + WGEAV +AC+++NR P   L  +    +W     + S+LKV+GC A    P  +++ +  K+  CIFIGY      YR      K +  
Subjt:  MLLSSGLSDNIWGEAVLSACFVLNRIPHKRLD-KTSYELWKGHAPNLSYLKVWGCLAKVPFPALKKSTVGSKTFDCIFIGYAQNSAAYRFMCLNDKTINE

Query:  SRDAEFFEHVFP-------------LKQSLYAPSLSKRMHDPENT----SIVSETP-------------VSETVN-TSNLRCELELRRSKRQRTEKSFGP
        SRD  F E                 +   +  PS S      E+T    S   E P             V E  + T        LRRS+R R E    P
Subjt:  SRDAEFFEHVFP-------------LKQSLYAPSLSKRMHDPENT----SIVSETP-------------VSETVN-TSNLRCELELRRSKRQRTEKSFGP

Query:  DFLSTFIVKRRDEIDCNFTNLYLIDEDPKTYQEMLNSVESSMWKEAIKSELDSLVMNHTWDLVDLPMGNKPIRCKWIFKRKIKPNGSMERYKARLVVVGY
           ST  V   D            D +P++ +E+L+  E +   +A++ E++SL  N T+ LV+LP G +P++CKW+FK K   +  + RYKARLVV G+
Subjt:  DFLSTFIVKRRDEIDCNFTNLYLIDEDPKTYQEMLNSVESSMWKEAIKSELDSLVMNHTWDLVDLPMGNKPIRCKWIFKRKIKPNGSMERYKARLVVVGY

Query:  TQKQGIDYFDTYSSVTKITTIRTLIALVAIHNLLIHQMNVKTTFLNGDLEEEIYMTQPEGFKISGQENKVCKLKKSLYNLKQAPKKWYEKFNNTLITNGF
         QK+GID+ + +S V K+T+IRT+++L A  +L + Q++VKT FL+GDLEEEIYM QPEGF+++G+++ VCKL KSLY LKQAP++WY KF++ + +  +
Subjt:  TQKQGIDYFDTYSSVTKITTIRTLIALVAIHNLLIHQMNVKTTFLNGDLEEEIYMTQPEGFKISGQENKVCKLKKSLYNLKQAPKKWYEKFNNTLITNGF

Query:  KINSSDTCVYSKVFGV-DCILICLYVDDMLIFGTNMELITDTKLFLSSHFEMKDLGEANVILGVKIRKNKTS--LSLCQSHYVEKILKKFDSFDVSPVRT
            SD CVY K F   + I++ LYVDDMLI G +  LI   K  LS  F+MKDLG A  ILG+KI + +TS  L L Q  Y+E++L++F+  +  PV T
Subjt:  KINSSDTCVYSKVFGV-DCILICLYVDDMLIFGTNMELITDTKLFLSSHFEMKDLGEANVILGVKIRKNKTS--LSLCQSHYVEKILKKFDSFDVSPVRT

Query:  PFDASKHLKMNK---------GDSVSQPEYAKIIGSVMYLMNYTRPDIAYAVSRLSRYTHNPDRYHWDALRHLLRYLKGTIDYCLHF
        P   + HLK++K           ++++  Y+  +GS+MY M  TRPDIA+AV  +SR+  NP + HW+A++ +LRYL+GT   CL F
Subjt:  PFDASKHLKMNK---------GDSVSQPEYAKIIGSVMYLMNYTRPDIAYAVSRLSRYTHNPDRYHWDALRHLLRYLKGTIDYCLHF

P25600 Putative transposon Ty5-1 protein YCL074W5.1e-2831.54Show/hide
Query:  MNVKTTFLNGDLEEEIYMTQPEGFKISGQENKVCKLKKSLYNLKQAPKKWYEKFNNTLITNGFKINSSDTCVYSKVFGVDCILICLYVDDMLIFGTNMEL
        M+V T FLN  ++E IY+ QP GF      + V +L   +Y LKQAP  W E  NNTL   GF  +  +  +Y +      I I +YVDD+L+   + ++
Subjt:  MNVKTTFLNGDLEEEIYMTQPEGFKISGQENKVCKLKKSLYNLKQAPKKWYEKFNNTLITNGFKINSSDTCVYSKVFGVDCILICLYVDDMLIFGTNMEL

Query:  ITDTKLFLSSHFEMKDLGEANVILGVKIRKNKT-SLSLCQSHYVEKILKKFDSFDVSPVRTPFDASKHLKMNKGDSVSQ-PEYAKIIGSVMYLMNYTRPD
            K  L+  + MKDLG+ +  LG+ I ++    ++L    Y+ K   + +       +TP   SK L       +     Y  I+G +++  N  RPD
Subjt:  ITDTKLFLSSHFEMKDLGEANVILGVKIRKNKT-SLSLCQSHYVEKILKKFDSFDVSPVRTPFDASKHLKMNKGDSVSQ-PEYAKIIGSVMYLMNYTRPD

Query:  IAYAVSRLSRYTHNPDRYHWDALRHLLRYLKGTIDYCLHFNKFS-------------AVKDIVMQTGSQIMMRLTLLVG
        I+Y VS LSR+   P   H ++ R +LRYL  T   CL +   S             A+ D+   TG  +    TLL G
Subjt:  IAYAVSRLSRYTHNPDRYHWDALRHLLRYLKGTIDYCLHFNKFS-------------AVKDIVMQTGSQIMMRLTLLVG

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.3e-6839.44Show/hide
Query:  DEDPKTYQEMLNSVESSMWKEAIKSELDSLVMNHTWDLVDLPMGNKPI-RCKWIFKRKIKPNGSMERYKARLVVVGYTQKQGIDYFDTYSSVTKITTIRT
        + +P+T    + +++   W+ A+ SE+++ + NHTWDLV  P  +  I  C+WIF +K   +GS+ RYKARLV  GY Q+ G+DY +T+S V K T+IR 
Subjt:  DEDPKTYQEMLNSVESSMWKEAIKSELDSLVMNHTWDLVDLPMGNKPI-RCKWIFKRKIKPNGSMERYKARLVVVGYTQKQGIDYFDTYSSVTKITTIRT

Query:  LIALVAIHNLLIHQMNVKTTFLNGDLEEEIYMTQPEGFKISGQENKVCKLKKSLYNLKQAPKKWYEKFNNTLITNGFKINSSDTCVYSKVFGVDCILICL
        ++ +    +  I Q++V   FL G L +++YM+QP GF    + N VCKL+K+LY LKQAP+ WY +  N L+T GF  + SDT ++    G   + + +
Subjt:  LIALVAIHNLLIHQMNVKTTFLNGDLEEEIYMTQPEGFKISGQENKVCKLKKSLYNLKQAPKKWYEKFNNTLITNGFKINSSDTCVYSKVFGVDCILICL

Query:  YVDDMLIFGTNMELITDTKLFLSSHFEMKDLGEANVILGVKIRKNKTSLSLCQSHYVEKILKKFDSFDVSPVRTPFDASKHLKMNKGDSVSQP-EYAKII
        YVDD+LI G +  L+ +T   LS  F +KD  E +  LG++ ++  T L L Q  Y+  +L + +     PV TP   S  L +  G  ++ P EY  I+
Subjt:  YVDDMLIFGTNMELITDTKLFLSSHFEMKDLGEANVILGVKIRKNKTSLSLCQSHYVEKILKKFDSFDVSPVRTPFDASKHLKMNKGDSVSQP-EYAKII

Query:  GSVMYLMNYTRPDIAYAVSRLSRYTHNPDRYHWDALRHLLRYLKGTIDYCLHFNK
        GS+ YL  +TRPDI+YAV+RLS++ H P   H  AL+ +LRYL GT ++ +   K
Subjt:  GSVMYLMNYTRPDIAYAVSRLSRYTHNPDRYHWDALRHLLRYLKGTIDYCLHFNK

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.9e-6738.59Show/hide
Query:  DEDPKTYQEMLNSVESSMWKEAIKSELDSLVMNHTWDLVDLPMGNKPI-RCKWIFKRKIKPNGSMERYKARLVVVGYTQKQGIDYFDTYSSVTKITTIRT
        + +P+T    + +++   W++A+ SE+++ + NHTWDLV  P  +  I  C+WIF +K   +GS+ RYKARLV  GY Q+ G+DY +T+S V K T+IR 
Subjt:  DEDPKTYQEMLNSVESSMWKEAIKSELDSLVMNHTWDLVDLPMGNKPI-RCKWIFKRKIKPNGSMERYKARLVVVGYTQKQGIDYFDTYSSVTKITTIRT

Query:  LIALVAIHNLLIHQMNVKTTFLNGDLEEEIYMTQPEGFKISGQENKVCKLKKSLYNLKQAPKKWYEKFNNTLITNGFKINSSDTCVYSKVFGVDCILICL
        ++ +    +  I Q++V   FL G L +E+YM+QP GF    + + VC+L+K++Y LKQAP+ WY +    L+T GF  + SDT ++    G   I + +
Subjt:  LIALVAIHNLLIHQMNVKTTFLNGDLEEEIYMTQPEGFKISGQENKVCKLKKSLYNLKQAPKKWYEKFNNTLITNGFKINSSDTCVYSKVFGVDCILICL

Query:  YVDDMLIFGTNMELITDTKLFLSSHFEMKDLGEANVILGVKIRKNKTSLSLCQSHYVEKILKKFDSFDVSPVRTPFDASKHLKMNKGDSVSQP-EYAKII
        YVDD+LI G +  L+  T   LS  F +K+  + +  LG++ ++    L L Q  Y   +L + +     PV TP   S  L ++ G  +  P EY  I+
Subjt:  YVDDMLIFGTNMELITDTKLFLSSHFEMKDLGEANVILGVKIRKNKTSLSLCQSHYVEKILKKFDSFDVSPVRTPFDASKHLKMNKGDSVSQP-EYAKII

Query:  GSVMYLMNYTRPDIAYAVSRLSRYTHNPDRYHWDALRHLLRYLKGTIDYCLHFNK
        GS+ YL  +TRPD++YAV+RLS+Y H P   HW+AL+ +LRYL GT D+ +   K
Subjt:  GSVMYLMNYTRPDIAYAVSRLSRYTHNPDRYHWDALRHLLRYLKGTIDYCLHFNK

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 82.3e-6837.92Show/hide
Query:  EDPKTYQEMLNSVESSMWKEAIKSELDSLVMNHTWDLVDLPMGNKPIRCKWIFKRKIKPNGSMERYKARLVVVGYTQKQGIDYFDTYSSVTKITTIRTLI
        ++P TY E   + E  +W  A+  E+ ++   HTW++  LP   KPI CKW++K K   +G++ERYKARLV  GYTQ++GID+ +T+S V K+T+++ ++
Subjt:  EDPKTYQEMLNSVESSMWKEAIKSELDSLVMNHTWDLVDLPMGNKPIRCKWIFKRKIKPNGSMERYKARLVVVGYTQKQGIDYFDTYSSVTKITTIRTLI

Query:  ALVAIHNLLIHQMNVKTTFLNGDLEEEIYMTQPEGFKI----SGQENKVCKLKKSLYNLKQAPKKWYEKFNNTLITNGFKINSSDTCVYSKVFGVDCILI
        A+ AI+N  +HQ+++   FLNGDL+EEIYM  P G+      S   N VC LKKS+Y LKQA ++W+ KF+ TLI  GF  + SD   + K+     + +
Subjt:  ALVAIHNLLIHQMNVKTTFLNGDLEEEIYMTQPEGFKI----SGQENKVCKLKKSLYNLKQAPKKWYEKFNNTLITNGFKINSSDTCVYSKVFGVDCILI

Query:  CLYVDDMLIFGTNMELITDTKLFLSSHFEMKDLGEANVILGVKIRKNKTSLSLCQSHYVEKILKKFDSFDVSPVRTPFDASKHLKMNK-GDSVSQPEYAK
         +YVDD++I   N   + + K  L S F+++DLG     LG++I ++   +++CQ  Y   +L +       P   P D S     +  GD V    Y +
Subjt:  CLYVDDMLIFGTNMELITDTKLFLSSHFEMKDLGEANVILGVKIRKNKTSLSLCQSHYVEKILKKFDSFDVSPVRTPFDASKHLKMNK-GDSVSQPEYAK

Query:  IIGSVMYLMNYTRPDIAYAVSRLSRYTHNPDRYHWDALRHLLRYLKGTIDYCLHFN
        +IG +MYL   TR DI++AV++LS+++  P   H  A+  +L Y+KGT+   L ++
Subjt:  IIGSVMYLMNYTRPDIAYAVSRLSRYTHNPDRYHWDALRHLLRYLKGTIDYCLHFN

ATMG00810.1 DNA/RNA polymerases superfamily protein5.6e-2235.85Show/hide
Query:  ICLYVDDMLIFGTNMELITDTKLFLSSHFEMKDLGEANVILGVKIRKNKTSLSLCQSHYVEKILKKFDSFDVSPVRTPFDASKHLKMNKGDSVSQPEYAK
        + LYVDD+L+ G++  L+      LSS F MKDLG  +  LG++I+ + + L L Q+ Y E+IL      D  P+ TP     +  ++        ++  
Subjt:  ICLYVDDMLIFGTNMELITDTKLFLSSHFEMKDLGEANVILGVKIRKNKTSLSLCQSHYVEKILKKFDSFDVSPVRTPFDASKHLKMNKGDSVSQPEYAK

Query:  IIGSVMYLMNYTRPDIAYAVSRLSRYTHNPDRYHWDALRHLLRYLKGTIDYCLHFNKFS
        I+G++ YL   TRPDI+YAV+ + +  H P    +D L+ +LRY+KGTI + L+ +K S
Subjt:  IIGSVMYLMNYTRPDIAYAVSRLSRYTHNPDRYHWDALRHLLRYLKGTIDYCLHFNKFS

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)1.6e-1639.22Show/hide
Query:  IDEDPKTYQEMLNSVESSMWKEAIKSELDSLVMNHTWDLVDLPMGNKPIRCKWIFKRKIKPNGSMERYKARLVVVGYTQKQGIDYFDTYSSVTKITTIRT
        I ++PK+   ++ +++   W +A++ ELD+L  N TW LV  P+    + CKW+FK K+  +G+++R KARLV  G+ Q++GI + +TYS V +  TIRT
Subjt:  IDEDPKTYQEMLNSVESSMWKEAIKSELDSLVMNHTWDLVDLPMGNKPIRCKWIFKRKIKPNGSMERYKARLVVVGYTQKQGIDYFDTYSSVTKITTIRT

Query:  LI
        ++
Subjt:  LI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATGTAATGTTATTAAGCTCCGGCCTATCTGATAATATATGGGGAGAAGCCGTGTTGTCTGCGTGTTTTGTTCTTAACAGGATTCCCCACAAAAGGCTAGACAAAAC
TTCCTACGAACTCTGGAAAGGACATGCACCAAATCTTTCCTACTTGAAGGTCTGGGGATGCTTGGCTAAGGTACCATTTCCTGCATTGAAGAAATCTACGGTAGGATCTA
AAACTTTTGACTGCATTTTTATTGGGTATGCTCAAAATAGTGCTGCATATAGGTTTATGTGTTTAAATGATAAAACTATAAACGAATCTAGAGATGCAGAATTCTTTGAG
CATGTATTTCCGTTAAAGCAATCATTGTATGCTCCTAGCCTATCTAAAAGAATGCATGATCCTGAAAACACCTCAATCGTTAGTGAAACACCTGTTTCTGAAACTGTTAA
TACTTCAAACCTAAGATGTGAATTAGAACTTAGGAGAAGTAAAAGACAGAGAACTGAGAAAAGTTTCGGTCCAGATTTCCTAAGCACTTTCATAGTGAAAAGGCGTGATG
AAATTGACTGTAACTTCACAAACTTGTACTTAATAGATGAGGATCCTAAAACTTACCAAGAAATGCTGAACTCTGTAGAGTCAAGTATGTGGAAAGAGGCCATTAAAAGT
GAACTGGATTCACTGGTCATGAATCATACATGGGACCTAGTGGACCTTCCTATGGGAAACAAGCCAATTAGATGTAAGTGGATCTTCAAAAGAAAAATAAAACCAAATGG
ATCAATGGAAAGATACAAGGCTAGATTAGTGGTAGTAGGGTATACCCAGAAACAAGGCATTGATTATTTTGATACATATTCCTCTGTAACTAAGATAACCACAATTAGGA
CCTTGATTGCATTAGTCGCAATACATAACCTTCTTATTCACCAAATGAACGTAAAAACAACTTTTTTAAATGGTGACCTAGAAGAGGAAATTTATATGACACAACCAGAA
GGCTTTAAAATCTCTGGGCAAGAAAACAAAGTGTGTAAACTAAAAAAATCCTTATACAATCTCAAGCAAGCTCCCAAGAAGTGGTATGAAAAGTTCAATAATACGTTGAT
AACCAATGGATTTAAAATAAATTCCTCTGACACGTGTGTTTATTCAAAGGTGTTTGGAGTTGATTGCATATTAATATGTCTATATGTTGATGACATGTTAATCTTTGGAA
CAAACATGGAGTTAATAACTGATACTAAGTTATTTCTCTCATCACACTTTGAAATGAAAGACCTAGGAGAAGCAAACGTAATCCTAGGTGTTAAAATCAGGAAAAACAAA
ACTAGTTTGTCTCTATGTCAATCTCACTACGTGGAGAAAATACTAAAGAAGTTTGATTCCTTTGATGTTTCTCCTGTGAGAACTCCCTTTGACGCTAGTAAACATCTTAA
GATGAATAAAGGAGATAGTGTGTCTCAACCTGAATATGCAAAGATCATAGGTAGTGTGATGTATTTAATGAATTACACTAGACCGGATATTGCATATGCTGTCAGTAGAT
TAAGTAGATATACACATAATCCTGATAGATACCACTGGGATGCCTTACGCCATCTATTGAGATATCTTAAAGGGACGATAGATTATTGTTTACACTTCAACAAATTTTCT
GCCGTGAAGGATATTGTGATGCAAACTGGGTCGCAGATAATGATGAGGTTAACTCTACTAGTGGGTATGTATTTTTGCTCGGAGGTGGAGCAATATCTTGGAAGTCTGCA
AAATAGACTTGTATAG
mRNA sequenceShow/hide mRNA sequence
ATGAATGTAATGTTATTAAGCTCCGGCCTATCTGATAATATATGGGGAGAAGCCGTGTTGTCTGCGTGTTTTGTTCTTAACAGGATTCCCCACAAAAGGCTAGACAAAAC
TTCCTACGAACTCTGGAAAGGACATGCACCAAATCTTTCCTACTTGAAGGTCTGGGGATGCTTGGCTAAGGTACCATTTCCTGCATTGAAGAAATCTACGGTAGGATCTA
AAACTTTTGACTGCATTTTTATTGGGTATGCTCAAAATAGTGCTGCATATAGGTTTATGTGTTTAAATGATAAAACTATAAACGAATCTAGAGATGCAGAATTCTTTGAG
CATGTATTTCCGTTAAAGCAATCATTGTATGCTCCTAGCCTATCTAAAAGAATGCATGATCCTGAAAACACCTCAATCGTTAGTGAAACACCTGTTTCTGAAACTGTTAA
TACTTCAAACCTAAGATGTGAATTAGAACTTAGGAGAAGTAAAAGACAGAGAACTGAGAAAAGTTTCGGTCCAGATTTCCTAAGCACTTTCATAGTGAAAAGGCGTGATG
AAATTGACTGTAACTTCACAAACTTGTACTTAATAGATGAGGATCCTAAAACTTACCAAGAAATGCTGAACTCTGTAGAGTCAAGTATGTGGAAAGAGGCCATTAAAAGT
GAACTGGATTCACTGGTCATGAATCATACATGGGACCTAGTGGACCTTCCTATGGGAAACAAGCCAATTAGATGTAAGTGGATCTTCAAAAGAAAAATAAAACCAAATGG
ATCAATGGAAAGATACAAGGCTAGATTAGTGGTAGTAGGGTATACCCAGAAACAAGGCATTGATTATTTTGATACATATTCCTCTGTAACTAAGATAACCACAATTAGGA
CCTTGATTGCATTAGTCGCAATACATAACCTTCTTATTCACCAAATGAACGTAAAAACAACTTTTTTAAATGGTGACCTAGAAGAGGAAATTTATATGACACAACCAGAA
GGCTTTAAAATCTCTGGGCAAGAAAACAAAGTGTGTAAACTAAAAAAATCCTTATACAATCTCAAGCAAGCTCCCAAGAAGTGGTATGAAAAGTTCAATAATACGTTGAT
AACCAATGGATTTAAAATAAATTCCTCTGACACGTGTGTTTATTCAAAGGTGTTTGGAGTTGATTGCATATTAATATGTCTATATGTTGATGACATGTTAATCTTTGGAA
CAAACATGGAGTTAATAACTGATACTAAGTTATTTCTCTCATCACACTTTGAAATGAAAGACCTAGGAGAAGCAAACGTAATCCTAGGTGTTAAAATCAGGAAAAACAAA
ACTAGTTTGTCTCTATGTCAATCTCACTACGTGGAGAAAATACTAAAGAAGTTTGATTCCTTTGATGTTTCTCCTGTGAGAACTCCCTTTGACGCTAGTAAACATCTTAA
GATGAATAAAGGAGATAGTGTGTCTCAACCTGAATATGCAAAGATCATAGGTAGTGTGATGTATTTAATGAATTACACTAGACCGGATATTGCATATGCTGTCAGTAGAT
TAAGTAGATATACACATAATCCTGATAGATACCACTGGGATGCCTTACGCCATCTATTGAGATATCTTAAAGGGACGATAGATTATTGTTTACACTTCAACAAATTTTCT
GCCGTGAAGGATATTGTGATGCAAACTGGGTCGCAGATAATGATGAGGTTAACTCTACTAGTGGGTATGTATTTTTGCTCGGAGGTGGAGCAATATCTTGGAAGTCTGCA
AAATAGACTTGTATAG
Protein sequenceShow/hide protein sequence
MNVMLLSSGLSDNIWGEAVLSACFVLNRIPHKRLDKTSYELWKGHAPNLSYLKVWGCLAKVPFPALKKSTVGSKTFDCIFIGYAQNSAAYRFMCLNDKTINESRDAEFFE
HVFPLKQSLYAPSLSKRMHDPENTSIVSETPVSETVNTSNLRCELELRRSKRQRTEKSFGPDFLSTFIVKRRDEIDCNFTNLYLIDEDPKTYQEMLNSVESSMWKEAIKS
ELDSLVMNHTWDLVDLPMGNKPIRCKWIFKRKIKPNGSMERYKARLVVVGYTQKQGIDYFDTYSSVTKITTIRTLIALVAIHNLLIHQMNVKTTFLNGDLEEEIYMTQPE
GFKISGQENKVCKLKKSLYNLKQAPKKWYEKFNNTLITNGFKINSSDTCVYSKVFGVDCILICLYVDDMLIFGTNMELITDTKLFLSSHFEMKDLGEANVILGVKIRKNK
TSLSLCQSHYVEKILKKFDSFDVSPVRTPFDASKHLKMNKGDSVSQPEYAKIIGSVMYLMNYTRPDIAYAVSRLSRYTHNPDRYHWDALRHLLRYLKGTIDYCLHFNKFS
AVKDIVMQTGSQIMMRLTLLVGMYFCSEVEQYLGSLQNRLV