; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc03g0071771 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc03g0071771
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionIntegrase catalytic domain-containing protein
Genome locationCMiso1.1chr03:18349108..18350472
RNA-Seq ExpressionCmc03g0071771
SyntenyCmc03g0071771
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0045944 - positive regulation of transcription by RNA polymerase II (biological process)
GO:0060320 - rejection of self pollen (biological process)
GO:0090502 - RNA phosphodiester bond hydrolysis, endonucleolytic (biological process)
GO:0005576 - extracellular region (cellular component)
GO:0005634 - nucleus (cellular component)
GO:0000977 - RNA polymerase II regulatory region sequence-specific DNA binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0046983 - protein dimerization activity (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAD6453934.1 hypothetical protein E3N88_08640 [Mikania micrantha]4.8e-18569.14Show/hide
Query:  DGIDCNFTNLYLIDEDPKTYQEALNSVDSSMWKEGIKSELDSLVMNHIWELVDLPMGNKPIRCKWIFKRKTKPNGSIERYKARLVVVGYTQKQGVDYFDT
        D +  +F +++L++ DPKTYQEA+ SVD+++WKE IKSE+DS++ NH WEL DLP GNKPI  KWIFK+K +P+G++++YKARLV+ G+TQK G+DYFDT
Subjt:  DGIDCNFTNLYLIDEDPKTYQEALNSVDSSMWKEGIKSELDSLVMNHIWELVDLPMGNKPIRCKWIFKRKTKPNGSIERYKARLVVVGYTQKQGVDYFDT

Query:  YSPVTTITTIRALIALAAIHNLLIHQMDVKTAFLNGDLEEEIYMTQPEGFKISEQENKVCKLRKSLYGLKQAPKQWYEKFNNTLITNGFKINSSDMCVYS
        YSPVT ITTIR+LI++AAI+ LLIHQMDVKTAFLNGDL+EEIYM QPEGF +S  E+KVCKLRKSLYGLKQAPK+WYEKF+ TL  +G+ +N+SD CVYS
Subjt:  YSPVTTITTIRALIALAAIHNLLIHQMDVKTAFLNGDLEEEIYMTQPEGFKISEQENKVCKLRKSLYGLKQAPKQWYEKFNNTLITNGFKINSSDMCVYS

Query:  KMVGADCILICLYVDDMLIFGTNMELISDTKIFLSSHFEMKDLGEADVILGVKIRKNKTSLSLCQSHYVEKILNKFDSFDVSPVRTPFDASKHLKKNKGD
        K      +LICLYVDDMLIFG +M  I+ TK FLSS FEMKDLGEADVILGVKI++    +SLCQSHY+E++L KFD F+++PV+TP+D S  LKKN  +
Subjt:  KMVGADCILICLYVDDMLIFGTNMELISDTKIFLSSHFEMKDLGEADVILGVKIRKNKTSLSLCQSHYVEKILNKFDSFDVSPVRTPFDASKHLKKNKGD

Query:  SVSQPEYAKIIGSVMYLMNYTRPDIAYAVSILSRYTHNPNRYHWDTLRHLLRYLKGMIDYCLHFNKFPVALEGYCDANWVTDNDEVNSTSGYVFLLRGGA
        SVSQ EYAKIIGSVM+LMNYTRPDIAY VS LSRYTHNP++ HW  +  L+RYL+G ++ CLH+NKFP  LEGYCDANWVTDNDEV+STSGYVF++ GGA
Subjt:  SVSQPEYAKIIGSVMYLMNYTRPDIAYAVSILSRYTHNPNRYHWDTLRHLLRYLKGMIDYCLHFNKFPVALEGYCDANWVTDNDEVNSTSGYVFLLRGGA

Query:  ISWKSTKQTCIARSTMEFEFIALELAGQEAEWIKNLVEDVPLCG
        ISWKS+KQTCIARSTME EFIALELAGQEAEW++ L+ D+P  G
Subjt:  ISWKSTKQTCIARSTMEFEFIALELAGQEAEWIKNLVEDVPLCG

KAG7551885.1 Ribonuclease H-like superfamily [Arabidopsis thaliana x Arabidopsis arenosa]1.2e-17064.96Show/hide
Query:  DGIDCNFTNLYLIDEDPKTYQEALNSVDSSMWKEGIKSELDSLVMNHIWELVDLPMGNKPIRCKWIFKRKTKPNGSIERYKARLVVVGYTQKQGVDYFDT
        D I   F   +LI+EDPKT+ EA+ SVD+  WKE + +E DS++ NH WE+VDLP G K IRCKWI K+K K +GSI+++KARLV  G+ QKQGVDYFDT
Subjt:  DGIDCNFTNLYLIDEDPKTYQEALNSVDSSMWKEGIKSELDSLVMNHIWELVDLPMGNKPIRCKWIFKRKTKPNGSIERYKARLVVVGYTQKQGVDYFDT

Query:  YSPVTTITTIRALIALAAIHNLLIHQMDVKTAFLNGDLEEEIYMTQPEGFKISEQENKVCKLRKSLYGLKQAPKQWYEKFNNTLITNGFKINSSDMCVYS
        Y+PVT I +IR L+A+A+ H L++HQMDVKTAFLNGDL EEIYM QPEGF I  QENKVCKL KSLYGLKQAPKQW+EKF+NTL+ NGF  N  D CV+S
Subjt:  YSPVTTITTIRALIALAAIHNLLIHQMDVKTAFLNGDLEEEIYMTQPEGFKISEQENKVCKLRKSLYGLKQAPKQWYEKFNNTLITNGFKINSSDMCVYS

Query:  KMVGADCILICLYVDDMLIFGTNMELISDTKIFLSSHFEMKDLGEADVILGVKIRKNKTSLSLCQSHYVEKILNKFDSFDVSPVRTPFDASKHLKKNKGD
        K+     ++ICLYVDDMLI GT++E++ DTK+FLSS F+MKDLGEADVILG+K+ K  +  SL QSHY+EKIL KF  +D    ++P+D+S HL +N+G+
Subjt:  KMVGADCILICLYVDDMLIFGTNMELISDTKIFLSSHFEMKDLGEADVILGVKIRKNKTSLSLCQSHYVEKILNKFDSFDVSPVRTPFDASKHLKKNKGD

Query:  SVSQPEYAKIIGSVMYLMNYTRPDIAYAVSILSRYTHNPNRYHWDTLRHLLRYLKGMIDYCLHFNKFPVALEGYCDANWVTDNDEVNSTSGYVFLLRGGA
        SV+Q EYAK+IGSVMYLMN TRPDIAY VS LSRYTHNP   HW  L  L+RYLKG ID+ L ++     LE YCDANW +DNDEVNSTSG+VF L GGA
Subjt:  SVSQPEYAKIIGSVMYLMNYTRPDIAYAVSILSRYTHNPNRYHWDTLRHLLRYLKGMIDYCLHFNKFPVALEGYCDANWVTDNDEVNSTSGYVFLLRGGA

Query:  ISWKSTKQTCIARSTMEFEFIALELAGQEAEWIKNLVEDVPLCGTSVP
        I+WKSTKQTCIARSTME E IALELAGQEAEW++NL+ D P+ G   P
Subjt:  ISWKSTKQTCIARSTMEFEFIALELAGQEAEWIKNLVEDVPLCGTSVP

KAG7571733.1 Integrase catalytic core [Arabidopsis suecica]4.0e-17165.18Show/hide
Query:  DGIDCNFTNLYLIDEDPKTYQEALNSVDSSMWKEGIKSELDSLVMNHIWELVDLPMGNKPIRCKWIFKRKTKPNGSIERYKARLVVVGYTQKQGVDYFDT
        D I   F   +LI+EDPKT+ EA+ SVD+  WKE + +E DS++ NH WE+VDLP G K IRCKWI K+K K +GSI+++KARLV  G+ QKQGVDYFDT
Subjt:  DGIDCNFTNLYLIDEDPKTYQEALNSVDSSMWKEGIKSELDSLVMNHIWELVDLPMGNKPIRCKWIFKRKTKPNGSIERYKARLVVVGYTQKQGVDYFDT

Query:  YSPVTTITTIRALIALAAIHNLLIHQMDVKTAFLNGDLEEEIYMTQPEGFKISEQENKVCKLRKSLYGLKQAPKQWYEKFNNTLITNGFKINSSDMCVYS
        Y+PVT I +IR L+A+A+ H L++HQMDVKTAFLNGDL EEIYM QPEGF I  QENKVCKL KSLYGLKQAPKQW+EKF+NTL+ NGF  N  D CV+S
Subjt:  YSPVTTITTIRALIALAAIHNLLIHQMDVKTAFLNGDLEEEIYMTQPEGFKISEQENKVCKLRKSLYGLKQAPKQWYEKFNNTLITNGFKINSSDMCVYS

Query:  KMVGADCILICLYVDDMLIFGTNMELISDTKIFLSSHFEMKDLGEADVILGVKIRKNKTSLSLCQSHYVEKILNKFDSFDVSPVRTPFDASKHLKKNKGD
        K+     ++ICLYVDDMLI GT++E++ DTK+FLSS F+MKDLGEADVILG+K+ K  +  SL QSHY+EKIL KF  +D    ++P+D+S HL +N+G+
Subjt:  KMVGADCILICLYVDDMLIFGTNMELISDTKIFLSSHFEMKDLGEADVILGVKIRKNKTSLSLCQSHYVEKILNKFDSFDVSPVRTPFDASKHLKKNKGD

Query:  SVSQPEYAKIIGSVMYLMNYTRPDIAYAVSILSRYTHNPNRYHWDTLRHLLRYLKGMIDYCLHFNKFPVALEGYCDANWVTDNDEVNSTSGYVFLLRGGA
        SV+Q EYAK+IGSVMYLMN TRPDIAYAVS LSRYTHNP   HW  L  L+RYLKG ID+ L ++     LE YCDANW +DNDEVNSTSG+VF L GGA
Subjt:  SVSQPEYAKIIGSVMYLMNYTRPDIAYAVSILSRYTHNPNRYHWDTLRHLLRYLKGMIDYCLHFNKFPVALEGYCDANWVTDNDEVNSTSGYVFLLRGGA

Query:  ISWKSTKQTCIARSTMEFEFIALELAGQEAEWIKNLVEDVPLCGTSVP
        I+WKSTKQTCIARSTME E IALELAGQEAEW++NL+ D P+ G   P
Subjt:  ISWKSTKQTCIARSTMEFEFIALELAGQEAEWIKNLVEDVPLCGTSVP

KAG7592238.1 Ribonuclease H-like superfamily [Arabidopsis thaliana x Arabidopsis arenosa]1.2e-17064.96Show/hide
Query:  DGIDCNFTNLYLIDEDPKTYQEALNSVDSSMWKEGIKSELDSLVMNHIWELVDLPMGNKPIRCKWIFKRKTKPNGSIERYKARLVVVGYTQKQGVDYFDT
        D I   F   +LI+EDPKT+ EA+ SVD+  WKE + +E DS++ NH WE+VDLP G K IRCKWI K+K K +GSI+++KARLV  G+ QKQGVDYFDT
Subjt:  DGIDCNFTNLYLIDEDPKTYQEALNSVDSSMWKEGIKSELDSLVMNHIWELVDLPMGNKPIRCKWIFKRKTKPNGSIERYKARLVVVGYTQKQGVDYFDT

Query:  YSPVTTITTIRALIALAAIHNLLIHQMDVKTAFLNGDLEEEIYMTQPEGFKISEQENKVCKLRKSLYGLKQAPKQWYEKFNNTLITNGFKINSSDMCVYS
        Y+PVT I +IR L+A+A+ H L++HQMDVKTAFLNGDL EEIYM QPEGF I  QENKVCKL KSLYGLKQAPKQW+EKF+NTL+ NGF  N  D CV+S
Subjt:  YSPVTTITTIRALIALAAIHNLLIHQMDVKTAFLNGDLEEEIYMTQPEGFKISEQENKVCKLRKSLYGLKQAPKQWYEKFNNTLITNGFKINSSDMCVYS

Query:  KMVGADCILICLYVDDMLIFGTNMELISDTKIFLSSHFEMKDLGEADVILGVKIRKNKTSLSLCQSHYVEKILNKFDSFDVSPVRTPFDASKHLKKNKGD
        K+     ++ICLYVDDMLI GT++E++ DTK+FLSS F+MKDLGEADVILG+K+ K  +  SL QSHY+EKIL KF  +D    ++P+D+S HL +N+G+
Subjt:  KMVGADCILICLYVDDMLIFGTNMELISDTKIFLSSHFEMKDLGEADVILGVKIRKNKTSLSLCQSHYVEKILNKFDSFDVSPVRTPFDASKHLKKNKGD

Query:  SVSQPEYAKIIGSVMYLMNYTRPDIAYAVSILSRYTHNPNRYHWDTLRHLLRYLKGMIDYCLHFNKFPVALEGYCDANWVTDNDEVNSTSGYVFLLRGGA
        SV+Q EYAK+IGSVMYLMN TRPDIAY VS LSRYTHNP   HW  L  L+RYLKG ID+ L ++     LE YCDANW +DNDEVNSTSG+VF L GGA
Subjt:  SVSQPEYAKIIGSVMYLMNYTRPDIAYAVSILSRYTHNPNRYHWDTLRHLLRYLKGMIDYCLHFNKFPVALEGYCDANWVTDNDEVNSTSGYVFLLRGGA

Query:  ISWKSTKQTCIARSTMEFEFIALELAGQEAEWIKNLVEDVPLCGTSVP
        I+WKSTKQTCIARSTME E IALELAGQEAEW++NL+ D P+ G   P
Subjt:  ISWKSTKQTCIARSTMEFEFIALELAGQEAEWIKNLVEDVPLCGTSVP

TYK06518.1 ty1-copia retrotransposon protein [Cucumis melo var. makuwa]4.7e-24994.47Show/hide
Query:  RDGIDCNFTNLYLIDEDPKTYQEALNSVDSSMWKEGIKSELDSLVMNHIWELVDLPMGNKPIRCKWIFKRKTKPNGSIERYKARLVVVGYTQKQGVDYFD
        RD IDCNFTNL+LIDEDPKTYQEALNSVDS MWKE IKSELDSL MNH WELVDLPMGNKPIRCKWIFKRKTKPNG IERYKARLVVVGYTQKQGVDYFD
Subjt:  RDGIDCNFTNLYLIDEDPKTYQEALNSVDSSMWKEGIKSELDSLVMNHIWELVDLPMGNKPIRCKWIFKRKTKPNGSIERYKARLVVVGYTQKQGVDYFD

Query:  TYSPVTTITTIRALIALAAIHNLLIHQMDVKTAFLNGDLEEEIYMTQPEGFKISEQENKVCKLRKSLYGLKQAPKQWYEKFNNTLITNGFKINSSDMCVY
        TYSPVT ITTIRALIALAAIHNLLIHQMDVKTAFLNG+LEEEIYMTQPEGFKIS QENKVCKLRKSLYGLKQAPKQWYEKFNNTLITNGFKINSSD CVY
Subjt:  TYSPVTTITTIRALIALAAIHNLLIHQMDVKTAFLNGDLEEEIYMTQPEGFKISEQENKVCKLRKSLYGLKQAPKQWYEKFNNTLITNGFKINSSDMCVY

Query:  SKMVGADCILICLYVDDMLIFGTNMELISDTKIFLSSHFEMKDLGEADVILGVKIRKNKTSLSLCQSHYVEKILNKFDSFDVSPVRTPFDASKHLKKNKG
        SKMVGADCILICLYVDDMLIFGTNMELI+DTK FLSSHFEMKDLGEADVILGVKIRKNKTSLSLCQSHYVEKIL KFDSFDVSPVRTPFDASKHLKKNKG
Subjt:  SKMVGADCILICLYVDDMLIFGTNMELISDTKIFLSSHFEMKDLGEADVILGVKIRKNKTSLSLCQSHYVEKILNKFDSFDVSPVRTPFDASKHLKKNKG

Query:  DSVSQPEYAKIIGSVMYLMNYTRPDIAYAVSILSRYTHNPNRYHWDTLRHLLRYLKGMIDYCLHFNKFPVALEGYCDANWVTDNDEVNSTSGYVFLLRGG
        DSVSQPEYAKIIGSVMYLMNYTRPDIAYAVS LSRYTHNPNRYHWD LRHLLRYLKG IDYCLHF KFP  LEGYCDANWVTDNDEVNSTSGYVFLL GG
Subjt:  DSVSQPEYAKIIGSVMYLMNYTRPDIAYAVSILSRYTHNPNRYHWDTLRHLLRYLKGMIDYCLHFNKFPVALEGYCDANWVTDNDEVNSTSGYVFLLRGG

Query:  AISWKSTKQTCIARSTMEFEFIALELAGQEAEWIKNLVEDVPLCGTSVPVSI
        AISWKSTKQTCIARSTME EFIALELAGQEAEWIKNL+ DVPL GTSVPVSI
Subjt:  AISWKSTKQTCIARSTMEFEFIALELAGQEAEWIKNLVEDVPLCGTSVPVSI

TrEMBL top hitse value%identityAlignment
A0A2N9EQT1 Integrase catalytic domain-containing protein2.3e-17766.82Show/hide
Query:  NFTNLYLIDEDPKTYQEALNSVDSSMWKEGIKSELDSLVMNHIWELVDLPMGNKPIRCKWIFKRKTKPNGSIERYKARLVVVGYTQKQGVDYFDTYSPVT
        +F   +L ++DPKTYQEA+ SVD++ WK+ I SEL+S++ NH WELV+LP G K I  KW+FK+K K +GSIE++KARLV  GYTQK+G+DYFDTYSPVT
Subjt:  NFTNLYLIDEDPKTYQEALNSVDSSMWKEGIKSELDSLVMNHIWELVDLPMGNKPIRCKWIFKRKTKPNGSIERYKARLVVVGYTQKQGVDYFDTYSPVT

Query:  TITTIRALIALAAIHNLLIHQMDVKTAFLNGDLEEEIYMTQPEGFKISEQENKVCKLRKSLYGLKQAPKQWYEKFNNTLITNGFKINSSDMCVYSKMVGA
         +TTIR L+A+A+I+ L+IHQMDVKTAFLNGDL+EEIYM QPEGF +  QENKVCKLRKSLYGLKQAPKQW+EKF+ TL++NGF +N SD CVYSK  GA
Subjt:  TITTIRALIALAAIHNLLIHQMDVKTAFLNGDLEEEIYMTQPEGFKISEQENKVCKLRKSLYGLKQAPKQWYEKFNNTLITNGFKINSSDMCVYSKMVGA

Query:  DCILICLYVDDMLIFGTNMELISDTKIFLSSHFEMKDLGEADVILGVKIRKNKTSLSLCQSHYVEKILNKFDSFDVSPVRTPFDASKHLKKNKGDSVSQP
          ++ICLYVDDMLIFGT+M  + +TK FLSS+F+MKDLGEAD+ILG++I +N   L+L QSHY+EK+L KF+ +D  PVRTP+D S HLKKN G  VSQ 
Subjt:  DCILICLYVDDMLIFGTNMELISDTKIFLSSHFEMKDLGEADVILGVKIRKNKTSLSLCQSHYVEKILNKFDSFDVSPVRTPFDASKHLKKNKGDSVSQP

Query:  EYAKIIGSVMYLMNYTRPDIAYAVSILSRYTHNPNRYHWDTLRHLLRYLKGMIDYCLHFNKFPVALEGYCDANWVTDNDEVNSTSGYVFLLRGGAISWKS
        EYAKIIGSVM+LMN TRPDIAYAVS LSRYTHNP   HW+ +  LL+YLKG ++  L +   P  LEGYCDANW++DNDE NSTSGYVF L GGAISWKS
Subjt:  EYAKIIGSVMYLMNYTRPDIAYAVSILSRYTHNPNRYHWDTLRHLLRYLKGMIDYCLHFNKFPVALEGYCDANWVTDNDEVNSTSGYVFLLRGGAISWKS

Query:  TKQTCIARSTMEFEFIALELAGQEAEWIKNLVEDVPLCGTSVP
        +KQTC ARSTME EF+ALE AG EAEW++NL+ D+PL    +P
Subjt:  TKQTCIARSTMEFEFIALELAGQEAEWIKNLVEDVPLCGTSVP

A0A2N9H4B0 Uncharacterized protein2.3e-17766.82Show/hide
Query:  NFTNLYLIDEDPKTYQEALNSVDSSMWKEGIKSELDSLVMNHIWELVDLPMGNKPIRCKWIFKRKTKPNGSIERYKARLVVVGYTQKQGVDYFDTYSPVT
        +F   +L ++DPKTYQEA+ SVD++ WK+ I SEL+S++ NH WELV+LP G K I  KW+FK+K K +GSIE++KARLV  GYTQK+G+DYFDTYSPVT
Subjt:  NFTNLYLIDEDPKTYQEALNSVDSSMWKEGIKSELDSLVMNHIWELVDLPMGNKPIRCKWIFKRKTKPNGSIERYKARLVVVGYTQKQGVDYFDTYSPVT

Query:  TITTIRALIALAAIHNLLIHQMDVKTAFLNGDLEEEIYMTQPEGFKISEQENKVCKLRKSLYGLKQAPKQWYEKFNNTLITNGFKINSSDMCVYSKMVGA
         +TTIR L+A+A+I+ L+IHQMDVKTAFLNGDL+EEIYM QPEGF +  QENKVCKLRKSLYGLKQAPKQW+EKF+ TL++NGF +N SD CVYSK  GA
Subjt:  TITTIRALIALAAIHNLLIHQMDVKTAFLNGDLEEEIYMTQPEGFKISEQENKVCKLRKSLYGLKQAPKQWYEKFNNTLITNGFKINSSDMCVYSKMVGA

Query:  DCILICLYVDDMLIFGTNMELISDTKIFLSSHFEMKDLGEADVILGVKIRKNKTSLSLCQSHYVEKILNKFDSFDVSPVRTPFDASKHLKKNKGDSVSQP
          ++ICLYVDDMLIFGT+M  + +TK FLSS+F+MKDLGEAD+ILG++I +N   L+L QSHY+EK+L KF+ +D  PVRTP+D S HLKKN G  VSQ 
Subjt:  DCILICLYVDDMLIFGTNMELISDTKIFLSSHFEMKDLGEADVILGVKIRKNKTSLSLCQSHYVEKILNKFDSFDVSPVRTPFDASKHLKKNKGDSVSQP

Query:  EYAKIIGSVMYLMNYTRPDIAYAVSILSRYTHNPNRYHWDTLRHLLRYLKGMIDYCLHFNKFPVALEGYCDANWVTDNDEVNSTSGYVFLLRGGAISWKS
        EYAKIIGSVM+LMN TRPDIAYAVS LSRYTHNP   HW+ +  LL+YLKG ++  L +   P  LEGYCDANW++DNDE NSTSGYVF L GGAISWKS
Subjt:  EYAKIIGSVMYLMNYTRPDIAYAVSILSRYTHNPNRYHWDTLRHLLRYLKGMIDYCLHFNKFPVALEGYCDANWVTDNDEVNSTSGYVFLLRGGAISWKS

Query:  TKQTCIARSTMEFEFIALELAGQEAEWIKNLVEDVPLCGTSVP
        +KQTC ARSTME EF+ALE AG EAEW++NL+ D+PL    +P
Subjt:  TKQTCIARSTMEFEFIALELAGQEAEWIKNLVEDVPLCGTSVP

A0A5D3C5T2 Ty1-copia retrotransposon protein2.3e-24994.47Show/hide
Query:  RDGIDCNFTNLYLIDEDPKTYQEALNSVDSSMWKEGIKSELDSLVMNHIWELVDLPMGNKPIRCKWIFKRKTKPNGSIERYKARLVVVGYTQKQGVDYFD
        RD IDCNFTNL+LIDEDPKTYQEALNSVDS MWKE IKSELDSL MNH WELVDLPMGNKPIRCKWIFKRKTKPNG IERYKARLVVVGYTQKQGVDYFD
Subjt:  RDGIDCNFTNLYLIDEDPKTYQEALNSVDSSMWKEGIKSELDSLVMNHIWELVDLPMGNKPIRCKWIFKRKTKPNGSIERYKARLVVVGYTQKQGVDYFD

Query:  TYSPVTTITTIRALIALAAIHNLLIHQMDVKTAFLNGDLEEEIYMTQPEGFKISEQENKVCKLRKSLYGLKQAPKQWYEKFNNTLITNGFKINSSDMCVY
        TYSPVT ITTIRALIALAAIHNLLIHQMDVKTAFLNG+LEEEIYMTQPEGFKIS QENKVCKLRKSLYGLKQAPKQWYEKFNNTLITNGFKINSSD CVY
Subjt:  TYSPVTTITTIRALIALAAIHNLLIHQMDVKTAFLNGDLEEEIYMTQPEGFKISEQENKVCKLRKSLYGLKQAPKQWYEKFNNTLITNGFKINSSDMCVY

Query:  SKMVGADCILICLYVDDMLIFGTNMELISDTKIFLSSHFEMKDLGEADVILGVKIRKNKTSLSLCQSHYVEKILNKFDSFDVSPVRTPFDASKHLKKNKG
        SKMVGADCILICLYVDDMLIFGTNMELI+DTK FLSSHFEMKDLGEADVILGVKIRKNKTSLSLCQSHYVEKIL KFDSFDVSPVRTPFDASKHLKKNKG
Subjt:  SKMVGADCILICLYVDDMLIFGTNMELISDTKIFLSSHFEMKDLGEADVILGVKIRKNKTSLSLCQSHYVEKILNKFDSFDVSPVRTPFDASKHLKKNKG

Query:  DSVSQPEYAKIIGSVMYLMNYTRPDIAYAVSILSRYTHNPNRYHWDTLRHLLRYLKGMIDYCLHFNKFPVALEGYCDANWVTDNDEVNSTSGYVFLLRGG
        DSVSQPEYAKIIGSVMYLMNYTRPDIAYAVS LSRYTHNPNRYHWD LRHLLRYLKG IDYCLHF KFP  LEGYCDANWVTDNDEVNSTSGYVFLL GG
Subjt:  DSVSQPEYAKIIGSVMYLMNYTRPDIAYAVSILSRYTHNPNRYHWDTLRHLLRYLKGMIDYCLHFNKFPVALEGYCDANWVTDNDEVNSTSGYVFLLRGG

Query:  AISWKSTKQTCIARSTMEFEFIALELAGQEAEWIKNLVEDVPLCGTSVPVSI
        AISWKSTKQTCIARSTME EFIALELAGQEAEWIKNL+ DVPL GTSVPVSI
Subjt:  AISWKSTKQTCIARSTMEFEFIALELAGQEAEWIKNLVEDVPLCGTSVPVSI

A0A5N6PGV2 Reverse transcriptase Ty1/copia-type domain-containing protein2.3e-18569.14Show/hide
Query:  DGIDCNFTNLYLIDEDPKTYQEALNSVDSSMWKEGIKSELDSLVMNHIWELVDLPMGNKPIRCKWIFKRKTKPNGSIERYKARLVVVGYTQKQGVDYFDT
        D +  +F +++L++ DPKTYQEA+ SVD+++WKE IKSE+DS++ NH WEL DLP GNKPI  KWIFK+K +P+G++++YKARLV+ G+TQK G+DYFDT
Subjt:  DGIDCNFTNLYLIDEDPKTYQEALNSVDSSMWKEGIKSELDSLVMNHIWELVDLPMGNKPIRCKWIFKRKTKPNGSIERYKARLVVVGYTQKQGVDYFDT

Query:  YSPVTTITTIRALIALAAIHNLLIHQMDVKTAFLNGDLEEEIYMTQPEGFKISEQENKVCKLRKSLYGLKQAPKQWYEKFNNTLITNGFKINSSDMCVYS
        YSPVT ITTIR+LI++AAI+ LLIHQMDVKTAFLNGDL+EEIYM QPEGF +S  E+KVCKLRKSLYGLKQAPK+WYEKF+ TL  +G+ +N+SD CVYS
Subjt:  YSPVTTITTIRALIALAAIHNLLIHQMDVKTAFLNGDLEEEIYMTQPEGFKISEQENKVCKLRKSLYGLKQAPKQWYEKFNNTLITNGFKINSSDMCVYS

Query:  KMVGADCILICLYVDDMLIFGTNMELISDTKIFLSSHFEMKDLGEADVILGVKIRKNKTSLSLCQSHYVEKILNKFDSFDVSPVRTPFDASKHLKKNKGD
        K      +LICLYVDDMLIFG +M  I+ TK FLSS FEMKDLGEADVILGVKI++    +SLCQSHY+E++L KFD F+++PV+TP+D S  LKKN  +
Subjt:  KMVGADCILICLYVDDMLIFGTNMELISDTKIFLSSHFEMKDLGEADVILGVKIRKNKTSLSLCQSHYVEKILNKFDSFDVSPVRTPFDASKHLKKNKGD

Query:  SVSQPEYAKIIGSVMYLMNYTRPDIAYAVSILSRYTHNPNRYHWDTLRHLLRYLKGMIDYCLHFNKFPVALEGYCDANWVTDNDEVNSTSGYVFLLRGGA
        SVSQ EYAKIIGSVM+LMNYTRPDIAY VS LSRYTHNP++ HW  +  L+RYL+G ++ CLH+NKFP  LEGYCDANWVTDNDEV+STSGYVF++ GGA
Subjt:  SVSQPEYAKIIGSVMYLMNYTRPDIAYAVSILSRYTHNPNRYHWDTLRHLLRYLKGMIDYCLHFNKFPVALEGYCDANWVTDNDEVNSTSGYVFLLRGGA

Query:  ISWKSTKQTCIARSTMEFEFIALELAGQEAEWIKNLVEDVPLCG
        ISWKS+KQTCIARSTME EFIALELAGQEAEW++ L+ D+P  G
Subjt:  ISWKSTKQTCIARSTMEFEFIALELAGQEAEWIKNLVEDVPLCG

A0A6A2Y4J7 Uncharacterized protein2.5e-16366.19Show/hide
Query:  DGIDCNFTNLYLIDEDPKTYQEALNSVDSSMWKEGIKSELDSLVMNHIWELVDLPMGNKPIRCKWIFKRKTKPNGSIERYKARLVVVGYTQKQGVDYFDT
        D I      ++L+DEDPK ++EA+ S+++S WK  +  EL+S++ NH WELVDLP G KPI  KW+F++K +P+GSI+RYKARLVV G+TQ+ G+DYFDT
Subjt:  DGIDCNFTNLYLIDEDPKTYQEALNSVDSSMWKEGIKSELDSLVMNHIWELVDLPMGNKPIRCKWIFKRKTKPNGSIERYKARLVVVGYTQKQGVDYFDT

Query:  YSPVTTITTIRALIALAAIHNLLIHQMDVKTAFLNGDLEEEIYMTQPEGFKISEQENKVCKLRKSLYGLKQAPKQWYEKFNNTLITNGFKINSSDMCVYS
        YSPVT I+TIRAL ALA+IH L +HQMDVKTAFLNGDL+EEIYM QP GF+    E KV +L+KSLYGLKQAPKQWYEKF+ T+++ GF +N SD CVYS
Subjt:  YSPVTTITTIRALIALAAIHNLLIHQMDVKTAFLNGDLEEEIYMTQPEGFKISEQENKVCKLRKSLYGLKQAPKQWYEKFNNTLITNGFKINSSDMCVYS

Query:  KMVGADCILICLYVDDMLIFGTNMELISDTKIFLSSHFEMKDLGEADVILGVKIRKNKTSLSLCQSHYVEKILNKFDSFDVSPVRTPFDASKHLKKNKGD
        KM   +C++I LYVDDMLIF +N+E I+  K FLS+ FEM  LGE DVILGV++ K +   SLCQ+HY++K+L KFDSFDV PVRTP+D S HL KNKG 
Subjt:  KMVGADCILICLYVDDMLIFGTNMELISDTKIFLSSHFEMKDLGEADVILGVKIRKNKTSLSLCQSHYVEKILNKFDSFDVSPVRTPFDASKHLKKNKGD

Query:  SVSQPEYAKIIGSVMYLMNYTRPDIAYAVSILSRYTHNPNRYHWDTLRHLLRYLKGMIDYCLHFNKFPVALEGYCDANWVTDNDEVNSTSGYVFLLRGGA
        SVSQ EYAK+IGS+M+LMNYTRPDIAYAVS LSRYTHNP+  HW  L+ LL+YLKG +D+ L F  FP  LEGYCDANWV+DNDEV+STSGYVF L G A
Subjt:  SVSQPEYAKIIGSVMYLMNYTRPDIAYAVSILSRYTHNPNRYHWDTLRHLLRYLKGMIDYCLHFNKFPVALEGYCDANWVTDNDEVNSTSGYVFLLRGGA

Query:  ISWKSTKQTCIARSTME
        ISWKS+KQTCIARSTME
Subjt:  ISWKSTKQTCIARSTME

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.4e-7838.16Show/hide
Query:  LIDEDPKTYQEALNSVDSSMWKEGIKSELDSLVMNHIWELVDLPMGNKPIRCKWIFKRKTKPNGSIERYKARLVVVGYTQKQGVDYFDTYSPVTTITTIR
        + ++ P ++ E     D S W+E I +EL++  +N+ W +   P     +  +W+F  K    G+  RYKARLV  G+TQK  +DY +T++PV  I++ R
Subjt:  LIDEDPKTYQEALNSVDSSMWKEGIKSELDSLVMNHIWELVDLPMGNKPIRCKWIFKRKTKPNGSIERYKARLVVVGYTQKQGVDYFDTYSPVTTITTIR

Query:  ALIALAAIHNLLIHQMDVKTAFLNGDLEEEIYMTQPEGFKISEQENKVCKLRKSLYGLKQAPKQWYEKFNNTLITNGFKINSSDMCVYSKMVG--ADCIL
         +++L   +NL +HQMDVKTAFLNG L+EEIYM  P+G  IS   + VCKL K++YGLKQA + W+E F   L    F  +S D C+Y    G   + I 
Subjt:  ALIALAAIHNLLIHQMDVKTAFLNGDLEEEIYMTQPEGFKISEQENKVCKLRKSLYGLKQAPKQWYEKFNNTLITNGFKINSSDMCVYSKMVG--ADCIL

Query:  ICLYVDDMLIFGTNMELISDTKIFLSSHFEMKDLGEADVILGVKIRKNKTSLSLCQSHYVEKILNKFDSFDVSPVRTPFDASKHLKKNKGDSVSQPEYAK
        + LYVDD++I   +M  +++ K +L   F M DL E    +G++I   +  + L QS YV+KIL+KF+  + + V TP  +  + +    D         
Subjt:  ICLYVDDMLIFGTNMELISDTKIFLSSHFEMKDLGEADVILGVKIRKNKTSLSLCQSHYVEKILNKFDSFDVSPVRTPFDASKHLKKNKGDSVSQPEYAK

Query:  IIGSVMYLMNYTRPDIAYAVSILSRYTHNPNRYHWDTLRHLLRYLKGMIDYCLHFNK---FPVALEGYCDANWVTDNDEVNSTSGYVF-LLRGGAISWKS
        +IG +MY+M  TRPD+  AV+ILSRY+   N   W  L+ +LRYLKG ID  L F K   F   + GY D++W     +  ST+GY+F +     I W +
Subjt:  IIGSVMYLMNYTRPDIAYAVSILSRYTHNPNRYHWDTLRHLLRYLKGMIDYCLHFNK---FPVALEGYCDANWVTDNDEVNSTSGYVF-LLRGGAISWKS

Query:  TKQTCIARSTMEFEFIALELAGQEAEWIKNLVEDV
         +Q  +A S+ E E++AL  A +EA W+K L+  +
Subjt:  TKQTCIARSTMEFEFIALELAGQEAEWIKNLVEDV

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.8e-10344.32Show/hide
Query:  DEDPKTYQEALNSVDSSMWKEGIKSELDSLVMNHIWELVDLPMGNKPIRCKWIFKRKTKPNGSIERYKARLVVVGYTQKQGVDYFDTYSPVTTITTIRAL
        D +P++ +E L+  + +   + ++ E++SL  N  ++LV+LP G +P++CKW+FK K   +  + RYKARLVV G+ QK+G+D+ + +SPV  +T+IR +
Subjt:  DEDPKTYQEALNSVDSSMWKEGIKSELDSLVMNHIWELVDLPMGNKPIRCKWIFKRKTKPNGSIERYKARLVVVGYTQKQGVDYFDTYSPVTTITTIRAL

Query:  IALAAIHNLLIHQMDVKTAFLNGDLEEEIYMTQPEGFKISEQENKVCKLRKSLYGLKQAPKQWYEKFNNTLITNGFKINSSDMCVYSKMVGA-DCILICL
        ++LAA  +L + Q+DVKTAFL+GDLEEEIYM QPEGF+++ +++ VCKL KSLYGLKQAP+QWY KF++ + +  +    SD CVY K     + I++ L
Subjt:  IALAAIHNLLIHQMDVKTAFLNGDLEEEIYMTQPEGFKISEQENKVCKLRKSLYGLKQAPKQWYEKFNNTLITNGFKINSSDMCVYSKMVGA-DCILICL

Query:  YVDDMLIFGTNMELISDTKIFLSSHFEMKDLGEADVILGVKIRKNKTS--LSLCQSHYVEKILNKFDSFDVSPVRTPFDASKHLKK--------NKGDSV
        YVDDMLI G +  LI+  K  LS  F+MKDLG A  ILG+KI + +TS  L L Q  Y+E++L +F+  +  PV TP      L K         KG+  
Subjt:  YVDDMLIFGTNMELISDTKIFLSSHFEMKDLGEADVILGVKIRKNKTS--LSLCQSHYVEKILNKFDSFDVSPVRTPFDASKHLKK--------NKGDSV

Query:  SQPEYAKIIGSVMYLMNYTRPDIAYAVSILSRYTHNPNRYHWDTLRHLLRYLKGMIDYCLHFNKFPVALEGYCDANWVTDNDEVNSTSGYVFLLRGGAIS
          P Y+  +GS+MY M  TRPDIA+AV ++SR+  NP + HW+ ++ +LRYL+G    CL F      L+GY DA+   D D   S++GY+F   GGAIS
Subjt:  SQPEYAKIIGSVMYLMNYTRPDIAYAVSILSRYTHNPNRYHWDTLRHLLRYLKGMIDYCLHFNKFPVALEGYCDANWVTDNDEVNSTSGYVFLLRGGAIS

Query:  WKSTKQTCIARSTMEFEFIALELAGQEAEWIKNLVEDVPL
        W+S  Q C+A ST E E+IA    G+E  W+K  ++++ L
Subjt:  WKSTKQTCIARSTMEFEFIALELAGQEAEWIKNLVEDVPL

P25600 Putative transposon Ty5-1 protein YCL074W3.4e-4035.21Show/hide
Query:  MDVKTAFLNGDLEEEIYMTQPEGFKISEQENKVCKLRKSLYGLKQAPKQWYEKFNNTLITNGFKINSSDMCVYSKMVGADCILICLYVDDMLIFGTNMEL
        MDV TAFLN  ++E IY+ QP GF      + V +L   +YGLKQAP  W E  NNTL   GF  +  +  +Y +      I I +YVDD+L+   + ++
Subjt:  MDVKTAFLNGDLEEEIYMTQPEGFKISEQENKVCKLRKSLYGLKQAPKQWYEKFNNTLITNGFKINSSDMCVYSKMVGADCILICLYVDDMLIFGTNMEL

Query:  ISDTKIFLSSHFEMKDLGEADVILGVKIRKNKT-SLSLCQSHYVEKILNKFDSFDVSPVRTPFDASKHLKKNKGDSVSQ-PEYAKIIGSVMYLMNYTRPD
            K  L+  + MKDLG+ D  LG+ I ++    ++L    Y+ K  ++ +       +TP   SK L +     +     Y  I+G +++  N  RPD
Subjt:  ISDTKIFLSSHFEMKDLGEADVILGVKIRKNKT-SLSLCQSHYVEKILNKFDSFDVSPVRTPFDASKHLKKNKGDSVSQ-PEYAKIIGSVMYLMNYTRPD

Query:  IAYAVSILSRYTHNPNRYHWDTLRHLLRYLKGMIDYCLHFNK-FPVALEGYCDANWVTDNDEVNSTSGYVFLLRGGAISWKSTK
        I+Y VS+LSR+   P   H ++ R +LRYL      CL +     +AL  YCDA+    +D  +ST GYV LL G  ++W S K
Subjt:  IAYAVSILSRYTHNPNRYHWDTLRHLLRYLKGMIDYCLHFNK-FPVALEGYCDANWVTDNDEVNSTSGYVFLLRGGAISWKSTK

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.5e-8338.27Show/hide
Query:  DEDPKTYQEALNSVDSSMWKEGIKSELDSLVMNHIWELVDLPMGNKPI-RCKWIFKRKTKPNGSIERYKARLVVVGYTQKQGVDYFDTYSPVTTITTIRA
        + +P+T   A+ ++    W+  + SE+++ + NH W+LV  P  +  I  C+WIF +K   +GS+ RYKARLV  GY Q+ G+DY +T+SPV   T+IR 
Subjt:  DEDPKTYQEALNSVDSSMWKEGIKSELDSLVMNHIWELVDLPMGNKPI-RCKWIFKRKTKPNGSIERYKARLVVVGYTQKQGVDYFDTYSPVTTITTIRA

Query:  LIALAAIHNLLIHQMDVKTAFLNGDLEEEIYMTQPEGFKISEQENKVCKLRKSLYGLKQAPKQWYEKFNNTLITNGFKINSSDMCVYSKMVGADCILICL
        ++ +A   +  I Q+DV  AFL G L +++YM+QP GF   ++ N VCKLRK+LYGLKQAP+ WY +  N L+T GF  + SD  ++    G   + + +
Subjt:  LIALAAIHNLLIHQMDVKTAFLNGDLEEEIYMTQPEGFKISEQENKVCKLRKSLYGLKQAPKQWYEKFNNTLITNGFKINSSDMCVYSKMVGADCILICL

Query:  YVDDMLIFGTNMELISDTKIFLSSHFEMKDLGEADVILGVKIRKNKTSLSLCQSHYVEKILNKFDSFDVSPVRTPFDASKHLKKNKGDSVSQP-EYAKII
        YVDD+LI G +  L+ +T   LS  F +KD  E    LG++ ++  T L L Q  Y+  +L + +     PV TP   S  L    G  ++ P EY  I+
Subjt:  YVDDMLIFGTNMELISDTKIFLSSHFEMKDLGEADVILGVKIRKNKTSLSLCQSHYVEKILNKFDSFDVSPVRTPFDASKHLKKNKGDSVSQP-EYAKII

Query:  GSVMYLMNYTRPDIAYAVSILSRYTHNPNRYHWDTLRHLLRYLKGMIDYCLHFNK-FPVALEGYCDANWVTDNDEVNSTSGYVFLLRGGAISWKSTKQTC
        GS+ YL  +TRPDI+YAV+ LS++ H P   H   L+ +LRYL G  ++ +   K   ++L  Y DA+W  D D+  ST+GY+  L    ISW S KQ  
Subjt:  GSVMYLMNYTRPDIAYAVSILSRYTHNPNRYHWDTLRHLLRYLKGMIDYCLHFNK-FPVALEGYCDANWVTDNDEVNSTSGYVFLLRGGAISWKSTKQTC

Query:  IARSTMEFEFIALELAGQEAEWIKNLVEDVPLCGTSVPV
        + RS+ E E+ ++     E +WI +L+ ++ +  T  PV
Subjt:  IARSTMEFEFIALELAGQEAEWIKNLVEDVPLCGTSVPV

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.0e-8137.36Show/hide
Query:  DEDPKTYQEALNSVDSSMWKEGIKSELDSLVMNHIWELVDLPMGNKPI-RCKWIFKRKTKPNGSIERYKARLVVVGYTQKQGVDYFDTYSPVTTITTIRA
        + +P+T   A+ ++    W++ + SE+++ + NH W+LV  P  +  I  C+WIF +K   +GS+ RYKARLV  GY Q+ G+DY +T+SPV   T+IR 
Subjt:  DEDPKTYQEALNSVDSSMWKEGIKSELDSLVMNHIWELVDLPMGNKPI-RCKWIFKRKTKPNGSIERYKARLVVVGYTQKQGVDYFDTYSPVTTITTIRA

Query:  LIALAAIHNLLIHQMDVKTAFLNGDLEEEIYMTQPEGFKISEQENKVCKLRKSLYGLKQAPKQWYEKFNNTLITNGFKINSSDMCVYSKMVGADCILICL
        ++ +A   +  I Q+DV  AFL G L +E+YM+QP GF   ++ + VC+LRK++YGLKQAP+ WY +    L+T GF  + SD  ++    G   I + +
Subjt:  LIALAAIHNLLIHQMDVKTAFLNGDLEEEIYMTQPEGFKISEQENKVCKLRKSLYGLKQAPKQWYEKFNNTLITNGFKINSSDMCVYSKMVGADCILICL

Query:  YVDDMLIFGTNMELISDTKIFLSSHFEMKDLGEADVILGVKIRKNKTSLSLCQSHYVEKILNKFDSFDVSPVRTPFDASKHLKKNKGDSVSQP-EYAKII
        YVDD+LI G +  L+  T   LS  F +K+  +    LG++ ++    L L Q  Y   +L + +     PV TP   S  L  + G  +  P EY  I+
Subjt:  YVDDMLIFGTNMELISDTKIFLSSHFEMKDLGEADVILGVKIRKNKTSLSLCQSHYVEKILNKFDSFDVSPVRTPFDASKHLKKNKGDSVSQP-EYAKII

Query:  GSVMYLMNYTRPDIAYAVSILSRYTHNPNRYHWDTLRHLLRYLKGMIDYCLHFNK-FPVALEGYCDANWVTDNDEVNSTSGYVFLLRGGAISWKSTKQTC
        GS+ YL  +TRPD++YAV+ LS+Y H P   HW+ L+ +LRYL G  D+ +   K   ++L  Y DA+W  D D+  ST+GY+  L    ISW S KQ  
Subjt:  GSVMYLMNYTRPDIAYAVSILSRYTHNPNRYHWDTLRHLLRYLKGMIDYCLHFNK-FPVALEGYCDANWVTDNDEVNSTSGYVFLLRGGAISWKSTKQTC

Query:  IARSTMEFEFIALELAGQEAEWIKNLVEDVPLCGTSVPV
        + RS+ E E+ ++     E +WI +L+ ++ +  +  PV
Subjt:  IARSTMEFEFIALELAGQEAEWIKNLVEDVPLCGTSVPV

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 85.7e-8336.64Show/hide
Query:  EDPKTYQEALNSVDSSMWKEGIKSELDSLVMNHIWELVDLPMGNKPIRCKWIFKRKTKPNGSIERYKARLVVVGYTQKQGVDYFDTYSPVTTITTIRALI
        ++P TY EA   +   +W   +  E+ ++   H WE+  LP   KPI CKW++K K   +G+IERYKARLV  GYTQ++G+D+ +T+SPV  +T+++ ++
Subjt:  EDPKTYQEALNSVDSSMWKEGIKSELDSLVMNHIWELVDLPMGNKPIRCKWIFKRKTKPNGSIERYKARLVVVGYTQKQGVDYFDTYSPVTTITTIRALI

Query:  ALAAIHNLLIHQMDVKTAFLNGDLEEEIYMTQPEGFKISEQE----NKVCKLRKSLYGLKQAPKQWYEKFNNTLITNGFKINSSDMCVYSKMVGADCILI
        A++AI+N  +HQ+D+  AFLNGDL+EEIYM  P G+   + +    N VC L+KS+YGLKQA +QW+ KF+ TLI  GF  + SD   + K+     + +
Subjt:  ALAAIHNLLIHQMDVKTAFLNGDLEEEIYMTQPEGFKISEQE----NKVCKLRKSLYGLKQAPKQWYEKFNNTLITNGFKINSSDMCVYSKMVGADCILI

Query:  CLYVDDMLIFGTNMELISDTKIFLSSHFEMKDLGEADVILGVKIRKNKTSLSLCQSHYVEKILNKFDSFDVSPVRTPFDASKHLKKNK-GDSVSQPEYAK
         +YVDD++I   N   + + K  L S F+++DLG     LG++I ++   +++CQ  Y   +L++       P   P D S     +  GD V    Y +
Subjt:  CLYVDDMLIFGTNMELISDTKIFLSSHFEMKDLGEADVILGVKIRKNKTSLSLCQSHYVEKILNKFDSFDVSPVRTPFDASKHLKKNK-GDSVSQPEYAK

Query:  IIGSVMYLMNYTRPDIAYAVSILSRYTHNPNRYHWDTLRHLLRYLKGMIDYCLHF-NKFPVALEGYCDANWVTDNDEVNSTSGYVFLLRGGAISWKSTKQ
        +IG +MYL   TR DI++AV+ LS+++  P   H   +  +L Y+KG +   L + ++  + L+ + DA++ +  D   ST+GY   L    ISWKS KQ
Subjt:  IIGSVMYLMNYTRPDIAYAVSILSRYTHNPNRYHWDTLRHLLRYLKGMIDYCLHF-NKFPVALEGYCDANWVTDNDEVNSTSGYVFLLRGGAISWKSTKQ

Query:  TCIARSTMEFEFIALELAGQEAEWIKNLVEDVPL
          +++S+ E E+ AL  A  E  W+     ++ L
Subjt:  TCIARSTMEFEFIALELAGQEAEWIKNLVEDVPL

ATMG00810.1 DNA/RNA polymerases superfamily protein1.1e-3334.38Show/hide
Query:  ICLYVDDMLIFGTNMELISDTKIFLSSHFEMKDLGEADVILGVKIRKNKTSLSLCQSHYVEKILNKFDSFDVSPVRTPFDASKHLKKNKGDSVSQPEYAK
        + LYVDD+L+ G++  L++     LSS F MKDLG     LG++I+ + + L L Q+ Y E+ILN     D  P+ TP     +   +        ++  
Subjt:  ICLYVDDMLIFGTNMELISDTKIFLSSHFEMKDLGEADVILGVKIRKNKTSLSLCQSHYVEKILNKFDSFDVSPVRTPFDASKHLKKNKGDSVSQPEYAK

Query:  IIGSVMYLMNYTRPDIAYAVSILSRYTHNPNRYHWDTLRHLLRYLKGMIDYCLHFNK-FPVALEGYCDANWVTDNDEVNSTSGYVFLLRGGAISWKSTKQ
        I+G++ YL   TRPDI+YAV+I+ +  H P    +D L+ +LRY+KG I + L+ +K   + ++ +CD++W        ST+G+   L    ISW + +Q
Subjt:  IIGSVMYLMNYTRPDIAYAVSILSRYTHNPNRYHWDTLRHLLRYLKGMIDYCLHFNK-FPVALEGYCDANWVTDNDEVNSTSGYVFLLRGGAISWKSTKQ

Query:  TCIARSTMEFEFIALELAGQEAEW
          ++RS+ E E+ AL L   E  W
Subjt:  TCIARSTMEFEFIALELAGQEAEW

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)1.7e-1538.1Show/hide
Query:  IDEDPKTYQEALNSVDSSMWKEGIKSELDSLVMNHIWELVDLPMGNKPIRCKWIFKRKTKPNGSIERYKARLVVVGYTQKQGVDYFDTYSPVTTITTIRA
        I ++PK+   AL       W + ++ ELD+L  N  W LV  P+    + CKW+FK K   +G+++R KARLV  G+ Q++G+ + +TYSPV    TIR 
Subjt:  IDEDPKTYQEALNSVDSSMWKEGIKSELDSLVMNHIWELVDLPMGNKPIRCKWIFKRKTKPNGSIERYKARLVVVGYTQKQGVDYFDTYSPVTTITTIRA

Query:  LIALA
        ++ +A
Subjt:  LIALA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCGTGATGGAATTGACTGTAACTTCACAAACTTGTACTTAATAGATGAGGATCCTAAAACTTACCAAGAAGCGCTAAACTCTGTAGATTCAAGTATGTGGAAA
GAGGGCATTAAAAGTGAATTGGACTCACTGGTCATGAATCATATATGGGAACTGGTGGACCTTCCTATGGGAAACAAGCCAATTAGATGTAAGTGGATCTTTAAA
AGAAAAACAAAACCAAATGGATCAATAGAAAGATACAAGGCTAGATTAGTGGTAGTAGGATATACCCAAAAACAAGGTGTTGACTATTTTGACACATATTCCCCT
GTAACTACGATAACCACAATTAGGGCCTTAATTGCTTTGGCCGCAATACATAACCTTCTTATTCACCAAATGGACGTAAAAACAGCCTTTTTAAATGGTGACTTA
GAAGAAGAAATTTATATGACACAACCAGAAGGCTTTAAAATCTCTGAGCAAGAAAACAAAGTGTGTAAACTGAGAAAGTCCCTATACGGTCTCAAGCAAGCTCCC
AAGCAGTGGTACGAAAAATTTAACAATACGTTGATAACCAATGGATTCAAAATAAATTCCTCTGACATGTGTGTTTATTCAAAGATGGTTGGAGCTGATTGCATA
TTAATATGTCTATATGTTGATGACATGTTAATCTTTGGAACAAACATGGAGTTAATAAGTGATACTAAGATTTTCCTCTCGTCACACTTTGAAATGAAAGACCTG
GGAGAAGCAGACGTAATCCTAGGTGTTAAAATCAGGAAAAACAAAACCAGTTTGTCTCTATGTCAATCTCATTACGTAGAGAAAATACTAAACAAGTTTGATTCC
TTTGATGTTTCTCCTGTGAGAACTCCATTTGACGCTAGTAAACATCTTAAGAAGAATAAAGGAGATAGTGTGTCTCAACCCGAATATGCAAAGATCATAGGTAGT
GTGATGTATTTAATGAATTACACTAGACCAGATATTGCATATGCTGTCAGTATATTAAGTAGATATACACACAATCCTAATAGATACCACTGGGATACCTTACGC
CATCTGTTGAGATATCTTAAAGGGATGATAGATTACTGTCTACACTTCAACAAATTTCCTGTCGCACTAGAAGGATATTGTGATGCAAACTGGGTTACAGATAAT
GATGAGGTTAACTCTACTAGTGGGTATGTATTTTTACTCAGAGGTGGAGCTATATCTTGGAAGTCTACAAAACAGACTTGTATAGCCAGATCCACGATGGAATTC
GAGTTCATAGCACTAGAGTTGGCAGGACAGGAGGCTGAGTGGATCAAAAATCTAGTAGAAGATGTACCATTATGTGGGACATCTGTACCTGTGTCCATACTGTGA
mRNA sequenceShow/hide mRNA sequence
ATGCGTGATGGAATTGACTGTAACTTCACAAACTTGTACTTAATAGATGAGGATCCTAAAACTTACCAAGAAGCGCTAAACTCTGTAGATTCAAGTATGTGGAAA
GAGGGCATTAAAAGTGAATTGGACTCACTGGTCATGAATCATATATGGGAACTGGTGGACCTTCCTATGGGAAACAAGCCAATTAGATGTAAGTGGATCTTTAAA
AGAAAAACAAAACCAAATGGATCAATAGAAAGATACAAGGCTAGATTAGTGGTAGTAGGATATACCCAAAAACAAGGTGTTGACTATTTTGACACATATTCCCCT
GTAACTACGATAACCACAATTAGGGCCTTAATTGCTTTGGCCGCAATACATAACCTTCTTATTCACCAAATGGACGTAAAAACAGCCTTTTTAAATGGTGACTTA
GAAGAAGAAATTTATATGACACAACCAGAAGGCTTTAAAATCTCTGAGCAAGAAAACAAAGTGTGTAAACTGAGAAAGTCCCTATACGGTCTCAAGCAAGCTCCC
AAGCAGTGGTACGAAAAATTTAACAATACGTTGATAACCAATGGATTCAAAATAAATTCCTCTGACATGTGTGTTTATTCAAAGATGGTTGGAGCTGATTGCATA
TTAATATGTCTATATGTTGATGACATGTTAATCTTTGGAACAAACATGGAGTTAATAAGTGATACTAAGATTTTCCTCTCGTCACACTTTGAAATGAAAGACCTG
GGAGAAGCAGACGTAATCCTAGGTGTTAAAATCAGGAAAAACAAAACCAGTTTGTCTCTATGTCAATCTCATTACGTAGAGAAAATACTAAACAAGTTTGATTCC
TTTGATGTTTCTCCTGTGAGAACTCCATTTGACGCTAGTAAACATCTTAAGAAGAATAAAGGAGATAGTGTGTCTCAACCCGAATATGCAAAGATCATAGGTAGT
GTGATGTATTTAATGAATTACACTAGACCAGATATTGCATATGCTGTCAGTATATTAAGTAGATATACACACAATCCTAATAGATACCACTGGGATACCTTACGC
CATCTGTTGAGATATCTTAAAGGGATGATAGATTACTGTCTACACTTCAACAAATTTCCTGTCGCACTAGAAGGATATTGTGATGCAAACTGGGTTACAGATAAT
GATGAGGTTAACTCTACTAGTGGGTATGTATTTTTACTCAGAGGTGGAGCTATATCTTGGAAGTCTACAAAACAGACTTGTATAGCCAGATCCACGATGGAATTC
GAGTTCATAGCACTAGAGTTGGCAGGACAGGAGGCTGAGTGGATCAAAAATCTAGTAGAAGATGTACCATTATGTGGGACATCTGTACCTGTGTCCATACTGTGA
Protein sequenceShow/hide protein sequence
MRDGIDCNFTNLYLIDEDPKTYQEALNSVDSSMWKEGIKSELDSLVMNHIWELVDLPMGNKPIRCKWIFKRKTKPNGSIERYKARLVVVGYTQKQGVDYFDTYSP
VTTITTIRALIALAAIHNLLIHQMDVKTAFLNGDLEEEIYMTQPEGFKISEQENKVCKLRKSLYGLKQAPKQWYEKFNNTLITNGFKINSSDMCVYSKMVGADCI
LICLYVDDMLIFGTNMELISDTKIFLSSHFEMKDLGEADVILGVKIRKNKTSLSLCQSHYVEKILNKFDSFDVSPVRTPFDASKHLKKNKGDSVSQPEYAKIIGS
VMYLMNYTRPDIAYAVSILSRYTHNPNRYHWDTLRHLLRYLKGMIDYCLHFNKFPVALEGYCDANWVTDNDEVNSTSGYVFLLRGGAISWKSTKQTCIARSTMEF
EFIALELAGQEAEWIKNLVEDVPLCGTSVPVSIL