; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc01g0020251 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc01g0020251
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionTy3/gypsy retrotransposon protein
Genome locationCMiso1.1chr01:18406533..18407498
RNA-Seq ExpressionCmc01g0020251
SyntenyCmc01g0020251
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0015074 - DNA integration (biological process)
GO:0098869 - cellular oxidant detoxification (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0004601 - peroxidase activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0062448.1 disease resistance protein [Cucumis melo var. makuwa]1.3e-15785.53Show/hide
Query:  HWPSMKTVFMAGTTQVVLKGDPSLTKIECSLKTISKTWEEEDQGFLLEFQGVEINIDTEDENEKEEEGEKSKLPMVRNLLARYRSIFETPKGLPPKRAVD
        HWPSM  VFM G++QV+LKGDPSLTK ECSLKTISKTWEEEDQGFL+EFQ +EI   +E E+EKEEEGE+S LPM+RNLLA+ R IFE PKGLPPKRAVD
Subjt:  HWPSMKTVFMAGTTQVVLKGDPSLTKIECSLKTISKTWEEEDQGFLLEFQGVEINIDTEDENEKEEEGEKSKLPMVRNLLARYRSIFETPKGLPPKRAVD

Query:  HRIVTLEGQKPINVRPYKYGYIQKEEIEKLVSKMLQAGIIRPSRSPYSNPVLLVKKRDGGWRFCVDYRKLNQLSIADKFPIPVIEELLDELHEAEVFSKL
        HRI+  EGQ PINVRPYKYGYIQK EIE+LVS+MLQA IIRPSRSPYS+PVLLVKK+DGGWRFCVDYRKLNQ++IADKFPIPVIEELLDELH AEVFSKL
Subjt:  HRIVTLEGQKPINVRPYKYGYIQKEEIEKLVSKMLQAGIIRPSRSPYSNPVLLVKKRDGGWRFCVDYRKLNQLSIADKFPIPVIEELLDELHEAEVFSKL

Query:  DLRSGYHQIRMKDEDVEKTSFRTHEGHYEFLVMPFGLTNAPATFQSLMNQVFRPFLRRFVLVFFDDILIYSKDFTEHERHLGVVFNVMKDNQLFANEKKC
        DLRSGYHQIRMK+ED+EKT+FRTHEGHYEFLVMPFGLTNAPATFQSLMNQ+FRPFLRRFVLVFFDDILIYSK+ TEHE+HLGVVFNVMKDNQLFANEKKC
Subjt:  DLRSGYHQIRMKDEDVEKTSFRTHEGHYEFLVMPFGLTNAPATFQSLMNQVFRPFLRRFVLVFFDDILIYSKDFTEHERHLGVVFNVMKDNQLFANEKKC

Query:  VIGQSRISYLGHWISKKG
        +IG SRI+YLGHWISKKG
Subjt:  VIGQSRISYLGHWISKKG

KAE8637561.1 hypothetical protein CSA_017659 [Cucumis sativus]2.5e-13573.27Show/hide
Query:  HWPSMKTVFMAGTTQVVLKGDPSLTKIECSLKTISKTWEEEDQGFLLEFQGVEINIDTEDENEKEEEGEKSKLPMVRNLLARYRSIFETPKGLPPKRAVD
        HWPS+   F +G  +++LKGDPSL + EC LKTI KTWE EDQGFLLEFQ  E  IDT+ E ++  +G++ ++PM++NLL +Y +IFE PK LPPKR +D
Subjt:  HWPSMKTVFMAGTTQVVLKGDPSLTKIECSLKTISKTWEEEDQGFLLEFQGVEINIDTEDENEKEEEGEKSKLPMVRNLLARYRSIFETPKGLPPKRAVD

Query:  HRIVTLEGQKPINVRPYKYGYIQKEEIEKLVSKMLQAGIIRPSRSPYSNPVLLVKKRDGGWRFCVDYRKLNQLSIADKFPIPVIEELLDELHEAEVFSKL
        HRI+ L  Q+PINVRPYKYGY+QKEEIEKLV +MLQAG+IRPS SPYS+PVLLVKK+DGGWRFCVDYRKLNQ++I+DKFPIPVIEELLDELH A VFSKL
Subjt:  HRIVTLEGQKPINVRPYKYGYIQKEEIEKLVSKMLQAGIIRPSRSPYSNPVLLVKKRDGGWRFCVDYRKLNQLSIADKFPIPVIEELLDELHEAEVFSKL

Query:  DLRSGYHQIRMKDEDVEKTSFRTHEGHYEFLVMPFGLTNAPATFQSLMNQVFRPFLRRFVLVFFDDILIYSKDFTEHERHLGVVFNVMKDNQLFANEKKC
        D++S YHQIRM++EDVEKT+FRTHEGHYEFLVMPFGLTNAPATFQSLMNQVF+PFLRR VLVFFDDIL+YSKD +EHE+HLG+VF V++DN LFAN+KKC
Subjt:  DLRSGYHQIRMKDEDVEKTSFRTHEGHYEFLVMPFGLTNAPATFQSLMNQVFRPFLRRFVLVFFDDILIYSKDFTEHERHLGVVFNVMKDNQLFANEKKC

Query:  VIGQSRISYLGHWISKKG
        VI  S+I YLGH IS KG
Subjt:  VIGQSRISYLGHWISKKG

TYJ98046.1 peroxidase 64 [Cucumis melo var. makuwa]1.1e-15684.91Show/hide
Query:  HWPSMKTVFMAGTTQVVLKGDPSLTKIECSLKTISKTWEEEDQGFLLEFQGVEINIDTEDENEKEEEGEKSKLPMVRNLLARYRSIFETPKGLPPKRAVD
        HWPSM  VFM G++QV+LKGDPSL K ECSLKTISKTWEEEDQGFL+EFQ +EI   +E E+EKEEEGE+S LPM+RNLLA+ R IFE PKGLPPKRAVD
Subjt:  HWPSMKTVFMAGTTQVVLKGDPSLTKIECSLKTISKTWEEEDQGFLLEFQGVEINIDTEDENEKEEEGEKSKLPMVRNLLARYRSIFETPKGLPPKRAVD

Query:  HRIVTLEGQKPINVRPYKYGYIQKEEIEKLVSKMLQAGIIRPSRSPYSNPVLLVKKRDGGWRFCVDYRKLNQLSIADKFPIPVIEELLDELHEAEVFSKL
        HRI+  EGQ PINVRPYKYGYIQK EIE+LVS+MLQA IIRPSRSPYS+PVLLVKKRDGGWRFCVDY KLNQ+++ADKFPIPVIEELLDELH AEVFSKL
Subjt:  HRIVTLEGQKPINVRPYKYGYIQKEEIEKLVSKMLQAGIIRPSRSPYSNPVLLVKKRDGGWRFCVDYRKLNQLSIADKFPIPVIEELLDELHEAEVFSKL

Query:  DLRSGYHQIRMKDEDVEKTSFRTHEGHYEFLVMPFGLTNAPATFQSLMNQVFRPFLRRFVLVFFDDILIYSKDFTEHERHLGVVFNVMKDNQLFANEKKC
        DLRSGYHQIRMK+ED+EKT+FRTHEGHYEFLVMPFGLTN PATFQSLMNQ+FRPFLRRFVLVFFDDILIYSK+ TEHE+HLGVVFNVMKDNQLFANEKKC
Subjt:  DLRSGYHQIRMKDEDVEKTSFRTHEGHYEFLVMPFGLTNAPATFQSLMNQVFRPFLRRFVLVFFDDILIYSKDFTEHERHLGVVFNVMKDNQLFANEKKC

Query:  VIGQSRISYLGHWISKKG
        VIG SRI+YLGHWISKKG
Subjt:  VIGQSRISYLGHWISKKG

TYK22651.1 reverse transcriptase [Cucumis melo var. makuwa]1.3e-136100Show/hide
Query:  MKTVFMAGTTQVVLKGDPSLTKIECSLKTISKTWEEEDQGFLLEFQGVEINIDTEDENEKEEEGEKSKLPMVRNLLARYRSIFETPKGLPPKRAVDHRIV
        MKTVFMAGTTQVVLKGDPSLTKIECSLKTISKTWEEEDQGFLLEFQGVEINIDTEDENEKEEEGEKSKLPMVRNLLARYRSIFETPKGLPPKRAVDHRIV
Subjt:  MKTVFMAGTTQVVLKGDPSLTKIECSLKTISKTWEEEDQGFLLEFQGVEINIDTEDENEKEEEGEKSKLPMVRNLLARYRSIFETPKGLPPKRAVDHRIV

Query:  TLEGQKPINVRPYKYGYIQKEEIEKLVSKMLQAGIIRPSRSPYSNPVLLVKKRDGGWRFCVDYRKLNQLSIADKFPIPVIEELLDELHEAEVFSKLDLRS
        TLEGQKPINVRPYKYGYIQKEEIEKLVSKMLQAGIIRPSRSPYSNPVLLVKKRDGGWRFCVDYRKLNQLSIADKFPIPVIEELLDELHEAEVFSKLDLRS
Subjt:  TLEGQKPINVRPYKYGYIQKEEIEKLVSKMLQAGIIRPSRSPYSNPVLLVKKRDGGWRFCVDYRKLNQLSIADKFPIPVIEELLDELHEAEVFSKLDLRS

Query:  GYHQIRMKDEDVEKTSFRTHEGHYEFLVMPFGLTNAPATFQSLMNQ
        GYHQIRMKDEDVEKTSFRTHEGHYEFLVMPFGLTNAPATFQSLMNQ
Subjt:  GYHQIRMKDEDVEKTSFRTHEGHYEFLVMPFGLTNAPATFQSLMNQ

TYK27437.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]1.3e-15785.53Show/hide
Query:  HWPSMKTVFMAGTTQVVLKGDPSLTKIECSLKTISKTWEEEDQGFLLEFQGVEINIDTEDENEKEEEGEKSKLPMVRNLLARYRSIFETPKGLPPKRAVD
        HWPSM  VFM G++QV+LKGDPSLTK ECSLKTISKTWEEEDQGFL+EFQ +EI   +E E+EKEEEGE+S LPM+RNLLA+ R IFE PKGLPPKRAVD
Subjt:  HWPSMKTVFMAGTTQVVLKGDPSLTKIECSLKTISKTWEEEDQGFLLEFQGVEINIDTEDENEKEEEGEKSKLPMVRNLLARYRSIFETPKGLPPKRAVD

Query:  HRIVTLEGQKPINVRPYKYGYIQKEEIEKLVSKMLQAGIIRPSRSPYSNPVLLVKKRDGGWRFCVDYRKLNQLSIADKFPIPVIEELLDELHEAEVFSKL
        HRI+  EGQ PINVRPYKYGYIQK EIE+LVS+MLQA IIRPSRSPYS+PVLLVKK+DGGWRFCVDYRKLNQ++IADKFPIPVIEELLDELH AEVFSKL
Subjt:  HRIVTLEGQKPINVRPYKYGYIQKEEIEKLVSKMLQAGIIRPSRSPYSNPVLLVKKRDGGWRFCVDYRKLNQLSIADKFPIPVIEELLDELHEAEVFSKL

Query:  DLRSGYHQIRMKDEDVEKTSFRTHEGHYEFLVMPFGLTNAPATFQSLMNQVFRPFLRRFVLVFFDDILIYSKDFTEHERHLGVVFNVMKDNQLFANEKKC
        DLRSGYHQIRMK+ED+EKT+FRTHEGHYEFLVMPFGLTNAPATFQSLMNQ+FRPFLRRFVLVFFDDILIYSK+ TEHE+HLGVVFNVMKDNQLFANEKKC
Subjt:  DLRSGYHQIRMKDEDVEKTSFRTHEGHYEFLVMPFGLTNAPATFQSLMNQVFRPFLRRFVLVFFDDILIYSKDFTEHERHLGVVFNVMKDNQLFANEKKC

Query:  VIGQSRISYLGHWISKKG
        +IG SRI+YLGHWISKKG
Subjt:  VIGQSRISYLGHWISKKG

TrEMBL top hitse value%identityAlignment
A0A5A7UYM1 Ty3/gypsy retrotransposon protein2.7e-13572.64Show/hide
Query:  HWPSMKTVFMAGTTQVVLKGDPSLTKIECSLKTISKTWEEEDQGFLLEFQGVEINIDTEDENEKEEEGEKSKLPMVRNLLARYRSIFETPKGLPPKRAVD
        +WPS+   F  G  Q+VLKGDPSL + ECSL+T+ KTW+EEDQGFLLE+  +E+  D   + +++EEG+++ +PM+R LL +Y+ IF TPKGLPPKR +D
Subjt:  HWPSMKTVFMAGTTQVVLKGDPSLTKIECSLKTISKTWEEEDQGFLLEFQGVEINIDTEDENEKEEEGEKSKLPMVRNLLARYRSIFETPKGLPPKRAVD

Query:  HRIVTLEGQKPINVRPYKYGYIQKEEIEKLVSKMLQAGIIRPSRSPYSNPVLLVKKRDGGWRFCVDYRKLNQLSIADKFPIPVIEELLDELHEAEVFSKL
        HRI+TL  QKPINVRPYKYG+IQK EIEKLV++MLQ G+IRPSRSPYS+PVLLVKK+DGGWRFCVDYRKLNQ +I+DKFPIPVIEELLDEL+ A VFSKL
Subjt:  HRIVTLEGQKPINVRPYKYGYIQKEEIEKLVSKMLQAGIIRPSRSPYSNPVLLVKKRDGGWRFCVDYRKLNQLSIADKFPIPVIEELLDELHEAEVFSKL

Query:  DLRSGYHQIRMKDEDVEKTSFRTHEGHYEFLVMPFGLTNAPATFQSLMNQVFRPFLRRFVLVFFDDILIYSKDFTEHERHLGVVFNVMKDNQLFANEKKC
        DL+SGYHQIRMK+EDVEKT+FRTHEGHYEFLVMPFGLTNAPATFQSLMNQVF+PFLRR VLVFFDDIL+YS+D  E E+HLG+VF V++DNQL+AN KKC
Subjt:  DLRSGYHQIRMKDEDVEKTSFRTHEGHYEFLVMPFGLTNAPATFQSLMNQVFRPFLRRFVLVFFDDILIYSKDFTEHERHLGVVFNVMKDNQLFANEKKC

Query:  VIGQSRISYLGHWISKKG
        V   S+I YLGH ISK G
Subjt:  VIGQSRISYLGHWISKKG

A0A5A7V4D8 Disease resistance protein6.5e-15885.53Show/hide
Query:  HWPSMKTVFMAGTTQVVLKGDPSLTKIECSLKTISKTWEEEDQGFLLEFQGVEINIDTEDENEKEEEGEKSKLPMVRNLLARYRSIFETPKGLPPKRAVD
        HWPSM  VFM G++QV+LKGDPSLTK ECSLKTISKTWEEEDQGFL+EFQ +EI   +E E+EKEEEGE+S LPM+RNLLA+ R IFE PKGLPPKRAVD
Subjt:  HWPSMKTVFMAGTTQVVLKGDPSLTKIECSLKTISKTWEEEDQGFLLEFQGVEINIDTEDENEKEEEGEKSKLPMVRNLLARYRSIFETPKGLPPKRAVD

Query:  HRIVTLEGQKPINVRPYKYGYIQKEEIEKLVSKMLQAGIIRPSRSPYSNPVLLVKKRDGGWRFCVDYRKLNQLSIADKFPIPVIEELLDELHEAEVFSKL
        HRI+  EGQ PINVRPYKYGYIQK EIE+LVS+MLQA IIRPSRSPYS+PVLLVKK+DGGWRFCVDYRKLNQ++IADKFPIPVIEELLDELH AEVFSKL
Subjt:  HRIVTLEGQKPINVRPYKYGYIQKEEIEKLVSKMLQAGIIRPSRSPYSNPVLLVKKRDGGWRFCVDYRKLNQLSIADKFPIPVIEELLDELHEAEVFSKL

Query:  DLRSGYHQIRMKDEDVEKTSFRTHEGHYEFLVMPFGLTNAPATFQSLMNQVFRPFLRRFVLVFFDDILIYSKDFTEHERHLGVVFNVMKDNQLFANEKKC
        DLRSGYHQIRMK+ED+EKT+FRTHEGHYEFLVMPFGLTNAPATFQSLMNQ+FRPFLRRFVLVFFDDILIYSK+ TEHE+HLGVVFNVMKDNQLFANEKKC
Subjt:  DLRSGYHQIRMKDEDVEKTSFRTHEGHYEFLVMPFGLTNAPATFQSLMNQVFRPFLRRFVLVFFDDILIYSKDFTEHERHLGVVFNVMKDNQLFANEKKC

Query:  VIGQSRISYLGHWISKKG
        +IG SRI+YLGHWISKKG
Subjt:  VIGQSRISYLGHWISKKG

A0A5D3BG24 Peroxidase 645.5e-15784.91Show/hide
Query:  HWPSMKTVFMAGTTQVVLKGDPSLTKIECSLKTISKTWEEEDQGFLLEFQGVEINIDTEDENEKEEEGEKSKLPMVRNLLARYRSIFETPKGLPPKRAVD
        HWPSM  VFM G++QV+LKGDPSL K ECSLKTISKTWEEEDQGFL+EFQ +EI   +E E+EKEEEGE+S LPM+RNLLA+ R IFE PKGLPPKRAVD
Subjt:  HWPSMKTVFMAGTTQVVLKGDPSLTKIECSLKTISKTWEEEDQGFLLEFQGVEINIDTEDENEKEEEGEKSKLPMVRNLLARYRSIFETPKGLPPKRAVD

Query:  HRIVTLEGQKPINVRPYKYGYIQKEEIEKLVSKMLQAGIIRPSRSPYSNPVLLVKKRDGGWRFCVDYRKLNQLSIADKFPIPVIEELLDELHEAEVFSKL
        HRI+  EGQ PINVRPYKYGYIQK EIE+LVS+MLQA IIRPSRSPYS+PVLLVKKRDGGWRFCVDY KLNQ+++ADKFPIPVIEELLDELH AEVFSKL
Subjt:  HRIVTLEGQKPINVRPYKYGYIQKEEIEKLVSKMLQAGIIRPSRSPYSNPVLLVKKRDGGWRFCVDYRKLNQLSIADKFPIPVIEELLDELHEAEVFSKL

Query:  DLRSGYHQIRMKDEDVEKTSFRTHEGHYEFLVMPFGLTNAPATFQSLMNQVFRPFLRRFVLVFFDDILIYSKDFTEHERHLGVVFNVMKDNQLFANEKKC
        DLRSGYHQIRMK+ED+EKT+FRTHEGHYEFLVMPFGLTN PATFQSLMNQ+FRPFLRRFVLVFFDDILIYSK+ TEHE+HLGVVFNVMKDNQLFANEKKC
Subjt:  DLRSGYHQIRMKDEDVEKTSFRTHEGHYEFLVMPFGLTNAPATFQSLMNQVFRPFLRRFVLVFFDDILIYSKDFTEHERHLGVVFNVMKDNQLFANEKKC

Query:  VIGQSRISYLGHWISKKG
        VIG SRI+YLGHWISKKG
Subjt:  VIGQSRISYLGHWISKKG

A0A5D3DG25 Reverse transcriptase6.4e-137100Show/hide
Query:  MKTVFMAGTTQVVLKGDPSLTKIECSLKTISKTWEEEDQGFLLEFQGVEINIDTEDENEKEEEGEKSKLPMVRNLLARYRSIFETPKGLPPKRAVDHRIV
        MKTVFMAGTTQVVLKGDPSLTKIECSLKTISKTWEEEDQGFLLEFQGVEINIDTEDENEKEEEGEKSKLPMVRNLLARYRSIFETPKGLPPKRAVDHRIV
Subjt:  MKTVFMAGTTQVVLKGDPSLTKIECSLKTISKTWEEEDQGFLLEFQGVEINIDTEDENEKEEEGEKSKLPMVRNLLARYRSIFETPKGLPPKRAVDHRIV

Query:  TLEGQKPINVRPYKYGYIQKEEIEKLVSKMLQAGIIRPSRSPYSNPVLLVKKRDGGWRFCVDYRKLNQLSIADKFPIPVIEELLDELHEAEVFSKLDLRS
        TLEGQKPINVRPYKYGYIQKEEIEKLVSKMLQAGIIRPSRSPYSNPVLLVKKRDGGWRFCVDYRKLNQLSIADKFPIPVIEELLDELHEAEVFSKLDLRS
Subjt:  TLEGQKPINVRPYKYGYIQKEEIEKLVSKMLQAGIIRPSRSPYSNPVLLVKKRDGGWRFCVDYRKLNQLSIADKFPIPVIEELLDELHEAEVFSKLDLRS

Query:  GYHQIRMKDEDVEKTSFRTHEGHYEFLVMPFGLTNAPATFQSLMNQ
        GYHQIRMKDEDVEKTSFRTHEGHYEFLVMPFGLTNAPATFQSLMNQ
Subjt:  GYHQIRMKDEDVEKTSFRTHEGHYEFLVMPFGLTNAPATFQSLMNQ

A0A5D3DUJ0 Ty3/gypsy retrotransposon protein6.5e-15885.53Show/hide
Query:  HWPSMKTVFMAGTTQVVLKGDPSLTKIECSLKTISKTWEEEDQGFLLEFQGVEINIDTEDENEKEEEGEKSKLPMVRNLLARYRSIFETPKGLPPKRAVD
        HWPSM  VFM G++QV+LKGDPSLTK ECSLKTISKTWEEEDQGFL+EFQ +EI   +E E+EKEEEGE+S LPM+RNLLA+ R IFE PKGLPPKRAVD
Subjt:  HWPSMKTVFMAGTTQVVLKGDPSLTKIECSLKTISKTWEEEDQGFLLEFQGVEINIDTEDENEKEEEGEKSKLPMVRNLLARYRSIFETPKGLPPKRAVD

Query:  HRIVTLEGQKPINVRPYKYGYIQKEEIEKLVSKMLQAGIIRPSRSPYSNPVLLVKKRDGGWRFCVDYRKLNQLSIADKFPIPVIEELLDELHEAEVFSKL
        HRI+  EGQ PINVRPYKYGYIQK EIE+LVS+MLQA IIRPSRSPYS+PVLLVKK+DGGWRFCVDYRKLNQ++IADKFPIPVIEELLDELH AEVFSKL
Subjt:  HRIVTLEGQKPINVRPYKYGYIQKEEIEKLVSKMLQAGIIRPSRSPYSNPVLLVKKRDGGWRFCVDYRKLNQLSIADKFPIPVIEELLDELHEAEVFSKL

Query:  DLRSGYHQIRMKDEDVEKTSFRTHEGHYEFLVMPFGLTNAPATFQSLMNQVFRPFLRRFVLVFFDDILIYSKDFTEHERHLGVVFNVMKDNQLFANEKKC
        DLRSGYHQIRMK+ED+EKT+FRTHEGHYEFLVMPFGLTNAPATFQSLMNQ+FRPFLRRFVLVFFDDILIYSK+ TEHE+HLGVVFNVMKDNQLFANEKKC
Subjt:  DLRSGYHQIRMKDEDVEKTSFRTHEGHYEFLVMPFGLTNAPATFQSLMNQVFRPFLRRFVLVFFDDILIYSKDFTEHERHLGVVFNVMKDNQLFANEKKC

Query:  VIGQSRISYLGHWISKKG
        +IG SRI+YLGHWISKKG
Subjt:  VIGQSRISYLGHWISKKG

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.69.4e-4542.51Show/hide
Query:  YKYGYIQKEEIEKLVSKMLQAGIIRPSRSPYSNPVLLV-KKRDGG----WRFCVDYRKLNQLSIADKFPIPVIEELLDELHEAEVFSKLDLRSGYHQIRM
        Y Y    ++E+E  +  ML  GIIR S SPY++P+ +V KK+D      +R  +DYRKLN++++ D+ PIP ++E+L +L     F+ +DL  G+HQI M
Subjt:  YKYGYIQKEEIEKLVSKMLQAGIIRPSRSPYSNPVLLV-KKRDGG----WRFCVDYRKLNQLSIADKFPIPVIEELLDELHEAEVFSKLDLRSGYHQIRM

Query:  KDEDVEKTSFRTHEGHYEFLVMPFGLTNAPATFQSLMNQVFRPFLRRFVLVFFDDILIYSKDFTEHERHLGVVFNVMKDNQLFANEKKCVIGQSRISYLG
          E V KT+F T  GHYE+L MPFGL NAPATFQ  MN + RP L +  LV+ DDI+++S    EH + LG+VF  +    L     KC   +   ++LG
Subjt:  KDEDVEKTSFRTHEGHYEFLVMPFGLTNAPATFQSLMNQVFRPFLRRFVLVFFDDILIYSKDFTEHERHLGVVFNVMKDNQLFANEKKCVIGQSRISYLG

Query:  HWISKKG
        H ++  G
Subjt:  HWISKKG

P20825 Retrovirus-related Pol polyprotein from transposon 2971.7e-4637.35Show/hide
Query:  VRNLLARYRSI-FETPKGLPPKRAVDHRIVTLEGQKPINVRPYKYGYIQKEEIEKLVSKMLQAGIIRPSRSPYSNPVLLVKKRDGG-----WRFCVDYRK
        ++ LL ++R++ ++  + L     + H ++      PI  + Y      + E+E  V +ML  G+IR S SPY++P  +V K+        +R  +DYRK
Subjt:  VRNLLARYRSI-FETPKGLPPKRAVDHRIVTLEGQKPINVRPYKYGYIQKEEIEKLVSKMLQAGIIRPSRSPYSNPVLLVKKRDGG-----WRFCVDYRK

Query:  LNQLSIADKFPIPVIEELLDELHEAEVFSKLDLRSGYHQIRMKDEDVEKTSFRTHEGHYEFLVMPFGLTNAPATFQSLMNQVFRPFLRRFVLVFFDDILI
        LN+++I D++PIP ++E+L +L + + F+ +DL  G+HQI M +E + KT+F T  GHYE+L MPFGL NAPATFQ  MN + RP L +  LV+ DDI+I
Subjt:  LNQLSIADKFPIPVIEELLDELHEAEVFSKLDLRSGYHQIRMKDEDVEKTSFRTHEGHYEFLVMPFGLTNAPATFQSLMNQVFRPFLRRFVLVFFDDILI

Query:  YSKDFTEHERHLGVVFNVMKDNQLFANEKKCVIGQSRISYLGHWISKKG
        +S   TEH   + +VF  + D  L     KC   +   ++LGH ++  G
Subjt:  YSKDFTEHERHLGVVFNVMKDNQLFANEKKCVIGQSRISYLGHWISKKG

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein8.2e-4937.42Show/hide
Query:  KTVFMAGTTQVVLKGDPSLTKIECSLKTISKTWEEEDQGFLLE---FQGVEINIDTEDENEKEEEGEKSKLPMVRNLLARYRSIFETPKGLPPKRA----
        KT+ +A TT   +    +L     +  T ++    E+   L E   +  V   I + + N  +   + +   +   L  +YR I      LPP+ A    
Subjt:  KTVFMAGTTQVVLKGDPSLTKIECSLKTISKTWEEEDQGFLLE---FQGVEINIDTEDENEKEEEGEKSKLPMVRNLLARYRSIFETPKGLPPKRA----

Query:  --VDHRIVTLEGQKPINVRPYKYGYIQKEEIEKLVSKMLQAGIIRPSRSPYSNPVLLVKKRDGGWRFCVDYRKLNQLSIADKFPIPVIEELLDELHEAEV
          V H I    G +   ++PY      ++EI K+V K+L    I PS+SP S+PV+LV K+DG +R CVDYR LN+ +I+D FP+P I+ LL  +  A++
Subjt:  --VDHRIVTLEGQKPINVRPYKYGYIQKEEIEKLVSKMLQAGIIRPSRSPYSNPVLLVKKRDGGWRFCVDYRKLNQLSIADKFPIPVIEELLDELHEAEV

Query:  FSKLDLRSGYHQIRMKDEDVEKTSFRTHEGHYEFLVMPFGLTNAPATFQSLMNQVFRPFLRRFVLVFFDDILIYSKDFTEHERHLGVVFNVMKDNQLFAN
        F+ LDL SGYHQI M+ +D  KT+F T  G YE+ VMPFGL NAP+TF   M   FR    RFV V+ DDILI+S+   EH +HL  V   +K+  L   
Subjt:  FSKLDLRSGYHQIRMKDEDVEKTSFRTHEGHYEFLVMPFGLTNAPATFQSLMNQVFRPFLRRFVLVFFDDILIYSKDFTEHERHLGVVFNVMKDNQLFAN

Query:  EKKCVIGQSRISYLGHWI
        +KKC        +LG+ I
Subjt:  EKKCVIGQSRISYLGHWI

Q8I7P9 Retrovirus-related Pol polyprotein from transposon opus6.1e-4437.2Show/hide
Query:  MVRNLLARYRSIFETP-KGLPPKRAVDHRIVTLEGQKPINVRPYKYGYIQKEEIEKLVSKMLQAGIIRPSRSPYSNPVLLVKKR-----DGGWRFCVDYR
        ++ +LL  +  IFE P  G+  + AV   I T   Q PI  + Y Y    + E+E+ + ++LQ GIIRPS SPY++P+ +V K+     +  +R  VD++
Subjt:  MVRNLLARYRSIFETP-KGLPPKRAVDHRIVTLEGQKPINVRPYKYGYIQKEEIEKLVSKMLQAGIIRPSRSPYSNPVLLVKKR-----DGGWRFCVDYR

Query:  KLNQLSIADKFPIPVIEELLDELHEAEVFSKLDLRSGYHQIRMKDEDVEKTSFRTHEGHYEFLVMPFGLTNAPATFQSLMNQVFRPFLRRFVLVFFDDIL
        +LN ++I D +PIP I   L  L  A+ F+ LDL SG+HQI MK+ D+ KT+F T  G YEFL +PFGL NAPA FQ +++ + R  + +   V+ DDI+
Subjt:  KLNQLSIADKFPIPVIEELLDELHEAEVFSKLDLRSGYHQIRMKDEDVEKTSFRTHEGHYEFLVMPFGLTNAPATFQSLMNQVFRPFLRRFVLVFFDDIL

Query:  IYSKDFTEHERHLGVVFNVMKDNQLFANEKKCVIGQSRISYLGHWISKKG
        ++S+D+  H ++L +V   +    L  N +K     +++ +LG+ ++  G
Subjt:  IYSKDFTEHERHLGVVFNVMKDNQLFANEKKCVIGQSRISYLGHWISKKG

Q99315 Transposon Ty3-G Gag-Pol polyprotein8.2e-4937.42Show/hide
Query:  KTVFMAGTTQVVLKGDPSLTKIECSLKTISKTWEEEDQGFLLE---FQGVEINIDTEDENEKEEEGEKSKLPMVRNLLARYRSIFETPKGLPPKRA----
        KT+ +A TT   +    +L     +  T ++    E+   L E   +  V   I + + N  +   + +   +   L  +YR I      LPP+ A    
Subjt:  KTVFMAGTTQVVLKGDPSLTKIECSLKTISKTWEEEDQGFLLE---FQGVEINIDTEDENEKEEEGEKSKLPMVRNLLARYRSIFETPKGLPPKRA----

Query:  --VDHRIVTLEGQKPINVRPYKYGYIQKEEIEKLVSKMLQAGIIRPSRSPYSNPVLLVKKRDGGWRFCVDYRKLNQLSIADKFPIPVIEELLDELHEAEV
          V H I    G +   ++PY      ++EI K+V K+L    I PS+SP S+PV+LV K+DG +R CVDYR LN+ +I+D FP+P I+ LL  +  A++
Subjt:  --VDHRIVTLEGQKPINVRPYKYGYIQKEEIEKLVSKMLQAGIIRPSRSPYSNPVLLVKKRDGGWRFCVDYRKLNQLSIADKFPIPVIEELLDELHEAEV

Query:  FSKLDLRSGYHQIRMKDEDVEKTSFRTHEGHYEFLVMPFGLTNAPATFQSLMNQVFRPFLRRFVLVFFDDILIYSKDFTEHERHLGVVFNVMKDNQLFAN
        F+ LDL SGYHQI M+ +D  KT+F T  G YE+ VMPFGL NAP+TF   M   FR    RFV V+ DDILI+S+   EH +HL  V   +K+  L   
Subjt:  FSKLDLRSGYHQIRMKDEDVEKTSFRTHEGHYEFLVMPFGLTNAPATFQSLMNQVFRPFLRRFVLVFFDDILIYSKDFTEHERHLGVVFNVMKDNQLFAN

Query:  EKKCVIGQSRISYLGHWI
        +KKC        +LG+ I
Subjt:  EKKCVIGQSRISYLGHWI

Arabidopsis top hitse value%identityAlignment
ATMG00850.1 DNA/RNA polymerases superfamily protein1.0e-0652.5Show/hide
Query:  IQKEEIEKLVSKMLQAGIIRPSRSPYSNPVLLVKKRDGGW
        +++  ++  + +ML+A II+PS SPYS+PVLLV+K+DGGW
Subjt:  IQKEEIEKLVSKMLQAGIIRPSRSPYSNPVLLVKKRDGGW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGAATCACTGGCCATCCATGAAGACGGTGTTTATGGCTGGCACAACTCAAGTAGTGTTAAAAGGAGACCCATCATTGACAAAAATAGAGTGTTCCCTGAAAACAAT
ATCCAAAACATGGGAAGAGGAGGACCAGGGATTTTTGTTAGAATTTCAAGGGGTTGAAATTAATATAGACACTGAAGACGAGAATGAAAAAGAGGAGGAAGGGGAGAAAT
CGAAACTACCAATGGTCAGAAACTTGTTAGCACGATACAGAAGCATATTCGAAACGCCGAAAGGATTACCTCCAAAAAGGGCAGTGGACCACCGAATAGTGACCTTAGAG
GGGCAAAAACCCATCAACGTGCGGCCCTACAAGTATGGGTACATACAAAAAGAGGAGATAGAAAAGCTGGTTTCAAAAATGCTTCAAGCTGGTATCATCCGACCCAGTCG
AAGTCCTTATTCCAACCCGGTATTATTAGTAAAGAAACGTGATGGAGGGTGGCGTTTCTGTGTCGACTACCGTAAGTTGAATCAATTAAGTATTGCCGATAAATTTCCCA
TTCCAGTCATCGAAGAACTCCTGGACGAATTGCACGAGGCTGAGGTCTTTTCGAAGTTAGATTTACGATCGGGTTACCATCAAATTCGAATGAAGGATGAGGACGTTGAG
AAAACATCGTTTCGCACCCACGAGGGACACTATGAATTTTTGGTTATGCCATTTGGACTGACCAACGCGCCAGCAACTTTTCAATCCCTAATGAACCAGGTTTTCAGGCC
TTTCCTCAGACGCTTTGTATTAGTTTTCTTTGACGACATACTGATTTACAGCAAGGATTTTACAGAGCATGAGAGACATTTGGGAGTAGTATTTAATGTGATGAAGGACA
ATCAGTTGTTTGCTAACGAAAAGAAATGTGTGATCGGACAGTCAAGGATAAGTTACTTAGGCCATTGGATATCAAAAAAGGGGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGGGAATCACTGGCCATCCATGAAGACGGTGTTTATGGCTGGCACAACTCAAGTAGTGTTAAAAGGAGACCCATCATTGACAAAAATAGAGTGTTCCCTGAAAACAAT
ATCCAAAACATGGGAAGAGGAGGACCAGGGATTTTTGTTAGAATTTCAAGGGGTTGAAATTAATATAGACACTGAAGACGAGAATGAAAAAGAGGAGGAAGGGGAGAAAT
CGAAACTACCAATGGTCAGAAACTTGTTAGCACGATACAGAAGCATATTCGAAACGCCGAAAGGATTACCTCCAAAAAGGGCAGTGGACCACCGAATAGTGACCTTAGAG
GGGCAAAAACCCATCAACGTGCGGCCCTACAAGTATGGGTACATACAAAAAGAGGAGATAGAAAAGCTGGTTTCAAAAATGCTTCAAGCTGGTATCATCCGACCCAGTCG
AAGTCCTTATTCCAACCCGGTATTATTAGTAAAGAAACGTGATGGAGGGTGGCGTTTCTGTGTCGACTACCGTAAGTTGAATCAATTAAGTATTGCCGATAAATTTCCCA
TTCCAGTCATCGAAGAACTCCTGGACGAATTGCACGAGGCTGAGGTCTTTTCGAAGTTAGATTTACGATCGGGTTACCATCAAATTCGAATGAAGGATGAGGACGTTGAG
AAAACATCGTTTCGCACCCACGAGGGACACTATGAATTTTTGGTTATGCCATTTGGACTGACCAACGCGCCAGCAACTTTTCAATCCCTAATGAACCAGGTTTTCAGGCC
TTTCCTCAGACGCTTTGTATTAGTTTTCTTTGACGACATACTGATTTACAGCAAGGATTTTACAGAGCATGAGAGACATTTGGGAGTAGTATTTAATGTGATGAAGGACA
ATCAGTTGTTTGCTAACGAAAAGAAATGTGTGATCGGACAGTCAAGGATAAGTTACTTAGGCCATTGGATATCAAAAAAGGGGTAG
Protein sequenceShow/hide protein sequence
MGNHWPSMKTVFMAGTTQVVLKGDPSLTKIECSLKTISKTWEEEDQGFLLEFQGVEINIDTEDENEKEEEGEKSKLPMVRNLLARYRSIFETPKGLPPKRAVDHRIVTLE
GQKPINVRPYKYGYIQKEEIEKLVSKMLQAGIIRPSRSPYSNPVLLVKKRDGGWRFCVDYRKLNQLSIADKFPIPVIEELLDELHEAEVFSKLDLRSGYHQIRMKDEDVE
KTSFRTHEGHYEFLVMPFGLTNAPATFQSLMNQVFRPFLRRFVLVFFDDILIYSKDFTEHERHLGVVFNVMKDNQLFANEKKCVIGQSRISYLGHWISKKG