; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc01g0018941 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc01g0018941
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionRibonuclease H
Genome locationCMiso1.1chr01:16605426..16606610
RNA-Seq ExpressionCmc01g0018941
SyntenyCmc01g0018941
Gene Ontology termsGO:0006310 - DNA recombination (biological process)
GO:0015074 - DNA integration (biological process)
GO:0071897 - DNA biosynthetic process (biological process)
GO:0090502 - RNA phosphodiester bond hydrolysis, endonucleolytic (biological process)
GO:0005634 - nucleus (cellular component)
GO:0030430 - host cell cytoplasm (cellular component)
GO:0003723 - RNA binding (molecular function)
GO:0003887 - DNA-directed DNA polymerase activity (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0033221.1 uncharacterized protein E6C27_scaffold845G00100 [Cucumis melo var. makuwa]7.4e-16678.68Show/hide
Query:  MPYQETLKVINLGTLEEVKKVRIGTLASKQDQLDLVTLLHEFKDIFAWSYQDMPGLDTDIVRHRLPLKLECKPIRQKLRKLKPEMSIKIKEEVKKQFDAG
        M YQETL VIN GT EEVK+VRIGTLAS+QDQ +LVTLLHEFKDIFAWSYQDM GLDT+IV HRLPLK ECKPIRQKLRKLKPEM IKIKEEVKKQFDAG
Subjt:  MPYQETLKVINLGTLEEVKKVRIGTLASKQDQLDLVTLLHEFKDIFAWSYQDMPGLDTDIVRHRLPLKLECKPIRQKLRKLKPEMSIKIKEEVKKQFDAG

Query:  FLAVVKYLIWVANIVPVPKKDRKVRMCVDYRDLNRASLKDNFPLPHIDVLVDNTAGFSTFSFMDGFSGYNQIKMAPEDQEKTTFITLWETFCYKVMPFGL
        FLAV KY IWVANIVPVPKKD KVRMCVDYRDLNRAS KDNFPLPHIDVLVDNTAGFSTFSFMDGFSGYNQIKMAPEDQEKTTFITLW  FCYKVMPFGL
Subjt:  FLAVVKYLIWVANIVPVPKKDRKVRMCVDYRDLNRASLKDNFPLPHIDVLVDNTAGFSTFSFMDGFSGYNQIKMAPEDQEKTTFITLWETFCYKVMPFGL

Query:  KNTGATNQRAMVTLFHDLMHKEIKVYIDDMIAKFRLEEKHVVTLRKLFERLQKFQLKLNPTKCIFGVSSGKLLGFIVSREGIKVDPDKMKPIMNLRPPKT
        KN GAT QRAMVTLFHDLMHKEI+VY+DDMIAK RLEEKHVVTLRKLFERL+KFQLKLNP KC FGVSSGKLLGFIVSREGIKVDPDK+K I++LR PKT
Subjt:  KNTGATNQRAMVTLFHDLMHKEIKVYIDDMIAKFRLEEKHVVTLRKLFERLQKFQLKLNPTKCIFGVSSGKLLGFIVSREGIKVDPDKMKPIMNLRPPKT

Query:  QKEVRSFLRRLNYIERFISHLTQTCEQILKLFCKNEIYLWNEDCQKAFDKIKDYLQSPHILVLPTPGRLLILYLIVKEGSMGCVLGQHDSTGKK
        QKE                                               IKDYLQSP ILV PTPGR LILYL VKEGSMGCVLGQ DSTGKK
Subjt:  QKEVRSFLRRLNYIERFISHLTQTCEQILKLFCKNEIYLWNEDCQKAFDKIKDYLQSPHILVLPTPGRLLILYLIVKEGSMGCVLGQHDSTGKK

KAA0065293.1 uncharacterized protein E6C27_scaffold1023G00060 [Cucumis melo var. makuwa]2.8e-17380.96Show/hide
Query:  MPYQETLKVINLGTLEEVKKVRIGTLASKQDQLDLVTLLHEFKDIFAWSYQDMPGLDTDIVRHRLPLKLECKPIRQKLRKLKPEMSIKIKEEVKKQFDAG
        MPYQETLKVINLGT EEVK+VRIGTLAS+QDQ +LVTLLHEFKDIFAWSYQDMPGLDT+IV HRLPLK ECKPIRQKLRKLKPEM IKIKEEVKKQFDAG
Subjt:  MPYQETLKVINLGTLEEVKKVRIGTLASKQDQLDLVTLLHEFKDIFAWSYQDMPGLDTDIVRHRLPLKLECKPIRQKLRKLKPEMSIKIKEEVKKQFDAG

Query:  FLAVVKYLIWVANIVPVPKKDRKVRMCVDYRDLNRASLKDNFPLPHIDVLVDNTAGFSTFSFMDGFSGYNQIKMAPEDQEKTTFITLWETFCYKVMPFGL
        FLAV KY IWVANIVPVPKKD KVRMCVDYRDLNRA                                YNQIKMAPEDQEKTTFITLW TFCYKVMPFGL
Subjt:  FLAVVKYLIWVANIVPVPKKDRKVRMCVDYRDLNRASLKDNFPLPHIDVLVDNTAGFSTFSFMDGFSGYNQIKMAPEDQEKTTFITLWETFCYKVMPFGL

Query:  KNTGATNQRAMVTLFHDLMHKEIKVYIDDMIAKFRLEEKHVVTLRKLFERLQKFQLKLNPTKCIFGVSSGKLLGFIVSREGIKVDPDKMKPIMNLRPPKT
        KN GAT QRAMVTLFHDLMHKEI+VY+DDM+AK RLEEKHVVTLRKLFE L+KFQLKLNP KC FGVSS KLLGFIVSREGIKVDPDK++ I++LRPPKT
Subjt:  KNTGATNQRAMVTLFHDLMHKEIKVYIDDMIAKFRLEEKHVVTLRKLFERLQKFQLKLNPTKCIFGVSSGKLLGFIVSREGIKVDPDKMKPIMNLRPPKT

Query:  QKEVRSFLRRLNYIERFISHLTQTCEQILKLFCKNEIYLWNEDCQKAFDKIKDYLQSPHILVLPTPGRLLILYLIVKEGSMGCVLGQHDSTGKK
        QKEVRSFL RLNY  RFISHLTQTCE ILKL  KNEI  WNEDCQKAFDKIKDYLQSP ILV PTPGR LI  L VKEGSMGCVLGQHDSTGKK
Subjt:  QKEVRSFLRRLNYIERFISHLTQTCEQILKLFCKNEIYLWNEDCQKAFDKIKDYLQSPHILVLPTPGRLLILYLIVKEGSMGCVLGQHDSTGKK

TYK00092.1 protein NYNRIN-like [Cucumis melo var. makuwa]2.6e-19588.58Show/hide
Query:  MPYQETLKVINLGTLEEVKKVRIGTLASKQDQLDLVTLLHEFKDIFAWSYQDMPGLDTDIVRHRLPLKLECKPIRQKLRKLKPEMSIKIKEEVKKQFDAG
        MPYQETLKVINLGT EEVK+VRIGTLAS+QDQ +LV LLHEFKDIFAWSYQDMP LDT+IV HRLPLK ECKPIRQKLRKLKP M IKIKEEVKKQFDAG
Subjt:  MPYQETLKVINLGTLEEVKKVRIGTLASKQDQLDLVTLLHEFKDIFAWSYQDMPGLDTDIVRHRLPLKLECKPIRQKLRKLKPEMSIKIKEEVKKQFDAG

Query:  FLAVVKYLIWVANIVPVPKKDRKVRMCVDYRDLNRASLKDNFPLPHIDVLVDNTAGFSTFSFMDGFSGYNQIKMAPEDQEKTTFITLWETFCYKVMPFGL
        FLAV KY IWVANIVPVPKKD KVRMCVDYRDLNRAS KDNFPLPHIDVLVDNTAGFSTFSFMDGFSGYNQIKMAPEDQEKTTFITLW TFCYKVMPFGL
Subjt:  FLAVVKYLIWVANIVPVPKKDRKVRMCVDYRDLNRASLKDNFPLPHIDVLVDNTAGFSTFSFMDGFSGYNQIKMAPEDQEKTTFITLWETFCYKVMPFGL

Query:  KNTGATNQRAMVTLFHDLMHKEIKVYIDDMIAKFRLEEKHVVTLRKLFERLQKFQLKLNPTKCIFGVSSGKLLGFIVSREGIKVDPDKMKPIMNLRPPKT
        KN GAT QRAMVTLFHDLMHKEI+VY+DDMIAK RLEEKHVVTL KLFERL+KFQLKLNP KCIFGVSSGKLLGFIVS EGIKVD DK+K I++LRPPKT
Subjt:  KNTGATNQRAMVTLFHDLMHKEIKVYIDDMIAKFRLEEKHVVTLRKLFERLQKFQLKLNPTKCIFGVSSGKLLGFIVSREGIKVDPDKMKPIMNLRPPKT

Query:  QKEVRSFLRRLNYIERFISHLTQTCEQILKLFCKNEIYLWNEDCQKAFDKIKDYLQSPHILVLPTPGRLLILYLIVKEGSMGCVLGQHDSTGKK
        +KEVRSFL RLNYI RFISHLTQTCE ILKL  KNEI  WNEDCQKAFDKIKDYLQSP ILV PTP R LILYL V EGSMGCVLGQHDSTGKK
Subjt:  QKEVRSFLRRLNYIERFISHLTQTCEQILKLFCKNEIYLWNEDCQKAFDKIKDYLQSPHILVLPTPGRLLILYLIVKEGSMGCVLGQHDSTGKK

XP_022155098.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111022231, partial [Momordica charantia]1.1e-16673.54Show/hide
Query:  PYQETLKVINLGTLEEVKKVRIGTLASKQDQLDLVTLLHEFKDIFAWSYQDMPGLDTDIVRHRLPLKLECKPIRQKLRKLKPEMSIKIKEEVKKQFDAGF
        P+ E+ +++NLG+  EVK+V+IGT  S + +  L+ LLHEF D+FAWSYQDMP LDTDIV H+LP+  ECKP+RQKLRK++P+M IK+K+EV+KQ DAGF
Subjt:  PYQETLKVINLGTLEEVKKVRIGTLASKQDQLDLVTLLHEFKDIFAWSYQDMPGLDTDIVRHRLPLKLECKPIRQKLRKLKPEMSIKIKEEVKKQFDAGF

Query:  LAVVKYLIWVANIVPVPKKDRKVRMCVDYRDLNRASLKDNFPLPHIDVLVDNTAGFSTFSFMDGFSGYNQIKMAPEDQEKTTFITLWETFCYKVMPFGLK
        L V  Y  WVANIVPVPKK+ +VRMCVDYRDLNRAS KDNFPLPHIDVLVDNTAGFSTFSFMDGFSGYN+IKMAPED+EKTTFITLW TFCYKVM FGLK
Subjt:  LAVVKYLIWVANIVPVPKKDRKVRMCVDYRDLNRASLKDNFPLPHIDVLVDNTAGFSTFSFMDGFSGYNQIKMAPEDQEKTTFITLWETFCYKVMPFGLK

Query:  NTGATNQRAMVTLFHDLMHKEIKVYIDDMIAKFRLEEKHVVTLRKLFERLQKFQLKLNPTKCIFGVSSGKLLGFIVSREGIKVDPDKMKPIMNLRPPKTQ
        N GAT QRAMVTLFHDLMHKEI+VY+DDMIAK R  E+H + LRKLFERL+KFQLKLNP KC FGV+SGKLLGF+V +EGIKVDPDK+K I+ + PPKT 
Subjt:  NTGATNQRAMVTLFHDLMHKEIKVYIDDMIAKFRLEEKHVVTLRKLFERLQKFQLKLNPTKCIFGVSSGKLLGFIVSREGIKVDPDKMKPIMNLRPPKTQ

Query:  KEVRSFLRRLNYIERFISHLTQTCEQILKLFCKNEIYLWNEDCQKAFDKIKDYLQSPHILVLPTPGRLLILYLIVKEGSMGCVLGQHDSTGKK
        KEVR FL RLNYI RFISHLT TCE I KL  KN   +W EDCQ AFDKIK Y QSP ILV PT GR LILYL + E SMGCVLGQHD +G+K
Subjt:  KEVRSFLRRLNYIERFISHLTQTCEQILKLFCKNEIYLWNEDCQKAFDKIKDYLQSPHILVLPTPGRLLILYLIVKEGSMGCVLGQHDSTGKK

XP_031739260.1 LOW QUALITY PROTEIN: uncharacterized protein LOC116402917 [Cucumis sativus]5.7e-16673.28Show/hide
Query:  PYQETLKVINLGTLEEVKKVRIGTLASKQDQLDLVTLLHEFKDIFAWSYQDMPGLDTDIVRHRLPLKLECKPIRQKLRKLKPEMSIKIKEEVKKQFDAGF
        P+QE ++VINLG+ EE K+V+IGT  + + +  ++ LL E+ DIFAWSYQDMPGL+TDIV HR+PLK EC P+RQKLRK+KP++ IKIKEEV+KQ +AGF
Subjt:  PYQETLKVINLGTLEEVKKVRIGTLASKQDQLDLVTLLHEFKDIFAWSYQDMPGLDTDIVRHRLPLKLECKPIRQKLRKLKPEMSIKIKEEVKKQFDAGF

Query:  LAVVKYLIWVANIVPVPKKDRKVRMCVDYRDLNRASLKDNFPLPHIDVLVDNTAGFSTFSFMDGFSGYNQIKMAPEDQEKTTFITLWETFCYKVMPFGLK
        L V KY  WVANIVPVPKKD KVRMCVDYRDLNRAS KDNFPLPHID+LVDNTAG+STFSFMDGFSGYNQIKMA ED+EKTTFITLW TFCYKVMPFGLK
Subjt:  LAVVKYLIWVANIVPVPKKDRKVRMCVDYRDLNRASLKDNFPLPHIDVLVDNTAGFSTFSFMDGFSGYNQIKMAPEDQEKTTFITLWETFCYKVMPFGLK

Query:  NTGATNQRAMVTLFHDLMHKEIKVYIDDMIAKFRLEEKHVVTLRKLFERLQKFQLKLNPTKCIFGVSSGKLLGFIVSREGIKVDPDKMKPIMNLRPPKTQ
        N GAT QRAMVTLFHD+MHKEI+VY+DDMIAK +  E H   L+KLF+RL+K+QLKLNP+KC FG +SGKLLGFIVS EGIKVDPDK+K IM++  P+T+
Subjt:  NTGATNQRAMVTLFHDLMHKEIKVYIDDMIAKFRLEEKHVVTLRKLFERLQKFQLKLNPTKCIFGVSSGKLLGFIVSREGIKVDPDKMKPIMNLRPPKTQ

Query:  KEVRSFLRRLNYIERFISHLTQTCEQILKLFCKNEIYLWNEDCQKAFDKIKDYLQSPHILVLPTPGRLLILYLIVKEGSMGCVLGQHDSTGKK
        KEVR+FL RLNYI RFISHLT TCE I KL  KN    WNEDC++AF+KIK YLQSP +L+ P PGR LILYL V EGSMG VLGQHD +GKK
Subjt:  KEVRSFLRRLNYIERFISHLTQTCEQILKLFCKNEIYLWNEDCQKAFDKIKDYLQSPHILVLPTPGRLLILYLIVKEGSMGCVLGQHDSTGKK

TrEMBL top hitse value%identityAlignment
A0A2K3NP54 Ribonuclease H1.8e-16572.01Show/hide
Query:  PYQETLKVINLGTLEEVKKVRIGTLASKQDQLDLVTLLHEFKDIFAWSYQDMPGLDTDIVRHRLPLKLECKPIRQKLRKLKPEMSIKIKEEVKKQFDAGF
        P+QE+++VINLG  E+ K++++G       +  LV LL E+ D+FAWSYQDMPGLDTDIV HRLPLK EC P++QK+R+ +P+M++KI+EEVKKQFDAGF
Subjt:  PYQETLKVINLGTLEEVKKVRIGTLASKQDQLDLVTLLHEFKDIFAWSYQDMPGLDTDIVRHRLPLKLECKPIRQKLRKLKPEMSIKIKEEVKKQFDAGF

Query:  LAVVKYLIWVANIVPVPKKDRKVRMCVDYRDLNRASLKDNFPLPHIDVLVDNTAGFSTFSFMDGFSGYNQIKMAPEDQEKTTFITLWETFCYKVMPFGLK
        LAV KY  WVANIVPVPKKD KVRMCVDYRDLNRAS KD+FPLPHIDVLVDNTA FS FSFMDGFSGYNQIKMAPED EKTTFIT W TFCYKVMPFGLK
Subjt:  LAVVKYLIWVANIVPVPKKDRKVRMCVDYRDLNRASLKDNFPLPHIDVLVDNTAGFSTFSFMDGFSGYNQIKMAPEDQEKTTFITLWETFCYKVMPFGLK

Query:  NTGATNQRAMVTLFHDLMHKEIKVYIDDMIAKFRLEEKHVVTLRKLFERLQKFQLKLNPTKCIFGVSSGKLLGFIVSREGIKVDPDKMKPIMNLRPPKTQ
        N GAT QRAMVTLFHD++HKEI+VY+DDMIAK + EE H+V L+KLFERL+KF+L+LNP KC FG  SGKLLGF+VS+ GI+VDPDK++ I N+  P+T+
Subjt:  NTGATNQRAMVTLFHDLMHKEIKVYIDDMIAKFRLEEKHVVTLRKLFERLQKFQLKLNPTKCIFGVSSGKLLGFIVSREGIKVDPDKMKPIMNLRPPKTQ

Query:  KEVRSFLRRLNYIERFISHLTQTCEQILKLFCKNEIYLWNEDCQKAFDKIKDYLQSPHILVLPTPGRLLILYLIVKEGSMGCVLGQHDSTGKK
        KEVR FL RLNYI RFISHLT TCE I KL  K++   WNEDCQKAF+KIK+YLQ P ILV P PGR LI+YL V +GSMGCVLGQHD TG+K
Subjt:  KEVRSFLRRLNYIERFISHLTQTCEQILKLFCKNEIYLWNEDCQKAFDKIKDYLQSPHILVLPTPGRLLILYLIVKEGSMGCVLGQHDSTGKK

A0A5A7SUT0 Reverse transcriptase3.6e-16678.68Show/hide
Query:  MPYQETLKVINLGTLEEVKKVRIGTLASKQDQLDLVTLLHEFKDIFAWSYQDMPGLDTDIVRHRLPLKLECKPIRQKLRKLKPEMSIKIKEEVKKQFDAG
        M YQETL VIN GT EEVK+VRIGTLAS+QDQ +LVTLLHEFKDIFAWSYQDM GLDT+IV HRLPLK ECKPIRQKLRKLKPEM IKIKEEVKKQFDAG
Subjt:  MPYQETLKVINLGTLEEVKKVRIGTLASKQDQLDLVTLLHEFKDIFAWSYQDMPGLDTDIVRHRLPLKLECKPIRQKLRKLKPEMSIKIKEEVKKQFDAG

Query:  FLAVVKYLIWVANIVPVPKKDRKVRMCVDYRDLNRASLKDNFPLPHIDVLVDNTAGFSTFSFMDGFSGYNQIKMAPEDQEKTTFITLWETFCYKVMPFGL
        FLAV KY IWVANIVPVPKKD KVRMCVDYRDLNRAS KDNFPLPHIDVLVDNTAGFSTFSFMDGFSGYNQIKMAPEDQEKTTFITLW  FCYKVMPFGL
Subjt:  FLAVVKYLIWVANIVPVPKKDRKVRMCVDYRDLNRASLKDNFPLPHIDVLVDNTAGFSTFSFMDGFSGYNQIKMAPEDQEKTTFITLWETFCYKVMPFGL

Query:  KNTGATNQRAMVTLFHDLMHKEIKVYIDDMIAKFRLEEKHVVTLRKLFERLQKFQLKLNPTKCIFGVSSGKLLGFIVSREGIKVDPDKMKPIMNLRPPKT
        KN GAT QRAMVTLFHDLMHKEI+VY+DDMIAK RLEEKHVVTLRKLFERL+KFQLKLNP KC FGVSSGKLLGFIVSREGIKVDPDK+K I++LR PKT
Subjt:  KNTGATNQRAMVTLFHDLMHKEIKVYIDDMIAKFRLEEKHVVTLRKLFERLQKFQLKLNPTKCIFGVSSGKLLGFIVSREGIKVDPDKMKPIMNLRPPKT

Query:  QKEVRSFLRRLNYIERFISHLTQTCEQILKLFCKNEIYLWNEDCQKAFDKIKDYLQSPHILVLPTPGRLLILYLIVKEGSMGCVLGQHDSTGKK
        QKE                                               IKDYLQSP ILV PTPGR LILYL VKEGSMGCVLGQ DSTGKK
Subjt:  QKEVRSFLRRLNYIERFISHLTQTCEQILKLFCKNEIYLWNEDCQKAFDKIKDYLQSPHILVLPTPGRLLILYLIVKEGSMGCVLGQHDSTGKK

A0A5A7VAU5 Uncharacterized protein1.4e-17380.96Show/hide
Query:  MPYQETLKVINLGTLEEVKKVRIGTLASKQDQLDLVTLLHEFKDIFAWSYQDMPGLDTDIVRHRLPLKLECKPIRQKLRKLKPEMSIKIKEEVKKQFDAG
        MPYQETLKVINLGT EEVK+VRIGTLAS+QDQ +LVTLLHEFKDIFAWSYQDMPGLDT+IV HRLPLK ECKPIRQKLRKLKPEM IKIKEEVKKQFDAG
Subjt:  MPYQETLKVINLGTLEEVKKVRIGTLASKQDQLDLVTLLHEFKDIFAWSYQDMPGLDTDIVRHRLPLKLECKPIRQKLRKLKPEMSIKIKEEVKKQFDAG

Query:  FLAVVKYLIWVANIVPVPKKDRKVRMCVDYRDLNRASLKDNFPLPHIDVLVDNTAGFSTFSFMDGFSGYNQIKMAPEDQEKTTFITLWETFCYKVMPFGL
        FLAV KY IWVANIVPVPKKD KVRMCVDYRDLNRA                                YNQIKMAPEDQEKTTFITLW TFCYKVMPFGL
Subjt:  FLAVVKYLIWVANIVPVPKKDRKVRMCVDYRDLNRASLKDNFPLPHIDVLVDNTAGFSTFSFMDGFSGYNQIKMAPEDQEKTTFITLWETFCYKVMPFGL

Query:  KNTGATNQRAMVTLFHDLMHKEIKVYIDDMIAKFRLEEKHVVTLRKLFERLQKFQLKLNPTKCIFGVSSGKLLGFIVSREGIKVDPDKMKPIMNLRPPKT
        KN GAT QRAMVTLFHDLMHKEI+VY+DDM+AK RLEEKHVVTLRKLFE L+KFQLKLNP KC FGVSS KLLGFIVSREGIKVDPDK++ I++LRPPKT
Subjt:  KNTGATNQRAMVTLFHDLMHKEIKVYIDDMIAKFRLEEKHVVTLRKLFERLQKFQLKLNPTKCIFGVSSGKLLGFIVSREGIKVDPDKMKPIMNLRPPKT

Query:  QKEVRSFLRRLNYIERFISHLTQTCEQILKLFCKNEIYLWNEDCQKAFDKIKDYLQSPHILVLPTPGRLLILYLIVKEGSMGCVLGQHDSTGKK
        QKEVRSFL RLNY  RFISHLTQTCE ILKL  KNEI  WNEDCQKAFDKIKDYLQSP ILV PTPGR LI  L VKEGSMGCVLGQHDSTGKK
Subjt:  QKEVRSFLRRLNYIERFISHLTQTCEQILKLFCKNEIYLWNEDCQKAFDKIKDYLQSPHILVLPTPGRLLILYLIVKEGSMGCVLGQHDSTGKK

A0A5D3BMZ2 Protein NYNRIN-like1.3e-19588.58Show/hide
Query:  MPYQETLKVINLGTLEEVKKVRIGTLASKQDQLDLVTLLHEFKDIFAWSYQDMPGLDTDIVRHRLPLKLECKPIRQKLRKLKPEMSIKIKEEVKKQFDAG
        MPYQETLKVINLGT EEVK+VRIGTLAS+QDQ +LV LLHEFKDIFAWSYQDMP LDT+IV HRLPLK ECKPIRQKLRKLKP M IKIKEEVKKQFDAG
Subjt:  MPYQETLKVINLGTLEEVKKVRIGTLASKQDQLDLVTLLHEFKDIFAWSYQDMPGLDTDIVRHRLPLKLECKPIRQKLRKLKPEMSIKIKEEVKKQFDAG

Query:  FLAVVKYLIWVANIVPVPKKDRKVRMCVDYRDLNRASLKDNFPLPHIDVLVDNTAGFSTFSFMDGFSGYNQIKMAPEDQEKTTFITLWETFCYKVMPFGL
        FLAV KY IWVANIVPVPKKD KVRMCVDYRDLNRAS KDNFPLPHIDVLVDNTAGFSTFSFMDGFSGYNQIKMAPEDQEKTTFITLW TFCYKVMPFGL
Subjt:  FLAVVKYLIWVANIVPVPKKDRKVRMCVDYRDLNRASLKDNFPLPHIDVLVDNTAGFSTFSFMDGFSGYNQIKMAPEDQEKTTFITLWETFCYKVMPFGL

Query:  KNTGATNQRAMVTLFHDLMHKEIKVYIDDMIAKFRLEEKHVVTLRKLFERLQKFQLKLNPTKCIFGVSSGKLLGFIVSREGIKVDPDKMKPIMNLRPPKT
        KN GAT QRAMVTLFHDLMHKEI+VY+DDMIAK RLEEKHVVTL KLFERL+KFQLKLNP KCIFGVSSGKLLGFIVS EGIKVD DK+K I++LRPPKT
Subjt:  KNTGATNQRAMVTLFHDLMHKEIKVYIDDMIAKFRLEEKHVVTLRKLFERLQKFQLKLNPTKCIFGVSSGKLLGFIVSREGIKVDPDKMKPIMNLRPPKT

Query:  QKEVRSFLRRLNYIERFISHLTQTCEQILKLFCKNEIYLWNEDCQKAFDKIKDYLQSPHILVLPTPGRLLILYLIVKEGSMGCVLGQHDSTGKK
        +KEVRSFL RLNYI RFISHLTQTCE ILKL  KNEI  WNEDCQKAFDKIKDYLQSP ILV PTP R LILYL V EGSMGCVLGQHDSTGKK
Subjt:  QKEVRSFLRRLNYIERFISHLTQTCEQILKLFCKNEIYLWNEDCQKAFDKIKDYLQSPHILVLPTPGRLLILYLIVKEGSMGCVLGQHDSTGKK

A0A6J1DM29 LOW QUALITY PROTEIN: uncharacterized protein LOC1110222315.5e-16773.54Show/hide
Query:  PYQETLKVINLGTLEEVKKVRIGTLASKQDQLDLVTLLHEFKDIFAWSYQDMPGLDTDIVRHRLPLKLECKPIRQKLRKLKPEMSIKIKEEVKKQFDAGF
        P+ E+ +++NLG+  EVK+V+IGT  S + +  L+ LLHEF D+FAWSYQDMP LDTDIV H+LP+  ECKP+RQKLRK++P+M IK+K+EV+KQ DAGF
Subjt:  PYQETLKVINLGTLEEVKKVRIGTLASKQDQLDLVTLLHEFKDIFAWSYQDMPGLDTDIVRHRLPLKLECKPIRQKLRKLKPEMSIKIKEEVKKQFDAGF

Query:  LAVVKYLIWVANIVPVPKKDRKVRMCVDYRDLNRASLKDNFPLPHIDVLVDNTAGFSTFSFMDGFSGYNQIKMAPEDQEKTTFITLWETFCYKVMPFGLK
        L V  Y  WVANIVPVPKK+ +VRMCVDYRDLNRAS KDNFPLPHIDVLVDNTAGFSTFSFMDGFSGYN+IKMAPED+EKTTFITLW TFCYKVM FGLK
Subjt:  LAVVKYLIWVANIVPVPKKDRKVRMCVDYRDLNRASLKDNFPLPHIDVLVDNTAGFSTFSFMDGFSGYNQIKMAPEDQEKTTFITLWETFCYKVMPFGLK

Query:  NTGATNQRAMVTLFHDLMHKEIKVYIDDMIAKFRLEEKHVVTLRKLFERLQKFQLKLNPTKCIFGVSSGKLLGFIVSREGIKVDPDKMKPIMNLRPPKTQ
        N GAT QRAMVTLFHDLMHKEI+VY+DDMIAK R  E+H + LRKLFERL+KFQLKLNP KC FGV+SGKLLGF+V +EGIKVDPDK+K I+ + PPKT 
Subjt:  NTGATNQRAMVTLFHDLMHKEIKVYIDDMIAKFRLEEKHVVTLRKLFERLQKFQLKLNPTKCIFGVSSGKLLGFIVSREGIKVDPDKMKPIMNLRPPKTQ

Query:  KEVRSFLRRLNYIERFISHLTQTCEQILKLFCKNEIYLWNEDCQKAFDKIKDYLQSPHILVLPTPGRLLILYLIVKEGSMGCVLGQHDSTGKK
        KEVR FL RLNYI RFISHLT TCE I KL  KN   +W EDCQ AFDKIK Y QSP ILV PT GR LILYL + E SMGCVLGQHD +G+K
Subjt:  KEVRSFLRRLNYIERFISHLTQTCEQILKLFCKNEIYLWNEDCQKAFDKIKDYLQSPHILVLPTPGRLLILYLIVKEGSMGCVLGQHDSTGKK

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.69.8e-3633.33Show/hide
Query:  IWVANIVPVPKKD-----RKVRMCVDYRDLNRASLKDNFPLPHIDVLVDNTAGFSTFSFMDGFSGYNQIKMAPEDQEKTTFITLWETFCYKVMPFGLKNT
        IWV     VPKK      +K R+ +DYR LN  ++ D  P+P++D ++      + F+ +D   G++QI+M PE   KT F T    + Y  MPFGLKN 
Subjt:  IWVANIVPVPKKD-----RKVRMCVDYRDLNRASLKDNFPLPHIDVLVDNTAGFSTFSFMDGFSGYNQIKMAPEDQEKTTFITLWETFCYKVMPFGLKNT

Query:  GATNQRAMVTLFHDLMHKEIKVYIDDMIAKFRLEEKHVVTLRKLFERLQKFQLKLNPTKCIFGVSSGKLLGFIVSREGIKVDPDKMKPIMNLRPPKTQKE
         AT QR M  +   L++K   VY+DD+I      ++H+ +L  +FE+L K  LKL   KC F       LG +++ +GIK +P+K++ I     P   KE
Subjt:  GATNQRAMVTLFHDLMHKEIKVYIDDMIAKFRLEEKHVVTLRKLFERLQKFQLKLNPTKCIFGVSSGKLLGFIVSREGIKVDPDKMKPIMNLRPPKTQKE

Query:  VRSFLRRLNYIERFISHLTQTCEQILKLFCKN-EIYLWNEDCQKAFDKIKDYLQSPHILVLPTPGRLLILYLIVKEGSMGCVLGQ
        +++FL    Y  +FI +     + + K   KN +I   N +   AF K+K  +    IL +P   +   L     + ++G VL Q
Subjt:  VRSFLRRLNYIERFISHLTQTCEQILKLFCKN-EIYLWNEDCQKAFDKIKDYLQSPHILVLPTPGRLLILYLIVKEGSMGCVLGQ

P10394 Retrovirus-related Pol polyprotein from transposon 4122.0e-3329.2Show/hide
Query:  LVTLLHEFKDIFAWSYQDMPGLDTDIVRHRLPLKLECKPIRQKLRKLKPEMSIKIKEEVKKQFDAGFL--AVVKYLIWVANIVPVPKKD------RKVRM
        L  +  E+ DIFA   +  P    ++ + +L LK + +P+  K  +       +I+ +V+K      +  +V +Y    + ++ VPKK       +K R+
Subjt:  LVTLLHEFKDIFAWSYQDMPGLDTDIVRHRLPLKLECKPIRQKLRKLKPEMSIKIKEEVKKQFDAGFL--AVVKYLIWVANIVPVPKKD------RKVRM

Query:  CVDYRDLNRASLKDNFPLPHIDVLVDNTAGFSTFSFMDGFSGYNQIKMAPEDQEKTTFITLWETFCYKVMPFGLKNTGATNQRAMVTLFHDLMHKEIKVY
         +DYR +N+  L D FPLP ID ++D       FS +D  SG++QI++    ++ T+F T   ++ +  +PFGLK    + QR M   F  +   +  +Y
Subjt:  CVDYRDLNRASLKDNFPLPHIDVLVDNTAGFSTFSFMDGFSGYNQIKMAPEDQEKTTFITLWETFCYKVMPFGLKNTGATNQRAMVTLFHDLMHKEIKVY

Query:  IDDMIAKFRLEEKHVV-TLRKLFERLQKFQLKLNPTKCIFGVSSGKLLGFIVSREGIKVDPDKMKPIMNLRPPKTQKEVRSFLRRLNYIERFISHLTQTC
        +DD+I      EKH++  L ++F + +++ LKL+P KC F +     LG   + +GI  D  K   I N   P      R F+   NY  RFI +     
Subjt:  IDDMIAKFRLEEKHVV-TLRKLFERLQKFQLKLNPTKCIFGVSSGKLLGFIVSREGIKVDPDKMKPIMNLRPPKTQKEVRSFLRRLNYIERFISHLTQTC

Query:  EQILKLFCKNEIYLWNEDCQKAFDKIKDYLQSPHILVLPTPGRLLILYLIVKEGSMGCVLGQH
          I +L  KN  + W ++CQKAF  +K  L +P +L  P   +   +     + + G VL Q+
Subjt:  EQILKLFCKNEIYLWNEDCQKAFDKIKDYLQSPHILVLPTPGRLLILYLIVKEGSMGCVLGQH

P20825 Retrovirus-related Pol polyprotein from transposon 2975.4e-3427.85Show/hide
Query:  QETLKVINLGTLEEVKKVRIGTL----ASKQDQLDLVTLLHEFKDIFAWSYQDMPGLD-TDIVRHRLPLKLECKPIRQKLRKLKPEMSIKIKEEVKKQFD
        Q T + I     E +KK+          ++++   L  LL++F+++    Y++   L  T+ ++H L       PI  K   L     I+++ +V++  +
Subjt:  QETLKVINLGTLEEVKKVRIGTL----ASKQDQLDLVTLLHEFKDIFAWSYQDMPGLD-TDIVRHRLPLKLECKPIRQKLRKLKPEMSIKIKEEVKKQFD

Query:  AGFL----AVVKYLIWVANIVPVPKKDRKVRMCVDYRDLNRASLKDNFPLPHIDVLVDNTAGFSTFSFMDGFSGYNQIKMAPEDQEKTTFITLWETFCYK
         G +    +      WV    P      K R+ +DYR LN  ++ D +P+P++D ++        F+ +D   G++QI+M  E   KT F T    + Y 
Subjt:  AGFL----AVVKYLIWVANIVPVPKKDRKVRMCVDYRDLNRASLKDNFPLPHIDVLVDNTAGFSTFSFMDGFSGYNQIKMAPEDQEKTTFITLWETFCYK

Query:  VMPFGLKNTGATNQRAMVTLFHDLMHKEIKVYIDDMIAKFRLEEKHVVTLRKLFERLQKFQLKLNPTKCIFGVSSGKLLGFIVSREGIKVDPDKMKPIMN
         MPFGL+N  AT QR M  +   L++K   VY+DD+I       +H+ +++ +F +L    LKL   KC F       LG IV+ +GIK +P K+K I++
Subjt:  VMPFGLKNTGATNQRAMVTLFHDLMHKEIKVYIDDMIAKFRLEEKHVVTLRKLFERLQKFQLKLNPTKCIFGVSSGKLLGFIVSREGIKVDPDKMKPIMN

Query:  LRPPKTQKEVRSFLRRLNYIERFISHLTQTCEQILKLFCK-NEIYLWNEDCQKAFDKIKDYLQSPHILVLPTPGRLLILYLIVKEGSMGCVLGQH
           P   KE+R+FL    Y  +FI +     + +     K  +I     +  +AF+K+K  +    IL LP   +  +L       ++G VL Q+
Subjt:  LRPPKTQKEVRSFLRRLNYIERFISHLTQTCEQILKLFCK-NEIYLWNEDCQKAFDKIKDYLQSPHILVLPTPGRLLILYLIVKEGSMGCVLGQH

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein9.4e-3932.6Show/hide
Query:  VTLLHEFKDIFAWSYQDMPGLDTDIVRHRLPLK--LECKPIRQKLRKLKP-----EMSIKIKEEVKKQFDAGFLAVVKYLIWVANIVPVPKKDRKVRMCV
        V L  ++++I      D+P    DI  + +P+K  +E KP   +L +L+P     +   +I + V+K  D  F+   K     + +V VPKKD   R+CV
Subjt:  VTLLHEFKDIFAWSYQDMPGLDTDIVRHRLPLK--LECKPIRQKLRKLKP-----EMSIKIKEEVKKQFDAGFLAVVKYLIWVANIVPVPKKDRKVRMCV

Query:  DYRDLNRASLKDNFPLPHIDVLVDNTAGFSTFSFMDGFSGYNQIKMAPEDQEKTTFITLWETFCYKVMPFGLKNTGATNQRAMVTLFHDLMHKEIKVYID
        DYR LN+A++ D FPLP ID L+        F+ +D  SGY+QI M P+D+ KT F+T    + Y VMPFGL N  +T  R M   F DL  + + VY+D
Subjt:  DYRDLNRASLKDNFPLPHIDVLVDNTAGFSTFSFMDGFSGYNQIKMAPEDQEKTTFITLWETFCYKVMPFGLKNTGATNQRAMVTLFHDLMHKEIKVYID

Query:  DMIAKFRLEEKHVVTLRKLFERLQKFQLKLNPTKCIFGVSSGKLLGFIVSREGIKVDPDKMKPIMNLRPPKTQKEVRSFLRRLNYIERFISHLTQTCEQI
        D++      E+H   L  + ERL+   L +   KC F     + LG+ +  + I     K   I +   PKT K+ + FL  +NY  RFI + ++  + I
Subjt:  DMIAKFRLEEKHVVTLRKLFERLQKFQLKLNPTKCIFGVSSGKLLGFIVSREGIKVDPDKMKPIMNLRPPKTQKEVRSFLRRLNYIERFISHLTQTCEQI

Query:  LKLFCKNEIYLWNEDCQKAFDKIKDYLQSPHILVLPTPGRLLILYLIVKEGSMGCVLGQHDSTGK
            C      W E   KA +K+K  L +  +LV         L     +  +G VL + D+  K
Subjt:  LKLFCKNEIYLWNEDCQKAFDKIKDYLQSPHILVLPTPGRLLILYLIVKEGSMGCVLGQHDSTGK

Q99315 Transposon Ty3-G Gag-Pol polyprotein3.8e-4033.15Show/hide
Query:  VTLLHEFKDIFAWSYQDMPGLDTDIVRHRLPLK--LECKPIRQKLRKLKP-----EMSIKIKEEVKKQFDAGFLAVVKYLIWVANIVPVPKKDRKVRMCV
        V L  ++++I      D+P    DI  + +P+K  +E KP   +L +L+P     +   +I + V+K  D  F+   K     + +V VPKKD   R+CV
Subjt:  VTLLHEFKDIFAWSYQDMPGLDTDIVRHRLPLK--LECKPIRQKLRKLKP-----EMSIKIKEEVKKQFDAGFLAVVKYLIWVANIVPVPKKDRKVRMCV

Query:  DYRDLNRASLKDNFPLPHIDVLVDNTAGFSTFSFMDGFSGYNQIKMAPEDQEKTTFITLWETFCYKVMPFGLKNTGATNQRAMVTLFHDLMHKEIKVYID
        DYR LN+A++ D FPLP ID L+        F+ +D  SGY+QI M P+D+ KT F+T    + Y VMPFGL N  +T  R M   F DL  + + VY+D
Subjt:  DYRDLNRASLKDNFPLPHIDVLVDNTAGFSTFSFMDGFSGYNQIKMAPEDQEKTTFITLWETFCYKVMPFGLKNTGATNQRAMVTLFHDLMHKEIKVYID

Query:  DMIAKFRLEEKHVVTLRKLFERLQKFQLKLNPTKCIFGVSSGKLLGFIVSREGIKVDPDKMKPIMNLRPPKTQKEVRSFLRRLNYIERFISHLTQTCEQI
        D++      E+H   L  + ERL+   L +   KC F     + LG+ +  + I     K   I +   PKT K+ + FL  +NY  RFI + ++  + I
Subjt:  DMIAKFRLEEKHVVTLRKLFERLQKFQLKLNPTKCIFGVSSGKLLGFIVSREGIKVDPDKMKPIMNLRPPKTQKEVRSFLRRLNYIERFISHLTQTCEQI

Query:  LKLFCKNEIYLWNEDCQKAFDKIKDYLQSPHILVLPTPGRLLILYLIVKEGSMGCVLGQHDSTGK
            C      W E   KA DK+KD L +  +LV         L     +  +G VL + D+  K
Subjt:  LKLFCKNEIYLWNEDCQKAFDKIKDYLQSPHILVLPTPGRLLILYLIVKEGSMGCVLGQHDSTGK

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCATATCAAGAAACTTTGAAGGTTATCAATTTGGGAACACTAGAAGAAGTGAAAAAAGTGCGAATTGGCACTTTGGCTTCGAAACAGGATCAATTAGATCTGGTGAC
CCTACTTCATGAGTTCAAAGATATATTTGCATGGTCCTACCAAGATATGCCCGGTTTGGATACAGATATTGTAAGACATCGATTACCACTCAAGCTAGAATGTAAGCCTA
TACGACAAAAGCTTCGCAAATTGAAACCTGAAATGTCGATCAAGATCAAGGAGGAAGTTAAAAAGCAGTTTGATGCTGGATTCTTAGCAGTAGTAAAATACCTAATTTGG
GTTGCAAACATTGTCCCAGTTCCAAAGAAGGACAGAAAGGTTAGAATGTGCGTAGACTATAGAGATCTTAATCGGGCAAGTCTTAAAGATAACTTTCCTCTTCCTCACAT
CGACGTGTTGGTAGATAATACTGCTGGGTTTTCCACCTTCTCATTTATGGATGGATTTTCAGGATACAACCAGATCAAAATGGCTCCAGAAGATCAAGAAAAAACGACAT
TCATCACCTTGTGGGAAACGTTTTGTTATAAAGTAATGCCTTTTGGGTTAAAAAATACAGGAGCAACTAATCAGAGAGCGATGGTTACATTATTCCATGACTTGATGCAT
AAAGAAATTAAAGTTTATATTGATGATATGATCGCCAAGTTTAGACTCGAAGAAAAGCACGTGGTTACCCTTCGTAAGCTTTTTGAGAGATTGCAAAAATTCCAATTAAA
GTTGAATCCAACAAAATGCATATTTGGAGTTTCTTCTGGGAAGTTATTGGGTTTTATTGTTAGTCGTGAAGGCATCAAGGTGGACCCAGACAAAATGAAGCCGATAATGA
ACTTAAGGCCACCAAAAACACAAAAGGAGGTTAGAAGTTTTTTAAGAAGGTTGAATTATATTGAACGATTTATTTCACACCTTACTCAAACTTGTGAGCAAATTTTAAAA
TTATTTTGCAAAAATGAGATTTATCTTTGGAATGAAGATTGCCAAAAAGCTTTTGACAAAATCAAAGATTACTTGCAAAGCCCTCATATCCTCGTCCTACCAACTCCAGG
ACGACTTTTAATCTTATACTTGATAGTGAAGGAGGGGTCAATGGGATGTGTGCTAGGACAGCATGACTCTACTGGAAAGAAATAG
mRNA sequenceShow/hide mRNA sequence
ATGCCATATCAAGAAACTTTGAAGGTTATCAATTTGGGAACACTAGAAGAAGTGAAAAAAGTGCGAATTGGCACTTTGGCTTCGAAACAGGATCAATTAGATCTGGTGAC
CCTACTTCATGAGTTCAAAGATATATTTGCATGGTCCTACCAAGATATGCCCGGTTTGGATACAGATATTGTAAGACATCGATTACCACTCAAGCTAGAATGTAAGCCTA
TACGACAAAAGCTTCGCAAATTGAAACCTGAAATGTCGATCAAGATCAAGGAGGAAGTTAAAAAGCAGTTTGATGCTGGATTCTTAGCAGTAGTAAAATACCTAATTTGG
GTTGCAAACATTGTCCCAGTTCCAAAGAAGGACAGAAAGGTTAGAATGTGCGTAGACTATAGAGATCTTAATCGGGCAAGTCTTAAAGATAACTTTCCTCTTCCTCACAT
CGACGTGTTGGTAGATAATACTGCTGGGTTTTCCACCTTCTCATTTATGGATGGATTTTCAGGATACAACCAGATCAAAATGGCTCCAGAAGATCAAGAAAAAACGACAT
TCATCACCTTGTGGGAAACGTTTTGTTATAAAGTAATGCCTTTTGGGTTAAAAAATACAGGAGCAACTAATCAGAGAGCGATGGTTACATTATTCCATGACTTGATGCAT
AAAGAAATTAAAGTTTATATTGATGATATGATCGCCAAGTTTAGACTCGAAGAAAAGCACGTGGTTACCCTTCGTAAGCTTTTTGAGAGATTGCAAAAATTCCAATTAAA
GTTGAATCCAACAAAATGCATATTTGGAGTTTCTTCTGGGAAGTTATTGGGTTTTATTGTTAGTCGTGAAGGCATCAAGGTGGACCCAGACAAAATGAAGCCGATAATGA
ACTTAAGGCCACCAAAAACACAAAAGGAGGTTAGAAGTTTTTTAAGAAGGTTGAATTATATTGAACGATTTATTTCACACCTTACTCAAACTTGTGAGCAAATTTTAAAA
TTATTTTGCAAAAATGAGATTTATCTTTGGAATGAAGATTGCCAAAAAGCTTTTGACAAAATCAAAGATTACTTGCAAAGCCCTCATATCCTCGTCCTACCAACTCCAGG
ACGACTTTTAATCTTATACTTGATAGTGAAGGAGGGGTCAATGGGATGTGTGCTAGGACAGCATGACTCTACTGGAAAGAAATAG
Protein sequenceShow/hide protein sequence
MPYQETLKVINLGTLEEVKKVRIGTLASKQDQLDLVTLLHEFKDIFAWSYQDMPGLDTDIVRHRLPLKLECKPIRQKLRKLKPEMSIKIKEEVKKQFDAGFLAVVKYLIW
VANIVPVPKKDRKVRMCVDYRDLNRASLKDNFPLPHIDVLVDNTAGFSTFSFMDGFSGYNQIKMAPEDQEKTTFITLWETFCYKVMPFGLKNTGATNQRAMVTLFHDLMH
KEIKVYIDDMIAKFRLEEKHVVTLRKLFERLQKFQLKLNPTKCIFGVSSGKLLGFIVSREGIKVDPDKMKPIMNLRPPKTQKEVRSFLRRLNYIERFISHLTQTCEQILK
LFCKNEIYLWNEDCQKAFDKIKDYLQSPHILVLPTPGRLLILYLIVKEGSMGCVLGQHDSTGKK