; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0002925 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0002925
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon 17.6
Genome locationchr05:14858124..14858648
RNA-Seq ExpressionPI0002925
SyntenyPI0002925
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008445 - D-aspartate oxidase activity (molecular function)
InterPro domainsIPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0040850.1 Retrovirus-related Pol polyprotein from transposon 17.6 [Cucumis melo var. makuwa]1.3e-6170.18Show/hide
Query:  MVNEGIVLGHKVSSIGLEVDPAKIDVVSKLPPPNDVKPLRSFLGHAGFYRRFLKGFSQIAKPLSNLLCANQPYQFDESYHQAFQTLKDALTSAPILTTLD
        MVNEGIVLG+K+S+  LEVDPAKIDVVSKLP P+++KPL  FLGHAGFYRRF+KGFSQIAKPLSNLLC +QP+ FDE++  +   LKD L SAPIL TL+
Subjt:  MVNEGIVLGHKVSSIGLEVDPAKIDVVSKLPPPNDVKPLRSFLGHAGFYRRFLKGFSQIAKPLSNLLCANQPYQFDESYHQAFQTLKDALTSAPILTTLD

Query:  WSQPFELMCDASDVAVGAMLGQKKNKVIHPIYYASKTLMDVQENYTTTEKELLAIVFAIEKFRSYIIGSKV
        W QPFE +CD SDVA  AMLGQKK+KVIHPIYY  K     QENYTTT+KELLA+VFAIEK R+ ++ SK+
Subjt:  WSQPFELMCDASDVAVGAMLGQKKNKVIHPIYYASKTLMDVQENYTTTEKELLAIVFAIEKFRSYIIGSKV

PNY04892.1 hypothetical protein L195_g001324 [Trifolium pratense]8.3e-6167.82Show/hide
Query:  MVNEGIVLGHKVSSIGLEVDPAKIDVVSKLPPPNDVKPLRSFLGHAGFYRRFLKGFSQIAKPLSNLLCANQPYQFDESYHQAFQTLKDALTSAPILTTLD
        MV EGIVLGHK+SS G+EVD AK++V+ KLPPP ++K +RSFLGHAGFYRRF+K FS+IAKPLSNLL  ++P+ FD+S   AF  LK+ LT+API+T  D
Subjt:  MVNEGIVLGHKVSSIGLEVDPAKIDVVSKLPPPNDVKPLRSFLGHAGFYRRFLKGFSQIAKPLSNLLCANQPYQFDESYHQAFQTLKDALTSAPILTTLD

Query:  WSQPFELMCDASDVAVGAMLGQKKNKVIHPIYYASKTLMDVQENYTTTEKELLAIVFAIEKFRSYIIGSKVTAF
        WS  FELMCDASD AVGA+LGQ+KNK  H I+YASK L D Q NY TTEKELLAIV+A+EKFRSY+IGSK+  +
Subjt:  WSQPFELMCDASDVAVGAMLGQKKNKVIHPIYYASKTLMDVQENYTTTEKELLAIVFAIEKFRSYIIGSKVTAF

PNY12480.1 hypothetical protein L195_g009111 [Trifolium pratense]3.7e-6167.82Show/hide
Query:  MVNEGIVLGHKVSSIGLEVDPAKIDVVSKLPPPNDVKPLRSFLGHAGFYRRFLKGFSQIAKPLSNLLCANQPYQFDESYHQAFQTLKDALTSAPILTTLD
        MV EGIVLGHK+SS G+EVD AK++V+ KLPPP +VK +RSFLGHAGFYRRF+K FS+IAKPLSNLL  ++P+ FD +   AF  LK+ LT+API+T  D
Subjt:  MVNEGIVLGHKVSSIGLEVDPAKIDVVSKLPPPNDVKPLRSFLGHAGFYRRFLKGFSQIAKPLSNLLCANQPYQFDESYHQAFQTLKDALTSAPILTTLD

Query:  WSQPFELMCDASDVAVGAMLGQKKNKVIHPIYYASKTLMDVQENYTTTEKELLAIVFAIEKFRSYIIGSKVTAF
        WS  FELMCDASD AVGA+LGQ+KNK+ H I+YASK L D Q NY TTEKELLAIV+A+EKFRSY+IGSK+  +
Subjt:  WSQPFELMCDASDVAVGAMLGQKKNKVIHPIYYASKTLMDVQENYTTTEKELLAIVFAIEKFRSYIIGSKVTAF

XP_020990048.1 uncharacterized protein LOC110277177 [Arachis duranensis]6.4e-6166.67Show/hide
Query:  MVNEGIVLGHKVSSIGLEVDPAKIDVVSKLPPPNDVKPLRSFLGHAGFYRRFLKGFSQIAKPLSNLLCANQPYQFDESYHQAFQTLKDALTSAPILTTLD
        MV  GIVLGHK+S+ G+EVD AK++++ KLPPP+DVK +RSFLGHAGFY+RF+K FS+IAKPLSNLL ++ P+ FDE+   AF+ LK+ L+SAPI++  D
Subjt:  MVNEGIVLGHKVSSIGLEVDPAKIDVVSKLPPPNDVKPLRSFLGHAGFYRRFLKGFSQIAKPLSNLLCANQPYQFDESYHQAFQTLKDALTSAPILTTLD

Query:  WSQPFELMCDASDVAVGAMLGQKKNKVIHPIYYASKTLMDVQENYTTTEKELLAIVFAIEKFRSYIIGSKVTAF
        W+ PFELMCDASD AVGA+LGQ+K+ ++H IYYASK L D Q NYTTTEKELLAIVFA +KFRSY+IG+KV  F
Subjt:  WSQPFELMCDASDVAVGAMLGQKKNKVIHPIYYASKTLMDVQENYTTTEKELLAIVFAIEKFRSYIIGSKVTAF

XP_023521407.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111785222 [Cucurbita pepo subsp. pepo]1.1e-6067.82Show/hide
Query:  MVNEGIVLGHKVSSIGLEVDPAKIDVVSKLPPPNDVKPLRSFLGHAGFYRRFLKGFSQIAKPLSNLLCANQPYQFDESYHQAFQTLKDALTSAPILTTLD
        MVNEGIVLGHK+S  G+EVD AKI+ + +L PP  VK +RSFLGHAGFYRRF+K FS+IAKPL +LL +N+ + FD+    AFQ LK ALT+API++T D
Subjt:  MVNEGIVLGHKVSSIGLEVDPAKIDVVSKLPPPNDVKPLRSFLGHAGFYRRFLKGFSQIAKPLSNLLCANQPYQFDESYHQAFQTLKDALTSAPILTTLD

Query:  WSQPFELMCDASDVAVGAMLGQKKNKVIHPIYYASKTLMDVQENYTTTEKELLAIVFAIEKFRSYIIGSKVTAF
        W+ PFELMCDAS  AVGAMLGQKK KV+HPIYYASKTL   Q NYTTTEKELLA+VFA+EKFR+Y+ G+KV  +
Subjt:  WSQPFELMCDASDVAVGAMLGQKKNKVIHPIYYASKTLMDVQENYTTTEKELLAIVFAIEKFRSYIIGSKVTAF

TrEMBL top hitse value%identityAlignment
A0A2K3NPD0 Reverse transcriptase4.0e-6167.82Show/hide
Query:  MVNEGIVLGHKVSSIGLEVDPAKIDVVSKLPPPNDVKPLRSFLGHAGFYRRFLKGFSQIAKPLSNLLCANQPYQFDESYHQAFQTLKDALTSAPILTTLD
        MV EGIVLGHK+SS G+EVD AK++V+ KLPPP ++K +RSFLGHAGFYRRF+K FS+IAKPLSNLL  ++P+ FD+S   AF  LK+ LT+API+T  D
Subjt:  MVNEGIVLGHKVSSIGLEVDPAKIDVVSKLPPPNDVKPLRSFLGHAGFYRRFLKGFSQIAKPLSNLLCANQPYQFDESYHQAFQTLKDALTSAPILTTLD

Query:  WSQPFELMCDASDVAVGAMLGQKKNKVIHPIYYASKTLMDVQENYTTTEKELLAIVFAIEKFRSYIIGSKVTAF
        WS  FELMCDASD AVGA+LGQ+KNK  H I+YASK L D Q NY TTEKELLAIV+A+EKFRSY+IGSK+  +
Subjt:  WSQPFELMCDASDVAVGAMLGQKKNKVIHPIYYASKTLMDVQENYTTTEKELLAIVFAIEKFRSYIIGSKVTAF

A0A2K3PB15 Reverse transcriptase1.8e-6167.82Show/hide
Query:  MVNEGIVLGHKVSSIGLEVDPAKIDVVSKLPPPNDVKPLRSFLGHAGFYRRFLKGFSQIAKPLSNLLCANQPYQFDESYHQAFQTLKDALTSAPILTTLD
        MV EGIVLGHK+SS G+EVD AK++V+ KLPPP +VK +RSFLGHAGFYRRF+K FS+IAKPLSNLL  ++P+ FD +   AF  LK+ LT+API+T  D
Subjt:  MVNEGIVLGHKVSSIGLEVDPAKIDVVSKLPPPNDVKPLRSFLGHAGFYRRFLKGFSQIAKPLSNLLCANQPYQFDESYHQAFQTLKDALTSAPILTTLD

Query:  WSQPFELMCDASDVAVGAMLGQKKNKVIHPIYYASKTLMDVQENYTTTEKELLAIVFAIEKFRSYIIGSKVTAF
        WS  FELMCDASD AVGA+LGQ+KNK+ H I+YASK L D Q NY TTEKELLAIV+A+EKFRSY+IGSK+  +
Subjt:  WSQPFELMCDASDVAVGAMLGQKKNKVIHPIYYASKTLMDVQENYTTTEKELLAIVFAIEKFRSYIIGSKVTAF

A0A5D3D3M4 Retrovirus-related Pol polyprotein from transposon 17.66.2e-6270.18Show/hide
Query:  MVNEGIVLGHKVSSIGLEVDPAKIDVVSKLPPPNDVKPLRSFLGHAGFYRRFLKGFSQIAKPLSNLLCANQPYQFDESYHQAFQTLKDALTSAPILTTLD
        MVNEGIVLG+K+S+  LEVDPAKIDVVSKLP P+++KPL  FLGHAGFYRRF+KGFSQIAKPLSNLLC +QP+ FDE++  +   LKD L SAPIL TL+
Subjt:  MVNEGIVLGHKVSSIGLEVDPAKIDVVSKLPPPNDVKPLRSFLGHAGFYRRFLKGFSQIAKPLSNLLCANQPYQFDESYHQAFQTLKDALTSAPILTTLD

Query:  WSQPFELMCDASDVAVGAMLGQKKNKVIHPIYYASKTLMDVQENYTTTEKELLAIVFAIEKFRSYIIGSKV
        W QPFE +CD SDVA  AMLGQKK+KVIHPIYY  K     QENYTTT+KELLA+VFAIEK R+ ++ SK+
Subjt:  WSQPFELMCDASDVAVGAMLGQKKNKVIHPIYYASKTLMDVQENYTTTEKELLAIVFAIEKFRSYIIGSKV

A0A6P4CQW0 uncharacterized protein LOC1074792645.3e-6166.09Show/hide
Query:  MVNEGIVLGHKVSSIGLEVDPAKIDVVSKLPPPNDVKPLRSFLGHAGFYRRFLKGFSQIAKPLSNLLCANQPYQFDESYHQAFQTLKDALTSAPILTTLD
        MV EGIVLGHKVS+ G+EVD AK++++ KLPPP++VK +RSFLGHAGFYRRF++ FS+IAKPLSNLL ++ P+ FDE+   A++ LK  L+SAPI+   D
Subjt:  MVNEGIVLGHKVSSIGLEVDPAKIDVVSKLPPPNDVKPLRSFLGHAGFYRRFLKGFSQIAKPLSNLLCANQPYQFDESYHQAFQTLKDALTSAPILTTLD

Query:  WSQPFELMCDASDVAVGAMLGQKKNKVIHPIYYASKTLMDVQENYTTTEKELLAIVFAIEKFRSYIIGSKVTAF
        W+ PFELMCDASD+A+GA+LGQ+K+ ++H IYYASK L D Q NYTTTEKELLAIVF+ +KFRSY+IGSKV  F
Subjt:  WSQPFELMCDASDVAVGAMLGQKKNKVIHPIYYASKTLMDVQENYTTTEKELLAIVFAIEKFRSYIIGSKVTAF

A0A6P5N4L4 uncharacterized protein LOC1102771773.1e-6166.67Show/hide
Query:  MVNEGIVLGHKVSSIGLEVDPAKIDVVSKLPPPNDVKPLRSFLGHAGFYRRFLKGFSQIAKPLSNLLCANQPYQFDESYHQAFQTLKDALTSAPILTTLD
        MV  GIVLGHK+S+ G+EVD AK++++ KLPPP+DVK +RSFLGHAGFY+RF+K FS+IAKPLSNLL ++ P+ FDE+   AF+ LK+ L+SAPI++  D
Subjt:  MVNEGIVLGHKVSSIGLEVDPAKIDVVSKLPPPNDVKPLRSFLGHAGFYRRFLKGFSQIAKPLSNLLCANQPYQFDESYHQAFQTLKDALTSAPILTTLD

Query:  WSQPFELMCDASDVAVGAMLGQKKNKVIHPIYYASKTLMDVQENYTTTEKELLAIVFAIEKFRSYIIGSKVTAF
        W+ PFELMCDASD AVGA+LGQ+K+ ++H IYYASK L D Q NYTTTEKELLAIVFA +KFRSY+IG+KV  F
Subjt:  WSQPFELMCDASDVAVGAMLGQKKNKVIHPIYYASKTLMDVQENYTTTEKELLAIVFAIEKFRSYIIGSKVTAF

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.63.0e-2941.57Show/hide
Query:  EGIVLGHKVSSIGLEVDPAKIDVVSKLPPPNDVKPLRSFLGHAGFYRRFLKGFSQIAKPLSNLLCANQPYQ-FDESYHQAFQTLKDALTSAPILTTLDWS
        E   LGH ++  G++ +P KI+ + K P P   K +++FLG  G+YR+F+  F+ IAKP++  L  N      +  Y  AF+ LK  ++  PIL   D++
Subjt:  EGIVLGHKVSSIGLEVDPAKIDVVSKLPPPNDVKPLRSFLGHAGFYRRFLKGFSQIAKPLSNLLCANQPYQ-FDESYHQAFQTLKDALTSAPILTTLDWS

Query:  QPFELMCDASDVAVGAMLGQKKNKVIHPIYYASKTLMDVQENYTTTEKELLAIVFAIEKFRSYIIG
        + F L  DASDVA+GA+L Q      HP+ Y S+TL + + NY+T EKELLAIV+A + FR Y++G
Subjt:  QPFELMCDASDVAVGAMLGQKKNKVIHPIYYASKTLMDVQENYTTTEKELLAIVFAIEKFRSYIIG

P10394 Retrovirus-related Pol polyprotein from transposon 4128.4e-2436.26Show/hide
Query:  VNEGIVLGHKVSSIGLEVDPAKIDVVSKLPPPNDVKPLRSFLGHAGFYRRFLKGFSQIAKPLSNLLCANQPYQFDESYHQAFQTLKDALTSAPILTTLDW
        ++E   LGHK +  G+  D  K DV+   P P+D    R F+    +YRRF+K F+  ++ ++ L   N P+++ +   +AF  LK  L +  +L   D+
Subjt:  VNEGIVLGHKVSSIGLEVDPAKIDVVSKLPPPNDVKPLRSFLGHAGFYRRFLKGFSQIAKPLSNLLCANQPYQFDESYHQAFQTLKDALTSAPILTTLDW

Query:  SQPFELMCDASDVAVGAMLGQKKNKVIHPIYYASKTLMDVQENYTTTEKELLAIVFAIEKFRSYIIGSKVT
        S+ F +  DAS  A GA+L Q  N    P+ YAS+     + N +TTE+EL AI +AI  FR YI G   T
Subjt:  SQPFELMCDASDVAVGAMLGQKKNKVIHPIYYASKTLMDVQENYTTTEKELLAIVFAIEKFRSYIIGSKVT

P10401 Retrovirus-related Pol polyprotein from transposon gypsy2.0e-2538.29Show/hide
Query:  LGHKVSSIGLEVDPAKIDVVSKLPPPNDVKPLRSFLGHAGFYRRFLKGFSQIAKPLSNLLCANQ-----------PYQFDESYHQAFQTLKDALTSAP-I
        LG  VS  G + DP K+  + + P P+ V  +RSFLG A +YR F+K F+ IA+P++++L               P +F+E+   AFQ L++ L S   I
Subjt:  LGHKVSSIGLEVDPAKIDVVSKLPPPNDVKPLRSFLGHAGFYRRFLKGFSQIAKPLSNLLCANQ-----------PYQFDESYHQAFQTLKDALTSAP-I

Query:  LTTLDWSQPFELMCDASDVAVGAMLGQKKNKVIHPIYYASKTLMDVQENYTTTEKELLAIVFAIEKFRSYIIGSK
        L   D+ +PF+L  DAS   +GA+L Q+      PI   S+TL   ++NY T E+ELLAIV+A+ K ++++ GS+
Subjt:  LTTLDWSQPFELMCDASDVAVGAMLGQKKNKVIHPIYYASKTLMDVQENYTTTEKELLAIVFAIEKFRSYIIGSK

P20825 Retrovirus-related Pol polyprotein from transposon 2975.3e-2639.29Show/hide
Query:  EGIVLGHKVSSIGLEVDPAKIDVVSKLPPPNDVKPLRSFLGHAGFYRRFLKGFSQIAKPLSNLLCANQPYQFDE---SYHQAFQTLKDALTSAPILTTLD
        E   LGH V+  G++ +P K+  +   P P   K +R+FLG  G+YR+F+  ++ IAKP+++  C  +  + D     Y +AF+ LK  +   PIL   D
Subjt:  EGIVLGHKVSSIGLEVDPAKIDVVSKLPPPNDVKPLRSFLGHAGFYRRFLKGFSQIAKPLSNLLCANQPYQFDE---SYHQAFQTLKDALTSAPILTTLD

Query:  WSQPFELMCDASDVAVGAMLGQKKNKVIHPIYYASKTLMDVQENYTTTEKELLAIVFAIEKFRSYIIG
        + + F L  DAS++A+GA+L Q      HPI + S+TL D + NY+  EKELLAIV+A + FR Y++G
Subjt:  WSQPFELMCDASDVAVGAMLGQKKNKVIHPIYYASKTLMDVQENYTTTEKELLAIVFAIEKFRSYIIG

Q8I7P9 Retrovirus-related Pol polyprotein from transposon opus2.7e-3039.88Show/hide
Query:  LGHKVSSIGLEVDPAKIDVVSKLPPPNDVKPLRSFLGHAGFYRRFLKGFSQIAKPLSNL---LCAN--------QPYQFDESYHQAFQTLKDALTSAPIL
        LG+ V++ G++ DP K+  +S++PPP  VK L+ FLG   +YR+F++ ++++AKPL+NL   L AN         P   DE+  Q+F  LK  L S+ IL
Subjt:  LGHKVSSIGLEVDPAKIDVVSKLPPPNDVKPLRSFLGHAGFYRRFLKGFSQIAKPLSNL---LCAN--------QPYQFDESYHQAFQTLKDALTSAPIL

Query:  TTLDWSQPFELMCDASDVAVGAMLGQKKNKVIHPIYYASKTLMDVQENYTTTEKELLAIVFAIEKFRSYIIGS
            +++PF L  DAS+ A+GA+L Q       PI Y S++L   +ENY T EKE+LAI+++++  R+Y+ G+
Subjt:  TTLDWSQPFELMCDASDVAVGAMLGQKKNKVIHPIYYASKTLMDVQENYTTTEKELLAIVFAIEKFRSYIIGS

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein1.3e-1440Show/hide
Query:  LGHK--VSSIGLEVDPAKIDVVSKLPPPNDVKPLRSFLGHAGFYRRFLKGFSQIAKPLSNLLCANQPYQFDESYHQAFQTLKDALTSAPILTTLDWSQPF
        LGH+  +S  G+  DPAK++ +   P P +   LR FLG  G+YRRF+K + +I +PL+ LL  N   ++ E    AF+ LK A+T+ P+L   D   PF
Subjt:  LGHK--VSSIGLEVDPAKIDVVSKLPPPNDVKPLRSFLGHAGFYRRFLKGFSQIAKPLSNLLCANQPYQFDESYHQAFQTLKDALTSAPILTTLDWSQPF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTAATGAAGGAATTGTGTTGGGGCACAAGGTCTCCAGCATAGGGTTAGAAGTGGACCCCGCGAAGATTGATGTGGTTAGCAAGTTGCCACCGCCAAATGATGTCAA
ACCTTTGAGAAGCTTCTTAGGCCACGCTGGGTTCTATAGGCGATTCCTCAAGGGATTCTCTCAAATTGCGAAGCCACTCAGTAACCTACTCTGTGCTAATCAACCTTATC
AGTTTGATGAAAGCTATCATCAGGCGTTCCAGACCTTGAAAGACGCGCTGACCTCAGCGCCTATCCTCACCACTCTTGATTGGTCACAACCATTTGAACTAATGTGTGAC
GCGAGTGATGTTGCGGTAGGGGCTATGCTGGGTCAAAAGAAAAATAAAGTGATCCACCCTATATACTACGCGAGCAAGACTCTTATGGACGTTCAAGAGAACTACACTAC
TACGGAAAAGGAGCTGCTTGCGATAGTATTTGCGATAGAAAAGTTCAGGAGTTACATAATTGGCTCCAAAGTTACGGCATTCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGTTAATGAAGGAATTGTGTTGGGGCACAAGGTCTCCAGCATAGGGTTAGAAGTGGACCCCGCGAAGATTGATGTGGTTAGCAAGTTGCCACCGCCAAATGATGTCAA
ACCTTTGAGAAGCTTCTTAGGCCACGCTGGGTTCTATAGGCGATTCCTCAAGGGATTCTCTCAAATTGCGAAGCCACTCAGTAACCTACTCTGTGCTAATCAACCTTATC
AGTTTGATGAAAGCTATCATCAGGCGTTCCAGACCTTGAAAGACGCGCTGACCTCAGCGCCTATCCTCACCACTCTTGATTGGTCACAACCATTTGAACTAATGTGTGAC
GCGAGTGATGTTGCGGTAGGGGCTATGCTGGGTCAAAAGAAAAATAAAGTGATCCACCCTATATACTACGCGAGCAAGACTCTTATGGACGTTCAAGAGAACTACACTAC
TACGGAAAAGGAGCTGCTTGCGATAGTATTTGCGATAGAAAAGTTCAGGAGTTACATAATTGGCTCCAAAGTTACGGCATTCTAA
Protein sequenceShow/hide protein sequence
MVNEGIVLGHKVSSIGLEVDPAKIDVVSKLPPPNDVKPLRSFLGHAGFYRRFLKGFSQIAKPLSNLLCANQPYQFDESYHQAFQTLKDALTSAPILTTLDWSQPFELMCD
ASDVAVGAMLGQKKNKVIHPIYYASKTLMDVQENYTTTEKELLAIVFAIEKFRSYIIGSKVTAF