; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0021953 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0021953
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionIntegrase catalytic domain-containing protein
Genome locationchr07:12607643..12617010
RNA-Seq ExpressionPI0021953
SyntenyPI0021953
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0053418.1 hypothetical protein E6C27_scaffold428G00880 [Cucumis melo var. makuwa]3.6e-2069.62Show/hide
Query:  MCDASDVAVGAMLGQKKSKVIHPIYYASKTLTDAQENYTTTEKELLAVVFAVEKFRSYIIGSKVTI-SHGKERRQAVVN
        M DASDVA+  MLGQKK KVIHPIYY SKTL +A+ENYTTTEK+ L VVFA+EKFRSYI+GSKVT+ S+  E R  + N
Subjt:  MCDASDVAVGAMLGQKKSKVIHPIYYASKTLTDAQENYTTTEKELLAVVFAVEKFRSYIIGSKVTI-SHGKERRQAVVN

KAA0062295.1 hypothetical protein E6C27_scaffold154G00380 [Cucumis melo var. makuwa]4.7e-2080.3Show/hide
Query:  MCDASDVAVGAMLGQKKSKVIHPIYYASKTLTDAQENYTTTEKELLAVVFAVEKFRSYIIGSKVTI
        MCDASDV +GAMLGQK  KVI+PIY  SKTL +AQENYTTTEKELLAVVFA+EK+ SYI+GSKVTI
Subjt:  MCDASDVAVGAMLGQKKSKVIHPIYYASKTLTDAQENYTTTEKELLAVVFAVEKFRSYIIGSKVTI

XP_012832904.1 PREDICTED: uncharacterized protein LOC105953771 [Erythranthe guttata]7.9e-2078.79Show/hide
Query:  MCDASDVAVGAMLGQKKSKVIHPIYYASKTLTDAQENYTTTEKELLAVVFAVEKFRSYIIGSKVTI
        MCDASD AVGA+LGQ+K+K+ H IYYASKTL DAQ NYTTTEKELLAVVFA EKFRSY+IG+KV +
Subjt:  MCDASDVAVGAMLGQKKSKVIHPIYYASKTLTDAQENYTTTEKELLAVVFAVEKFRSYIIGSKVTI

XP_022152366.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111020110 [Momordica charantia]1.2e-2077.27Show/hide
Query:  MCDASDVAVGAMLGQKKSKVIHPIYYASKTLTDAQENYTTTEKELLAVVFAVEKFRSYIIGSKVTI
        MCDASD A+GAMLGQ+K+K++HP+YYASKTLT AQ NYTTTEKELLAVVFA +KFRSY+IG+KV +
Subjt:  MCDASDVAVGAMLGQKKSKVIHPIYYASKTLTDAQENYTTTEKELLAVVFAVEKFRSYIIGSKVTI

XP_022158979.1 uncharacterized protein LOC111025424 [Momordica charantia]6.1e-2076.92Show/hide
Query:  CDASDVAVGAMLGQKKSKVIHPIYYASKTLTDAQENYTTTEKELLAVVFAVEKFRSYIIGSKVTI
        CDASD A+GAMLGQ+K+K++HP+YYASKTLT AQ NYTTTEKELLAVVFA  KFRSY+IG+KV +
Subjt:  CDASDVAVGAMLGQKKSKVIHPIYYASKTLTDAQENYTTTEKELLAVVFAVEKFRSYIIGSKVTI

TrEMBL top hitse value%identityAlignment
A0A5A7V8R9 RT_RNaseH domain-containing protein2.3e-2080.3Show/hide
Query:  MCDASDVAVGAMLGQKKSKVIHPIYYASKTLTDAQENYTTTEKELLAVVFAVEKFRSYIIGSKVTI
        MCDASDV +GAMLGQK  KVI+PIY  SKTL +AQENYTTTEKELLAVVFA+EK+ SYI+GSKVTI
Subjt:  MCDASDVAVGAMLGQKKSKVIHPIYYASKTLTDAQENYTTTEKELLAVVFAVEKFRSYIIGSKVTI

A0A5D3CVE7 Uncharacterized protein1.7e-2069.62Show/hide
Query:  MCDASDVAVGAMLGQKKSKVIHPIYYASKTLTDAQENYTTTEKELLAVVFAVEKFRSYIIGSKVTI-SHGKERRQAVVN
        M DASDVA+  MLGQKK KVIHPIYY SKTL +A+ENYTTTEK+ L VVFA+EKFRSYI+GSKVT+ S+  E R  + N
Subjt:  MCDASDVAVGAMLGQKKSKVIHPIYYASKTLTDAQENYTTTEKELLAVVFAVEKFRSYIIGSKVTI-SHGKERRQAVVN

A0A6J1DHH9 LOW QUALITY PROTEIN: uncharacterized protein LOC1110201105.9e-2177.27Show/hide
Query:  MCDASDVAVGAMLGQKKSKVIHPIYYASKTLTDAQENYTTTEKELLAVVFAVEKFRSYIIGSKVTI
        MCDASD A+GAMLGQ+K+K++HP+YYASKTLT AQ NYTTTEKELLAVVFA +KFRSY+IG+KV +
Subjt:  MCDASDVAVGAMLGQKKSKVIHPIYYASKTLTDAQENYTTTEKELLAVVFAVEKFRSYIIGSKVTI

A0A6J1E110 uncharacterized protein LOC1110254242.9e-2076.92Show/hide
Query:  CDASDVAVGAMLGQKKSKVIHPIYYASKTLTDAQENYTTTEKELLAVVFAVEKFRSYIIGSKVTI
        CDASD A+GAMLGQ+K+K++HP+YYASKTLT AQ NYTTTEKELLAVVFA  KFRSY+IG+KV +
Subjt:  CDASDVAVGAMLGQKKSKVIHPIYYASKTLTDAQENYTTTEKELLAVVFAVEKFRSYIIGSKVTI

A0A6J1E3L7 uncharacterized protein LOC1110257545.0e-2075.76Show/hide
Query:  MCDASDVAVGAMLGQKKSKVIHPIYYASKTLTDAQENYTTTEKELLAVVFAVEKFRSYIIGSKVTI
        MCDASD A+GAMLGQ+K+K++HP+YYASKTLT AQ NYTTTEKELLAVVFA +K RSY+IG+KV +
Subjt:  MCDASDVAVGAMLGQKKSKVIHPIYYASKTLTDAQENYTTTEKELLAVVFAVEKFRSYIIGSKVTI

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.62.2e-0950.77Show/hide
Query:  DASDVAVGAMLGQKKSKVIHPIYYASKTLTDAQENYTTTEKELLAVVFAVEKFRSYIIGSKVTIS
        DASDVA+GA+L Q      HP+ Y S+TL + + NY+T EKELLA+V+A + FR Y++G    IS
Subjt:  DASDVAVGAMLGQKKSKVIHPIYYASKTLTDAQENYTTTEKELLAVVFAVEKFRSYIIGSKVTIS

P10394 Retrovirus-related Pol polyprotein from transposon 4121.5e-0542.19Show/hide
Query:  DASDVAVGAMLGQKKSKVIHPIYYASKTLTDAQENYTTTEKELLAVVFAVEKFRSYIIGSKVTI
        DAS  A GA+L Q  +    P+ YAS+  T  + N +TTE+EL A+ +A+  FR YI G   T+
Subjt:  DASDVAVGAMLGQKKSKVIHPIYYASKTLTDAQENYTTTEKELLAVVFAVEKFRSYIIGSKVTI

P20825 Retrovirus-related Pol polyprotein from transposon 2974.2e-0846.15Show/hide
Query:  DASDVAVGAMLGQKKSKVIHPIYYASKTLTDAQENYTTTEKELLAVVFAVEKFRSYIIGSKVTIS
        DAS++A+GA+L Q      HPI + S+TL D + NY+  EKELLA+V+A + FR Y++G +  I+
Subjt:  DASDVAVGAMLGQKKSKVIHPIYYASKTLTDAQENYTTTEKELLAVVFAVEKFRSYIIGSKVTIS

Q8I7P9 Retrovirus-related Pol polyprotein from transposon opus3.6e-0741.67Show/hide
Query:  DASDVAVGAMLGQKKSKVIHPIYYASKTLTDAQENYTTTEKELLAVVFAVEKFRSYIIGS
        DAS+ A+GA+L Q       PI Y S++L   +ENY T EKE+LA++++++  R+Y+ G+
Subjt:  DASDVAVGAMLGQKKSKVIHPIYYASKTLTDAQENYTTTEKELLAVVFAVEKFRSYIIGS

Q9UR07 Transposon Tf2-11 polyprotein2.0e-0543.1Show/hide
Query:  DASDVAVGAMLGQK-KSKVIHPIYYASKTLTDAQENYTTTEKELLAVVFAVEKFRSYI
        DASDVAVGA+L QK      +P+ Y S  ++ AQ NY+ ++KE+LA++ +++ +R Y+
Subjt:  DASDVAVGAMLGQK-KSKVIHPIYYASKTLTDAQENYTTTEKELLAVVFAVEKFRSYI

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGTGACGCAAGTGATGTTGCGGTAGGGGCTATGCTGGGGCAAAAGAAAAGCAAAGTGATCCATCCTATATATTACGCGAGCAAGACTCTTACGGACGCTCAAGAAAA
CTACACTACTACAGAAAAGGAACTGCTCGCGGTAGTATTTGCGGTAGAAAAGTTCAGGAGTTATATAATTGGCTCCAAAGTTACGATATCTCATGGCAAAGAAAGACGTC
AAGCCGTGGTTAATCATTTGGGTATTATTGCTCCAAGAATTTGA
mRNA sequenceShow/hide mRNA sequence
ATGTGTGACGCAAGTGATGTTGCGGTAGGGGCTATGCTGGGGCAAAAGAAAAGCAAAGTGATCCATCCTATATATTACGCGAGCAAGACTCTTACGGACGCTCAAGAAAA
CTACACTACTACAGAAAAGGAACTGCTCGCGGTAGTATTTGCGGTAGAAAAGTTCAGGAGTTATATAATTGGCTCCAAAGTTACGATATCTCATGGCAAAGAAAGACGTC
AAGCCGTGGTTAATCATTTGGGTATTATTGCTCCAAGAATTTGA
Protein sequenceShow/hide protein sequence
MCDASDVAVGAMLGQKKSKVIHPIYYASKTLTDAQENYTTTEKELLAVVFAVEKFRSYIIGSKVTISHGKERRQAVVNHLGIIAPRI