; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

IVF0009698 (gene) of Melon (IVF77) v1 genome

Gene IDIVF0009698
OrganismCucumis melo ssp. agrestis cv. IVF77 (Melon (IVF77) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr12:22490596..22491348
RNA-Seq ExpressionIVF0009698
SyntenyIVF0009698
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0055390.1 uncharacterized protein E6C27_scaffold80G002350 [Cucumis melo var. makuwa]9.86e-13796.1Show/hide
Query:  MPEIRGIMPPRISHWSRSRKGHETTAPPDPEEQQRQRGNDTEIATSLKQRNDLSSSGSSGICFPEPDLKIIPTSGHQTKQPPNPDAKIQTPTKTTSPPPS
        MPEIRGIMPPRISHWSRSRKGHETTAPPDPEEQQRQRGNDTEIATSLKQRNDLSSSGSSGICFPEPDLKIIPTSGHQTKQPPNPDAKIQTPTKTTSPPPS
Subjt:  MPEIRGIMPPRISHWSRSRKGHETTAPPDPEEQQRQRGNDTEIATSLKQRNDLSSSGSSGICFPEPDLKIIPTSGHQTKQPPNPDAKIQTPTKTTSPPPS

Query:  RSNLKAGLNLTKHVGSPNGSPKQKFGSSQSNAPIILHKIYREIAFHRQGNSSITSYFTKLKELWDQLAKAYNDLAPQYSSDAINMLSEHMEREKVIQFLV
        RSNLKAGLNLTKHVGSPNGSPKQKFGSSQSNAPIILHKIYREIAFHRQGNSSITSYFTKLKELWDQLAKAYNDLAPQYSSDAINMLSEHMEREK++  + 
Subjt:  RSNLKAGLNLTKHVGSPNGSPKQKFGSSQSNAPIILHKIYREIAFHRQGNSSITSYFTKLKELWDQLAKAYNDLAPQYSSDAINMLSEHMEREKVIQFLV

Query:  GLNDS
          NDS
Subjt:  GLNDS

TYJ99320.1 hypothetical protein E5676_scaffold248G005340 [Cucumis melo var. makuwa]2.80e-17699.6Show/hide
Query:  MPEIRGIMPPRISHWSRSRKGHETTAPPDPEEQQRQRGNDTEIATSLKQRNDLSSSGSSGICFPEPDLKIIPTSGHQTKQPPNPDAKIQTPTKTTSPPPS
        MPEIRGIMPPRISHWSRSRKGHETTAPPDPEEQQRQRGNDTEIATSLKQRNDLSSSGSSGICFPEPDLKIIPTSGHQTKQPPNPD KIQTPTKTTSPPPS
Subjt:  MPEIRGIMPPRISHWSRSRKGHETTAPPDPEEQQRQRGNDTEIATSLKQRNDLSSSGSSGICFPEPDLKIIPTSGHQTKQPPNPDAKIQTPTKTTSPPPS

Query:  RSNLKAGLNLTKHVGSPNGSPKQKFGSSQSNAPIILHKIYREIAFHRQGNSSITSYFTKLKELWDQLAKAYNDLAPQYSSDAINMLSEHMEREKVIQFLV
        RSNLKAGLNLTKHVGSPNGSPKQKFGSSQSNAPIILHKIYREIAFHRQGNSSITSYFTKLKELWDQLAKAYNDLAPQYSSDAINMLSEHMEREKVIQFLV
Subjt:  RSNLKAGLNLTKHVGSPNGSPKQKFGSSQSNAPIILHKIYREIAFHRQGNSSITSYFTKLKELWDQLAKAYNDLAPQYSSDAINMLSEHMEREKVIQFLV

Query:  GLNDSYSSTCSKILFMRPFPTVDTAFSVIIRKENRRKLVALSQYNDEEND
        GLNDSYSSTCSKILFMRPFPTVDTAFSVIIRKENRRKLVALSQYNDEEND
Subjt:  GLNDSYSSTCSKILFMRPFPTVDTAFSVIIRKENRRKLVALSQYNDEEND

XP_038895285.1 GATA zinc finger domain-containing protein 11-like isoform X1 [Benincasa hispida]4.38e-3357.85Show/hide
Query:  GSPKQKFGSSQSNAPIILHKIYREIAFHRQGNSSITSYFTKLKELWDQLAKAYNDLAPQYSSDAINMLSEHMEREKVIQFLVGLNDSYSSTCSKILFMRP
        GS  ++  SSQSN   +  +IY+EIAFHRQ NSSITSYFTKL+ LWD+LA    DL       A   LSE+MEREKV+QFLVGLNDSYS  C++IL   P
Subjt:  GSPKQKFGSSQSNAPIILHKIYREIAFHRQGNSSITSYFTKLKELWDQLAKAYNDLAPQYSSDAINMLSEHMEREKVIQFLVGLNDSYSSTCSKILFMRP

Query:  FPTVDTAFSVIIRKENRRKLV
        FPT++ A+S +IR+E  R+LV
Subjt:  FPTVDTAFSVIIRKENRRKLV

XP_038895286.1 hybrid signal transduction histidine kinase L-like isoform X2 [Benincasa hispida]4.08e-3357.85Show/hide
Query:  GSPKQKFGSSQSNAPIILHKIYREIAFHRQGNSSITSYFTKLKELWDQLAKAYNDLAPQYSSDAINMLSEHMEREKVIQFLVGLNDSYSSTCSKILFMRP
        GS  ++  SSQSN   +  +IY+EIAFHRQ NSSITSYFTKL+ LWD+LA    DL       A   LSE+MEREKV+QFLVGLNDSYS  C++IL   P
Subjt:  GSPKQKFGSSQSNAPIILHKIYREIAFHRQGNSSITSYFTKLKELWDQLAKAYNDLAPQYSSDAINMLSEHMEREKVIQFLVGLNDSYSSTCSKILFMRP

Query:  FPTVDTAFSVIIRKENRRKLV
        FPT++ A+S +IR+E  R+LV
Subjt:  FPTVDTAFSVIIRKENRRKLV

XP_038895287.1 hybrid signal transduction histidine kinase L-like isoform X3 [Benincasa hispida]3.93e-3357.85Show/hide
Query:  GSPKQKFGSSQSNAPIILHKIYREIAFHRQGNSSITSYFTKLKELWDQLAKAYNDLAPQYSSDAINMLSEHMEREKVIQFLVGLNDSYSSTCSKILFMRP
        GS  ++  SSQSN   +  +IY+EIAFHRQ NSSITSYFTKL+ LWD+LA    DL       A   LSE+MEREKV+QFLVGLNDSYS  C++IL   P
Subjt:  GSPKQKFGSSQSNAPIILHKIYREIAFHRQGNSSITSYFTKLKELWDQLAKAYNDLAPQYSSDAINMLSEHMEREKVIQFLVGLNDSYSSTCSKILFMRP

Query:  FPTVDTAFSVIIRKENRRKLV
        FPT++ A+S +IR+E  R+LV
Subjt:  FPTVDTAFSVIIRKENRRKLV

TrEMBL top hitse value%identityAlignment
A0A5A7UJM8 Uncharacterized protein3.5e-10696.1Show/hide
Query:  MPEIRGIMPPRISHWSRSRKGHETTAPPDPEEQQRQRGNDTEIATSLKQRNDLSSSGSSGICFPEPDLKIIPTSGHQTKQPPNPDAKIQTPTKTTSPPPS
        MPEIRGIMPPRISHWSRSRKGHETTAPPDPEEQQRQRGNDTEIATSLKQRNDLSSSGSSGICFPEPDLKIIPTSGHQTKQPPNPDAKIQTPTKTTSPPPS
Subjt:  MPEIRGIMPPRISHWSRSRKGHETTAPPDPEEQQRQRGNDTEIATSLKQRNDLSSSGSSGICFPEPDLKIIPTSGHQTKQPPNPDAKIQTPTKTTSPPPS

Query:  RSNLKAGLNLTKHVGSPNGSPKQKFGSSQSNAPIILHKIYREIAFHRQGNSSITSYFTKLKELWDQLAKAYNDLAPQYSSDAINMLSEHMEREKVIQFLV
        RSNLKAGLNLTKHVGSPNGSPKQKFGSSQSNAPIILHKIYREIAFHRQGNSSITSYFTKLKELWDQLAKAYNDLAPQYSSDAINMLSEHMEREK++  + 
Subjt:  RSNLKAGLNLTKHVGSPNGSPKQKFGSSQSNAPIILHKIYREIAFHRQGNSSITSYFTKLKELWDQLAKAYNDLAPQYSSDAINMLSEHMEREKVIQFLV

Query:  GLNDS
          NDS
Subjt:  GLNDS

A0A5D3BHR0 Uncharacterized protein1.9e-13699.6Show/hide
Query:  MPEIRGIMPPRISHWSRSRKGHETTAPPDPEEQQRQRGNDTEIATSLKQRNDLSSSGSSGICFPEPDLKIIPTSGHQTKQPPNPDAKIQTPTKTTSPPPS
        MPEIRGIMPPRISHWSRSRKGHETTAPPDPEEQQRQRGNDTEIATSLKQRNDLSSSGSSGICFPEPDLKIIPTSGHQTKQPPNPD KIQTPTKTTSPPPS
Subjt:  MPEIRGIMPPRISHWSRSRKGHETTAPPDPEEQQRQRGNDTEIATSLKQRNDLSSSGSSGICFPEPDLKIIPTSGHQTKQPPNPDAKIQTPTKTTSPPPS

Query:  RSNLKAGLNLTKHVGSPNGSPKQKFGSSQSNAPIILHKIYREIAFHRQGNSSITSYFTKLKELWDQLAKAYNDLAPQYSSDAINMLSEHMEREKVIQFLV
        RSNLKAGLNLTKHVGSPNGSPKQKFGSSQSNAPIILHKIYREIAFHRQGNSSITSYFTKLKELWDQLAKAYNDLAPQYSSDAINMLSEHMEREKVIQFLV
Subjt:  RSNLKAGLNLTKHVGSPNGSPKQKFGSSQSNAPIILHKIYREIAFHRQGNSSITSYFTKLKELWDQLAKAYNDLAPQYSSDAINMLSEHMEREKVIQFLV

Query:  GLNDSYSSTCSKILFMRPFPTVDTAFSVIIRKENRRKLVALSQYNDEEND
        GLNDSYSSTCSKILFMRPFPTVDTAFSVIIRKENRRKLVALSQYNDEEND
Subjt:  GLNDSYSSTCSKILFMRPFPTVDTAFSVIIRKENRRKLVALSQYNDEEND

A0A6J1C6T8 uncharacterized protein LOC111008934 isoform X21.2e-2662.39Show/hide
Query:  SSQSNAPIILHKIYREIAFHRQGNSSITSYFTKLKELWDQLAKAYNDLAPQYSSDAINMLSEHMEREKVIQFLVGLNDSYSSTCSKILFMRPFPTVDTAF
        SSQSN P I  +IY++IA HRQGNSSITSYFT+LK LWD+L + YNDL+   SS       EH+EREKV+QFLVGLND YS+ C +IL +RPFPTV+ A+
Subjt:  SSQSNAPIILHKIYREIAFHRQGNSSITSYFTKLKELWDQLAKAYNDLAPQYSSDAINMLSEHMEREKVIQFLVGLNDSYSSTCSKILFMRPFPTVDTAF

Query:  SVIIRKENR
        S++IR+E R
Subjt:  SVIIRKENR

A0A6J1C6U3 uncharacterized protein LOC111008934 isoform X11.2e-2662.39Show/hide
Query:  SSQSNAPIILHKIYREIAFHRQGNSSITSYFTKLKELWDQLAKAYNDLAPQYSSDAINMLSEHMEREKVIQFLVGLNDSYSSTCSKILFMRPFPTVDTAF
        SSQSN P I  +IY++IA HRQGNSSITSYFT+LK LWD+L + YNDL+   SS       EH+EREKV+QFLVGLND YS+ C +IL +RPFPTV+ A+
Subjt:  SSQSNAPIILHKIYREIAFHRQGNSSITSYFTKLKELWDQLAKAYNDLAPQYSSDAINMLSEHMEREKVIQFLVGLNDSYSSTCSKILFMRPFPTVDTAF

Query:  SVIIRKENR
        S++IR+E R
Subjt:  SVIIRKENR

A0A6J1GTG4 serine/arginine repetitive matrix protein 1-like2.3e-2556.78Show/hide
Query:  KQKFGSSQSNAPIILHKIYREIAFHRQGNSSITSYFTKLKELWDQLAKAYNDLAPQYSSDAINMLSEHMEREKVIQFLVGLNDSYSSTCSKILFMRPFPT
        K++  SSQ N    + +IY+EIA H QGNSSITSY TKLK LWD+L +AY D  P+ S  +    SE +EREKV+QFL+GLNDSYS+ C++IL M+PFPT
Subjt:  KQKFGSSQSNAPIILHKIYREIAFHRQGNSSITSYFTKLKELWDQLAKAYNDLAPQYSSDAINMLSEHMEREKVIQFLVGLNDSYSSTCSKILFMRPFPT

Query:  VDTAFSVIIRKENRRKLV
        V+ A   I+R+E RR+LV
Subjt:  VDTAFSVIIRKENRRKLV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).8.1e-0729.36Show/hide
Query:  LHKIYREIAFHRQGNSSITSYFTKLKELWDQLAKAYNDLAPQYSSDAIN-----MLSEHMEREKVIQFLVG--LNDSYSSTCSKILFMRPFPTVDTAFSV
        ++++ R +A  RQG  S+  YF KL ++W +L++ Y  + P+      N        E  E+E+  +FL+G  LN  + +  +KI+F +P P++  AF++
Subjt:  LHKIYREIAFHRQGNSSITSYFTKLKELWDQLAKAYNDLAPQYSSDAIN-----MLSEHMEREKVIQFLVG--LNDSYSSTCSKILFMRPFPTVDTAFSV

Query:  IIRKENRRK
        +   E+  K
Subjt:  IIRKENRRK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCGGAAATAAGAGGAATTATGCCTCCCAGAATATCCCACTGGTCTAGAAGCCGAAAGGGACATGAGACGACAGCACCACCCGATCCGGAGGAGCAACAACGTCAAAG
AGGAAACGACACCGAAATAGCAACCAGCTTAAAACAAAGAAATGACCTTAGTTCTTCAGGTTCTTCAGGTATCTGCTTCCCAGAACCCGATTTAAAGATCATCCCAACCT
CAGGCCATCAAACAAAACAACCCCCGAATCCAGACGCTAAAATTCAAACCCCCACAAAAACAACTTCACCTCCGCCTTCAAGAAGCAATTTAAAAGCTGGGTTGAACTTA
ACTAAACATGTTGGTTCTCCAAATGGTTCTCCCAAACAAAAATTTGGTTCTTCTCAAAGCAACGCTCCAATAATATTACATAAAATTTACAGGGAAATTGCATTTCATCG
TCAAGGTAACTCATCTATTACATCTTACTTCACAAAGCTCAAGGAATTATGGGATCAACTTGCAAAAGCCTACAATGATTTGGCGCCTCAATATTCATCTGATGCAATTA
ATATGCTGAGTGAGCATATGGAAAGGGAAAAGGTAATCCAATTTCTTGTTGGATTAAATGATTCTTATTCCTCCACCTGTTCTAAAATACTTTTTATGAGGCCATTTCCT
ACCGTGGATACGGCTTTTTCTGTAATAATTCGTAAAGAGAACCGTAGGAAATTAGTAGCTTTGTCACAGTATAATGATGAGGAAAATGATTAG
mRNA sequenceShow/hide mRNA sequence
ATGCCGGAAATAAGAGGAATTATGCCTCCCAGAATATCCCACTGGTCTAGAAGCCGAAAGGGACATGAGACGACAGCACCACCCGATCCGGAGGAGCAACAACGTCAAAG
AGGAAACGACACCGAAATAGCAACCAGCTTAAAACAAAGAAATGACCTTAGTTCTTCAGGTTCTTCAGGTATCTGCTTCCCAGAACCCGATTTAAAGATCATCCCAACCT
CAGGCCATCAAACAAAACAACCCCCGAATCCAGACGCTAAAATTCAAACCCCCACAAAAACAACTTCACCTCCGCCTTCAAGAAGCAATTTAAAAGCTGGGTTGAACTTA
ACTAAACATGTTGGTTCTCCAAATGGTTCTCCCAAACAAAAATTTGGTTCTTCTCAAAGCAACGCTCCAATAATATTACATAAAATTTACAGGGAAATTGCATTTCATCG
TCAAGGTAACTCATCTATTACATCTTACTTCACAAAGCTCAAGGAATTATGGGATCAACTTGCAAAAGCCTACAATGATTTGGCGCCTCAATATTCATCTGATGCAATTA
ATATGCTGAGTGAGCATATGGAAAGGGAAAAGGTAATCCAATTTCTTGTTGGATTAAATGATTCTTATTCCTCCACCTGTTCTAAAATACTTTTTATGAGGCCATTTCCT
ACCGTGGATACGGCTTTTTCTGTAATAATTCGTAAAGAGAACCGTAGGAAATTAGTAGCTTTGTCACAGTATAATGATGAGGAAAATGATTAG
Protein sequenceShow/hide protein sequence
MPEIRGIMPPRISHWSRSRKGHETTAPPDPEEQQRQRGNDTEIATSLKQRNDLSSSGSSGICFPEPDLKIIPTSGHQTKQPPNPDAKIQTPTKTTSPPPSRSNLKAGLNL
TKHVGSPNGSPKQKFGSSQSNAPIILHKIYREIAFHRQGNSSITSYFTKLKELWDQLAKAYNDLAPQYSSDAINMLSEHMEREKVIQFLVGLNDSYSSTCSKILFMRPFP
TVDTAFSVIIRKENRRKLVALSQYNDEEND