; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0028366 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0028366
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr04:22153912..22155438
RNA-Seq ExpressionPI0028366
SyntenyPI0028366
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
WP_217833177.1 retrotransposon gag domain-containing protein, partial [Synechococcus sp. PCC 7002]2.8e-6947.97Show/hide
Query:  FMSDSEQP-FELDPEIKRTFQGNRRRARQRQIRR-MENNRNA-----PPPQADPEPNA------------AYIAHDLDRPIRSYAAPNLYNFNPGIAYHV
        +MS+ + P F++DPEI+RTF+   R+ +QR+  + +E N +A       PQA    NA              +AHD +RP+R YA+PNLYNF PGI    
Subjt:  FMSDSEQP-FELDPEIKRTFQGNRRRARQRQIRR-MENNRNA-----PPPQADPEPNA------------AYIAHDLDRPIRSYAAPNLYNFNPGIAYHV

Query:  FGENARFEIKPVMFQMIQNAGQFGGHPGEDPHEHIRSFYSICASFHMSGISPEELRFALFPLTLRDEAKKWANALEDGEVGTWDQLIEKFMKNFFPPHKN
        F  N RFE+KPVM QM+Q AGQFGG  GEDPH H++SF  IC++F M+G+  + +R  LFP +LRDEA++WA + E GE+ TW +++EKFM+ +FPP  +
Subjt:  FGENARFEIKPVMFQMIQNAGQFGGHPGEDPHEHIRSFYSICASFHMSGISPEELRFALFPLTLRDEAKKWANALEDGEVGTWDQLIEKFMKNFFPPHKN

Query:  ARRRKELMSFQQRDRENLHDAWSRFKRMVKACPHNGIPKCILMEVFYFGLNKATQQTADAMFVDGMLKSTY
        A+RR+++++F+Q+D E   +AW+RFKR+V+ CPHNGIP C+ ME+FY GLNK +Q  ADA    G++  TY
Subjt:  ARRRKELMSFQQRDRENLHDAWSRFKRMVKACPHNGIPKCILMEVFYFGLNKATQQTADAMFVDGMLKSTY

XP_017216983.1 PREDICTED: uncharacterized protein LOC108194534 [Daucus carota subsp. sativus]2.3e-5246.15Show/hide
Query:  FELDPEIKRTFQGNRRRARQRQIRRMENNRNAPPPQAD--PEPNAAYIAHDLDRPIRSYAAPNLYNFNPGIAYHVFGENARFEIKPVMFQMIQNAGQFGG
        F  DP I+RTF  NRRR  QR+I++ +          D    P  A+I  D DR IR YAAP     N GI      +  +FE+KPVMFQM+Q  GQF G
Subjt:  FELDPEIKRTFQGNRRRARQRQIRRMENNRNAPPPQAD--PEPNAAYIAHDLDRPIRSYAAPNLYNFNPGIAYHVFGENARFEIKPVMFQMIQNAGQFGG

Query:  HPGEDPHEHIRSFYSICASFHMSGISPEELRFALFPLTLRDEAKKWANALEDGEVGTWDQLIEKFMKNFFPPHKNARRRKELMSFQQRDRENLHDAWSRF
         P EDPH H+R F  I  SF   G++ + LR  LFP  +RD A+ W N+L  G V  W+ L EKF+  +FPP+ NA+ R E+ SFQQ+D E+L+DAW RF
Subjt:  HPGEDPHEHIRSFYSICASFHMSGISPEELRFALFPLTLRDEAKKWANALEDGEVGTWDQLIEKFMKNFFPPHKNARRRKELMSFQQRDRENLHDAWSRF

Query:  KRMVKACPHNGIPKCILMEVFYFGLNKATQQTADAMFVDGMLKSTYN
        K +++ CPH+GI  CI ME FY GLN  T+   DA     +L  +YN
Subjt:  KRMVKACPHNGIPKCILMEVFYFGLNKATQQTADAMFVDGMLKSTYN

XP_017233063.1 PREDICTED: uncharacterized protein LOC108207110 [Daucus carota subsp. sativus]6.6e-5547.2Show/hide
Query:  FELDPEIKRTFQGNRRRARQRQIRRM-----ENNRNAPPPQADPEPNAAYIAHDLDRPIRSYAAPNLYNFNPGIAYHVFGENARFEIKPVMFQMIQNAGQ
        F  DPEI+RTF  NRRR  QR+I++      +N  N   P     P  A+I  D DR IR YAAP     N GI      +  +FE+KPVMFQM+Q  GQ
Subjt:  FELDPEIKRTFQGNRRRARQRQIRRM-----ENNRNAPPPQADPEPNAAYIAHDLDRPIRSYAAPNLYNFNPGIAYHVFGENARFEIKPVMFQMIQNAGQ

Query:  FGGHPGEDPHEHIRSFYSICASFHMSGISPEELRFALFPLTLRDEAKKWANALEDGEVGTWDQLIEKFMKNFFPPHKNARRRKELMSFQQRDRENLHDAW
        F G P EDPH H+R F  I  SF   G+  + LR  LFP ++RD A+ W N+L  G V TW+ L EKF+  +FPP+ NA+ R E+ SFQQ+D E+L+DAW
Subjt:  FGGHPGEDPHEHIRSFYSICASFHMSGISPEELRFALFPLTLRDEAKKWANALEDGEVGTWDQLIEKFMKNFFPPHKNARRRKELMSFQQRDRENLHDAW

Query:  SRFKRMVKACPHNGIPKCILMEVFYFGLNKATQQTADAMFVDGMLKSTYN
         RFK +++ CPH+GI  CI ME FY GLN  T+   DA     +L  +YN
Subjt:  SRFKRMVKACPHNGIPKCILMEVFYFGLNKATQQTADAMFVDGMLKSTYN

XP_030508936.1 uncharacterized protein LOC115723589 [Cannabis sativa]4.3e-5445.88Show/hide
Query:  MSDSEQPFEL---DPEIKRTFQGNRRRARQRQIRRMENNRNAPPPQADPEPNAAYIAHDLDRPIRSYAAPNLYNFNPGIAYHVFGENARFEIKPVMFQMI
        M++ E+  EL   DPEI+RTF+  +RR  Q+  +R              E N   +A D  R IR YAAP     NPGI      +   FE+KPVMFQM+
Subjt:  MSDSEQPFEL---DPEIKRTFQGNRRRARQRQIRRMENNRNAPPPQADPEPNAAYIAHDLDRPIRSYAAPNLYNFNPGIAYHVFGENARFEIKPVMFQMI

Query:  QNAGQFGGHPGEDPHEHIRSFYSICASFHMSGISPEELRFALFPLTLRDEAKKWANALEDGEVGTWDQLIEKFMKNFFPPHKNARRRKELMSFQQRDREN
        Q  GQFGG P EDPH HIRSF  +  SF + G+S E LR  LFP +LRD A+ W N L    V  W+ L EKF++ +FPP +NA+ R E+MSFQQ + E 
Subjt:  QNAGQFGGHPGEDPHEHIRSFYSICASFHMSGISPEELRFALFPLTLRDEAKKWANALEDGEVGTWDQLIEKFMKNFFPPHKNARRRKELMSFQQRDREN

Query:  LHDAWSRFKRMVKACPHNGIPKCILMEVFYFGLNKATQQTADAMFVDGMLKSTYN
          DAW RFK +++ CPH+GIP CI +E FY GLN A +   DA     +L  +YN
Subjt:  LHDAWSRFKRMVKACPHNGIPKCILMEVFYFGLNKATQQTADAMFVDGMLKSTYN

XP_038887458.1 uncharacterized protein LOC120077591 [Benincasa hispida]1.4e-6048.82Show/hide
Query:  SEQPFELDPEIKRTF--QGNRRRARQRQIRRMENNRNAPPPQAD----PEPNAAYIAHDLDRPIRSYAAPNLYNFNPGIAYHVFGENARFEIKPVMFQMI
        ++  FE +PEI  TF  + ++ RA +R+I   +NN N  P        P  +  ++A D + PIR+YAAPNLY+F+PGI+  +  ENARFEIKPVM QMI
Subjt:  SEQPFELDPEIKRTF--QGNRRRARQRQIRRMENNRNAPPPQAD----PEPNAAYIAHDLDRPIRSYAAPNLYNFNPGIAYHVFGENARFEIKPVMFQMI

Query:  QNAGQFGGHPGEDPHEHIRSFYSICASFHMSGISPEELRFALFPLTLRDEAKKWANALEDGEVGTWDQLIEKFMKNFFPPHKNARRRKELMSFQQRDREN
        QN  QF     E+PH H+  F  +C++F + GI+P  +R  LFP TLRD+AK+WA++LE  E+ + DQL+E FMK FFPP  N RRRK +++F++ D E 
Subjt:  QNAGQFGGHPGEDPHEHIRSFYSICASFHMSGISPEELRFALFPLTLRDEAKKWANALEDGEVGTWDQLIEKFMKNFFPPHKNARRRKELMSFQQRDREN

Query:  LHDAWSRFKRMVKACPHNGIPKCILMEVFYFGLNKATQQTADAMFVDGMLKSTY
        L  AW RF+R+VK CPH GI  C+LME+FY GLN++TQ  ADA  V+  +  TY
Subjt:  LHDAWSRFKRMVKACPHNGIPKCILMEVFYFGLNKATQQTADAMFVDGMLKSTY

TrEMBL top hitse value%identityAlignment
A0A6J1EEI2 uncharacterized protein LOC1114333942.6e-4941.47Show/hide
Query:  RENKFMSDSE-QPFELDPEIKRTFQGNRRRARQRQIRRMENNRNAPPPQADPEPNAAYIAHDLDRPIRSYAAPNLYNFNPGIAYHVFGENARFEIKPVMF
        +E K M++   Q  EL  ++ R F+     A Q +I                  NA ++A D +R IR+YA P +   NP I      +   FE+KPVMF
Subjt:  RENKFMSDSE-QPFELDPEIKRTFQGNRRRARQRQIRRMENNRNAPPPQADPEPNAAYIAHDLDRPIRSYAAPNLYNFNPGIAYHVFGENARFEIKPVMF

Query:  QMIQNAGQFGGHPGEDPHEHIRSFYSICASFHMSGISPEELRFALFPLTLRDEAKKWANALEDGEVGTWDQLIEKFMKNFFPPHKNARRRKELMSFQQRD
        QM+Q  GQF G P EDPH H++SF  +  SF    +  + +R +LFP +LRD AK W N L  G + +W+ L+EKF+  +FPP +NAR R E++ FQQ +
Subjt:  QMIQNAGQFGGHPGEDPHEHIRSFYSICASFHMSGISPEELRFALFPLTLRDEAKKWANALEDGEVGTWDQLIEKFMKNFFPPHKNARRRKELMSFQQRD

Query:  RENLHDAWSRFKRMVKACPHNGIPKCILMEVFYFGLNKATQQTADAMFVDGMLKSTYN
         + L +AW RFK M++ CPH+G+P CI ME FY GLN AT+Q  DA     +L  TYN
Subjt:  RENLHDAWSRFKRMVKACPHNGIPKCILMEVFYFGLNKATQQTADAMFVDGMLKSTYN

A0A6J1EQ90 uncharacterized protein LOC1114364111.0e-4540.91Show/hide
Query:  FELDPEIKRTFQ---GNRRRARQRQIRRME----NNRNAPPP-----QADPEPNAAYIAHDLDRPIRSYAAPNLYNFNPGIAYHVFGENARFEIKPVMFQ
        F LDPEI+RTF+     +++  ++ I+++E     NR    P     Q     N  ++A D +R IR+YA P +   NP I      +   FE+KPVMFQ
Subjt:  FELDPEIKRTFQ---GNRRRARQRQIRRME----NNRNAPPP-----QADPEPNAAYIAHDLDRPIRSYAAPNLYNFNPGIAYHVFGENARFEIKPVMFQ

Query:  MIQNAGQFGGHPGEDPHEHIRSFYSI-------CASFHMSGISPEELRFALFPLTLRDEAKKWANALEDGEVGTWDQLIEKFMKNFFPPHKNARRRKELM
        M+Q  GQF G P EDPH H++SF  +         SF   G+  + +R +LFP  LRD AK W N L  G + +W+ L E F+  +FPP +NAR + E++
Subjt:  MIQNAGQFGGHPGEDPHEHIRSFYSI-------CASFHMSGISPEELRFALFPLTLRDEAKKWANALEDGEVGTWDQLIEKFMKNFFPPHKNARRRKELM

Query:  SFQQRDRENLHDAWSRFKRMVKACPHNGIPKCILMEVFYFGLNKATQQTADAMFVDGMLKSTYN
        +FQQ + E L +A  RFK M++ CPH+G+P CI ME FY GLN  T+Q  DA     +L  TYN
Subjt:  SFQQRDRENLHDAWSRFKRMVKACPHNGIPKCILMEVFYFGLNKATQQTADAMFVDGMLKSTYN

A0A6J1G7Q6 uncharacterized protein LOC1114515981.3e-4544.12Show/hide
Query:  NAAYIAHDLDRPIRSYAAPNLYNFNPGIAYHVFGENARFEIKPVMFQMIQNAGQFGGHPGEDPHEHIRSFYSICASFHMSGISPEELRFALFPLTLRDEA
        NA ++A D +R IR+YA P +   NP I      +   FE+KPVMFQM+Q  GQF G   +DPH H++SF  +  SF   G+  + +R + F  +LRD A
Subjt:  NAAYIAHDLDRPIRSYAAPNLYNFNPGIAYHVFGENARFEIKPVMFQMIQNAGQFGGHPGEDPHEHIRSFYSICASFHMSGISPEELRFALFPLTLRDEA

Query:  KKWANALEDGEVGTWDQLIEKFMKNFFPPHKNARRRKELMSFQQRDRENLHDAWSRFKRMVKACPHNGIPKCILMEVFYFGLNKATQQTADAMFVDGMLK
        K W N L  G + +W+ L EKF+  +FPP ++AR R E+++FQ+ + E L +AW RFK  ++ CPH+G+P CI +E FY GLN AT+Q  DA     +L 
Subjt:  KKWANALEDGEVGTWDQLIEKFMKNFFPPHKNARRRKELMSFQQRDRENLHDAWSRFKRMVKACPHNGIPKCILMEVFYFGLNKATQQTADAMFVDGMLK

Query:  STYN
         TYN
Subjt:  STYN

A0A6J1H7E4 uncharacterized protein LOC1114611689.9e-4944.74Show/hide
Query:  RQIRRMENNRNAPPPQADPE---PNAAYIAHDLDRPIRSYAAPNLYNFNPGIAYHVFGENARFEIKPVMFQMIQNAGQFGGHPGEDPHEHIRSFYSICAS
        +Q+ ++      P   A+ E    NA  +A D +R IR+YA P +   NP I      +   FE+KPVMFQM+Q  GQF G P EDPH H++SF  +  S
Subjt:  RQIRRMENNRNAPPPQADPE---PNAAYIAHDLDRPIRSYAAPNLYNFNPGIAYHVFGENARFEIKPVMFQMIQNAGQFGGHPGEDPHEHIRSFYSICAS

Query:  FHMSGISPEELRFALFPLTLRDEAKKWANALEDGEVGTWDQLIEKFMKNFFPPHKNARRRKELMSFQQRDRENLHDAWSRFKRMVKACPHNGIPKCILME
        F   G+  + +R +LFP +LRD AK W N L    + +W+ L EKF+  +FPP +NAR R E+++FQQ + E L +AW RFK M++ CPH+G+P CI ME
Subjt:  FHMSGISPEELRFALFPLTLRDEAKKWANALEDGEVGTWDQLIEKFMKNFFPPHKNARRRKELMSFQQRDRENLHDAWSRFKRMVKACPHNGIPKCILME

Query:  VFYFGLNKATQQTADAMFVDGMLKSTYN
         FY GLN AT+Q  DA     ML  TYN
Subjt:  VFYFGLNKATQQTADAMFVDGMLKSTYN

U5CUI2 Retrotrans_gag domain-containing protein2.1e-5149.02Show/hide
Query:  NAAYIAHDLDRPIRSYAAPNLYNFNPGIAYHVFGENARFEIKPVMFQMIQNAGQFGGHPGEDPHEHIRSFYSICASFHMSGISPEELRFALFPLTLRDEA
        N   +A D  R IR YAAP     NPGI      +  +FE+KPVMFQM+Q  GQF G P EDPH H+RSF  +  SF + G+S E LR  LFP +LRD A
Subjt:  NAAYIAHDLDRPIRSYAAPNLYNFNPGIAYHVFGENARFEIKPVMFQMIQNAGQFGGHPGEDPHEHIRSFYSICASFHMSGISPEELRFALFPLTLRDEA

Query:  KKWANALEDGEVGTWDQLIEKFMKNFFPPHKNARRRKELMSFQQRDRENLHDAWSRFKRMVKACPHNGIPKCILMEVFYFGLNKATQQTADAMFVDGMLK
        + W N L    V  W+ L EKF++ +FPP +NA+ R E+MSFQQ + E+  DAW RFK +++ CPH+GIP CI ME FY GLN A++   DA     +L 
Subjt:  KKWANALEDGEVGTWDQLIEKFMKNFFPPHKNARRRKELMSFQQRDRENLHDAWSRFKRMVKACPHNGIPKCILMEVFYFGLNKATQQTADAMFVDGMLK

Query:  STYN
         +YN
Subjt:  STYN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACGAAATATGTGGCGGAAGCACCGAGAGGCGCTAATCTTTTTGCAGAAACTCAGTTTCAGTTGAATCGCACGTTCAAACTCGAAGAACGTGAGAACAAGTTTATGAG
TGACAGCGAACAGCCATTCGAACTTGACCCTGAGATTAAGCGAACATTTCAGGGTAATCGGCGAAGAGCAAGGCAAAGACAAATTCGTAGAATGGAAAATAACAGAAATG
CTCCTCCGCCGCAAGCTGACCCAGAACCCAATGCCGCCTATATCGCACATGACTTGGACAGGCCAATTAGATCTTATGCGGCACCAAACCTTTATAACTTCAATCCAGGA
ATCGCCTACCATGTATTTGGCGAGAACGCCAGGTTTGAAATCAAACCTGTTATGTTTCAGATGATTCAGAACGCCGGACAATTCGGCGGACATCCTGGGGAAGATCCACA
CGAGCATATAAGGAGTTTCTACTCCATCTGCGCTTCTTTCCACATGTCAGGCATCTCACCTGAAGAATTAAGATTTGCCCTCTTCCCGTTAACTCTGAGGGATGAGGCGA
AGAAGTGGGCAAATGCCCTGGAAGATGGCGAGGTGGGAACATGGGATCAATTAATAGAAAAATTTATGAAGAATTTCTTCCCACCTCATAAAAATGCAAGAAGAAGGAAG
GAGCTTATGAGCTTCCAGCAGAGGGATAGAGAAAACCTACATGATGCGTGGAGTAGGTTTAAAAGGATGGTCAAAGCATGCCCCCACAATGGCATTCCTAAATGCATATT
AATGGAGGTGTTCTATTTTGGACTGAACAAGGCAACACAGCAGACTGCTGATGCTATGTTTGTAGACGGTATGTTGAAAAGTACATACAACTAG
mRNA sequenceShow/hide mRNA sequence
ATGACGAAATATGTGGCGGAAGCACCGAGAGGCGCTAATCTTTTTGCAGAAACTCAGTTTCAGTTGAATCGCACGTTCAAACTCGAAGAACGTGAGAACAAGTTTATGAG
TGACAGCGAACAGCCATTCGAACTTGACCCTGAGATTAAGCGAACATTTCAGGGTAATCGGCGAAGAGCAAGGCAAAGACAAATTCGTAGAATGGAAAATAACAGAAATG
CTCCTCCGCCGCAAGCTGACCCAGAACCCAATGCCGCCTATATCGCACATGACTTGGACAGGCCAATTAGATCTTATGCGGCACCAAACCTTTATAACTTCAATCCAGGA
ATCGCCTACCATGTATTTGGCGAGAACGCCAGGTTTGAAATCAAACCTGTTATGTTTCAGATGATTCAGAACGCCGGACAATTCGGCGGACATCCTGGGGAAGATCCACA
CGAGCATATAAGGAGTTTCTACTCCATCTGCGCTTCTTTCCACATGTCAGGCATCTCACCTGAAGAATTAAGATTTGCCCTCTTCCCGTTAACTCTGAGGGATGAGGCGA
AGAAGTGGGCAAATGCCCTGGAAGATGGCGAGGTGGGAACATGGGATCAATTAATAGAAAAATTTATGAAGAATTTCTTCCCACCTCATAAAAATGCAAGAAGAAGGAAG
GAGCTTATGAGCTTCCAGCAGAGGGATAGAGAAAACCTACATGATGCGTGGAGTAGGTTTAAAAGGATGGTCAAAGCATGCCCCCACAATGGCATTCCTAAATGCATATT
AATGGAGGTGTTCTATTTTGGACTGAACAAGGCAACACAGCAGACTGCTGATGCTATGTTTGTAGACGGTATGTTGAAAAGTACATACAACTAG
Protein sequenceShow/hide protein sequence
MTKYVAEAPRGANLFAETQFQLNRTFKLEERENKFMSDSEQPFELDPEIKRTFQGNRRRARQRQIRRMENNRNAPPPQADPEPNAAYIAHDLDRPIRSYAAPNLYNFNPG
IAYHVFGENARFEIKPVMFQMIQNAGQFGGHPGEDPHEHIRSFYSICASFHMSGISPEELRFALFPLTLRDEAKKWANALEDGEVGTWDQLIEKFMKNFFPPHKNARRRK
ELMSFQQRDRENLHDAWSRFKRMVKACPHNGIPKCILMEVFYFGLNKATQQTADAMFVDGMLKSTYN