; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0029049 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0029049
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr04:22507917..22508696
RNA-Seq ExpressionPI0029049
SyntenyPI0029049
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
WP_217833177.1 retrotransposon gag domain-containing protein, partial [Synechococcus sp. PCC 7002]6.5e-7047.08Show/hide
Query:  MSDGEQPQFELDPEIERTFRRNRQRARQRQARRMENNNRNA-----------------PPPHAALEPNAAYMAHDLDRPIRTYAAPNLYNFNPGIAYPVF
        MS+G+ P F++DPEIERTFRR  ++ +QR++ +    N +A                    HA  + N   +AHD +RP+R YA+PNLYNF PGI  P F
Subjt:  MSDGEQPQFELDPEIERTFRRNRQRARQRQARRMENNNRNA-----------------PPPHAALEPNAAYMAHDLDRPIRTYAAPNLYNFNPGIAYPVF

Query:  GENARFEIKPVMLQMIQNVGQFDGHPGEDPHEHIRNFYSICASFHMPGISPEELRFALFLLTLRDEAKRWANALEDGEVGTWDQLIEKFMKKFFPSHKNA
          N RFE+KPVMLQM+Q  GQF G  GEDPH H+++F  IC++F M G+  + +R  LF  +LRDEA++WA + E GE+ TW +++EKFM+K+FP   +A
Subjt:  GENARFEIKPVMLQMIQNVGQFDGHPGEDPHEHIRNFYSICASFHMPGISPEELRFALFLLTLRDEAKRWANALEDGEVGTWDQLIEKFMKKFFPSHKNA

Query:  RRRKELMGFQQKDRENLHDAWSRFKRMVKASPHNGIPECILMEVFYFGLNKATQQTADAVFVGSMLKSSYNQIK
        +RR++++ F+QKD E   +AW+RFKR+V+  PHNGIP C+ ME+FY GLNK +Q  ADA   G ++  +Y Q K
Subjt:  RRRKELMGFQQKDRENLHDAWSRFKRMVKASPHNGIPECILMEVFYFGLNKATQQTADAVFVGSMLKSSYNQIK

XP_017233063.1 PREDICTED: uncharacterized protein LOC108207110 [Daucus carota subsp. sativus]6.6e-5445.95Show/hide
Query:  MSDGEQPQFELDPEIERTFRRNRQRARQRQARR----MENNNRNAPPPHAALEPNAAYMAHDLDRPIRTYAAPNLYNFNPGIAYPVFGENARFEIKPVML
        MS      F  DPEIERTF  NR+R  QR+ ++    M++N  N   P   + P  A++  D DR IR YAAP     N GI  P   +  +FE+KPVM 
Subjt:  MSDGEQPQFELDPEIERTFRRNRQRARQRQARR----MENNNRNAPPPHAALEPNAAYMAHDLDRPIRTYAAPNLYNFNPGIAYPVFGENARFEIKPVML

Query:  QMIQNVGQFDGHPGEDPHEHIRNFYSICASFHMPGISPEELRFALFLLTLRDEAKRWANALEDGEVGTWDQLIEKFMKKFFPSHKNARRRKELMGFQQKD
        QM+Q +GQF G P EDPH H+R F  I  SF   G+  + LR  LF  ++RD A+ W N+L  G V TW+ L EKF+ K+FP + NA+ R E+  FQQ+D
Subjt:  QMIQNVGQFDGHPGEDPHEHIRNFYSICASFHMPGISPEELRFALFLLTLRDEAKRWANALEDGEVGTWDQLIEKFMKKFFPSHKNARRRKELMGFQQKD

Query:  RENLHDAWSRFKRMVKASPHNGIPECILMEVFYFGLNKATQQTADAVFVGSMLKSSYNQ
         E+L+DAW RFK +++  PH+GI  CI ME FY GLN  T+   DA   G++L  SYNQ
Subjt:  RENLHDAWSRFKRMVKASPHNGIPECILMEVFYFGLNKATQQTADAVFVGSMLKSSYNQ

XP_017239618.1 PREDICTED: uncharacterized protein LOC108212402 [Daucus carota subsp. sativus]6.8e-5145.17Show/hide
Query:  MSDGEQPQFELDPEIERTFRRNRQRARQRQARR----MENNNRNAPPPHAALEPNAAYMAHDLDRPIRTYAAPNLYNFNPGIAYPVFGENARFEIKPVML
        MS+     F  DPEIERTF  NR+R  QR+ ++    M +N  N   P   + P  A++  D DR IR YAAP     N GI  P   +  +FE+KPVM 
Subjt:  MSDGEQPQFELDPEIERTFRRNRQRARQRQARR----MENNNRNAPPPHAALEPNAAYMAHDLDRPIRTYAAPNLYNFNPGIAYPVFGENARFEIKPVML

Query:  QMIQNVGQFDGHPGEDPHEHIRNFYSICASFHMPGISPEELRFALFLLTLRDEAKRWANALEDGEVGTWDQLIEKFMKKFFPSHKNARRRKELMGFQQKD
        QM+Q +GQF G P EDPH H+R F  I  SF   G+  + LR  LF  ++RD A+ W N+L  G V TW+ L EKF+ K+FP + NA+   E+  FQQ+D
Subjt:  QMIQNVGQFDGHPGEDPHEHIRNFYSICASFHMPGISPEELRFALFLLTLRDEAKRWANALEDGEVGTWDQLIEKFMKKFFPSHKNARRRKELMGFQQKD

Query:  RENLHDAWSRFKRMVKASPHNGIPECILMEVFYFGLNKATQQTADAVFVGSMLKSSYNQ
         E+L+DAW RFK +++  PH+GI   I ME FY GLN  T+   DA   G++L  SYNQ
Subjt:  RENLHDAWSRFKRMVKASPHNGIPECILMEVFYFGLNKATQQTADAVFVGSMLKSSYNQ

XP_030508936.1 uncharacterized protein LOC115723589 [Cannabis sativa]2.8e-5246.12Show/hide
Query:  LDPEIERTFRRNRQRARQRQARRMENNNRNAPPPHAALEPNAAYMAHDLDRPIRTYAAPNLYNFNPGIAYPVFGENARFEIKPVMLQMIQNVGQFDGHPG
        +DPEIERTF   RQR ++++A++  N            E N   +A D  R IR YAAP     NPGI  P   +   FE+KPVM QM+Q VGQF G P 
Subjt:  LDPEIERTFRRNRQRARQRQARRMENNNRNAPPPHAALEPNAAYMAHDLDRPIRTYAAPNLYNFNPGIAYPVFGENARFEIKPVMLQMIQNVGQFDGHPG

Query:  EDPHEHIRNFYSICASFHMPGISPEELRFALFLLTLRDEAKRWANALEDGEVGTWDQLIEKFMKKFFPSHKNARRRKELMGFQQKDRENLHDAWSRFKRM
        EDPH HIR+F  +  SF + G+S E LR  LF  +LRD A+ W N L    V  W+ L EKF++K+FP  +NA+ R E+M FQQ + E   DAW RFK +
Subjt:  EDPHEHIRNFYSICASFHMPGISPEELRFALFLLTLRDEAKRWANALEDGEVGTWDQLIEKFMKKFFPSHKNARRRKELMGFQQKDRENLHDAWSRFKRM

Query:  VKASPHNGIPECILMEVFYFGLNKATQQTADAVFVGSMLKSSYNQ
        ++  PH+GIP CI +E FY GLN A +   DA   G++L  SYN+
Subjt:  VKASPHNGIPECILMEVFYFGLNKATQQTADAVFVGSMLKSSYNQ

XP_038887458.1 uncharacterized protein LOC120077591 [Benincasa hispida]2.3e-6248.85Show/hide
Query:  MSDGEQPQFELDPEIERTF--RRNRQRARQRQARRMENNNRNAPPPHAALE---PNAAYMAHDLDRPIRTYAAPNLYNFNPGIAYPVFGENARFEIKPVM
        MS    P+FE +PEI+ TF  R ++ RA +R+    +NNN  AP  +  +     +  ++A D + PIR YAAPNLY+F+PGI+ P+  ENARFEIKPVM
Subjt:  MSDGEQPQFELDPEIERTF--RRNRQRARQRQARRMENNNRNAPPPHAALE---PNAAYMAHDLDRPIRTYAAPNLYNFNPGIAYPVFGENARFEIKPVM

Query:  LQMIQNVGQFDGHPGEDPHEHIRNFYSICASFHMPGISPEELRFALFLLTLRDEAKRWANALEDGEVGTWDQLIEKFMKKFFPSHKNARRRKELMGFQQK
        +QMIQN+ QF+    E+PH H+  F  +C++F +PGI+P  +R  LF  TLRD+AKRWA++LE  E+ + DQL+E FMKKFFP   N RRRK ++ F++ 
Subjt:  LQMIQNVGQFDGHPGEDPHEHIRNFYSICASFHMPGISPEELRFALFLLTLRDEAKRWANALEDGEVGTWDQLIEKFMKKFFPSHKNARRRKELMGFQQK

Query:  DRENLHDAWSRFKRMVKASPHNGIPECILMEVFYFGLNKATQQTADAVFVGSMLKSSYNQIK
        D E L  AW RF+R+VK  PH GI +C+LME+FY GLN++TQ  ADA  V   +  +Y + K
Subjt:  DRENLHDAWSRFKRMVKASPHNGIPECILMEVFYFGLNKATQQTADAVFVGSMLKSSYNQIK

TrEMBL top hitse value%identityAlignment
A0A5A7V1F3 Retrotrans_gag domain-containing protein1.7e-4748.15Show/hide
Query:  PPHAALEPNAAYMAHDLDRPIRTYAAPNLYNFNPGIAYPVFGENARFEIKPVMLQMIQNVGQFDGHPGEDPHEHIRNFYSICASFHMPGISPEELRFALF
        P + A+     Y+AH+L RPIR+YA P+LY FNPGIAYP FGENA +E K                                                  
Subjt:  PPHAALEPNAAYMAHDLDRPIRTYAAPNLYNFNPGIAYPVFGENARFEIKPVMLQMIQNVGQFDGHPGEDPHEHIRNFYSICASFHMPGISPEELRFALF

Query:  LLTLRDEAKRWANALEDGEVGTWDQLIEKFMKKFFPSHKNARRRKELMGFQQKDRENLHDAWSRFKRMVKASPHNGIPECILMEVFYFGLNKATQQTADA
             D+AKRWAN++E GEV TW+ LIEKFMKKFFP+ K A+RR++L+ F+Q+DR+NLHDAWS FKRMVKA  H+GI + +LME FYFGL+K T+Q+AD+
Subjt:  LLTLRDEAKRWANALEDGEVGTWDQLIEKFMKKFFPSHKNARRRKELMGFQQKDRENLHDAWSRFKRMVKASPHNGIPECILMEVFYFGLNKATQQTADA

Query:  VFVGSMLKSSYNQIKA
        +F+G +L+SSYNQIKA
Subjt:  VFVGSMLKSSYNQIKA

A0A6J1EEI2 uncharacterized protein LOC1114333942.9e-4745.37Show/hide
Query:  NAAYMAHDLDRPIRTYAAPNLYNFNPGIAYPVFGENARFEIKPVMLQMIQNVGQFDGHPGEDPHEHIRNFYSICASFHMPGISPEELRFALFLLTLRDEA
        NA ++A D +R IR YA P +   NP I  P   +   FE+KPVM QM+Q +GQF G P EDPH H+++F  +  SF    +  + +R +LF  +LRD A
Subjt:  NAAYMAHDLDRPIRTYAAPNLYNFNPGIAYPVFGENARFEIKPVMLQMIQNVGQFDGHPGEDPHEHIRNFYSICASFHMPGISPEELRFALFLLTLRDEA

Query:  KRWANALEDGEVGTWDQLIEKFMKKFFPSHKNARRRKELMGFQQKDRENLHDAWSRFKRMVKASPHNGIPECILMEVFYFGLNKATQQTADAVFVGSMLK
        K W N L  G + +W+ L+EKF+ K+FP  +NAR R E++ FQQ + + L +AW RFK M++  PH+G+P CI ME FY GLN AT+Q  DA   G++L 
Subjt:  KRWANALEDGEVGTWDQLIEKFMKKFFPSHKNARRRKELMGFQQKDRENLHDAWSRFKRMVKASPHNGIPECILMEVFYFGLNKATQQTADAVFVGSMLK

Query:  SSYNQ
         +YN+
Subjt:  SSYNQ

A0A6J1EQ90 uncharacterized protein LOC1114364111.0e-4439.47Show/hide
Query:  QFELDPEIERTFRRNRQRARQRQARRME------NNNRNAPPP-----HAALEPNAAYMAHDLDRPIRTYAAPNLYNFNPGIAYPVFGENARFEIKPVML
        +F LDPEIERTFRR  ++ ++   + ++        NR    P        +  N  ++A D +R IR YA P +   NP I  P   +   FE+KPVM 
Subjt:  QFELDPEIERTFRRNRQRARQRQARRME------NNNRNAPPP-----HAALEPNAAYMAHDLDRPIRTYAAPNLYNFNPGIAYPVFGENARFEIKPVML

Query:  QMIQNVGQFDGHPGEDPHEHIRNFYSI-------CASFHMPGISPEELRFALFLLTLRDEAKRWANALEDGEVGTWDQLIEKFMKKFFPSHKNARRRKEL
        QM+Q +GQF G P EDPH H+++F  +         SF   G+  + +R +LF   LRD AK W N L  G + +W+ L E F+ K+FP  +NAR + E+
Subjt:  QMIQNVGQFDGHPGEDPHEHIRNFYSI-------CASFHMPGISPEELRFALFLLTLRDEAKRWANALEDGEVGTWDQLIEKFMKKFFPSHKNARRRKEL

Query:  MGFQQKDRENLHDAWSRFKRMVKASPHNGIPECILMEVFYFGLNKATQQTADAVFVGSMLKSSYNQ
        + FQQ + E L +A  RFK M++  PH+G+P CI ME FY GLN  T+Q  DA   G++L  +YN+
Subjt:  MGFQQKDRENLHDAWSRFKRMVKASPHNGIPECILMEVFYFGLNKATQQTADAVFVGSMLKSSYNQ

A0A6J1H7E4 uncharacterized protein LOC1114611687.6e-4846.34Show/hide
Query:  NAAYMAHDLDRPIRTYAAPNLYNFNPGIAYPVFGENARFEIKPVMLQMIQNVGQFDGHPGEDPHEHIRNFYSICASFHMPGISPEELRFALFLLTLRDEA
        NA  +A D +R IR YA P +   NP I  P   +   FE+KPVM QM+Q +GQF G P EDPH H+++F  +  SF   G+  + +R +LF  +LRD A
Subjt:  NAAYMAHDLDRPIRTYAAPNLYNFNPGIAYPVFGENARFEIKPVMLQMIQNVGQFDGHPGEDPHEHIRNFYSICASFHMPGISPEELRFALFLLTLRDEA

Query:  KRWANALEDGEVGTWDQLIEKFMKKFFPSHKNARRRKELMGFQQKDRENLHDAWSRFKRMVKASPHNGIPECILMEVFYFGLNKATQQTADAVFVGSMLK
        K W N L    + +W+ L EKF+ K+FP  +NAR R E++ FQQ + E L +AW RFK M++  PH+G+P CI ME FY GLN AT+Q  DA   G+ML 
Subjt:  KRWANALEDGEVGTWDQLIEKFMKKFFPSHKNARRRKELMGFQQKDRENLHDAWSRFKRMVKASPHNGIPECILMEVFYFGLNKATQQTADAVFVGSMLK

Query:  SSYNQ
         +YN+
Subjt:  SSYNQ

U5CUI2 Retrotrans_gag domain-containing protein1.1e-4948.29Show/hide
Query:  NAAYMAHDLDRPIRTYAAPNLYNFNPGIAYPVFGENARFEIKPVMLQMIQNVGQFDGHPGEDPHEHIRNFYSICASFHMPGISPEELRFALFLLTLRDEA
        N   +A D  R IR YAAP     NPGI  P   +  +FE+KPVM QM+Q VGQF G P EDPH H+R+F  +  SF + G+S E LR  LF  +LRD A
Subjt:  NAAYMAHDLDRPIRTYAAPNLYNFNPGIAYPVFGENARFEIKPVMLQMIQNVGQFDGHPGEDPHEHIRNFYSICASFHMPGISPEELRFALFLLTLRDEA

Query:  KRWANALEDGEVGTWDQLIEKFMKKFFPSHKNARRRKELMGFQQKDRENLHDAWSRFKRMVKASPHNGIPECILMEVFYFGLNKATQQTADAVFVGSMLK
        + W N L    V  W+ L EKF++K+FP  +NA+ R E+M FQQ + E+  DAW RFK +++  PH+GIP CI ME FY GLN A++   DA   G++L 
Subjt:  KRWANALEDGEVGTWDQLIEKFMKKFFPSHKNARRRKELMGFQQKDRENLHDAWSRFKRMVKASPHNGIPECILMEVFYFGLNKATQQTADAVFVGSMLK

Query:  SSYNQ
         SYN+
Subjt:  SSYNQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTGACGGTGAACAACCGCAATTCGAACTTGACCCTGAAATTGAGCGAACTTTTCGGCGTAATAGGCAAAGAGCTAGGCAGAGACAAGCAAGAAGAATGGAAAACAA
TAACAGAAATGCCCCTCCGCCGCATGCTGCCCTAGAACCAAATGCCGCCTACATGGCACATGACTTGGATAGGCCGATTAGAACTTATGCGGCACCCAACCTCTACAACT
TCAACCCAGGGATCGCTTACCCTGTGTTCGGCGAAAACGCTAGGTTTGAAATCAAACCTGTCATGCTTCAGATGATTCAGAATGTCGGACAATTCGACGGTCACCCTGGG
GAGGATCCACACGAGCATATTAGGAATTTTTACTCCATCTGCGCTTCCTTCCATATGCCAGGCATCTCACCTGAAGAATTGAGGTTCGCTCTCTTCCTGTTAACTCTGAG
GGATGAGGCGAAGAGGTGGGCCAACGCCTTGGAAGATGGTGAGGTGGGAACGTGGGACCAATTAATAGAGAAATTTATGAAGAAATTTTTCCCATCTCACAAAAATGCCA
GAAGAAGGAAGGAGCTTATGGGCTTCCAGCAGAAGGATAGAGAAAACCTACATGACGCGTGGAGTAGGTTTAAGAGGATGGTGAAAGCATCCCCCCACAATGGCATTCCT
GAATGCATATTGATGGAGGTTTTCTATTTTGGGCTAAACAAGGCTACACAGCAGACTGCTGATGCTGTGTTTGTAGGCAGTATGCTAAAGAGCTCCTACAACCAGATTAA
GGCGACCTGA
mRNA sequenceShow/hide mRNA sequence
ATGAGTGACGGTGAACAACCGCAATTCGAACTTGACCCTGAAATTGAGCGAACTTTTCGGCGTAATAGGCAAAGAGCTAGGCAGAGACAAGCAAGAAGAATGGAAAACAA
TAACAGAAATGCCCCTCCGCCGCATGCTGCCCTAGAACCAAATGCCGCCTACATGGCACATGACTTGGATAGGCCGATTAGAACTTATGCGGCACCCAACCTCTACAACT
TCAACCCAGGGATCGCTTACCCTGTGTTCGGCGAAAACGCTAGGTTTGAAATCAAACCTGTCATGCTTCAGATGATTCAGAATGTCGGACAATTCGACGGTCACCCTGGG
GAGGATCCACACGAGCATATTAGGAATTTTTACTCCATCTGCGCTTCCTTCCATATGCCAGGCATCTCACCTGAAGAATTGAGGTTCGCTCTCTTCCTGTTAACTCTGAG
GGATGAGGCGAAGAGGTGGGCCAACGCCTTGGAAGATGGTGAGGTGGGAACGTGGGACCAATTAATAGAGAAATTTATGAAGAAATTTTTCCCATCTCACAAAAATGCCA
GAAGAAGGAAGGAGCTTATGGGCTTCCAGCAGAAGGATAGAGAAAACCTACATGACGCGTGGAGTAGGTTTAAGAGGATGGTGAAAGCATCCCCCCACAATGGCATTCCT
GAATGCATATTGATGGAGGTTTTCTATTTTGGGCTAAACAAGGCTACACAGCAGACTGCTGATGCTGTGTTTGTAGGCAGTATGCTAAAGAGCTCCTACAACCAGATTAA
GGCGACCTGA
Protein sequenceShow/hide protein sequence
MSDGEQPQFELDPEIERTFRRNRQRARQRQARRMENNNRNAPPPHAALEPNAAYMAHDLDRPIRTYAAPNLYNFNPGIAYPVFGENARFEIKPVMLQMIQNVGQFDGHPG
EDPHEHIRNFYSICASFHMPGISPEELRFALFLLTLRDEAKRWANALEDGEVGTWDQLIEKFMKKFFPSHKNARRRKELMGFQQKDRENLHDAWSRFKRMVKASPHNGIP
ECILMEVFYFGLNKATQQTADAVFVGSMLKSSYNQIKAT