; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0016228 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0016228
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr11:16285444..16286253
RNA-Seq ExpressionPI0016228
SyntenyPI0016228
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
WP_217833177.1 retrotransposon gag domain-containing protein, partial [Synechococcus sp. PCC 7002]4.1e-7547.55Show/hide
Query:  MSDSEQPPFELDPEIERTFQSNRRRARQRQARRMENNNRNA-----------------PPPHADSEPNAAYIAHDLDRPIRSYAAPNLYNFNLGIAYPAF
        MS+ + P F++DPEIERTF+   R+ +QR++ +    N +A                    HA+ + N   +AHD +RP+R YA+PNLYNF  GI  P F
Subjt:  MSDSEQPPFELDPEIERTFQSNRRRARQRQARRMENNNRNA-----------------PPPHADSEPNAAYIAHDLDRPIRSYAAPNLYNFNLGIAYPAF

Query:  GENSRFEIKPVMLQMIQNAGQFGGHPGEDPHEHIRSFYSICASFHMPGISPEELRFALFPLTLRDEAKRWANALEDGEVGTWDQLIEKFMKKFFPPHENT
          N RFE+KPVMLQM+Q AGQFGG  GEDPH H++SF  IC++F M G+  + +R  LFP +LRDEA++WA + E GE+ TW +++EKFM+K+FPP  + 
Subjt:  GENSRFEIKPVMLQMIQNAGQFGGHPGEDPHEHIRSFYSICASFHMPGISPEELRFALFPLTLRDEAKRWANALEDGEVGTWDQLIEKFMKKFFPPHENT

Query:  RRRKELMSFQQKDRENLHDAWSRFKRMVKACPHNGIPKCILMEVFYFGLNKATQQTADAVFADGMLKSTYNQIKTMLDTMASNNEE
        +RR+++++F+QKD E   +AW+RFKR+V+ CPHNGIP C+ ME+FY GLNK +Q  ADA  A G++  TY Q K +LD ++ N  +
Subjt:  RRRKELMSFQQKDRENLHDAWSRFKRMVKACPHNGIPKCILMEVFYFGLNKATQQTADAVFADGMLKSTYNQIKTMLDTMASNNEE

XP_017233063.1 PREDICTED: uncharacterized protein LOC108207110 [Daucus carota subsp. sativus]3.5e-5846.49Show/hide
Query:  MSDSEQPPFELDPEIERTFQSNRRRARQRQARR----MENNNRNAPPPHADSEPNAAYIAHDLDRPIRSYAAPNLYNFNLGIAYPAFGENSRFEIKPVML
        MS      F  DPEIERTF  NRRR  QR+ ++    M++N  N   P     P  A+I  D DR IR YAAP     N GI  P   + ++FE+KPVM 
Subjt:  MSDSEQPPFELDPEIERTFQSNRRRARQRQARR----MENNNRNAPPPHADSEPNAAYIAHDLDRPIRSYAAPNLYNFNLGIAYPAFGENSRFEIKPVML

Query:  QMIQNAGQFGGHPGEDPHEHIRSFYSICASFHMPGISPEELRFALFPLTLRDEAKRWANALEDGEVGTWDQLIEKFMKKFFPPHENTRRRKELMSFQQKD
        QM+Q  GQF G P EDPH H+R F  I  SF   G+  + LR  LFP ++RD A+ W N+L  G V TW+ L EKF+ K+FPP+ N + R E+ SFQQ+D
Subjt:  QMIQNAGQFGGHPGEDPHEHIRSFYSICASFHMPGISPEELRFALFPLTLRDEAKRWANALEDGEVGTWDQLIEKFMKKFFPPHENTRRRKELMSFQQKD

Query:  RENLHDAWSRFKRMVKACPHNGIPKCILMEVFYFGLNKATQQTADAVFADGMLKSTYNQIKTMLDTMASNN
         E+L+DAW RFK +++ CPH+GI  CI ME FY GLN  T+   DA     +L  +YNQ   +L+T+A+ N
Subjt:  RENLHDAWSRFKRMVKACPHNGIPKCILMEVFYFGLNKATQQTADAVFADGMLKSTYNQIKTMLDTMASNN

XP_017239618.1 PREDICTED: uncharacterized protein LOC108212402 [Daucus carota subsp. sativus]5.6e-5646.13Show/hide
Query:  MSDSEQPPFELDPEIERTFQSNRRRARQRQARR----MENNNRNAPPPHADSEPNAAYIAHDLDRPIRSYAAPNLYNFNLGIAYPAFGENSRFEIKPVML
        MS+     F  DPEIERTF  NRRR  QR+ ++    M +N  N   P     P  A+I  D DR IR YAAP     N GI  P   + ++FE+KPVM 
Subjt:  MSDSEQPPFELDPEIERTFQSNRRRARQRQARR----MENNNRNAPPPHADSEPNAAYIAHDLDRPIRSYAAPNLYNFNLGIAYPAFGENSRFEIKPVML

Query:  QMIQNAGQFGGHPGEDPHEHIRSFYSICASFHMPGISPEELRFALFPLTLRDEAKRWANALEDGEVGTWDQLIEKFMKKFFPPHENTRRRKELMSFQQKD
        QM+Q  GQF G P EDPH H+R F  I  SF   G+  + LR  LFP ++RD A+ W N+L  G V TW+ L EKF+ K+FPP+ N +   E+ SFQQ+D
Subjt:  QMIQNAGQFGGHPGEDPHEHIRSFYSICASFHMPGISPEELRFALFPLTLRDEAKRWANALEDGEVGTWDQLIEKFMKKFFPPHENTRRRKELMSFQQKD

Query:  RENLHDAWSRFKRMVKACPHNGIPKCILMEVFYFGLNKATQQTADAVFADGMLKSTYNQIKTMLDTMASNN
         E+L+DAW RFK +++ CPH+GI   I ME FY GLN  T+   DA     +L  +YNQ   +L+T+A+NN
Subjt:  RENLHDAWSRFKRMVKACPHNGIPKCILMEVFYFGLNKATQQTADAVFADGMLKSTYNQIKTMLDTMASNN

XP_030508936.1 uncharacterized protein LOC115723589 [Cannabis sativa]4.3e-5645.91Show/hide
Query:  LDPEIERTFQSNRRRARQRQARRMENNNRNAPPPHADSEPNAAYIAHDLDRPIRSYAAPNLYNFNLGIAYPAFGENSRFEIKPVMLQMIQNAGQFGGHPG
        +DPEIERTF   R+R ++++A++  N           +E N   +A D  R IR YAAP     N GI  P   +   FE+KPVM QM+Q  GQFGG P 
Subjt:  LDPEIERTFQSNRRRARQRQARRMENNNRNAPPPHADSEPNAAYIAHDLDRPIRSYAAPNLYNFNLGIAYPAFGENSRFEIKPVMLQMIQNAGQFGGHPG

Query:  EDPHEHIRSFYSICASFHMPGISPEELRFALFPLTLRDEAKRWANALEDGEVGTWDQLIEKFMKKFFPPHENTRRRKELMSFQQKDRENLHDAWSRFKRM
        EDPH HIRSF  +  SF + G+S E LR  LFP +LRD A+ W N L    V  W+ L EKF++K+FPP  N + R E+MSFQQ + E   DAW RFK +
Subjt:  EDPHEHIRSFYSICASFHMPGISPEELRFALFPLTLRDEAKRWANALEDGEVGTWDQLIEKFMKKFFPPHENTRRRKELMSFQQKDRENLHDAWSRFKRM

Query:  VKACPHNGIPKCILMEVFYFGLNKATQQTADAVFADGMLKSTYNQIKTMLDTMASNN
        ++ CPH+GIP CI +E FY GLN A +   DA     +L  +YN+   +L+ +ASNN
Subjt:  VKACPHNGIPKCILMEVFYFGLNKATQQTADAVFADGMLKSTYNQIKTMLDTMASNN

XP_038887458.1 uncharacterized protein LOC120077591 [Benincasa hispida]1.1e-6448.18Show/hide
Query:  MSDSEQPPFELDPEIERTF--QSNRRRARQRQARRMENNNRNAPPPHADSE---PNAAYIAHDLDRPIRSYAAPNLYNFNLGIAYPAFGENSRFEIKPVM
        MS    P FE +PEI+ TF  ++++ RA +R+    +NNN  AP  +        +  ++A D + PIR+YAAPNLY+F+ GI+ P   EN+RFEIKPVM
Subjt:  MSDSEQPPFELDPEIERTF--QSNRRRARQRQARRMENNNRNAPPPHADSE---PNAAYIAHDLDRPIRSYAAPNLYNFNLGIAYPAFGENSRFEIKPVM

Query:  LQMIQNAGQFGGHPGEDPHEHIRSFYSICASFHMPGISPEELRFALFPLTLRDEAKRWANALEDGEVGTWDQLIEKFMKKFFPPHENTRRRKELMSFQQK
        +QMIQN  QF     E+PH H+  F  +C++F +PGI+P  +R  LFP TLRD+AKRWA++LE  E+ + DQL+E FMKKFFPP  NTRRRK +++F++ 
Subjt:  LQMIQNAGQFGGHPGEDPHEHIRSFYSICASFHMPGISPEELRFALFPLTLRDEAKRWANALEDGEVGTWDQLIEKFMKKFFPPHENTRRRKELMSFQQK

Query:  DRENLHDAWSRFKRMVKACPHNGIPKCILMEVFYFGLNKATQQTADAVFADGMLKSTYNQIKTMLDTMASNNEE
        D E L  AW RF+R+VK CPH GI  C+LME+FY GLN++TQ  ADA   +  +  TY + K +LD ++ N ++
Subjt:  DRENLHDAWSRFKRMVKACPHNGIPKCILMEVFYFGLNKATQQTADAVFADGMLKSTYNQIKTMLDTMASNNEE

TrEMBL top hitse value%identityAlignment
A0A5A7V1F3 Retrotrans_gag domain-containing protein3.2e-4948.61Show/hide
Query:  YIAHDLDRPIRSYAAPNLYNFNLGIAYPAFGENSRFEIKPVMLQMIQNAGQFGGHPGEDPHEHIRSFYSICASFHMPGISPEELRFALFPLTLRDEAKRW
        Y+AH+L RPIRSYA P+LY FN GIAYP FGEN+ +E K                                                       D+AKRW
Subjt:  YIAHDLDRPIRSYAAPNLYNFNLGIAYPAFGENSRFEIKPVMLQMIQNAGQFGGHPGEDPHEHIRSFYSICASFHMPGISPEELRFALFPLTLRDEAKRW

Query:  ANALEDGEVGTWDQLIEKFMKKFFPPHENTRRRKELMSFQQKDRENLHDAWSRFKRMVKACPHNGIPKCILMEVFYFGLNKATQQTADAVFADGMLKSTY
        AN++E GEV TW+ LIEKFMKKFFP  +  +RR++L+ F+Q+DR+NLHDAWS FKRMVKAC H+GI K +LME FYFGL+K T+Q+AD++F  G+L+S+Y
Subjt:  ANALEDGEVGTWDQLIEKFMKKFFPPHENTRRRKELMSFQQKDRENLHDAWSRFKRMVKACPHNGIPKCILMEVFYFGLNKATQQTADAVFADGMLKSTY

Query:  NQIKTMLDTMASNNEE
        NQIK MLD+MA+N+++
Subjt:  NQIKTMLDTMASNNEE

A0A6J1EEI2 uncharacterized protein LOC1114333945.8e-5146.08Show/hide
Query:  NAAYIAHDLDRPIRSYAAPNLYNFNLGIAYPAFGENSRFEIKPVMLQMIQNAGQFGGHPGEDPHEHIRSFYSICASFHMPGISPEELRFALFPLTLRDEA
        NA ++A D +R IR+YA P +   N  I  P   + + FE+KPVM QM+Q  GQF G P EDPH H++SF  +  SF    +  + +R +LFP +LRD A
Subjt:  NAAYIAHDLDRPIRSYAAPNLYNFNLGIAYPAFGENSRFEIKPVMLQMIQNAGQFGGHPGEDPHEHIRSFYSICASFHMPGISPEELRFALFPLTLRDEA

Query:  KRWANALEDGEVGTWDQLIEKFMKKFFPPHENTRRRKELMSFQQKDRENLHDAWSRFKRMVKACPHNGIPKCILMEVFYFGLNKATQQTADAVFADGMLK
        K W N L  G + +W+ L+EKF+ K+FPP  N R R E++ FQQ + + L +AW RFK M++ CPH+G+P CI ME FY GLN AT+Q  DA     +L 
Subjt:  KRWANALEDGEVGTWDQLIEKFMKKFFPPHENTRRRKELMSFQQKDRENLHDAWSRFKRMVKACPHNGIPKCILMEVFYFGLNKATQQTADAVFADGMLK

Query:  STYNQIKTMLDTMASNN
         TYN+   +L+ +ASNN
Subjt:  STYNQIKTMLDTMASNN

A0A6J1EQ90 uncharacterized protein LOC1114364116.0e-4840.07Show/hide
Query:  FELDPEIERTFQSNRRRARQRQARRME--------NNNRNAPPPHADSE---PNAAYIAHDLDRPIRSYAAPNLYNFNLGIAYPAFGENSRFEIKPVMLQ
        F LDPEIERTF+   ++ ++   + ++        N     P   A+ E    N  ++A D +R IR+YA P +   N  I  P   + + FE+KPVM Q
Subjt:  FELDPEIERTFQSNRRRARQRQARRME--------NNNRNAPPPHADSE---PNAAYIAHDLDRPIRSYAAPNLYNFNLGIAYPAFGENSRFEIKPVMLQ

Query:  MIQNAGQFGGHPGEDPHEHIRSFYSI-------CASFHMPGISPEELRFALFPLTLRDEAKRWANALEDGEVGTWDQLIEKFMKKFFPPHENTRRRKELM
        M+Q  GQF G P EDPH H++SF  +         SF   G+  + +R +LFP  LRD AK W N L  G + +W+ L E F+ K+FPP  N R + E++
Subjt:  MIQNAGQFGGHPGEDPHEHIRSFYSI-------CASFHMPGISPEELRFALFPLTLRDEAKRWANALEDGEVGTWDQLIEKFMKKFFPPHENTRRRKELM

Query:  SFQQKDRENLHDAWSRFKRMVKACPHNGIPKCILMEVFYFGLNKATQQTADAVFADGMLKSTYNQIKTMLDTMASNN
        +FQQ + E L +A  RFK M++ CPH+G+P CI ME FY GLN  T+Q  DA     +L  TYN+   +L+ +ASNN
Subjt:  SFQQKDRENLHDAWSRFKRMVKACPHNGIPKCILMEVFYFGLNKATQQTADAVFADGMLKSTYNQIKTMLDTMASNN

A0A6J1H7E4 uncharacterized protein LOC1114611681.5e-5147Show/hide
Query:  NAAYIAHDLDRPIRSYAAPNLYNFNLGIAYPAFGENSRFEIKPVMLQMIQNAGQFGGHPGEDPHEHIRSFYSICASFHMPGISPEELRFALFPLTLRDEA
        NA  +A D +R IR+YA P +   N  I  P   + + FE+KPVM QM+Q  GQF G P EDPH H++SF  +  SF   G+  + +R +LFP +LRD A
Subjt:  NAAYIAHDLDRPIRSYAAPNLYNFNLGIAYPAFGENSRFEIKPVMLQMIQNAGQFGGHPGEDPHEHIRSFYSICASFHMPGISPEELRFALFPLTLRDEA

Query:  KRWANALEDGEVGTWDQLIEKFMKKFFPPHENTRRRKELMSFQQKDRENLHDAWSRFKRMVKACPHNGIPKCILMEVFYFGLNKATQQTADAVFADGMLK
        K W N L    + +W+ L EKF+ K+FPP  N R R E+++FQQ + E L +AW RFK M++ CPH+G+P CI ME FY GLN AT+Q  DA     ML 
Subjt:  KRWANALEDGEVGTWDQLIEKFMKKFFPPHENTRRRKELMSFQQKDRENLHDAWSRFKRMVKACPHNGIPKCILMEVFYFGLNKATQQTADAVFADGMLK

Query:  STYNQIKTMLDTMASNN
         TYN+   +L+ +ASNN
Subjt:  STYNQIKTMLDTMASNN

U5CUI2 Retrotrans_gag domain-containing protein1.6e-5348.39Show/hide
Query:  NAAYIAHDLDRPIRSYAAPNLYNFNLGIAYPAFGENSRFEIKPVMLQMIQNAGQFGGHPGEDPHEHIRSFYSICASFHMPGISPEELRFALFPLTLRDEA
        N   +A D  R IR YAAP     N GI  P   +  +FE+KPVM QM+Q  GQF G P EDPH H+RSF  +  SF + G+S E LR  LFP +LRD A
Subjt:  NAAYIAHDLDRPIRSYAAPNLYNFNLGIAYPAFGENSRFEIKPVMLQMIQNAGQFGGHPGEDPHEHIRSFYSICASFHMPGISPEELRFALFPLTLRDEA

Query:  KRWANALEDGEVGTWDQLIEKFMKKFFPPHENTRRRKELMSFQQKDRENLHDAWSRFKRMVKACPHNGIPKCILMEVFYFGLNKATQQTADAVFADGMLK
        + W N L    V  W+ L EKF++K+FPP  N + R E+MSFQQ + E+  DAW RFK +++ CPH+GIP CI ME FY GLN A++   DA     +L 
Subjt:  KRWANALEDGEVGTWDQLIEKFMKKFFPPHENTRRRKELMSFQQKDRENLHDAWSRFKRMVKACPHNGIPKCILMEVFYFGLNKATQQTADAVFADGMLK

Query:  STYNQIKTMLDTMASNN
         +YN+   +L+T+ASNN
Subjt:  STYNQIKTMLDTMASNN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTGACAGCGAACAACCACCATTCGAACTTGACCCTGAAATTGAGCGAACATTTCAAAGTAACCGGCGAAGAGCAAGGCAGAGACAAGCTAGAAGGATGGAAAATAA
CAATAGAAATGCCCCTCCGCCGCATGCTGACTCAGAACCAAATGCTGCTTACATAGCACACGACTTGGATAGGCCGATTAGATCTTATGCAGCACCCAACCTCTACAACT
TTAACCTAGGAATCGCCTACCCTGCATTCGGCGAGAATTCCAGATTTGAAATCAAACCTGTTATGCTTCAGATGATTCAGAACGCCGGACAATTCGGCGGGCATCCTGGG
GAAGATCCACATGAACATATAAGGAGTTTCTATTCCATCTGCGCTTCCTTCCACATGCCAGGCATTTCACCTGAAGAATTGAGATTCGCCCTCTTCCCGTTAACTCTGAG
GGATGAGGCAAAGAGGTGGGCAAATGCTCTGGAAGATGGCGAGGTGGGAACATGGGACCAACTAATAGAGAAATTTATGAAGAAATTTTTTCCACCTCACGAAAACACTA
GAAGAAGGAAGGAGCTTATGAGCTTCCAGCAGAAGGATAGAGAAAACCTACATGACGCGTGGAGTAGGTTCAAAAGGATGGTCAAAGCATGTCCCCACAATGGCATTCCT
AAATGCATATTGATGGAAGTTTTCTATTTCGGACTAAACAAGGCTACACAGCAGACTGCTGACGCTGTGTTTGCAGACGGTATGCTAAAAAGCACATACAACCAAATTAA
GACGATGCTGGATACGATGGCCAGCAACAATGAGGAATGA
mRNA sequenceShow/hide mRNA sequence
ATGAGTGACAGCGAACAACCACCATTCGAACTTGACCCTGAAATTGAGCGAACATTTCAAAGTAACCGGCGAAGAGCAAGGCAGAGACAAGCTAGAAGGATGGAAAATAA
CAATAGAAATGCCCCTCCGCCGCATGCTGACTCAGAACCAAATGCTGCTTACATAGCACACGACTTGGATAGGCCGATTAGATCTTATGCAGCACCCAACCTCTACAACT
TTAACCTAGGAATCGCCTACCCTGCATTCGGCGAGAATTCCAGATTTGAAATCAAACCTGTTATGCTTCAGATGATTCAGAACGCCGGACAATTCGGCGGGCATCCTGGG
GAAGATCCACATGAACATATAAGGAGTTTCTATTCCATCTGCGCTTCCTTCCACATGCCAGGCATTTCACCTGAAGAATTGAGATTCGCCCTCTTCCCGTTAACTCTGAG
GGATGAGGCAAAGAGGTGGGCAAATGCTCTGGAAGATGGCGAGGTGGGAACATGGGACCAACTAATAGAGAAATTTATGAAGAAATTTTTTCCACCTCACGAAAACACTA
GAAGAAGGAAGGAGCTTATGAGCTTCCAGCAGAAGGATAGAGAAAACCTACATGACGCGTGGAGTAGGTTCAAAAGGATGGTCAAAGCATGTCCCCACAATGGCATTCCT
AAATGCATATTGATGGAAGTTTTCTATTTCGGACTAAACAAGGCTACACAGCAGACTGCTGACGCTGTGTTTGCAGACGGTATGCTAAAAAGCACATACAACCAAATTAA
GACGATGCTGGATACGATGGCCAGCAACAATGAGGAATGA
Protein sequenceShow/hide protein sequence
MSDSEQPPFELDPEIERTFQSNRRRARQRQARRMENNNRNAPPPHADSEPNAAYIAHDLDRPIRSYAAPNLYNFNLGIAYPAFGENSRFEIKPVMLQMIQNAGQFGGHPG
EDPHEHIRSFYSICASFHMPGISPEELRFALFPLTLRDEAKRWANALEDGEVGTWDQLIEKFMKKFFPPHENTRRRKELMSFQQKDRENLHDAWSRFKRMVKACPHNGIP
KCILMEVFYFGLNKATQQTADAVFADGMLKSTYNQIKTMLDTMASNNEE