; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0003495 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0003495
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr09:9210177..9211061
RNA-Seq ExpressionPI0003495
SyntenyPI0003495
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
WP_217833177.1 retrotransposon gag domain-containing protein, partial [Synechococcus sp. PCC 7002]1.1e-7047.55Show/hide
Query:  MSDSEQP-FELDLEIERTFRGNRRRARQRQVRRMENNNRNA-----LPPQAHPEPNA------------AYIAHDLDRPIRSYAAPNLYNFNPGIAYPVF
        MS+ + P F++D EIERTFR   R+ +QR+  +    N +A       PQA    NA              +AHD +RP+R YA+PNLYNF PGI  P F
Subjt:  MSDSEQP-FELDLEIERTFRGNRRRARQRQVRRMENNNRNA-----LPPQAHPEPNA------------AYIAHDLDRPIRSYAAPNLYNFNPGIAYPVF

Query:  GENARFEIKPVMLQMIQNVGQFGGHPGEDPHKHIRSFYSICASFHMPGISPEELRFTLFPLTLRDEAKRWANALKDGEVGTWDQLIEKFMKKFFPPHENA
          N RFE+KPVMLQM+Q  GQFGG  GEDPH H++SF  IC++F M G+  + +R TLFP +LRDEA++WA + + GE+ TW +++EKFM+K+FPP  +A
Subjt:  GENARFEIKPVMLQMIQNVGQFGGHPGEDPHKHIRSFYSICASFHMPGISPEELRFTLFPLTLRDEAKRWANALKDGEVGTWDQLIEKFMKKFFPPHENA

Query:  -RRKELMSFQQKDRENLHDTWSRFKRMVKACPYNGIPKCILMEVFYFGLNKATQQTADAVFVDGMLKSTYNQIKTTLDTMANNNEE
         RR+++++F+QKD E   + W+RFKR+V+ CP+NGIP C+ ME+FY GLNK +Q  ADA    G++  TY Q K  LD ++ N  +
Subjt:  -RRKELMSFQQKDRENLHDTWSRFKRMVKACPYNGIPKCILMEVFYFGLNKATQQTADAVFVDGMLKSTYNQIKTTLDTMANNNEE

XP_017233063.1 PREDICTED: uncharacterized protein LOC108207110 [Daucus carota subsp. sativus]2.5e-5746.24Show/hide
Query:  FELDLEIERTFRGNRRRARQRQVRR----MENNNRNALPPQAHPEPNAAYIAHDLDRPIRSYAAPNLYNFNPGIAYPVFGENARFEIKPVMLQMIQNVGQ
        F  D EIERTF  NRRR  QR++++    M++N  N   P     P  A+I  D DR IR YAAP     N GI  P   +  +FE+KPVM QM+Q +GQ
Subjt:  FELDLEIERTFRGNRRRARQRQVRR----MENNNRNALPPQAHPEPNAAYIAHDLDRPIRSYAAPNLYNFNPGIAYPVFGENARFEIKPVMLQMIQNVGQ

Query:  FGGHPGEDPHKHIRSFYSICASFHMPGISPEELRFTLFPLTLRDEAKRWANALKDGEVGTWDQLIEKFMKKFFPPHENAR-RKELMSFQQKDRENLHDTW
        F G P EDPH H+R F  I  SF   G+  + LR  LFP ++RD A+ W N+L  G V TW+ L EKF+ K+FPP+ NA+ R E+ SFQQ+D E+L+D W
Subjt:  FGGHPGEDPHKHIRSFYSICASFHMPGISPEELRFTLFPLTLRDEAKRWANALKDGEVGTWDQLIEKFMKKFFPPHENAR-RKELMSFQQKDRENLHDTW

Query:  SRFKRMVKACPYNGIPKCILMEVFYFGLNKATQQTADAVFVDGMLKSTYNQIKTTLDTMANNNEEW
         RFK +++ CP++GI  CI ME FY GLN  T+   DA     +L  +YNQ    L+T+A  N +W
Subjt:  SRFKRMVKACPYNGIPKCILMEVFYFGLNKATQQTADAVFVDGMLKSTYNQIKTTLDTMANNNEEW

XP_030497803.1 uncharacterized protein LOC115713460 [Cannabis sativa]2.3e-5548.44Show/hide
Query:  AHPEPNAAYIAHDLDRPIRSYAAPNLYNFNPGIAYPVFGENARFEIKPVMLQMIQNVGQFGGHPGEDPHKHIRSFYSICASFHMPGISPEELRFTLFPLT
        AH E N   +A D  R IR YAAP     NPGI  P   +   FE+KPVM QM+Q VGQFGG P EDPH HIRSF  +  SF + G+S E LR  LFP +
Subjt:  AHPEPNAAYIAHDLDRPIRSYAAPNLYNFNPGIAYPVFGENARFEIKPVMLQMIQNVGQFGGHPGEDPHKHIRSFYSICASFHMPGISPEELRFTLFPLT

Query:  LRDEAKRWANALKDGEVGTWDQLIEKFMKKFFPPHENAR-RKELMSFQQKDRENLHDTWSRFKRMVKACPYNGIPKCILMEVFYFGLNKATQQTADAVFV
        LRD A+ W N L    V  W+ L EKF++K+FPP  NA+ R E+MSFQQ + E   D W RFK +++ CP++GIP CI +E FY GLN A++   DA   
Subjt:  LRDEAKRWANALKDGEVGTWDQLIEKFMKKFFPPHENAR-RKELMSFQQKDRENLHDTWSRFKRMVKACPYNGIPKCILMEVFYFGLNKATQQTADAVFV

Query:  DGMLKSTYNQIKTTLDTMANNNEEW
          +L  +YN+    L+ +A+NN +W
Subjt:  DGMLKSTYNQIKTTLDTMANNNEEW

XP_030508936.1 uncharacterized protein LOC115723589 [Cannabis sativa]9.4e-5744.85Show/hide
Query:  MSDSEQPFEL---DLEIERTFRGNRRRARQRQVRRMENNNRNALPPQAHPEPNAAYIAHDLDRPIRSYAAPNLYNFNPGIAYPVFGENARFEIKPVMLQM
        M++ E+  EL   D EIERTF   R+R ++++ ++  N          H E N   +A D  R IR YAAP     NPGI  P   +   FE+KPVM QM
Subjt:  MSDSEQPFEL---DLEIERTFRGNRRRARQRQVRRMENNNRNALPPQAHPEPNAAYIAHDLDRPIRSYAAPNLYNFNPGIAYPVFGENARFEIKPVMLQM

Query:  IQNVGQFGGHPGEDPHKHIRSFYSICASFHMPGISPEELRFTLFPLTLRDEAKRWANALKDGEVGTWDQLIEKFMKKFFPPHENAR-RKELMSFQQKDRE
        +Q VGQFGG P EDPH HIRSF  +  SF + G+S E LR  LFP +LRD A+ W N L    V  W+ L EKF++K+FPP  NA+ R E+MSFQQ + E
Subjt:  IQNVGQFGGHPGEDPHKHIRSFYSICASFHMPGISPEELRFTLFPLTLRDEAKRWANALKDGEVGTWDQLIEKFMKKFFPPHENAR-RKELMSFQQKDRE

Query:  NLHDTWSRFKRMVKACPYNGIPKCILMEVFYFGLNKATQQTADAVFVDGMLKSTYNQIKTTLDTMANNNEEW
           D W RFK +++ CP++GIP CI +E FY GLN A +   DA     +L  +YN+    L+ +A+NN +W
Subjt:  NLHDTWSRFKRMVKACPYNGIPKCILMEVFYFGLNKATQQTADAVFVDGMLKSTYNQIKTTLDTMANNNEEW

XP_038887458.1 uncharacterized protein LOC120077591 [Benincasa hispida]2.1e-6446.02Show/hide
Query:  SEQPFELDLEIERTF--RGNRRRARQRQVRRMENNNRNALPPQAH-----PEPNAAYIAHDLDRPIRSYAAPNLYNFNPGIAYPVFGENARFEIKPVMLQ
        ++  FE + EI+ TF  R ++ RA +R++   +NNN  A  P+ +     P  +  ++A D + PIR+YAAPNLY+F+PGI+ P+  ENARFEIKPVM+Q
Subjt:  SEQPFELDLEIERTF--RGNRRRARQRQVRRMENNNRNALPPQAH-----PEPNAAYIAHDLDRPIRSYAAPNLYNFNPGIAYPVFGENARFEIKPVMLQ

Query:  MIQNVGQFGGHPGEDPHKHIRSFYSICASFHMPGISPEELRFTLFPLTLRDEAKRWANALKDGEVGTWDQLIEKFMKKFFPPHENA-RRKELMSFQQKDR
        MIQN+ QF     E+PH H+  F  +C++F +PGI+P  +R  LFP TLRD+AKRWA++L+  E+ + DQL+E FMKKFFPP  N  RRK +++F++ D 
Subjt:  MIQNVGQFGGHPGEDPHKHIRSFYSICASFHMPGISPEELRFTLFPLTLRDEAKRWANALKDGEVGTWDQLIEKFMKKFFPPHENA-RRKELMSFQQKDR

Query:  ENLHDTWSRFKRMVKACPYNGIPKCILMEVFYFGLNKATQQTADAVFVDGMLKSTYNQIKTTLDTMANNNEEWDEDDFGNRRGGRAKED
        E L   W RF+R+VK CP+ GI  C+LME+FY GLN++TQ  ADA  V+  +  TY + K  LD ++ N ++W +D +  R   R + D
Subjt:  ENLHDTWSRFKRMVKACPYNGIPKCILMEVFYFGLNKATQQTADAVFVDGMLKSTYNQIKTTLDTMANNNEEWDEDDFGNRRGGRAKED

TrEMBL top hitse value%identityAlignment
A0A5A7V1F3 Retrotrans_gag domain-containing protein3.9e-4848.61Show/hide
Query:  YIAHDLDRPIRSYAAPNLYNFNPGIAYPVFGENARFEIKPVMLQMIQNVGQFGGHPGEDPHKHIRSFYSICASFHMPGISPEELRFTLFPLTLRDEAKRW
        Y+AH+L RPIRSYA P+LY FNPGIAYP FGENA +E K                                                       D+AKRW
Subjt:  YIAHDLDRPIRSYAAPNLYNFNPGIAYPVFGENARFEIKPVMLQMIQNVGQFGGHPGEDPHKHIRSFYSICASFHMPGISPEELRFTLFPLTLRDEAKRW

Query:  ANALKDGEVGTWDQLIEKFMKKFFPPHENA-RRKELMSFQQKDRENLHDTWSRFKRMVKACPYNGIPKCILMEVFYFGLNKATQQTADAVFVDGMLKSTY
        AN+++ GEV TW+ LIEKFMKKFFP  + A RR++L+ F+Q+DR+NLHD WS FKRMVKAC ++GI K +LME FYFGL+K T+Q+AD++F+ G+L+S+Y
Subjt:  ANALKDGEVGTWDQLIEKFMKKFFPPHENA-RRKELMSFQQKDRENLHDTWSRFKRMVKACPYNGIPKCILMEVFYFGLNKATQQTADAVFVDGMLKSTY

Query:  NQIKTTLDTMANNNEE
        NQIK  LD+MANN+++
Subjt:  NQIKTTLDTMANNNEE

A0A6J1EEI2 uncharacterized protein LOC1114333941.9e-5040.53Show/hide
Query:  QPFELDLEIERTFRGNRRRARQRQVRRMENNNRNALPPQAHPEPNAAYIAHDLDRPIRSYAAPNLYNFNPGIAYPVFGENARFEIKPVMLQMIQNVGQFG
        Q  EL  ++ R F      A Q ++                   NA ++A D +R IR+YA P +   NP I  P   +   FE+KPVM QM+Q +GQF 
Subjt:  QPFELDLEIERTFRGNRRRARQRQVRRMENNNRNALPPQAHPEPNAAYIAHDLDRPIRSYAAPNLYNFNPGIAYPVFGENARFEIKPVMLQMIQNVGQFG

Query:  GHPGEDPHKHIRSFYSICASFHMPGISPEELRFTLFPLTLRDEAKRWANALKDGEVGTWDQLIEKFMKKFFPPHENAR-RKELMSFQQKDRENLHDTWSR
        G P EDPH H++SF  +  SF    +  + +R +LFP +LRD AK W N L  G + +W+ L+EKF+ K+FPP  NAR R E++ FQQ + + L + W R
Subjt:  GHPGEDPHKHIRSFYSICASFHMPGISPEELRFTLFPLTLRDEAKRWANALKDGEVGTWDQLIEKFMKKFFPPHENAR-RKELMSFQQKDRENLHDTWSR

Query:  FKRMVKACPYNGIPKCILMEVFYFGLNKATQQTADAVFVDGMLKSTYNQIKTTLDTMANNNEEW
        FK M++ CP++G+P CI ME FY GLN AT+Q  DA     +L  TYN+    L+ +A+NN +W
Subjt:  FKRMVKACPYNGIPKCILMEVFYFGLNKATQQTADAVFVDGMLKSTYNQIKTTLDTMANNNEEW

A0A6J1EQ90 uncharacterized protein LOC1114364118.6e-4839.93Show/hide
Query:  FELDLEIERTFRGNRRRARQRQVRRMENNNRNALPPQAHPE--------------PNAAYIAHDLDRPIRSYAAPNLYNFNPGIAYPVFGENARFEIKPV
        F LD EIERTF   RRR ++++    +N  +  L  Q + E               N  ++A D +R IR+YA P +   NP I  P   +   FE+KPV
Subjt:  FELDLEIERTFRGNRRRARQRQVRRMENNNRNALPPQAHPE--------------PNAAYIAHDLDRPIRSYAAPNLYNFNPGIAYPVFGENARFEIKPV

Query:  MLQMIQNVGQFGGHPGEDPHKHIRSFYSI-------CASFHMPGISPEELRFTLFPLTLRDEAKRWANALKDGEVGTWDQLIEKFMKKFFPPHENAR-RK
        M QM+Q +GQF G P EDPH H++SF  +         SF   G+  + +R +LFP  LRD AK W N L  G + +W+ L E F+ K+FPP  NAR + 
Subjt:  MLQMIQNVGQFGGHPGEDPHKHIRSFYSI-------CASFHMPGISPEELRFTLFPLTLRDEAKRWANALKDGEVGTWDQLIEKFMKKFFPPHENAR-RK

Query:  ELMSFQQKDRENLHDTWSRFKRMVKACPYNGIPKCILMEVFYFGLNKATQQTADAVFVDGMLKSTYNQIKTTLDTMANNNEEW
        E+++FQQ + E L +   RFK M++ CP++G+P CI ME FY GLN  T+Q  DA     +L  TYN+    L+ +A+NN +W
Subjt:  ELMSFQQKDRENLHDTWSRFKRMVKACPYNGIPKCILMEVFYFGLNKATQQTADAVFVDGMLKSTYNQIKTTLDTMANNNEEW

A0A6J1H7E4 uncharacterized protein LOC1114611689.8e-5246.36Show/hide
Query:  NAAYIAHDLDRPIRSYAAPNLYNFNPGIAYPVFGENARFEIKPVMLQMIQNVGQFGGHPGEDPHKHIRSFYSICASFHMPGISPEELRFTLFPLTLRDEA
        NA  +A D +R IR+YA P +   NP I  P   +   FE+KPVM QM+Q +GQF G P EDPH H++SF  +  SF   G+  + +R +LFP +LRD A
Subjt:  NAAYIAHDLDRPIRSYAAPNLYNFNPGIAYPVFGENARFEIKPVMLQMIQNVGQFGGHPGEDPHKHIRSFYSICASFHMPGISPEELRFTLFPLTLRDEA

Query:  KRWANALKDGEVGTWDQLIEKFMKKFFPPHENAR-RKELMSFQQKDRENLHDTWSRFKRMVKACPYNGIPKCILMEVFYFGLNKATQQTADAVFVDGMLK
        K W N L    + +W+ L EKF+ K+FPP  NAR R E+++FQQ + E L + W RFK M++ CP++G+P CI ME FY GLN AT+Q  DA     ML 
Subjt:  KRWANALKDGEVGTWDQLIEKFMKKFFPPHENAR-RKELMSFQQKDRENLHDTWSRFKRMVKACPYNGIPKCILMEVFYFGLNKATQQTADAVFVDGMLK

Query:  STYNQIKTTLDTMANNNEEW
         TYN+    L+ +A+NN +W
Subjt:  STYNQIKTTLDTMANNNEEW

U5CUI2 Retrotrans_gag domain-containing protein4.7e-5448.18Show/hide
Query:  NAAYIAHDLDRPIRSYAAPNLYNFNPGIAYPVFGENARFEIKPVMLQMIQNVGQFGGHPGEDPHKHIRSFYSICASFHMPGISPEELRFTLFPLTLRDEA
        N   +A D  R IR YAAP     NPGI  P   +  +FE+KPVM QM+Q VGQF G P EDPH H+RSF  +  SF + G+S E LR  LFP +LRD A
Subjt:  NAAYIAHDLDRPIRSYAAPNLYNFNPGIAYPVFGENARFEIKPVMLQMIQNVGQFGGHPGEDPHKHIRSFYSICASFHMPGISPEELRFTLFPLTLRDEA

Query:  KRWANALKDGEVGTWDQLIEKFMKKFFPPHENAR-RKELMSFQQKDRENLHDTWSRFKRMVKACPYNGIPKCILMEVFYFGLNKATQQTADAVFVDGMLK
        + W N L    V  W+ L EKF++K+FPP  NA+ R E+MSFQQ + E+  D W RFK +++ CP++GIP CI ME FY GLN A++   DA     +L 
Subjt:  KRWANALKDGEVGTWDQLIEKFMKKFFPPHENAR-RKELMSFQQKDRENLHDTWSRFKRMVKACPYNGIPKCILMEVFYFGLNKATQQTADAVFVDGMLK

Query:  STYNQIKTTLDTMANNNEEW
         +YN+    L+T+A+NN +W
Subjt:  STYNQIKTTLDTMANNNEEW

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTGACAGTGAACAACCATTCGAACTCGACCTTGAAATTGAGCGAACATTTCGGGGTAATCGACGAAGAGCAAGGCAGAGACAAGTTCGCAGGATGGAAAATAACAA
CAGAAATGCCCTTCCGCCGCAAGCTCACCCAGAACCGAACGCCGCCTATATAGCACATGACTTGGATAGGCCGATTAGATCTTATGCGGCGCCCAACCTTTATAACTTCA
ACCCAGGAATCGCCTACCCTGTATTCGGCGAGAACGCAAGGTTTGAAATCAAACCTGTTATGCTTCAAATGATTCAGAACGTCGGACAATTCGGCGGACATCCTGGGGAA
GATCCACACAAACATATAAGGAGTTTCTACTCCATCTGCGCTTCCTTCCATATGCCAGGCATCTCACCTGAAGAATTAAGATTCACCCTCTTCCCGTTAACTCTGAGGGA
TGAGGCGAAGAGGTGGGCGAATGCCCTGAAAGATGGCGAGGTGGGAACATGGGATCAATTAATAGAAAAATTTATGAAGAAATTTTTCCCACCTCACGAAAATGCTAGAA
GGAAGGAGCTTATGAGCTTCCAGCAGAAGGATAGAGAAAACCTACATGACACGTGGAGTAGGTTCAAACGGATGGTCAAAGCATGCCCCTACAATGGCATTCCTAAATGC
ATACTGATGGAGGTTTTCTATTTTGGACTAAACAAGGCTACACAGCAGACTGCTGATGCTGTGTTTGTAGACGGTATGCTGAAAAGTACATACAACCAGATTAAGACGAC
GCTGGACACGATGGCCAACAACAATGAAGAGTGGGATGAAGATGATTTCGGCAATCGCCGAGGAGGACGAGCAAAAGAGGATGGCATGGATAAGAGCGCCGTGGTGGCAT
TGTAG
mRNA sequenceShow/hide mRNA sequence
ATGAGTGACAGTGAACAACCATTCGAACTCGACCTTGAAATTGAGCGAACATTTCGGGGTAATCGACGAAGAGCAAGGCAGAGACAAGTTCGCAGGATGGAAAATAACAA
CAGAAATGCCCTTCCGCCGCAAGCTCACCCAGAACCGAACGCCGCCTATATAGCACATGACTTGGATAGGCCGATTAGATCTTATGCGGCGCCCAACCTTTATAACTTCA
ACCCAGGAATCGCCTACCCTGTATTCGGCGAGAACGCAAGGTTTGAAATCAAACCTGTTATGCTTCAAATGATTCAGAACGTCGGACAATTCGGCGGACATCCTGGGGAA
GATCCACACAAACATATAAGGAGTTTCTACTCCATCTGCGCTTCCTTCCATATGCCAGGCATCTCACCTGAAGAATTAAGATTCACCCTCTTCCCGTTAACTCTGAGGGA
TGAGGCGAAGAGGTGGGCGAATGCCCTGAAAGATGGCGAGGTGGGAACATGGGATCAATTAATAGAAAAATTTATGAAGAAATTTTTCCCACCTCACGAAAATGCTAGAA
GGAAGGAGCTTATGAGCTTCCAGCAGAAGGATAGAGAAAACCTACATGACACGTGGAGTAGGTTCAAACGGATGGTCAAAGCATGCCCCTACAATGGCATTCCTAAATGC
ATACTGATGGAGGTTTTCTATTTTGGACTAAACAAGGCTACACAGCAGACTGCTGATGCTGTGTTTGTAGACGGTATGCTGAAAAGTACATACAACCAGATTAAGACGAC
GCTGGACACGATGGCCAACAACAATGAAGAGTGGGATGAAGATGATTTCGGCAATCGCCGAGGAGGACGAGCAAAAGAGGATGGCATGGATAAGAGCGCCGTGGTGGCAT
TGTAG
Protein sequenceShow/hide protein sequence
MSDSEQPFELDLEIERTFRGNRRRARQRQVRRMENNNRNALPPQAHPEPNAAYIAHDLDRPIRSYAAPNLYNFNPGIAYPVFGENARFEIKPVMLQMIQNVGQFGGHPGE
DPHKHIRSFYSICASFHMPGISPEELRFTLFPLTLRDEAKRWANALKDGEVGTWDQLIEKFMKKFFPPHENARRKELMSFQQKDRENLHDTWSRFKRMVKACPYNGIPKC
ILMEVFYFGLNKATQQTADAVFVDGMLKSTYNQIKTTLDTMANNNEEWDEDDFGNRRGGRAKEDGMDKSAVVAL