; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0010073 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0010073
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr01:18776086..18776980
RNA-Seq ExpressionPI0010073
SyntenyPI0010073
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
WP_217833177.1 retrotransposon gag domain-containing protein, partial [Synechococcus sp. PCC 7002]5.9e-6749.81Show/hide
Query:  MSDSEQP-FELDPEIERTFRGNRRRARQRQIRR-MENNRNA-----PPPQADPEPNA------------AYIAHDLDRPIRSYAAPNLYNFSPGIAYPVF
        MS+ + P F++DPEIERTFR   R+ +QR+  + +E N +A       PQA    NA              +AHD +RP+R YA+PNLYNF+PGI  P F
Subjt:  MSDSEQP-FELDPEIERTFRGNRRRARQRQIRR-MENNRNA-----PPPQADPEPNA------------AYIAHDLDRPIRSYAAPNLYNFSPGIAYPVF

Query:  GENARFEIKPVMLQMIQNARQFGGHPREDPHEHIRSFYSICASFHMLGISPEELRFALFPLTLRDEAKRWANALEDGEVGTWDQLIEKFMKKFFPPHENA
          N RFE+KPVMLQM+Q A QFGG   EDPH H++SF  IC++F M G+  + +R  LFP +LRDEA++WA + E GE+ TW +++EKFM+K+FPP  +A
Subjt:  GENARFEIKPVMLQMIQNARQFGGHPREDPHEHIRSFYSICASFHMLGISPEELRFALFPLTLRDEAKRWANALEDGEVGTWDQLIEKFMKKFFPPHENA

Query:  RRRKELMSFQQRDRENLHDAWSRFKRMVKACPHNGILKCILMEVFYFGLNKATQQTADA
        +RR+++++F+Q+D E   +AW+RFKR+V+ CPHNGI  C+ ME+FY GLNK +Q  ADA
Subjt:  RRRKELMSFQQRDRENLHDAWSRFKRMVKACPHNGILKCILMEVFYFGLNKATQQTADA

XP_017216983.1 PREDICTED: uncharacterized protein LOC108194534 [Daucus carota subsp. sativus]3.5e-5147.66Show/hide
Query:  FELDPEIERTFRGNRRRARQRQIRRMENNRNAPPPQAD--PEPNAAYIAHDLDRPIRSYAAPNLYNFSPGIAYPVFGENARFEIKPVMLQMIQNARQFGG
        F  DP IERTF  NRRR  QR+I++ +          D    P  A+I  D DR IR YAAP     + GI  P   +  +FE+KPVM QM+Q   QF G
Subjt:  FELDPEIERTFRGNRRRARQRQIRRMENNRNAPPPQAD--PEPNAAYIAHDLDRPIRSYAAPNLYNFSPGIAYPVFGENARFEIKPVMLQMIQNARQFGG

Query:  HPREDPHEHIRSFYSICASFHMLGISPEELRFALFPLTLRDEAKRWANALEDGEVGTWDQLIEKFMKKFFPPHENARRRKELMSFQQRDRENLHDAWSRF
         P EDPH H+R F  I  SF   G++ + LR  LFP  +RD A+ W N+L  G V  W+ L EKF+ K+FPP+ NA+ R E+ SFQQ+D E+L+DAW RF
Subjt:  HPREDPHEHIRSFYSICASFHMLGISPEELRFALFPLTLRDEAKRWANALEDGEVGTWDQLIEKFMKKFFPPHENARRRKELMSFQQRDRENLHDAWSRF

Query:  KRMVKACPHNGILKCILMEVFYFGLNKATQQTADA
        K +++ CPH+GIL CI ME FY GLN  T+   DA
Subjt:  KRMVKACPHNGILKCILMEVFYFGLNKATQQTADA

XP_017233063.1 PREDICTED: uncharacterized protein LOC108207110 [Daucus carota subsp. sativus]9.7e-5448.74Show/hide
Query:  FELDPEIERTFRGNRRRARQRQIRRM-----ENNRNAPPPQADPEPNAAYIAHDLDRPIRSYAAPNLYNFSPGIAYPVFGENARFEIKPVMLQMIQNARQ
        F  DPEIERTF  NRRR  QR+I++      +N  N   P     P  A+I  D DR IR YAAP     + GI  P   +  +FE+KPVM QM+Q   Q
Subjt:  FELDPEIERTFRGNRRRARQRQIRRM-----ENNRNAPPPQADPEPNAAYIAHDLDRPIRSYAAPNLYNFSPGIAYPVFGENARFEIKPVMLQMIQNARQ

Query:  FGGHPREDPHEHIRSFYSICASFHMLGISPEELRFALFPLTLRDEAKRWANALEDGEVGTWDQLIEKFMKKFFPPHENARRRKELMSFQQRDRENLHDAW
        F G P EDPH H+R F  I  SF   G+  + LR  LFP ++RD A+ W N+L  G V TW+ L EKF+ K+FPP+ NA+ R E+ SFQQ+D E+L+DAW
Subjt:  FGGHPREDPHEHIRSFYSICASFHMLGISPEELRFALFPLTLRDEAKRWANALEDGEVGTWDQLIEKFMKKFFPPHENARRRKELMSFQQRDRENLHDAW

Query:  SRFKRMVKACPHNGILKCILMEVFYFGLNKATQQTADA
         RFK +++ CPH+GIL CI ME FY GLN  T+   DA
Subjt:  SRFKRMVKACPHNGILKCILMEVFYFGLNKATQQTADA

XP_030508936.1 uncharacterized protein LOC115723589 [Cannabis sativa]2.6e-5146.91Show/hide
Query:  MSDSEQPFEL---DPEIERTFRGNRRRARQRQIRRMENNRNAPPPQADPEPNAAYIAHDLDRPIRSYAAPNLYNFSPGIAYPVFGENARFEIKPVMLQMI
        M++ E+  EL   DPEIERTFR  +RR  Q+  +R              E N   +A D  R IR YAAP     +PGI  P   +   FE+KPVM QM+
Subjt:  MSDSEQPFEL---DPEIERTFRGNRRRARQRQIRRMENNRNAPPPQADPEPNAAYIAHDLDRPIRSYAAPNLYNFSPGIAYPVFGENARFEIKPVMLQMI

Query:  QNARQFGGHPREDPHEHIRSFYSICASFHMLGISPEELRFALFPLTLRDEAKRWANALEDGEVGTWDQLIEKFMKKFFPPHENARRRKELMSFQQRDREN
        Q   QFGG P EDPH HIRSF  +  SF + G+S E LR  LFP +LRD A+ W N L    V  W+ L EKF++K+FPP  NA+ R E+MSFQQ + E 
Subjt:  QNARQFGGHPREDPHEHIRSFYSICASFHMLGISPEELRFALFPLTLRDEAKRWANALEDGEVGTWDQLIEKFMKKFFPPHENARRRKELMSFQQRDREN

Query:  LHDAWSRFKRMVKACPHNGILKCILMEVFYFGLNKATQQTADA
          DAW RFK +++ CPH+GI  CI +E FY GLN A +   DA
Subjt:  LHDAWSRFKRMVKACPHNGILKCILMEVFYFGLNKATQQTADA

XP_038887458.1 uncharacterized protein LOC120077591 [Benincasa hispida]7.9e-6452.23Show/hide
Query:  SEQPFELDPEIERTF--RGNRRRARQRQIRRMENNRNAPPPQAD----PEPNAAYIAHDLDRPIRSYAAPNLYNFSPGIAYPVFGENARFEIKPVMLQMI
        ++  FE +PEI+ TF  R ++ RA +R+I   +NN N  P        P  +  ++A D + PIR+YAAPNLY+FSPGI+ P+  ENARFEIKPVM+QMI
Subjt:  SEQPFELDPEIERTF--RGNRRRARQRQIRRMENNRNAPPPQAD----PEPNAAYIAHDLDRPIRSYAAPNLYNFSPGIAYPVFGENARFEIKPVMLQMI

Query:  QNARQFGGHPREDPHEHIRSFYSICASFHMLGISPEELRFALFPLTLRDEAKRWANALEDGEVGTWDQLIEKFMKKFFPPHENARRRKELMSFQQRDREN
        QN RQF     E+PH H+  F  +C++F + GI+P  +R  LFP TLRD+AKRWA++LE  E+ + DQL+E FMKKFFPP  N RRRK +++F++ D E 
Subjt:  QNARQFGGHPREDPHEHIRSFYSICASFHMLGISPEELRFALFPLTLRDEAKRWANALEDGEVGTWDQLIEKFMKKFFPPHENARRRKELMSFQQRDREN

Query:  LHDAWSRFKRMVKACPHNGILKCILMEVFYFGLNKATQQTADAVFVD
        L  AW RF+R+VK CPH GIL C+LME+FY GLN++TQ  ADA  V+
Subjt:  LHDAWSRFKRMVKACPHNGILKCILMEVFYFGLNKATQQTADAVFVD

TrEMBL top hitse value%identityAlignment
A0A6J0ZYV0 uncharacterized protein LOC1104134133.2e-4236.36Show/hide
Query:  DPEIERTFRGNRRR----ARQRQIRRMENNRNAPPPQADPEPNAAYIAHDLDRPIRSYAAPNLYNFSPGIAYPVFGENARFEIKPVMLQMIQNARQFGGH
        DP+IERTFR +RR     A   Q    +NN N          NA  +  + +R +R YA P +      I  P    N  FEIKP  +QMIQ++ QF G 
Subjt:  DPEIERTFRGNRRR----ARQRQIRRMENNRNAPPPQADPEPNAAYIAHDLDRPIRSYAAPNLYNFSPGIAYPVFGENARFEIKPVMLQMIQNARQFGGH

Query:  PREDPHEHIRSFYSICASFHMLGISPEELRFALFPLTLRDEAKRWANALEDGEVGTWDQLIEKFMKKFFPPHENARRRKELMSFQQRDRENLHDAWSRFK
        P +DP+ H+ +F  IC +F   G++ + +R  LFP +LRD+AK W N+L +G + TW+ L +KF+ KFFPP + A+ R ++ SF Q D E+L++AW RFK
Subjt:  PREDPHEHIRSFYSICASFHMLGISPEELRFALFPLTLRDEAKRWANALEDGEVGTWDQLIEKFMKKFFPPHENARRRKELMSFQQRDRENLHDAWSRFK

Query:  RMVKACPHNGILKCILMEVFYFGLNKATQQTADAV---------FVDVHT--TRLRRRWIRWPATRKNGMKMISA
         +++ CPH+GI   + ++ FY GL  + +   DA           VD +     +     +WP+ R    K + A
Subjt:  RMVKACPHNGILKCILMEVFYFGLNKATQQTADAV---------FVDVHT--TRLRRRWIRWPATRKNGMKMISA

A0A6J1EEI2 uncharacterized protein LOC1114333943.4e-4441.7Show/hide
Query:  QPFELDPEIERTFRGNRRRARQRQIRRMENNRNAPPPQADPEPNAAYIAHDLDRPIRSYAAPNLYNFSPGIAYPVFGENARFEIKPVMLQMIQNARQFGG
        Q  EL  ++ R F      A Q +I                  NA ++A D +R IR+YA P +   +P I  P   +   FE+KPVM QM+Q   QF G
Subjt:  QPFELDPEIERTFRGNRRRARQRQIRRMENNRNAPPPQADPEPNAAYIAHDLDRPIRSYAAPNLYNFSPGIAYPVFGENARFEIKPVMLQMIQNARQFGG

Query:  HPREDPHEHIRSFYSICASFHMLGISPEELRFALFPLTLRDEAKRWANALEDGEVGTWDQLIEKFMKKFFPPHENARRRKELMSFQQRDRENLHDAWSRF
         P EDPH H++SF  +  SF    +  + +R +LFP +LRD AK W N L  G + +W+ L+EKF+ K+FPP  NAR R E++ FQQ + + L +AW RF
Subjt:  HPREDPHEHIRSFYSICASFHMLGISPEELRFALFPLTLRDEAKRWANALEDGEVGTWDQLIEKFMKKFFPPHENARRRKELMSFQQRDRENLHDAWSRF

Query:  KRMVKACPHNGILKCILMEVFYFGLNKATQQTADA
        K M++ CPH+G+  CI ME FY GLN AT+Q  DA
Subjt:  KRMVKACPHNGILKCILMEVFYFGLNKATQQTADA

A0A6J1EQ90 uncharacterized protein LOC1114364111.4e-4241.27Show/hide
Query:  FELDPEIERTFR---GNRRRARQRQIRRME----NNRNAPPP-----QADPEPNAAYIAHDLDRPIRSYAAPNLYNFSPGIAYPVFGENARFEIKPVMLQ
        F LDPEIERTFR     +++  ++ I+++E     NR    P     Q     N  ++A D +R IR+YA P +   +P I  P   +   FE+KPVM Q
Subjt:  FELDPEIERTFR---GNRRRARQRQIRRME----NNRNAPPP-----QADPEPNAAYIAHDLDRPIRSYAAPNLYNFSPGIAYPVFGENARFEIKPVMLQ

Query:  MIQNARQFGGHPREDPHEHIRSFYSI-------CASFHMLGISPEELRFALFPLTLRDEAKRWANALEDGEVGTWDQLIEKFMKKFFPPHENARRRKELM
        M+Q   QF G P EDPH H++SF  +         SF   G+  + +R +LFP  LRD AK W N L  G + +W+ L E F+ K+FPP  NAR + E++
Subjt:  MIQNARQFGGHPREDPHEHIRSFYSI-------CASFHMLGISPEELRFALFPLTLRDEAKRWANALEDGEVGTWDQLIEKFMKKFFPPHENARRRKELM

Query:  SFQQRDRENLHDAWSRFKRMVKACPHNGILKCILMEVFYFGLNKATQQTADA
        +FQQ + E L +A  RFK M++ CPH+G+  CI ME FY GLN  T+Q  DA
Subjt:  SFQQRDRENLHDAWSRFKRMVKACPHNGILKCILMEVFYFGLNKATQQTADA

A0A6J1H7E4 uncharacterized protein LOC1114611687.5e-4443.98Show/hide
Query:  RQIRRMENNRNAPPPQADPE---PNAAYIAHDLDRPIRSYAAPNLYNFSPGIAYPVFGENARFEIKPVMLQMIQNARQFGGHPREDPHEHIRSFYSICAS
        +Q+ ++      P   A+ E    NA  +A D +R IR+YA P +   +P I  P   +   FE+KPVM QM+Q   QF G P EDPH H++SF  +  S
Subjt:  RQIRRMENNRNAPPPQADPE---PNAAYIAHDLDRPIRSYAAPNLYNFSPGIAYPVFGENARFEIKPVMLQMIQNARQFGGHPREDPHEHIRSFYSICAS

Query:  FHMLGISPEELRFALFPLTLRDEAKRWANALEDGEVGTWDQLIEKFMKKFFPPHENARRRKELMSFQQRDRENLHDAWSRFKRMVKACPHNGILKCILME
        F   G+  + +R +LFP +LRD AK W N L    + +W+ L EKF+ K+FPP  NAR R E+++FQQ + E L +AW RFK M++ CPH+G+  CI ME
Subjt:  FHMLGISPEELRFALFPLTLRDEAKRWANALEDGEVGTWDQLIEKFMKKFFPPHENARRRKELMSFQQRDRENLHDAWSRFKRMVKACPHNGILKCILME

Query:  VFYFGLNKATQQTADA
         FY GLN AT+Q  DA
Subjt:  VFYFGLNKATQQTADA

U5CUI2 Retrotrans_gag domain-containing protein1.5e-4749.48Show/hide
Query:  NAAYIAHDLDRPIRSYAAPNLYNFSPGIAYPVFGENARFEIKPVMLQMIQNARQFGGHPREDPHEHIRSFYSICASFHMLGISPEELRFALFPLTLRDEA
        N   +A D  R IR YAAP     +PGI  P   +  +FE+KPVM QM+Q   QF G P EDPH H+RSF  +  SF + G+S E LR  LFP +LRD A
Subjt:  NAAYIAHDLDRPIRSYAAPNLYNFSPGIAYPVFGENARFEIKPVMLQMIQNARQFGGHPREDPHEHIRSFYSICASFHMLGISPEELRFALFPLTLRDEA

Query:  KRWANALEDGEVGTWDQLIEKFMKKFFPPHENARRRKELMSFQQRDRENLHDAWSRFKRMVKACPHNGILKCILMEVFYFGLNKATQQTADA
        + W N L    V  W+ L EKF++K+FPP  NA+ R E+MSFQQ + E+  DAW RFK +++ CPH+GI  CI ME FY GLN A++   DA
Subjt:  KRWANALEDGEVGTWDQLIEKFMKKFFPPHENARRRKELMSFQQRDRENLHDAWSRFKRMVKACPHNGILKCILMEVFYFGLNKATQQTADA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTGATAGCGAACAGCCATTCGAACTTGACCCTGAGATTGAGCGAACATTTCGGGGTAATCGGCGAAGAGCAAGGCAAAGACAAATTCGTAGAATGGAAAATAACAG
AAATGCTCCTCCGCCGCAAGCTGACCCAGAACCCAATGCCGCCTATATCGCACACGACTTAGACAGGCCAATTAGATCTTATGCGGCACCCAACCTTTATAACTTCAGTC
CAGGAATCGCCTACCCTGTATTTGGCGAGAACGCCAGGTTTGAAATCAAACCTGTTATGCTTCAGATGATTCAGAACGCCAGACAATTCGGCGGGCATCCTAGGGAAGAC
CCACACGAGCATATAAGGAGTTTCTACTCCATCTGCGCTTCTTTCCACATGTTAGGCATCTCACCTGAAGAATTAAGATTCGCCCTCTTCCCGTTAACTCTGAGGGATGA
GGCGAAGAGGTGGGCAAATGCCTTGGAAGATGGCGAGGTGGGAACTTGGGATCAATTAATAGAAAAATTTATGAAGAAATTCTTCCCACCTCATGAAAATGCAAGAAGAA
GGAAGGAGCTTATGAGCTTCCAGCAGAGGGATAGAGAAAACCTACATGATGCGTGGAGTAGGTTTAAAAGGATGGTCAAAGCATGCCCCCACAATGGCATTCTTAAATGC
ATATTGATGGAGGTGTTCTATTTTGGACTGAACAAGGCAACACAGCAGACTGCTGATGCTGTGTTTGTAGACGTACATACAACCAGATTAAGACGACGCTGGATACGATG
GCCAGCAACAAGGAAGAATGGGATGAAGATGATTTCGGCAATCGCCGAGGAGGACGAGCAAAAGATGATGGTATGGATAGGAACGCCGTGGTGGCACTGCAGGGACAAAT
GA
mRNA sequenceShow/hide mRNA sequence
ATGAGTGATAGCGAACAGCCATTCGAACTTGACCCTGAGATTGAGCGAACATTTCGGGGTAATCGGCGAAGAGCAAGGCAAAGACAAATTCGTAGAATGGAAAATAACAG
AAATGCTCCTCCGCCGCAAGCTGACCCAGAACCCAATGCCGCCTATATCGCACACGACTTAGACAGGCCAATTAGATCTTATGCGGCACCCAACCTTTATAACTTCAGTC
CAGGAATCGCCTACCCTGTATTTGGCGAGAACGCCAGGTTTGAAATCAAACCTGTTATGCTTCAGATGATTCAGAACGCCAGACAATTCGGCGGGCATCCTAGGGAAGAC
CCACACGAGCATATAAGGAGTTTCTACTCCATCTGCGCTTCTTTCCACATGTTAGGCATCTCACCTGAAGAATTAAGATTCGCCCTCTTCCCGTTAACTCTGAGGGATGA
GGCGAAGAGGTGGGCAAATGCCTTGGAAGATGGCGAGGTGGGAACTTGGGATCAATTAATAGAAAAATTTATGAAGAAATTCTTCCCACCTCATGAAAATGCAAGAAGAA
GGAAGGAGCTTATGAGCTTCCAGCAGAGGGATAGAGAAAACCTACATGATGCGTGGAGTAGGTTTAAAAGGATGGTCAAAGCATGCCCCCACAATGGCATTCTTAAATGC
ATATTGATGGAGGTGTTCTATTTTGGACTGAACAAGGCAACACAGCAGACTGCTGATGCTGTGTTTGTAGACGTACATACAACCAGATTAAGACGACGCTGGATACGATG
GCCAGCAACAAGGAAGAATGGGATGAAGATGATTTCGGCAATCGCCGAGGAGGACGAGCAAAAGATGATGGTATGGATAGGAACGCCGTGGTGGCACTGCAGGGACAAAT
GA
Protein sequenceShow/hide protein sequence
MSDSEQPFELDPEIERTFRGNRRRARQRQIRRMENNRNAPPPQADPEPNAAYIAHDLDRPIRSYAAPNLYNFSPGIAYPVFGENARFEIKPVMLQMIQNARQFGGHPRED
PHEHIRSFYSICASFHMLGISPEELRFALFPLTLRDEAKRWANALEDGEVGTWDQLIEKFMKKFFPPHENARRRKELMSFQQRDRENLHDAWSRFKRMVKACPHNGILKC
ILMEVFYFGLNKATQQTADAVFVDVHTTRLRRRWIRWPATRKNGMKMISAIAEEDEQKMMVWIGTPWWHCRDK