; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0013989 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0013989
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr02:6488668..6489471
RNA-Seq ExpressionPI0013989
SyntenyPI0013989
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
WP_217833177.1 retrotransposon gag domain-containing protein, partial [Synechococcus sp. PCC 7002]1.5e-6946.64Show/hide
Query:  MSDSEQPPFELDPEIERTFRSNRRRARQRQARRMENNNRNA-----------------PPPHADPEPNAAYIAHDLDRPIRSYAAPNLYNFNPGIAYPAF
        MS+ + P F++DPEIERTFR   R+ +QR++ +    N +A                    HA+ + N   +AHD +RP+R YA+PNLYNF PGI  P F
Subjt:  MSDSEQPPFELDPEIERTFRSNRRRARQRQARRMENNNRNA-----------------PPPHADPEPNAAYIAHDLDRPIRSYAAPNLYNFNPGIAYPAF

Query:  GENARFEIKLVMLQMIQNAEQFGGHPGEDPHEHIRSFYSIYASFHMPGISPEELRFTLFPLTLRDEAKRWANALEDGEVGTWDQLIEKFIKKFFPPHENA
          N RFE+K VMLQM+Q A QFGG  GEDPH H++SF  I ++F M G+  + +R TLFP +LRDEA++WA + E GE+ TW +++EKF++K+FPP  +A
Subjt:  GENARFEIKLVMLQMIQNAEQFGGHPGEDPHEHIRSFYSIYASFHMPGISPEELRFTLFPLTLRDEAKRWANALEDGEVGTWDQLIEKFIKKFFPPHENA

Query:  RRRKKLMSFQQKDRENLHDAWSRFKRMVKACPHNGIPKCILMEVFYFELNKATQQTVDAVFADDMLKNTYNQIKTKLNTMASN
        +RR+ +++F+QKD E   +AW+RFKR+V+ CPHNGIP C+ ME+FY  LNK +Q   DA  A  ++  TY Q K  L+ ++ N
Subjt:  RRRKKLMSFQQKDRENLHDAWSRFKRMVKACPHNGIPKCILMEVFYFELNKATQQTVDAVFADDMLKNTYNQIKTKLNTMASN

XP_017233063.1 PREDICTED: uncharacterized protein LOC108207110 [Daucus carota subsp. sativus]5.2e-5445.76Show/hide
Query:  MSDSEQPPFELDPEIERTFRSNRRRARQRQARR----MENNNRNAPPPHADPEPNAAYIAHDLDRPIRSYAAPNLYNFNPGIAYPAFGENARFEIKLVML
        MS      F  DPEIERTF  NRRR  QR+ ++    M++N  N   P     P  A+I  D DR IR YAAP     N GI  P   +  +FE+K VM 
Subjt:  MSDSEQPPFELDPEIERTFRSNRRRARQRQARR----MENNNRNAPPPHADPEPNAAYIAHDLDRPIRSYAAPNLYNFNPGIAYPAFGENARFEIKLVML

Query:  QMIQNAEQFGGHPGEDPHEHIRSFYSIYASFHMPGISPEELRFTLFPLTLRDEAKRWANALEDGEVGTWDQLIEKFIKKFFPPHENARRRKKLMSFQQKD
        QM+Q   QF G P EDPH H+R F  I  SF   G+  + LR  LFP ++RD A+ W N+L  G V TW+ L EKF+ K+FPP+ NA+ R ++ SFQQ+D
Subjt:  QMIQNAEQFGGHPGEDPHEHIRSFYSIYASFHMPGISPEELRFTLFPLTLRDEAKRWANALEDGEVGTWDQLIEKFIKKFFPPHENARRRKKLMSFQQKD

Query:  RENLHDAWSRFKRMVKACPHNGIPKCILMEVFYFELNKATQQTVDAVFADDMLKNTYNQIKTKLNTMASNN
         E+L+DAW RFK +++ CPH+GI  CI ME FY  LN  T+  VDA     +L  +YNQ    L T+A+ N
Subjt:  RENLHDAWSRFKRMVKACPHNGIPKCILMEVFYFELNKATQQTVDAVFADDMLKNTYNQIKTKLNTMASNN

XP_017239618.1 PREDICTED: uncharacterized protein LOC108212402 [Daucus carota subsp. sativus]8.3e-5245.39Show/hide
Query:  MSDSEQPPFELDPEIERTFRSNRRRARQRQARR----MENNNRNAPPPHADPEPNAAYIAHDLDRPIRSYAAPNLYNFNPGIAYPAFGENARFEIKLVML
        MS+     F  DPEIERTF  NRRR  QR+ ++    M +N  N   P     P  A+I  D DR IR YAAP     N GI  P   +  +FE+K VM 
Subjt:  MSDSEQPPFELDPEIERTFRSNRRRARQRQARR----MENNNRNAPPPHADPEPNAAYIAHDLDRPIRSYAAPNLYNFNPGIAYPAFGENARFEIKLVML

Query:  QMIQNAEQFGGHPGEDPHEHIRSFYSIYASFHMPGISPEELRFTLFPLTLRDEAKRWANALEDGEVGTWDQLIEKFIKKFFPPHENARRRKKLMSFQQKD
        QM+Q   QF G P EDPH H+R F  I  SF   G+  + LR  LFP ++RD A+ W N+L  G V TW+ L EKF+ K+FPP+ NA+   ++ SFQQ+D
Subjt:  QMIQNAEQFGGHPGEDPHEHIRSFYSIYASFHMPGISPEELRFTLFPLTLRDEAKRWANALEDGEVGTWDQLIEKFIKKFFPPHENARRRKKLMSFQQKD

Query:  RENLHDAWSRFKRMVKACPHNGIPKCILMEVFYFELNKATQQTVDAVFADDMLKNTYNQIKTKLNTMASNN
         E+L+DAW RFK +++ CPH+GI   I ME FY  LN  T+  VDA     +L  +YNQ    L T+A+NN
Subjt:  RENLHDAWSRFKRMVKACPHNGIPKCILMEVFYFELNKATQQTVDAVFADDMLKNTYNQIKTKLNTMASNN

XP_030508936.1 uncharacterized protein LOC115723589 [Cannabis sativa]2.2e-5245.14Show/hide
Query:  LDPEIERTFRSNRRRARQRQARRMENNNRNAPPPHADPEPNAAYIAHDLDRPIRSYAAPNLYNFNPGIAYPAFGENARFEIKLVMLQMIQNAEQFGGHPG
        +DPEIERTF   R+R ++++A++  N            E N   +A D  R IR YAAP     NPGI  P   +   FE+K VM QM+Q   QFGG P 
Subjt:  LDPEIERTFRSNRRRARQRQARRMENNNRNAPPPHADPEPNAAYIAHDLDRPIRSYAAPNLYNFNPGIAYPAFGENARFEIKLVMLQMIQNAEQFGGHPG

Query:  EDPHEHIRSFYSIYASFHMPGISPEELRFTLFPLTLRDEAKRWANALEDGEVGTWDQLIEKFIKKFFPPHENARRRKKLMSFQQKDRENLHDAWSRFKRM
        EDPH HIRSF  +  SF + G+S E LR  LFP +LRD A+ W N L    V  W+ L EKF++K+FPP  NA+ R ++MSFQQ + E   DAW RFK +
Subjt:  EDPHEHIRSFYSIYASFHMPGISPEELRFTLFPLTLRDEAKRWANALEDGEVGTWDQLIEKFIKKFFPPHENARRRKKLMSFQQKDRENLHDAWSRFKRM

Query:  VKACPHNGIPKCILMEVFYFELNKATQQTVDAVFADDMLKNTYNQIKTKLNTMASNN
        ++ CPH+GIP CI +E FY  LN A +  +DA     +L  +YN+    L  +ASNN
Subjt:  VKACPHNGIPKCILMEVFYFELNKATQQTVDAVFADDMLKNTYNQIKTKLNTMASNN

XP_038887458.1 uncharacterized protein LOC120077591 [Benincasa hispida]4.4e-6147.6Show/hide
Query:  MSDSEQPPFELDPEIERTF--RSNRRRARQRQARRMENNNRNAPPPHAD---PEPNAAYIAHDLDRPIRSYAAPNLYNFNPGIAYPAFGENARFEIKLVM
        MS    P FE +PEI+ TF  R+++ RA +R+    +NNN  AP  +     P  +  ++A D + PIR+YAAPNLY+F+PGI+ P   ENARFEIK VM
Subjt:  MSDSEQPPFELDPEIERTF--RSNRRRARQRQARRMENNNRNAPPPHAD---PEPNAAYIAHDLDRPIRSYAAPNLYNFNPGIAYPAFGENARFEIKLVM

Query:  LQMIQNAEQFGGHPGEDPHEHIRSFYSIYASFHMPGISPEELRFTLFPLTLRDEAKRWANALEDGEVGTWDQLIEKFIKKFFPPHENARRRKKLMSFQQK
        +QMIQN  QF     E+PH H+  F  + ++F +PGI+P  +R  LFP TLRD+AKRWA++LE  E+ + DQL+E F+KKFFPP  N RRRK +++F++ 
Subjt:  LQMIQNAEQFGGHPGEDPHEHIRSFYSIYASFHMPGISPEELRFTLFPLTLRDEAKRWANALEDGEVGTWDQLIEKFIKKFFPPHENARRRKKLMSFQQK

Query:  DRENLHDAWSRFKRMVKACPHNGIPKCILMEVFYFELNKATQQTVDAVFADDMLKNTYNQIKTKLNTMASN
        D E L  AW RF+R+VK CPH GI  C+LME+FY  LN++TQ   DA   +  +  TY + K  L+ ++ N
Subjt:  DRENLHDAWSRFKRMVKACPHNGIPKCILMEVFYFELNKATQQTVDAVFADDMLKNTYNQIKTKLNTMASN

TrEMBL top hitse value%identityAlignment
A0A5A7V1F3 Retrotrans_gag domain-containing protein7.3e-4647.2Show/hide
Query:  YIAHDLDRPIRSYAAPNLYNFNPGIAYPAFGENARFEIKLVMLQMIQNAEQFGGHPGEDPHEHIRSFYSIYASFHMPGISPEELRFTLFPLTLRDEAKRW
        Y+AH+L RPIRSYA P+LY FNPGIAYP FGENA +E K                                                       D+AKRW
Subjt:  YIAHDLDRPIRSYAAPNLYNFNPGIAYPAFGENARFEIKLVMLQMIQNAEQFGGHPGEDPHEHIRSFYSIYASFHMPGISPEELRFTLFPLTLRDEAKRW

Query:  ANALEDGEVGTWDQLIEKFIKKFFPPHENARRRKKLMSFQQKDRENLHDAWSRFKRMVKACPHNGIPKCILMEVFYFELNKATQQTVDAVFADDMLKNTY
        AN++E GEV TW+ LIEKF+KKFFP  + A+RR+ L+ F+Q+DR+NLHDAWS FKRMVKAC H+GI K +LME FYF L+K T+Q+ D++F   +L+++Y
Subjt:  ANALEDGEVGTWDQLIEKFIKKFFPPHENARRRKKLMSFQQKDRENLHDAWSRFKRMVKACPHNGIPKCILMEVFYFELNKATQQTVDAVFADDMLKNTY

Query:  NQIKTKLNTMASNN
        NQIK  L++MA+N+
Subjt:  NQIKTKLNTMASNN

A0A6J1EEI2 uncharacterized protein LOC1114333941.0e-4745.62Show/hide
Query:  NAAYIAHDLDRPIRSYAAPNLYNFNPGIAYPAFGENARFEIKLVMLQMIQNAEQFGGHPGEDPHEHIRSFYSIYASFHMPGISPEELRFTLFPLTLRDEA
        NA ++A D +R IR+YA P +   NP I  P   +   FE+K VM QM+Q   QF G P EDPH H++SF  +  SF    +  + +R +LFP +LRD A
Subjt:  NAAYIAHDLDRPIRSYAAPNLYNFNPGIAYPAFGENARFEIKLVMLQMIQNAEQFGGHPGEDPHEHIRSFYSIYASFHMPGISPEELRFTLFPLTLRDEA

Query:  KRWANALEDGEVGTWDQLIEKFIKKFFPPHENARRRKKLMSFQQKDRENLHDAWSRFKRMVKACPHNGIPKCILMEVFYFELNKATQQTVDAVFADDMLK
        K W N L  G + +W+ L+EKF+ K+FPP  NAR R +++ FQQ + + L +AW RFK M++ CPH+G+P CI ME FY  LN AT+Q VDA     +L 
Subjt:  KRWANALEDGEVGTWDQLIEKFIKKFFPPHENARRRKKLMSFQQKDRENLHDAWSRFKRMVKACPHNGIPKCILMEVFYFELNKATQQTVDAVFADDMLK

Query:  NTYNQIKTKLNTMASNN
         TYN+    L  +ASNN
Subjt:  NTYNQIKTKLNTMASNN

A0A6J1G7Q6 uncharacterized protein LOC1114515984.8e-4543.32Show/hide
Query:  NAAYIAHDLDRPIRSYAAPNLYNFNPGIAYPAFGENARFEIKLVMLQMIQNAEQFGGHPGEDPHEHIRSFYSIYASFHMPGISPEELRFTLFPLTLRDEA
        NA ++A D +R IR+YA P +   NP I  P   +   FE+K VM QM+Q   QF G   +DPH H++SF  +  SF   G+  + +R + F  +LRD A
Subjt:  NAAYIAHDLDRPIRSYAAPNLYNFNPGIAYPAFGENARFEIKLVMLQMIQNAEQFGGHPGEDPHEHIRSFYSIYASFHMPGISPEELRFTLFPLTLRDEA

Query:  KRWANALEDGEVGTWDQLIEKFIKKFFPPHENARRRKKLMSFQQKDRENLHDAWSRFKRMVKACPHNGIPKCILMEVFYFELNKATQQTVDAVFADDMLK
        K W N L  G + +W+ L EKF+ K+FPP  +AR R ++++FQ+ + E L +AW RFK  ++ CPH+G+P CI +E FY  LN AT+Q VDA    D+L 
Subjt:  KRWANALEDGEVGTWDQLIEKFIKKFFPPHENARRRKKLMSFQQKDRENLHDAWSRFKRMVKACPHNGIPKCILMEVFYFELNKATQQTVDAVFADDMLK

Query:  NTYNQIKTKLNTMASNN
         TYN+    L  +ASNN
Subjt:  NTYNQIKTKLNTMASNN

A0A6J1H7E4 uncharacterized protein LOC1114611682.7e-4846.54Show/hide
Query:  NAAYIAHDLDRPIRSYAAPNLYNFNPGIAYPAFGENARFEIKLVMLQMIQNAEQFGGHPGEDPHEHIRSFYSIYASFHMPGISPEELRFTLFPLTLRDEA
        NA  +A D +R IR+YA P +   NP I  P   +   FE+K VM QM+Q   QF G P EDPH H++SF  +  SF   G+  + +R +LFP +LRD A
Subjt:  NAAYIAHDLDRPIRSYAAPNLYNFNPGIAYPAFGENARFEIKLVMLQMIQNAEQFGGHPGEDPHEHIRSFYSIYASFHMPGISPEELRFTLFPLTLRDEA

Query:  KRWANALEDGEVGTWDQLIEKFIKKFFPPHENARRRKKLMSFQQKDRENLHDAWSRFKRMVKACPHNGIPKCILMEVFYFELNKATQQTVDAVFADDMLK
        K W N L    + +W+ L EKF+ K+FPP  NAR R ++++FQQ + E L +AW RFK M++ CPH+G+P CI ME FY  LN AT+Q VDA     ML 
Subjt:  KRWANALEDGEVGTWDQLIEKFIKKFFPPHENARRRKKLMSFQQKDRENLHDAWSRFKRMVKACPHNGIPKCILMEVFYFELNKATQQTVDAVFADDMLK

Query:  NTYNQIKTKLNTMASNN
         TYN+    L  +ASNN
Subjt:  NTYNQIKTKLNTMASNN

U5CUI2 Retrotrans_gag domain-containing protein3.8e-5047.47Show/hide
Query:  NAAYIAHDLDRPIRSYAAPNLYNFNPGIAYPAFGENARFEIKLVMLQMIQNAEQFGGHPGEDPHEHIRSFYSIYASFHMPGISPEELRFTLFPLTLRDEA
        N   +A D  R IR YAAP     NPGI  P   +  +FE+K VM QM+Q   QF G P EDPH H+RSF  +  SF + G+S E LR  LFP +LRD A
Subjt:  NAAYIAHDLDRPIRSYAAPNLYNFNPGIAYPAFGENARFEIKLVMLQMIQNAEQFGGHPGEDPHEHIRSFYSIYASFHMPGISPEELRFTLFPLTLRDEA

Query:  KRWANALEDGEVGTWDQLIEKFIKKFFPPHENARRRKKLMSFQQKDRENLHDAWSRFKRMVKACPHNGIPKCILMEVFYFELNKATQQTVDAVFADDMLK
        + W N L    V  W+ L EKF++K+FPP  NA+ R ++MSFQQ + E+  DAW RFK +++ CPH+GIP CI ME FY  LN A++  +DA     +L 
Subjt:  KRWANALEDGEVGTWDQLIEKFIKKFFPPHENARRRKKLMSFQQKDRENLHDAWSRFKRMVKACPHNGIPKCILMEVFYFELNKATQQTVDAVFADDMLK

Query:  NTYNQIKTKLNTMASNN
         +YN+    L T+ASNN
Subjt:  NTYNQIKTKLNTMASNN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTGATAGCGAACAACCACCATTCGAACTTGACCCTGAAATTGAGCGAACCTTTCGGAGTAACCGACGAAGAGCAAGGCAGAGACAAGCTAGAAGGATGGAAAATAA
CAACAGAAATGCCCCTCCGCCGCATGCTGACCCAGAACCAAATGCTGCTTACATAGCACACGACTTGGATAGGCCGATTAGATCTTATGCGGCACCCAACCTCTACAACT
TTAACCCAGGAATCGCCTACCCTGCATTCGGCGAGAACGCCAGGTTTGAAATCAAACTTGTTATGCTTCAGATGATTCAGAACGCCGAACAATTCGGCGGACATCCTGGG
GAAGATCCACACGAACATATAAGGAGTTTCTACTCCATCTACGCTTCTTTCCACATGCCAGGCATTTCACCTGAAGAATTGAGATTCACCCTCTTCCCGTTAACTCTGAG
GGATGAGGCGAAGAGGTGGGCAAATGCTCTGGAAGATGGCGAGGTGGGAACATGGGATCAACTAATAGAGAAATTTATAAAGAAATTTTTTCCACCTCACGAAAACGCTA
GAAGAAGGAAGAAGCTTATGAGCTTCCAGCAGAAGGATAGAGAAAACCTACATGACGCGTGGAGTAGGTTCAAAAGGATGGTTAAAGCATGTCCCCACAATGGCATTCCT
AAATGCATATTGATGGAAGTTTTCTATTTTGAACTAAACAAGGCTACACAGCAGACTGTTGACGCTGTGTTTGCAGACGACATGCTAAAAAACACATACAACCAGATTAA
GACGAAGCTGAATACGATGGCCAGCAACAACTAG
mRNA sequenceShow/hide mRNA sequence
ATGAGTGATAGCGAACAACCACCATTCGAACTTGACCCTGAAATTGAGCGAACCTTTCGGAGTAACCGACGAAGAGCAAGGCAGAGACAAGCTAGAAGGATGGAAAATAA
CAACAGAAATGCCCCTCCGCCGCATGCTGACCCAGAACCAAATGCTGCTTACATAGCACACGACTTGGATAGGCCGATTAGATCTTATGCGGCACCCAACCTCTACAACT
TTAACCCAGGAATCGCCTACCCTGCATTCGGCGAGAACGCCAGGTTTGAAATCAAACTTGTTATGCTTCAGATGATTCAGAACGCCGAACAATTCGGCGGACATCCTGGG
GAAGATCCACACGAACATATAAGGAGTTTCTACTCCATCTACGCTTCTTTCCACATGCCAGGCATTTCACCTGAAGAATTGAGATTCACCCTCTTCCCGTTAACTCTGAG
GGATGAGGCGAAGAGGTGGGCAAATGCTCTGGAAGATGGCGAGGTGGGAACATGGGATCAACTAATAGAGAAATTTATAAAGAAATTTTTTCCACCTCACGAAAACGCTA
GAAGAAGGAAGAAGCTTATGAGCTTCCAGCAGAAGGATAGAGAAAACCTACATGACGCGTGGAGTAGGTTCAAAAGGATGGTTAAAGCATGTCCCCACAATGGCATTCCT
AAATGCATATTGATGGAAGTTTTCTATTTTGAACTAAACAAGGCTACACAGCAGACTGTTGACGCTGTGTTTGCAGACGACATGCTAAAAAACACATACAACCAGATTAA
GACGAAGCTGAATACGATGGCCAGCAACAACTAG
Protein sequenceShow/hide protein sequence
MSDSEQPPFELDPEIERTFRSNRRRARQRQARRMENNNRNAPPPHADPEPNAAYIAHDLDRPIRSYAAPNLYNFNPGIAYPAFGENARFEIKLVMLQMIQNAEQFGGHPG
EDPHEHIRSFYSIYASFHMPGISPEELRFTLFPLTLRDEAKRWANALEDGEVGTWDQLIEKFIKKFFPPHENARRRKKLMSFQQKDRENLHDAWSRFKRMVKACPHNGIP
KCILMEVFYFELNKATQQTVDAVFADDMLKNTYNQIKTKLNTMASNN