; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0014925 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0014925
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr09:12936203..12941253
RNA-Seq ExpressionPI0014925
SyntenyPI0014925
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0061983.1 uncharacterized protein E6C27_scaffold89G002850 [Cucumis melo var. makuwa]9.4e-5058.72Show/hide
Query:  PPQAAQEPNAAYMAHDLDRSIRSYVVPNLYNFNPGIAYLVFGENA----RDEAKRCANAMEDGEVGTWDQLIEKFMKKFFPPHENVRRRKELMTFQQKDR
        P   A      Y+AH+L R IRSY +P+LY FNPGIAY  FGENA    +D+AKR AN+ME GEV TW+ LIEKFMKKFFP  +  +RR++L+ F+Q+DR
Subjt:  PPQAAQEPNAAYMAHDLDRSIRSYVVPNLYNFNPGIAYLVFGENA----RDEAKRCANAMEDGEVGTWDQLIEKFMKKFFPPHENVRRRKELMTFQQKDR

Query:  ENVHDAGSRFKRMVKPCPHNGIPECILMEVFYFGLNKATQQTANAVFVGGMLKSSYNQIKVMLDTMASNNEE
        +N+HDA S FKRMVK C H+GI + +LME FYFGL+K T+Q+A+++F+GG+L+SSYNQIK MLD+MA+N+++
Subjt:  ENVHDAGSRFKRMVKPCPHNGIPECILMEVFYFGLNKATQQTANAVFVGGMLKSSYNQIKVMLDTMASNNEE

TYK30531.1 Retrovirus-related Pol polyprotein from transposon 17.6 [Cucumis melo var. makuwa]1.4e-3746.98Show/hide
Query:  PPQAAQEPNAAYMAHDLDRSIRSYVVPNLYNFNPGIAYLVFGEN----ARDEAKRCANAMEDGEVGTWDQLIEKFMKKFFPPHENVRRRKELMTFQQKDR
        P   A      Y+AH+L R IRSY +P+LY FNPGIAYL FGEN    ++D+AKR AN+ME GEV TW+ LIEKFMKKFFP  +  +RR++L+ F+QKDR
Subjt:  PPQAAQEPNAAYMAHDLDRSIRSYVVPNLYNFNPGIAYLVFGEN----ARDEAKRCANAMEDGEVGTWDQLIEKFMKKFFPPHENVRRRKELMTFQQKDR

Query:  ENVHDAGSRFKRMVKPCPHNGIPECILMEVFYFGLNKATQQTANAVFVGGMLKSSYNQIKVMLDTMASNNEEWGETGQQNQGRHGGQGDHRGEASSSHAW
        +N+HDA S FKRMVK C H+GI + +LME FYFGL K  ++  +    GG    + +   + L+ +AS +               G G HR E    H  
Subjt:  ENVHDAGSRFKRMVKPCPHNGIPECILMEVFYFGLNKATQQTANAVFVGGMLKSSYNQIKVMLDTMASNNEEWGETGQQNQGRHGGQGDHRGEASSSHAW

Query:  YHNNRPQHS---QHQ
        +HNN   HS   QHQ
Subjt:  YHNNRPQHS---QHQ

WP_217833177.1 retrotransposon gag domain-containing protein, partial [Synechococcus sp. PCC 7002]1.3e-3834.49Show/hide
Query:  FKSDGEQAQFELDPEIERTFRRNWRRARQRHARR-MENNNNRNAPPPQAAQ---------------EPNAAYMAHDLDRSIRSYVVPNLYNFNPGIAYLV
        + S+G+   F++DPEIERTFRR  R+ +QR + + +E N +     PQA Q               + N   +AHD +R +R Y  PNLYNF PGI    
Subjt:  FKSDGEQAQFELDPEIERTFRRNWRRARQRHARR-MENNNNRNAPPPQAAQ---------------EPNAAYMAHDLDRSIRSYVVPNLYNFNPGIAYLV

Query:  FGENA-----------------------------------------------------------RDEAKRCANAMEDGEVGTWDQLIEKFMKKFFPPHEN
        F  N                                                            RDEA++ A + E GE+ TW +++EKFM+K+FPP  +
Subjt:  FGENA-----------------------------------------------------------RDEAKRCANAMEDGEVGTWDQLIEKFMKKFFPPHEN

Query:  VRRRKELMTFQQKDRENVHDAGSRFKRMVKPCPHNGIPECILMEVFYFGLNKATQQTANAVFVGGMLKSSYNQIKVMLDTMASNNEE
         +RR++++TF+QKD E   +A +RFKR+V+ CPHNGIP C+ ME+FY GLNK +Q  A+A   GG++  +Y Q K +LD ++ N  +
Subjt:  VRRRKELMTFQQKDRENVHDAGSRFKRMVKPCPHNGIPECILMEVFYFGLNKATQQTANAVFVGGMLKSSYNQIKVMLDTMASNNEE

XP_030503898.1 uncharacterized protein LOC115719117 [Cannabis sativa]2.0e-2836.36Show/hide
Query:  VNKFKSDGEQAQFELDPEIERTFRRNWRRARQRHARRMENNNNRNAPPPQAAQEPNAAYMAHDLDRSIRSYVVPNLYNFN----PGIAYLVFGENARDEA
        +N+ + D E A   +DPEIERTFR+  +  + +    M +          A  E N   +A D  R+IR Y +P     N      +   +F  + RD A
Subjt:  VNKFKSDGEQAQFELDPEIERTFRRNWRRARQRHARRMENNNNRNAPPPQAAQEPNAAYMAHDLDRSIRSYVVPNLYNFN----PGIAYLVFGENARDEA

Query:  KRCANAMEDGEVGTWDQLIEKFMKKFFPPHENVRRRKELMTFQQKDRENVHDAGSRFKRMVKPCPHNGIPECILMEVFYFGLNKATQQTANAVFVGGMLK
        +   N +    V  W+ L EKF++K+FPP  N + R E+M+FQQ + E   DA  RFK +++ CPH+GIP CI +E FY GLN A++   +A   G +L 
Subjt:  KRCANAMEDGEVGTWDQLIEKFMKKFFPPHENVRRRKELMTFQQKDRENVHDAGSRFKRMVKPCPHNGIPECILMEVFYFGLNKATQQTANAVFVGGMLK

Query:  SSYNQIKVMLDTMASNNEEW
         SYN+   +L+ +ASNN +W
Subjt:  SSYNQIKVMLDTMASNNEEW

XP_038887458.1 uncharacterized protein LOC120077591 [Benincasa hispida]6.7e-3233.68Show/hide
Query:  QFELDPEIERTF--RRNWRRARQRHARRMENNNNRNAPPPQAAQEP--NAAYMAHDLDRSIRSYVVPNLYNFNPGIAYLVFGENA---------------
        +FE +PEI+ TF  R +  RA +R     +NNNN      Q    P  +  ++A D +  IR+Y  PNLY+F+PGI+  +  ENA               
Subjt:  QFELDPEIERTF--RRNWRRARQRHARRMENNNNRNAPPPQAAQEP--NAAYMAHDLDRSIRSYVVPNLYNFNPGIAYLVFGENA---------------

Query:  --------------------------------------------RDEAKRCANAMEDGEVGTWDQLIEKFMKKFFPPHENVRRRKELMTFQQKDRENVHD
                                                    RD+AKR A+++E  E+ + DQL+E FMKKFFPP  N RRRK ++ F++ D E +  
Subjt:  --------------------------------------------RDEAKRCANAMEDGEVGTWDQLIEKFMKKFFPPHENVRRRKELMTFQQKDRENVHD

Query:  AGSRFKRMVKPCPHNGIPECILMEVFYFGLNKATQQTANAVFVGGMLKSSYNQIKVMLDTMASNNEEWGETGQQNQGRHGGQGDH
        A  RF+R+VK CPH GI +C+LME+FY GLN++TQ  A+A  V   +  +Y + KV+LD ++ N ++W + G + +G    + D+
Subjt:  AGSRFKRMVKPCPHNGIPECILMEVFYFGLNKATQQTANAVFVGGMLKSSYNQIKVMLDTMASNNEEWGETGQQNQGRHGGQGDH

TrEMBL top hitse value%identityAlignment
A0A5A7V1F3 Retrotrans_gag domain-containing protein4.5e-5058.72Show/hide
Query:  PPQAAQEPNAAYMAHDLDRSIRSYVVPNLYNFNPGIAYLVFGENA----RDEAKRCANAMEDGEVGTWDQLIEKFMKKFFPPHENVRRRKELMTFQQKDR
        P   A      Y+AH+L R IRSY +P+LY FNPGIAY  FGENA    +D+AKR AN+ME GEV TW+ LIEKFMKKFFP  +  +RR++L+ F+Q+DR
Subjt:  PPQAAQEPNAAYMAHDLDRSIRSYVVPNLYNFNPGIAYLVFGENA----RDEAKRCANAMEDGEVGTWDQLIEKFMKKFFPPHENVRRRKELMTFQQKDR

Query:  ENVHDAGSRFKRMVKPCPHNGIPECILMEVFYFGLNKATQQTANAVFVGGMLKSSYNQIKVMLDTMASNNEE
        +N+HDA S FKRMVK C H+GI + +LME FYFGL+K T+Q+A+++F+GG+L+SSYNQIK MLD+MA+N+++
Subjt:  ENVHDAGSRFKRMVKPCPHNGIPECILMEVFYFGLNKATQQTANAVFVGGMLKSSYNQIKVMLDTMASNNEE

A0A5D3DR17 Retrotransposon gag protein6.6e-2548.63Show/hide
Query:  GEVGTWDQLIEKFMKKFFPPHENVRRRKELMTFQQKDRENVHDAGSRFKRMVKPCPHNGIPECILMEVFYFGLNKATQQTANAVFVGGMLKSSYNQIKVM
        GEV TW+   ++ +KKFF   +NVRRR++ M F+Q++ EN+HDA SRFKR++K C   GIPE +LMEVFYF L+K TQ T  AVF GGMLKS YNQIK M
Subjt:  GEVGTWDQLIEKFMKKFFPPHENVRRRKELMTFQQKDRENVHDAGSRFKRMVKPCPHNGIPECILMEVFYFGLNKATQQTANAVFVGGMLKSSYNQIKVM

Query:  LDTMASNNEEWGETGQQNQGRHGGQGDHRGEASSSHAWYHNNRPQH
        L++MA N++EW ++      R  G+   + +      W++ N   H
Subjt:  LDTMASNNEEWGETGQQNQGRHGGQGDHRGEASSSHAWYHNNRPQH

A0A5D3E4P5 Retrovirus-related Pol polyprotein from transposon 17.66.8e-3846.98Show/hide
Query:  PPQAAQEPNAAYMAHDLDRSIRSYVVPNLYNFNPGIAYLVFGEN----ARDEAKRCANAMEDGEVGTWDQLIEKFMKKFFPPHENVRRRKELMTFQQKDR
        P   A      Y+AH+L R IRSY +P+LY FNPGIAYL FGEN    ++D+AKR AN+ME GEV TW+ LIEKFMKKFFP  +  +RR++L+ F+QKDR
Subjt:  PPQAAQEPNAAYMAHDLDRSIRSYVVPNLYNFNPGIAYLVFGEN----ARDEAKRCANAMEDGEVGTWDQLIEKFMKKFFPPHENVRRRKELMTFQQKDR

Query:  ENVHDAGSRFKRMVKPCPHNGIPECILMEVFYFGLNKATQQTANAVFVGGMLKSSYNQIKVMLDTMASNNEEWGETGQQNQGRHGGQGDHRGEASSSHAW
        +N+HDA S FKRMVK C H+GI + +LME FYFGL K  ++  +    GG    + +   + L+ +AS +               G G HR E    H  
Subjt:  ENVHDAGSRFKRMVKPCPHNGIPECILMEVFYFGLNKATQQTANAVFVGGMLKSSYNQIKVMLDTMASNNEEWGETGQQNQGRHGGQGDHRGEASSSHAW

Query:  YHNNRPQHS---QHQ
        +HNN   HS   QHQ
Subjt:  YHNNRPQHS---QHQ

A0A6J1EEI2 uncharacterized protein LOC1114333941.1e-2442.14Show/hide
Query:  VFGENARDEAKRCANAMEDGEVGTWDQLIEKFMKKFFPPHENVRRRKELMTFQQKDRENVHDAGSRFKRMVKPCPHNGIPECILMEVFYFGLNKATQQTA
        +F  + RD AK   N +  G + +W+ L+EKF+ K+FPP  N R R E++ FQQ + + + +A  RFK M++ CPH+G+P CI ME FY GLN AT+Q  
Subjt:  VFGENARDEAKRCANAMEDGEVGTWDQLIEKFMKKFFPPHENVRRRKELMTFQQKDRENVHDAGSRFKRMVKPCPHNGIPECILMEVFYFGLNKATQQTA

Query:  NAVFVGGMLKSSYNQIKVMLDTMASNNEEWGETGQQNQGR
        +A   G +L  +YN+   +L+ +ASNN +W +  + N GR
Subjt:  NAVFVGGMLKSSYNQIKVMLDTMASNNEEWGETGQQNQGR

U5CUI2 Retrotrans_gag domain-containing protein2.3e-2543.61Show/hide
Query:  VFGENARDEAKRCANAMEDGEVGTWDQLIEKFMKKFFPPHENVRRRKELMTFQQKDRENVHDAGSRFKRMVKPCPHNGIPECILMEVFYFGLNKATQQTA
        +F  + RD A+   N +    V  W+ L EKF++K+FPP  N + R E+M+FQQ + E+  DA  RFK +++ CPH+GIP CI ME FY GLN A++   
Subjt:  VFGENARDEAKRCANAMEDGEVGTWDQLIEKFMKKFFPPHENVRRRKELMTFQQKDRENVHDAGSRFKRMVKPCPHNGIPECILMEVFYFGLNKATQQTA

Query:  NAVFVGGMLKSSYNQIKVMLDTMASNNEEWGET
        +A   G +L  SYN+   +L+T+ASNN +W  T
Subjt:  NAVFVGGMLKSSYNQIKVMLDTMASNNEEWGET

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAAGAGGATCATATAGTACTGAAACAACAAAAGGATCAGATCAGGCCTTATTAGAAGATAATCATCGCTTGTTATCCATGATATCCACTCTCAAACTTGAGTTGCA
AGTTGTTCGTGCGGACTTTGAAGCTCTATCCAAGTCGATCAAAATGCTTAGTTCTGGCACACAAAGTCTGGACAGTATTATGAAAGCTGGCAAGATTGACGAAATGAGAA
GTAACCACGCGAAGAAAGCATCAGGCGAGGAACCACGCGGGCAACTAGGCGAACTTGAACACATGGCTAGGCGATCCAAAGCTCTAGCCGACAATCTAGCCCGTGGAGAT
GTCAGAAAGGACACCTATAACAGTGTTGCGGTGGCGAAGAGACGTAGAGACATAATCTTATCGCGGAACTTATCCTTAGCACATGAGACAGAAATGTTTAAGTGTGTGGC
GGAAGCACCGAGAGAAGTTCAATTTCAGCTGAATCGTGCGTTCAAACCTGAAGAGCGTGTGAACAAGTTTAAGAGTGACGGTGAACAAGCGCAATTCGAGCTTGACCCTG
AAATTGAGCGAACTTTTCGACGTAATTGGCGAAGAGCAAGGCAAAGACACGCAAGAAGAATGGAAAATAATAACAATAGAAATGCTCCTCCGCCGCAAGCTGCGCAAGAA
CCAAACGCCGCCTACATGGCGCATGACCTGGACAGGTCAATTAGGTCATATGTTGTGCCCAACCTCTATAACTTCAATCCAGGGATCGCCTACCTTGTGTTCGGCGAAAA
TGCAAGGGATGAGGCGAAAAGGTGTGCCAACGCCATGGAAGATGGCGAGGTAGGAACATGGGATCAATTGATAGAGAAATTTATGAAGAAATTTTTCCCACCTCACGAGA
ACGTGAGAAGAAGGAAGGAGCTCATGACCTTCCAGCAGAAAGATAGAGAGAACGTACATGATGCAGGGAGTAGGTTCAAGAGGATGGTCAAACCATGTCCCCACAATGGC
ATTCCCGAATGCATCTTGATGGAGGTCTTCTACTTTGGCTTGAACAAGGCAACACAGCAGACTGCTAATGCTGTATTTGTAGGTGGTATGTTGAAAAGCTCCTACAACCA
GATTAAGGTAATGCTGGACACAATGGCCAGCAATAATGAAGAATGGGGAGAAACGGGTCAACAAAATCAAGGACGACATGGTGGTCAAGGTGACCATCGCGGGGAAGCAT
CTAGCTCCCACGCGTGGTACCACAACAACAGACCCCAACACTCCCAGCATCAATAA
mRNA sequenceShow/hide mRNA sequence
ATGGGAAGAGGATCATATAGTACTGAAACAACAAAAGGATCAGATCAGGCCTTATTAGAAGATAATCATCGCTTGTTATCCATGATATCCACTCTCAAACTTGAGTTGCA
AGTTGTTCGTGCGGACTTTGAAGCTCTATCCAAGTCGATCAAAATGCTTAGTTCTGGCACACAAAGTCTGGACAGTATTATGAAAGCTGGCAAGATTGACGAAATGAGAA
GTAACCACGCGAAGAAAGCATCAGGCGAGGAACCACGCGGGCAACTAGGCGAACTTGAACACATGGCTAGGCGATCCAAAGCTCTAGCCGACAATCTAGCCCGTGGAGAT
GTCAGAAAGGACACCTATAACAGTGTTGCGGTGGCGAAGAGACGTAGAGACATAATCTTATCGCGGAACTTATCCTTAGCACATGAGACAGAAATGTTTAAGTGTGTGGC
GGAAGCACCGAGAGAAGTTCAATTTCAGCTGAATCGTGCGTTCAAACCTGAAGAGCGTGTGAACAAGTTTAAGAGTGACGGTGAACAAGCGCAATTCGAGCTTGACCCTG
AAATTGAGCGAACTTTTCGACGTAATTGGCGAAGAGCAAGGCAAAGACACGCAAGAAGAATGGAAAATAATAACAATAGAAATGCTCCTCCGCCGCAAGCTGCGCAAGAA
CCAAACGCCGCCTACATGGCGCATGACCTGGACAGGTCAATTAGGTCATATGTTGTGCCCAACCTCTATAACTTCAATCCAGGGATCGCCTACCTTGTGTTCGGCGAAAA
TGCAAGGGATGAGGCGAAAAGGTGTGCCAACGCCATGGAAGATGGCGAGGTAGGAACATGGGATCAATTGATAGAGAAATTTATGAAGAAATTTTTCCCACCTCACGAGA
ACGTGAGAAGAAGGAAGGAGCTCATGACCTTCCAGCAGAAAGATAGAGAGAACGTACATGATGCAGGGAGTAGGTTCAAGAGGATGGTCAAACCATGTCCCCACAATGGC
ATTCCCGAATGCATCTTGATGGAGGTCTTCTACTTTGGCTTGAACAAGGCAACACAGCAGACTGCTAATGCTGTATTTGTAGGTGGTATGTTGAAAAGCTCCTACAACCA
GATTAAGGTAATGCTGGACACAATGGCCAGCAATAATGAAGAATGGGGAGAAACGGGTCAACAAAATCAAGGACGACATGGTGGTCAAGGTGACCATCGCGGGGAAGCAT
CTAGCTCCCACGCGTGGTACCACAACAACAGACCCCAACACTCCCAGCATCAATAA
Protein sequenceShow/hide protein sequence
MGRGSYSTETTKGSDQALLEDNHRLLSMISTLKLELQVVRADFEALSKSIKMLSSGTQSLDSIMKAGKIDEMRSNHAKKASGEEPRGQLGELEHMARRSKALADNLARGD
VRKDTYNSVAVAKRRRDIILSRNLSLAHETEMFKCVAEAPREVQFQLNRAFKPEERVNKFKSDGEQAQFELDPEIERTFRRNWRRARQRHARRMENNNNRNAPPPQAAQE
PNAAYMAHDLDRSIRSYVVPNLYNFNPGIAYLVFGENARDEAKRCANAMEDGEVGTWDQLIEKFMKKFFPPHENVRRRKELMTFQQKDRENVHDAGSRFKRMVKPCPHNG
IPECILMEVFYFGLNKATQQTANAVFVGGMLKSSYNQIKVMLDTMASNNEEWGETGQQNQGRHGGQGDHRGEASSSHAWYHNNRPQHSQHQ