; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0007155 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0007155
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr07:12880157..12881059
RNA-Seq ExpressionPI0007155
SyntenyPI0007155
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
WP_217833177.1 retrotransposon gag domain-containing protein, partial [Synechococcus sp. PCC 7002]9.2e-6544.88Show/hide
Query:  MSNGEQPQFELHPEIERTFRCNRRRARQRQVRRMENNNRNAPPSQ-----------------AAPEPNAAYMAHDLDRPIRSYAAPNLYNFNPGIAYPVF
        MS G+ P F++ PEIERTFR   R+ +QR+  +    N +A  +Q                 A  + N   +AHD +RP+R YA+PNLYNF PGI  P F
Subjt:  MSNGEQPQFELHPEIERTFRCNRRRARQRQVRRMENNNRNAPPSQ-----------------AAPEPNAAYMAHDLDRPIRSYAAPNLYNFNPGIAYPVF

Query:  GGNARFEIKPVMLQMILNAGQFGGHPGEDPHEHIRSFYSIYASFHVPGISPEELRFALFPLTLRDEAKRWANALEDGEVGTWDQLIEKFMKKFFPPHENA
         GN RFE+KPVMLQM+  AGQFGG  GEDPH H++SF  I ++F + G+  + +R  LFP +LRDEA++WA + E GE+ TW +++EKFM+K+FPP  +A
Subjt:  GGNARFEIKPVMLQMILNAGQFGGHPGEDPHEHIRSFYSIYASFHVPGISPEELRFALFPLTLRDEAKRWANALEDGEVGTWDQLIEKFMKKFFPPHENA

Query:  RRRKELMSFQQKDRENLHDAWSRFKRMVKACPHNGILECILME------------TADTVFVGGMLKSSYNQIKATLDTMANN
        +RR+++++F+QKD E   +AW+RFKR+V+ CPHNGI  C+ ME             AD    GG++  +Y Q K  LD ++ N
Subjt:  RRRKELMSFQQKDRENLHDAWSRFKRMVKACPHNGILECILME------------TADTVFVGGMLKSSYNQIKATLDTMANN

XP_017216983.1 PREDICTED: uncharacterized protein LOC108194534 [Daucus carota subsp. sativus]6.4e-5043.8Show/hide
Query:  MSNGEQPQFELHPEIERTFRCNRRRARQRQVRRME-------NNNRNAPPSQAAPEPNAAYMAHDLDRPIRSYAAPNLYNFNPGIAYPVFGGNARFEIKP
        MS      F   P IERTF  NRRR  QR++++ +       NN            P  A++  D DR IR YAAP     N GI  P      +FE+KP
Subjt:  MSNGEQPQFELHPEIERTFRCNRRRARQRQVRRME-------NNNRNAPPSQAAPEPNAAYMAHDLDRPIRSYAAPNLYNFNPGIAYPVFGGNARFEIKP

Query:  VMLQMILNAGQFGGHPGEDPHEHIRSFYSIYASFHVPGISPEELRFALFPLTLRDEAKRWANALEDGEVGTWDQLIEKFMKKFFPPHENARRRKELMSFQ
        VM QM+   GQF G P EDPH H+R F  I  SF   G++ + LR  LFP  +RD A+ W N+L  G V  W+ L EKF+ K+FPP+ NA+ R E+ SFQ
Subjt:  VMLQMILNAGQFGGHPGEDPHEHIRSFYSIYASFHVPGISPEELRFALFPLTLRDEAKRWANALEDGEVGTWDQLIEKFMKKFFPPHENARRRKELMSFQ

Query:  QKDRENLHDAWSRFKRMVKACPHNGILECILMET------ADTVFV------GGMLKSSYNQIKATLDTMANNN
        Q+D E+L+DAW RFK +++ CPH+GIL CI MET      A T  V      G +L  SYNQ    L+T+A NN
Subjt:  QKDRENLHDAWSRFKRMVKACPHNGILECILMET------ADTVFV------GGMLKSSYNQIKATLDTMANNN

XP_017233063.1 PREDICTED: uncharacterized protein LOC108207110 [Daucus carota subsp. sativus]1.2e-5143Show/hide
Query:  MSNGEQPQFELHPEIERTFRCNRRRARQRQVRR----MENNNRNAPPSQAAPEPNAAYMAHDLDRPIRSYAAPNLYNFNPGIAYPVFGGNARFEIKPVML
        MS      F   PEIERTF  NRRR  QR++++    M++N  N         P  A++  D DR IR YAAP     N GI  P      +FE+KPVM 
Subjt:  MSNGEQPQFELHPEIERTFRCNRRRARQRQVRR----MENNNRNAPPSQAAPEPNAAYMAHDLDRPIRSYAAPNLYNFNPGIAYPVFGGNARFEIKPVML

Query:  QMILNAGQFGGHPGEDPHEHIRSFYSIYASFHVPGISPEELRFALFPLTLRDEAKRWANALEDGEVGTWDQLIEKFMKKFFPPHENARRRKELMSFQQKD
        QM+   GQF G P EDPH H+R F  I  SF   G+  + LR  LFP ++RD A+ W N+L  G V TW+ L EKF+ K+FPP+ NA+ R E+ SFQQ+D
Subjt:  QMILNAGQFGGHPGEDPHEHIRSFYSIYASFHVPGISPEELRFALFPLTLRDEAKRWANALEDGEVGTWDQLIEKFMKKFFPPHENARRRKELMSFQQKD

Query:  RENLHDAWSRFKRMVKACPHNGILECILMET------ADTVFV------GGMLKSSYNQIKATLDTMANNN--------ENGMKMISTIPEEDDQEAMKA
         E+L+DAW RFK +++ CPH+GIL CI MET      A T  V      G +L  SYNQ    L+T+A  N        + G K ++ I + D   +MKA
Subjt:  RENLHDAWSRFKRMVKACPHNGILECILMET------ADTVFV------GGMLKSSYNQIKATLDTMANNN--------ENGMKMISTIPEEDDQEAMKA

XP_017239618.1 PREDICTED: uncharacterized protein LOC108212402 [Daucus carota subsp. sativus]4.2e-4942.42Show/hide
Query:  MSNGEQPQFELHPEIERTFRCNRRRARQRQVRRME-NNNRNAPPSQAAPEPNAAYMAHDLDRPIRSYAAPNLYNFNPGIAYPVFGGNARFEIKPVMLQMI
        MS      F   PEIERTF  NRRR  QR++++ +     N         P  A++  D DR IR YAAP     N GI  P      +FE+KPVM QM+
Subjt:  MSNGEQPQFELHPEIERTFRCNRRRARQRQVRRME-NNNRNAPPSQAAPEPNAAYMAHDLDRPIRSYAAPNLYNFNPGIAYPVFGGNARFEIKPVMLQMI

Query:  LNAGQFGGHPGEDPHEHIRSFYSIYASFHVPGISPEELRFALFPLTLRDEAKRWANALEDGEVGTWDQLIEKFMKKFFPPHENARRRKELMSFQQKDREN
           GQF G P EDPH H+R F  I  SF   G+  + LR  LFP ++RD A+ W N+L  G V TW+ L EKF+ K+FPP+ NA+   E+ SFQQ+D E+
Subjt:  LNAGQFGGHPGEDPHEHIRSFYSIYASFHVPGISPEELRFALFPLTLRDEAKRWANALEDGEVGTWDQLIEKFMKKFFPPHENARRRKELMSFQQKDREN

Query:  LHDAWSRFKRMVKACPHNGILECILMET------ADTVFV------GGMLKSSYNQIKATLDTMANNN--------ENGMKMISTIPEEDDQEAMKA
        L+DAW RFK +++ CPH+GIL  I MET      A T  V      G +L  SYNQ    L+T+A NN        + G K ++ I + D   +MKA
Subjt:  LHDAWSRFKRMVKACPHNGILECILMET------ADTVFV------GGMLKSSYNQIKATLDTMANNN--------ENGMKMISTIPEEDDQEAMKA

XP_038887458.1 uncharacterized protein LOC120077591 [Benincasa hispida]2.9e-5845.62Show/hide
Query:  MSNGEQPQFELHPEIERTF--RCNRRRARQRQVRRMENNNRNAPPSQAA---PEPNAAYMAHDLDRPIRSYAAPNLYNFNPGIAYPVFGGNARFEIKPVM
        MS    P+FE +PEI+ TF  R ++ RA +R++   +NNN  AP        P  +  ++A D + PIR+YAAPNLY+F+PGI+ P+   NARFEIKPVM
Subjt:  MSNGEQPQFELHPEIERTF--RCNRRRARQRQVRRMENNNRNAPPSQAA---PEPNAAYMAHDLDRPIRSYAAPNLYNFNPGIAYPVFGGNARFEIKPVM

Query:  LQMILNAGQFGGHPGEDPHEHIRSFYSIYASFHVPGISPEELRFALFPLTLRDEAKRWANALEDGEVGTWDQLIEKFMKKFFPPHENARRRKELMSFQQK
        +QMI N  QF     E+PH H+  F  + ++F +PGI+P  +R  LFP TLRD+AKRWA++LE  E+ + DQL+E FMKKFFPP  N RRRK +++F++ 
Subjt:  LQMILNAGQFGGHPGEDPHEHIRSFYSIYASFHVPGISPEELRFALFPLTLRDEAKRWANALEDGEVGTWDQLIEKFMKKFFPPHENARRRKELMSFQQK

Query:  DRENLHDAWSRFKRMVKACPHNGILECILME------------TADTVFVGGMLKSSYNQIKATLDTMANNNEN
        D E L  AW RF+R+VK CPH GIL+C+LME             AD   V   +  +Y + K  LD ++ N ++
Subjt:  DRENLHDAWSRFKRMVKACPHNGILECILME------------TADTVFVGGMLKSSYNQIKATLDTMANNNEN

TrEMBL top hitse value%identityAlignment
A0A392NID4 Retrotrans_gag domain-containing protein (Fragment)8.2e-4339.84Show/hide
Query:  PQFELHPEIERTFRCNRRRARQRQ----------VRRMENNNRNAPPSQAAPEPNAAYMAHDLDRPIRSYAAPNLYNFNPGIAYPVFGGNARFEIKPVML
        P++   PEIERTFR  RR  R+                E       P     EP+   +A+D  R IR YAA +    N GI  P     A+FE KP+M 
Subjt:  PQFELHPEIERTFRCNRRRARQRQ----------VRRMENNNRNAPPSQAAPEPNAAYMAHDLDRPIRSYAAPNLYNFNPGIAYPVFGGNARFEIKPVML

Query:  QMILNAGQFGGHPGEDPHEHIRSFYSIYASFHVPGISPEELRFALFPLTLRDEAKRWANALEDGEVGTWDQLIEKFMKKFFPPHENARRRKELMSFQQKD
        QM+   GQF     EDPH H++ F  + ++F +PGI+ +  R  LFP +LRD  K W N+LE   +  W+ L EKF+ K+FPP +NA+ R ++ SF+Q D
Subjt:  QMILNAGQFGGHPGEDPHEHIRSFYSIYASFHVPGISPEELRFALFPLTLRDEAKRWANALEDGEVGTWDQLIEKFMKKFFPPHENARRRKELMSFQQKD

Query:  RENLHDAWSRFKRMVKACPHNGILECILMETADTVFVGGMLKSSYNQIKAT
         E L DAW R+K M++ CPHNGI  CI +ET    F  G++ +S N + A+
Subjt:  RENLHDAWSRFKRMVKACPHNGILECILMETADTVFVGGMLKSSYNQIKAT

A0A5A7V1F3 Retrotrans_gag domain-containing protein8.7e-4543.93Show/hide
Query:  PSQAAPEPNAAYMAHDLDRPIRSYAAPNLYNFNPGIAYPVFGGNARFEIKPVMLQMILNAGQFGGHPGEDPHEHIRSFYSIYASFHVPGISPEELRFALF
        P   A      Y+AH+L RPIRSYA P+LY FNPGIAYP FG NA +E K                                                  
Subjt:  PSQAAPEPNAAYMAHDLDRPIRSYAAPNLYNFNPGIAYPVFGGNARFEIKPVMLQMILNAGQFGGHPGEDPHEHIRSFYSIYASFHVPGISPEELRFALF

Query:  PLTLRDEAKRWANALEDGEVGTWDQLIEKFMKKFFPPHENARRRKELMSFQQKDRENLHDAWSRFKRMVKACPHNGILECILME------------TADT
             D+AKRWAN++E GEV TW+ LIEKFMKKFFP  + A+RR++L+ F+Q+DR+NLHDAWS FKRMVKAC H+GI + +LME            +AD+
Subjt:  PLTLRDEAKRWANALEDGEVGTWDQLIEKFMKKFFPPHENARRRKELMSFQQKDRENLHDAWSRFKRMVKACPHNGILECILME------------TADT

Query:  VFVGGMLKSSYNQIKATLDTMANNNENGMKMISTIPEED
        +F+GG+L+SSYNQIKA LD+MANN+++    +  + E D
Subjt:  VFVGGMLKSSYNQIKATLDTMANNNENGMKMISTIPEED

A0A6J1EEI2 uncharacterized protein LOC1114333942.4e-4241.94Show/hide
Query:  NAAYMAHDLDRPIRSYAAPNLYNFNPGIAYPVFGGNARFEIKPVMLQMILNAGQFGGHPGEDPHEHIRSFYSIYASFHVPGISPEELRFALFPLTLRDEA
        NA ++A D +R IR+YA P +   NP I  P       FE+KPVM QM+   GQF G P EDPH H++SF  +  SF    +  + +R +LFP +LRD A
Subjt:  NAAYMAHDLDRPIRSYAAPNLYNFNPGIAYPVFGGNARFEIKPVMLQMILNAGQFGGHPGEDPHEHIRSFYSIYASFHVPGISPEELRFALFPLTLRDEA

Query:  KRWANALEDGEVGTWDQLIEKFMKKFFPPHENARRRKELMSFQQKDRENLHDAWSRFKRMVKACPHNGILECILMET------------ADTVFVGGMLK
        K W N L  G + +W+ L+EKF+ K+FPP  NAR R E++ FQQ + + L +AW RFK M++ CPH+G+  CI MET             D    G +L 
Subjt:  KRWANALEDGEVGTWDQLIEKFMKKFFPPHENARRRKELMSFQQKDRENLHDAWSRFKRMVKACPHNGILECILMET------------ADTVFVGGMLK

Query:  SSYNQIKATLDTMANNN
         +YN+    L+ +A+NN
Subjt:  SSYNQIKATLDTMANNN

A0A6J1H7E4 uncharacterized protein LOC1114611686.3e-4342.86Show/hide
Query:  NAAYMAHDLDRPIRSYAAPNLYNFNPGIAYPVFGGNARFEIKPVMLQMILNAGQFGGHPGEDPHEHIRSFYSIYASFHVPGISPEELRFALFPLTLRDEA
        NA  +A D +R IR+YA P +   NP I  P       FE+KPVM QM+   GQF G P EDPH H++SF  +  SF   G+  + +R +LFP +LRD A
Subjt:  NAAYMAHDLDRPIRSYAAPNLYNFNPGIAYPVFGGNARFEIKPVMLQMILNAGQFGGHPGEDPHEHIRSFYSIYASFHVPGISPEELRFALFPLTLRDEA

Query:  KRWANALEDGEVGTWDQLIEKFMKKFFPPHENARRRKELMSFQQKDRENLHDAWSRFKRMVKACPHNGILECILMET------------ADTVFVGGMLK
        K W N L    + +W+ L EKF+ K+FPP  NAR R E+++FQQ + E L +AW RFK M++ CPH+G+  CI MET             D    G ML 
Subjt:  KRWANALEDGEVGTWDQLIEKFMKKFFPPHENARRRKELMSFQQKDRENLHDAWSRFKRMVKACPHNGILECILMET------------ADTVFVGGMLK

Query:  SSYNQIKATLDTMANNN
         +YN+    L+ +A+NN
Subjt:  SSYNQIKATLDTMANNN

U5CUI2 Retrotrans_gag domain-containing protein5.5e-4746.08Show/hide
Query:  NAAYMAHDLDRPIRSYAAPNLYNFNPGIAYPVFGGNARFEIKPVMLQMILNAGQFGGHPGEDPHEHIRSFYSIYASFHVPGISPEELRFALFPLTLRDEA
        N   +A D  R IR YAAP     NPGI  P      +FE+KPVM QM+   GQF G P EDPH H+RSF  +  SF + G+S E LR  LFP +LRD A
Subjt:  NAAYMAHDLDRPIRSYAAPNLYNFNPGIAYPVFGGNARFEIKPVMLQMILNAGQFGGHPGEDPHEHIRSFYSIYASFHVPGISPEELRFALFPLTLRDEA

Query:  KRWANALEDGEVGTWDQLIEKFMKKFFPPHENARRRKELMSFQQKDRENLHDAWSRFKRMVKACPHNGILECILMET------------ADTVFVGGMLK
        + W N L    V  W+ L EKF++K+FPP  NA+ R E+MSFQQ + E+  DAW RFK +++ CPH+GI  CI MET             D    G +L 
Subjt:  KRWANALEDGEVGTWDQLIEKFMKKFFPPHENARRRKELMSFQQKDRENLHDAWSRFKRMVKACPHNGILECILMET------------ADTVFVGGMLK

Query:  SSYNQIKATLDTMANNN
         SYN+    L+T+A+NN
Subjt:  SSYNQIKATLDTMANNN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTAACGGTGAACAACCGCAATTTGAACTTCACCCTGAAATTGAGCGAACCTTTCGGTGTAATCGGCGAAGAGCAAGGCAAAGACAAGTAAGGAGAATGGAAAACAA
TAACAGAAATGCCCCTCCGTCGCAAGCTGCCCCAGAACCAAACGCCGCCTACATGGCACATGACTTGGATAGACCGATTAGATCGTATGCGGCACCCAACCTCTACAACT
TCAACCCAGGGATCGCTTACCCTGTGTTCGGCGGAAATGCTAGGTTTGAAATAAAGCCTGTAATGCTTCAAATGATTCTGAACGCCGGACAATTTGGCGGTCATCCTGGA
GAAGATCCACATGAACATATTAGGAGTTTTTATTCTATCTATGCGTCCTTCCATGTGCCAGGCATCTCACCTGAGGAACTGAGATTCGCCCTCTTCCCGTTAACTCTGAG
GGACGAGGCGAAGAGGTGGGCCAATGCCTTGGAAGATGGCGAGGTGGGAACATGGGATCAATTGATAGAGAAATTCATGAAGAAATTCTTCCCACCTCACGAAAATGCCA
GAAGAAGGAAGGAGCTCATGAGCTTCCAGCAAAAGGATAGAGAAAACCTACATGATGCATGGAGTAGGTTTAAGAGGATGGTGAAAGCATGCCCCCACAATGGCATTCTT
GAGTGCATATTGATGGAGACTGCTGATACTGTGTTTGTAGGTGGTATGCTAAAGAGCTCCTACAACCAGATTAAGGCGACGTTGGACACAATGGCCAACAACAATGAAAA
TGGGATGAAGATGATTTCGACAATCCCCGAGGAGGACGATCAAGAGGCGATGAAGGCATGGATAAGAACGCCGTGGTGGCGTTGCAGGGACAAATGA
mRNA sequenceShow/hide mRNA sequence
ATGAGTAACGGTGAACAACCGCAATTTGAACTTCACCCTGAAATTGAGCGAACCTTTCGGTGTAATCGGCGAAGAGCAAGGCAAAGACAAGTAAGGAGAATGGAAAACAA
TAACAGAAATGCCCCTCCGTCGCAAGCTGCCCCAGAACCAAACGCCGCCTACATGGCACATGACTTGGATAGACCGATTAGATCGTATGCGGCACCCAACCTCTACAACT
TCAACCCAGGGATCGCTTACCCTGTGTTCGGCGGAAATGCTAGGTTTGAAATAAAGCCTGTAATGCTTCAAATGATTCTGAACGCCGGACAATTTGGCGGTCATCCTGGA
GAAGATCCACATGAACATATTAGGAGTTTTTATTCTATCTATGCGTCCTTCCATGTGCCAGGCATCTCACCTGAGGAACTGAGATTCGCCCTCTTCCCGTTAACTCTGAG
GGACGAGGCGAAGAGGTGGGCCAATGCCTTGGAAGATGGCGAGGTGGGAACATGGGATCAATTGATAGAGAAATTCATGAAGAAATTCTTCCCACCTCACGAAAATGCCA
GAAGAAGGAAGGAGCTCATGAGCTTCCAGCAAAAGGATAGAGAAAACCTACATGATGCATGGAGTAGGTTTAAGAGGATGGTGAAAGCATGCCCCCACAATGGCATTCTT
GAGTGCATATTGATGGAGACTGCTGATACTGTGTTTGTAGGTGGTATGCTAAAGAGCTCCTACAACCAGATTAAGGCGACGTTGGACACAATGGCCAACAACAATGAAAA
TGGGATGAAGATGATTTCGACAATCCCCGAGGAGGACGATCAAGAGGCGATGAAGGCATGGATAAGAACGCCGTGGTGGCGTTGCAGGGACAAATGA
Protein sequenceShow/hide protein sequence
MSNGEQPQFELHPEIERTFRCNRRRARQRQVRRMENNNRNAPPSQAAPEPNAAYMAHDLDRPIRSYAAPNLYNFNPGIAYPVFGGNARFEIKPVMLQMILNAGQFGGHPG
EDPHEHIRSFYSIYASFHVPGISPEELRFALFPLTLRDEAKRWANALEDGEVGTWDQLIEKFMKKFFPPHENARRRKELMSFQQKDRENLHDAWSRFKRMVKACPHNGIL
ECILMETADTVFVGGMLKSSYNQIKATLDTMANNNENGMKMISTIPEEDDQEAMKAWIRTPWWRCRDK