; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0001475 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0001475
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr04:24933130..24934036
RNA-Seq ExpressionPI0001475
SyntenyPI0001475
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
WP_217833177.1 retrotransposon gag domain-containing protein, partial [Synechococcus sp. PCC 7002]6.2e-6350.21Show/hide
Query:  MSDGEQPHFELDPEIERTFRRNWRRGRQRNARR-MENNNNRNAHPPQVAQ---------------EPNVAYMAHDLDRPIRSYAAPNLYNFNPGIAYPVF
        MS+G+ P F++DPEIERTFRR  R+ +QR + + +E N +   + PQ  Q               + N   +AHD +RP+R YA+PNLYNF PGI  P F
Subjt:  MSDGEQPHFELDPEIERTFRRNWRRGRQRNARR-MENNNNRNAHPPQVAQ---------------EPNVAYMAHDLDRPIRSYAAPNLYNFNPGIAYPVF

Query:  GENARFEIKPVMLQMLQNAGQFVGHPGEDPHEHIRSFYSICASFHMPGISPKELRFTLFPLTLRDEEKRWANALEDGEVGTWDQLIEKFMKKFFPPHENA
          N RFE+KPVMLQMLQ AGQF G  GEDPH H++SF  IC++F M G+    +R TLFP +LRDE ++WA + E GE+ TW +++EKFM+K+FPP  +A
Subjt:  GENARFEIKPVMLQMLQNAGQFVGHPGEDPHEHIRSFYSICASFHMPGISPKELRFTLFPLTLRDEEKRWANALEDGEVGTWDQLIEKFMKKFFPPHENA

Query:  RRRKELMSFQQKDRENLHDAWSRYKRMVKACPHNGIP
        +RR+++++F+QKD E   +AW+R+KR+V+ CPHNGIP
Subjt:  RRRKELMSFQQKDRENLHDAWSRYKRMVKACPHNGIP

XP_017233063.1 PREDICTED: uncharacterized protein LOC108207110 [Daucus carota subsp. sativus]2.6e-4546.7Show/hide
Query:  FELDPEIERTFRRNWRRGRQRNARRMENNNNRNAHPPQVAQEPNVAYMAHDLDRPIRSYAAPNLYNFNPGIAYPVFGENARFEIKPVMLQMLQNAGQFVG
        F  DPEIERTF R  RR  QR  ++ +   + N +   +   P  A++  D DR IR YAAP     N GI  P   +  +FE+KPVM QMLQ  GQF G
Subjt:  FELDPEIERTFRRNWRRGRQRNARRMENNNNRNAHPPQVAQEPNVAYMAHDLDRPIRSYAAPNLYNFNPGIAYPVFGENARFEIKPVMLQMLQNAGQFVG

Query:  HPGEDPHEHIRSFYSICASFHMPGISPKELRFTLFPLTLRDEEKRWANALEDGEVGTWDQLIEKFMKKFFPPHENARRRKELMSFQQKDRENLHDAWSRY
         P EDPH H+R F  I  SF   G+    LR  LFP ++RD  + W N+L  G V TW+ L EKF+ K+FPP+ NA+ R E+ SFQQ+D E+L+DAW R+
Subjt:  HPGEDPHEHIRSFYSICASFHMPGISPKELRFTLFPLTLRDEEKRWANALEDGEVGTWDQLIEKFMKKFFPPHENARRRKELMSFQQKDRENLHDAWSRY

Query:  KRMVKACPHNGI
        K +++ CPH+GI
Subjt:  KRMVKACPHNGI

XP_017239618.1 PREDICTED: uncharacterized protein LOC108212402 [Daucus carota subsp. sativus]5.0e-4445.29Show/hide
Query:  MSDGEQPHFELDPEIERTFRRNWRRGRQRNARRMENNNNRNAHPPQVAQEPNVAYMAHDLDRPIRSYAAPNLYNFNPGIAYPVFGENARFEIKPVMLQML
        MS+     F  DPEIERTF R  RR  QR  ++ +     N +   +   P  A++  D DR IR YAAP     N GI  P   +  +FE+KPVM QML
Subjt:  MSDGEQPHFELDPEIERTFRRNWRRGRQRNARRMENNNNRNAHPPQVAQEPNVAYMAHDLDRPIRSYAAPNLYNFNPGIAYPVFGENARFEIKPVMLQML

Query:  QNAGQFVGHPGEDPHEHIRSFYSICASFHMPGISPKELRFTLFPLTLRDEEKRWANALEDGEVGTWDQLIEKFMKKFFPPHENARRRKELMSFQQKDREN
        Q  GQF G P EDPH H+R F  I  SF   G+    LR  LFP ++RD  + W N+L  G V TW+ L EKF+ K+FPP+ NA+   E+ SFQQ+D E+
Subjt:  QNAGQFVGHPGEDPHEHIRSFYSICASFHMPGISPKELRFTLFPLTLRDEEKRWANALEDGEVGTWDQLIEKFMKKFFPPHENARRRKELMSFQQKDREN

Query:  LHDAWSRYKRMVKACPHNGIPES
        L+DAW R+K +++ CPH+GI  S
Subjt:  LHDAWSRYKRMVKACPHNGIPES

XP_030508936.1 uncharacterized protein LOC115723589 [Cannabis sativa]3.8e-4447.87Show/hide
Query:  LDPEIERTFRRNWRRGRQRNARRMENNNNRNAHPPQVAQEPNVAYMAHDLDRPIRSYAAPNLYNFNPGIAYPVFGENARFEIKPVMLQMLQNAGQFVGHP
        +DPEIERTFR+  RR  Q+  +R   N         V  E N   +A D  R IR YAAP     NPGI  P   +   FE+KPVM QMLQ  GQF G P
Subjt:  LDPEIERTFRRNWRRGRQRNARRMENNNNRNAHPPQVAQEPNVAYMAHDLDRPIRSYAAPNLYNFNPGIAYPVFGENARFEIKPVMLQMLQNAGQFVGHP

Query:  GEDPHEHIRSFYSICASFHMPGISPKELRFTLFPLTLRDEEKRWANALEDGEVGTWDQLIEKFMKKFFPPHENARRRKELMSFQQKDRENLHDAWSRYKR
         EDPH HIRSF  +  SF + G+S + LR  LFP +LRD  + W N L    V  W+ L EKF++K+FPP  NA+ R E+MSFQQ + E   DAW R+K 
Subjt:  GEDPHEHIRSFYSICASFHMPGISPKELRFTLFPLTLRDEEKRWANALEDGEVGTWDQLIEKFMKKFFPPHENARRRKELMSFQQKDRENLHDAWSRYKR

Query:  MVKACPHNGIP
        +++ CPH+GIP
Subjt:  MVKACPHNGIP

XP_038887458.1 uncharacterized protein LOC120077591 [Benincasa hispida]4.5e-5349.55Show/hide
Query:  MSDGEQPHFELDPEIERTFRRNWRRGR--QRNARRMENNNNRNAHPPQVAQEP--NVAYMAHDLDRPIRSYAAPNLYNFNPGIAYPVFGENARFEIKPVM
        MS    P FE +PEI+ TFR    + R  +R     +NNNN      Q    P  +  ++A D + PIR+YAAPNLY+F+PGI+ P+  ENARFEIKPVM
Subjt:  MSDGEQPHFELDPEIERTFRRNWRRGR--QRNARRMENNNNRNAHPPQVAQEP--NVAYMAHDLDRPIRSYAAPNLYNFNPGIAYPVFGENARFEIKPVM

Query:  LQMLQNAGQFVGHPGEDPHEHIRSFYSICASFHMPGISPKELRFTLFPLTLRDEEKRWANALEDGEVGTWDQLIEKFMKKFFPPHENARRRKELMSFQQK
        +QM+QN  QF     E+PH H+  F  +C++F +PGI+P  +R  LFP TLRD+ KRWA++LE  E+ + DQL+E FMKKFFPP  N RRRK +++F++ 
Subjt:  LQMLQNAGQFVGHPGEDPHEHIRSFYSICASFHMPGISPKELRFTLFPLTLRDEEKRWANALEDGEVGTWDQLIEKFMKKFFPPHENARRRKELMSFQQK

Query:  DRENLHDAWSRYKRMVKACPHNGI
        D E L  AW R++R+VK CPH GI
Subjt:  DRENLHDAWSRYKRMVKACPHNGI

TrEMBL top hitse value%identityAlignment
A0A392NID4 Retrotrans_gag domain-containing protein (Fragment)1.9e-4143.3Show/hide
Query:  PHFELDPEIERTF---RRNWRRGRQRNARRMENNNNR------NAHPPQVAQEPNVAYMAHDLDRPIRSYAAPNLYNFNPGIAYPVFGENARFEIKPVML
        P +  DPEIERTF   RRN R          E   +          P  V  EP++  +A+D  R IR YAA +    N GI  P     A+FE KP+M 
Subjt:  PHFELDPEIERTF---RRNWRRGRQRNARRMENNNNR------NAHPPQVAQEPNVAYMAHDLDRPIRSYAAPNLYNFNPGIAYPVFGENARFEIKPVML

Query:  QMLQNAGQFVGHPGEDPHEHIRSFYSICASFHMPGISPKELRFTLFPLTLRDEEKRWANALEDGEVGTWDQLIEKFMKKFFPPHENARRRKELMSFQQKD
        QMLQ  GQF     EDPH H++ F  + ++F +PGI+    R  LFP +LRD  K W N+LE   +  W+ L EKF+ K+FPP +NA+ R ++ SF+Q D
Subjt:  QMLQNAGQFVGHPGEDPHEHIRSFYSICASFHMPGISPKELRFTLFPLTLRDEEKRWANALEDGEVGTWDQLIEKFMKKFFPPHENARRRKELMSFQQKD

Query:  RENLHDAWSRYKRMVKACPHNGIP
         E L DAW RYK M++ CPHNGIP
Subjt:  RENLHDAWSRYKRMVKACPHNGIP

A0A6J0ZX64 LOW QUALITY PROTEIN: uncharacterized protein LOC1104129452.6e-3839.07Show/hide
Query:  DPEIERTFRRNWRRGRQ----RNARRMENNNNRNAHPPQVAQEPNVAYMAHDLDRPIRSYAAPNLYNFNPGIAYPVFGENARFEIKPVMLQMLQNAGQFV
        DP+IERTFRR+ R   Q          +NNNN N          N   +  + +R +R Y  P +   +  I  P    N  FEIKP  +QM+Q++ QF 
Subjt:  DPEIERTFRRNWRRGRQ----RNARRMENNNNRNAHPPQVAQEPNVAYMAHDLDRPIRSYAAPNLYNFNPGIAYPVFGENARFEIKPVMLQMLQNAGQFV

Query:  GHPGEDPHEHIRSFYSICASFHMPGISPKELRFTLFPLTLRDEEKRWANALEDGEVGTWDQLIEKFMKKFFPPHENARRRKELMSFQQKDRENLHDAWSR
        G P +DP+ H+ +F  IC +F   G++   +R  LFP +LRD+ K W N+L +G + TW+ L +KF+ KFFPP + A+ R ++ SF Q D E+L++AW R
Subjt:  GHPGEDPHEHIRSFYSICASFHMPGISPKELRFTLFPLTLRDEEKRWANALEDGEVGTWDQLIEKFMKKFFPPHENARRRKELMSFQQKDRENLHDAWSR

Query:  YKRMVKACPHNGIPE
        +K +++ CPH+GIP+
Subjt:  YKRMVKACPHNGIPE

A0A6J0ZYV0 uncharacterized protein LOC1104134138.8e-3939.53Show/hide
Query:  DPEIERTFRRNWRRGRQ----RNARRMENNNNRNAHPPQVAQEPNVAYMAHDLDRPIRSYAAPNLYNFNPGIAYPVFGENARFEIKPVMLQMLQNAGQFV
        DP+IERTFRR+ R   Q          +NNNN N          N   +  + +R +R YA P +   +  I  P    N  FEIKP  +QM+Q++ QF 
Subjt:  DPEIERTFRRNWRRGRQ----RNARRMENNNNRNAHPPQVAQEPNVAYMAHDLDRPIRSYAAPNLYNFNPGIAYPVFGENARFEIKPVMLQMLQNAGQFV

Query:  GHPGEDPHEHIRSFYSICASFHMPGISPKELRFTLFPLTLRDEEKRWANALEDGEVGTWDQLIEKFMKKFFPPHENARRRKELMSFQQKDRENLHDAWSR
        G P +DP+ H+ +F  IC +F   G++   +R  LFP +LRD+ K W N+L +G + TW+ L +KF+ KFFPP + A+ R ++ SF Q D E+L++AW R
Subjt:  GHPGEDPHEHIRSFYSICASFHMPGISPKELRFTLFPLTLRDEEKRWANALEDGEVGTWDQLIEKFMKKFFPPHENARRRKELMSFQQKDRENLHDAWSR

Query:  YKRMVKACPHNGIPE
        +K +++ CPH+GIP+
Subjt:  YKRMVKACPHNGIPE

A0A6J1EEI2 uncharacterized protein LOC1114333945.7e-3845.29Show/hide
Query:  NVAYMAHDLDRPIRSYAAPNLYNFNPGIAYPVFGENARFEIKPVMLQMLQNAGQFVGHPGEDPHEHIRSFYSICASFHMPGISPKELRFTLFPLTLRDEE
        N  ++A D +R IR+YA P +   NP I  P   +   FE+KPVM QMLQ  GQF G P EDPH H++SF  +  SF    +    +R +LFP +LRD  
Subjt:  NVAYMAHDLDRPIRSYAAPNLYNFNPGIAYPVFGENARFEIKPVMLQMLQNAGQFVGHPGEDPHEHIRSFYSICASFHMPGISPKELRFTLFPLTLRDEE

Query:  KRWANALEDGEVGTWDQLIEKFMKKFFPPHENARRRKELMSFQQKDRENLHDAWSRYKRMVKACPHNGIP
        K W N L  G + +W+ L+EKF+ K+FPP  NAR R E++ FQQ + + L +AW R+K M++ CPH+G+P
Subjt:  KRWANALEDGEVGTWDQLIEKFMKKFFPPHENARRRKELMSFQQKDRENLHDAWSRYKRMVKACPHNGIP

U5CUI2 Retrotrans_gag domain-containing protein1.5e-4149.41Show/hide
Query:  NVAYMAHDLDRPIRSYAAPNLYNFNPGIAYPVFGENARFEIKPVMLQMLQNAGQFVGHPGEDPHEHIRSFYSICASFHMPGISPKELRFTLFPLTLRDEE
        N   +A D  R IR YAAP     NPGI  P   +  +FE+KPVM QMLQ  GQF G P EDPH H+RSF  +  SF + G+S + LR  LFP +LRD  
Subjt:  NVAYMAHDLDRPIRSYAAPNLYNFNPGIAYPVFGENARFEIKPVMLQMLQNAGQFVGHPGEDPHEHIRSFYSICASFHMPGISPKELRFTLFPLTLRDEE

Query:  KRWANALEDGEVGTWDQLIEKFMKKFFPPHENARRRKELMSFQQKDRENLHDAWSRYKRMVKACPHNGIP
        + W N L    V  W+ L EKF++K+FPP  NA+ R E+MSFQQ + E+  DAW R+K +++ CPH+GIP
Subjt:  KRWANALEDGEVGTWDQLIEKFMKKFFPPHENARRRKELMSFQQKDRENLHDAWSRYKRMVKACPHNGIP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTGACGGTGAACAGCCACATTTTGAGCTTGACCCTGAAATTGAGAGAACTTTTCGGCGTAATTGGCGAAGAGGAAGGCAAAGAAACGCAAGAAGAATGGAAAATAA
TAACAATAGAAACGCTCATCCACCGCAAGTTGCCCAAGAACCAAACGTCGCCTACATGGCGCATGACCTTGATAGGCCAATTAGGTCATATGCTGCACCCAACCTCTATA
ACTTCAACCCAGGGATCGCCTACCCTGTGTTCGGTGAAAATGCAAGGTTTGAAATCAAGCCTGTGATGTTACAAATGCTTCAGAACGCCGGACAATTTGTCGGTCATCCT
GGGGAAGATCCACACGAGCACATTAGAAGTTTTTACTCTATCTGTGCTTCCTTCCATATGCCAGGCATCTCACCTAAGGAATTGAGATTCACACTTTTCCCATTAACACT
AAGGGATGAGGAAAAAAGGTGGGCCAATGCCTTGGAAGATGGCGAGGTGGGAACCTGGGATCAATTGATAGAGAAATTTATGAAGAAATTTTTTCCACCTCACGAGAACG
CCAGAAGAAGAAAGGAGCTCATGAGCTTCCAACAAAAAGATAGAGAGAACCTACATGACGCGTGGAGTAGGTACAAGAGGATGGTCAAAGCATGCCCCCACAATGGCATT
CCCGAATCTCCTACAACCAGATTAAGGCAACGCTGGATACGATGGCCAGCAATAATGAAGAATGGGATGAAGATGATTTTGGCAATCGCCGAGGAGGACGCGAAAGAAGC
AAAGAGGGTATGGATAAGAACGACGTGGTGGCGTTGCAAGGACAAATGA
mRNA sequenceShow/hide mRNA sequence
ATGAGTGACGGTGAACAGCCACATTTTGAGCTTGACCCTGAAATTGAGAGAACTTTTCGGCGTAATTGGCGAAGAGGAAGGCAAAGAAACGCAAGAAGAATGGAAAATAA
TAACAATAGAAACGCTCATCCACCGCAAGTTGCCCAAGAACCAAACGTCGCCTACATGGCGCATGACCTTGATAGGCCAATTAGGTCATATGCTGCACCCAACCTCTATA
ACTTCAACCCAGGGATCGCCTACCCTGTGTTCGGTGAAAATGCAAGGTTTGAAATCAAGCCTGTGATGTTACAAATGCTTCAGAACGCCGGACAATTTGTCGGTCATCCT
GGGGAAGATCCACACGAGCACATTAGAAGTTTTTACTCTATCTGTGCTTCCTTCCATATGCCAGGCATCTCACCTAAGGAATTGAGATTCACACTTTTCCCATTAACACT
AAGGGATGAGGAAAAAAGGTGGGCCAATGCCTTGGAAGATGGCGAGGTGGGAACCTGGGATCAATTGATAGAGAAATTTATGAAGAAATTTTTTCCACCTCACGAGAACG
CCAGAAGAAGAAAGGAGCTCATGAGCTTCCAACAAAAAGATAGAGAGAACCTACATGACGCGTGGAGTAGGTACAAGAGGATGGTCAAAGCATGCCCCCACAATGGCATT
CCCGAATCTCCTACAACCAGATTAAGGCAACGCTGGATACGATGGCCAGCAATAATGAAGAATGGGATGAAGATGATTTTGGCAATCGCCGAGGAGGACGCGAAAGAAGC
AAAGAGGGTATGGATAAGAACGACGTGGTGGCGTTGCAAGGACAAATGA
Protein sequenceShow/hide protein sequence
MSDGEQPHFELDPEIERTFRRNWRRGRQRNARRMENNNNRNAHPPQVAQEPNVAYMAHDLDRPIRSYAAPNLYNFNPGIAYPVFGENARFEIKPVMLQMLQNAGQFVGHP
GEDPHEHIRSFYSICASFHMPGISPKELRFTLFPLTLRDEEKRWANALEDGEVGTWDQLIEKFMKKFFPPHENARRRKELMSFQQKDRENLHDAWSRYKRMVKACPHNGI
PESPTTRLRQRWIRWPAIMKNGMKMILAIAEEDAKEAKRVWIRTTWWRCKDK