; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0025071 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0025071
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr05:9985418..9987716
RNA-Seq ExpressionPI0025071
SyntenyPI0025071
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAA3008942.1 UHRF1-binding 1-like, partial [Olea europaea subsp. europaea]1.4e-0533.98Show/hide
Query:  FEIDKFDGKTNVSLWKKKIQALLVQQKVVKALID-----------------------------PNKYKKSKSTIKYGRNSSTLHIVMNSLRSKELEIKTK
        F++  F+GK ++++W++K++ +LVQQKV KA+ D                             P+ YK+ K+ IKYG++ ST  IV++ LRSKE++I+ +
Subjt:  FEIDKFDGKTNVSLWKKKIQALLVQQKVVKALID-----------------------------PNKYKKSKSTIKYGRNSSTLHIVMNSLRSKELEIKTK

Query:  KKD
        K D
Subjt:  KKD

TXG73199.1 hypothetical protein EZV62_001778 [Acer yangbiense]4.2e-0527.98Show/hide
Query:  DPEMSARFEIDKFDGKTNVSLWKKKIQALLVQQKVVKALIDPNK--------------------------------------------------------
        +P MS +FEIDKFDG  +  +W++K++ALL QQK++KA+  P+K                                                        
Subjt:  DPEMSARFEIDKFDGKTNVSLWKKKIQALLVQQKVVKALIDPNK--------------------------------------------------------

Query:  ----------------YKKSKSTIKYGRNSSTLHIVMNSLRSKELEIKTKKKD-FEALFVRGKQLKAK
                        +K  K+ IKYGR S +L   + +L+SKELE+K ++KD  E LFVR  ++  K
Subjt:  ----------------YKKSKSTIKYGRNSSTLHIVMNSLRSKELEIKTKKKD-FEALFVRGKQLKAK

XP_022157854.1 uncharacterized protein LOC111024471 isoform X1 [Momordica charantia]8.4e-0648.61Show/hide
Query:  MSARFEIDKFDGKTNVSLWKKKIQALLVQQKVVKALIDPNKYKKSKST------IKYGRNSSTLHIVMNSLR
        M+A+FE++ FDGK + SLWKKK++ALLVQQKV KAL DP+K   +K+        +   +S  LH+  N LR
Subjt:  MSARFEIDKFDGKTNVSLWKKKIQALLVQQKVVKALIDPNKYKKSKST------IKYGRNSSTLHIVMNSLR

XP_022157856.1 uncharacterized protein LOC111024471 isoform X3 [Momordica charantia]8.4e-0648.61Show/hide
Query:  MSARFEIDKFDGKTNVSLWKKKIQALLVQQKVVKALIDPNKYKKSKST------IKYGRNSSTLHIVMNSLR
        M+A+FE++ FDGK + SLWKKK++ALLVQQKV KAL DP+K   +K+        +   +S  LH+  N LR
Subjt:  MSARFEIDKFDGKTNVSLWKKKIQALLVQQKVVKALIDPNKYKKSKST------IKYGRNSSTLHIVMNSLR

XP_038904482.1 uncharacterized protein LOC120090851 [Benincasa hispida]2.9e-0635.4Show/hide
Query:  ARFEIDKFDGKTNVSLWKKKIQALLVQQKVVKALIDP--------------------NKYKKSKSTIKYGRNSSTLHIVMNSLRSKELEIKTKKKDFEAL
        AR++I+KFDGK    LWK KI+A+L QQ+  KA+ DP                    N YK  K+ +KYGR   T  I++++L+ KE+E+   KK    +
Subjt:  ARFEIDKFDGKTNVSLWKKKIQALLVQQKVVKALIDP--------------------NKYKKSKSTIKYGRNSSTLHIVMNSLRSKELEIKTKKKDFEAL

Query:  FVRGKQLKAKIQK
          + KQ + + Q+
Subjt:  FVRGKQLKAKIQK

TrEMBL top hitse value%identityAlignment
A0A5A7UB25 Putative gag-pol polyprotein7.7e-0535.65Show/hide
Query:  KMDPEMSARFEIDKFDGKTNVSLWKKKIQALLVQQKVVKALIDPNKYKKSKSTIKYGRNSSTLHIVMNSLRSKELEIKTKKKDFEALFVRGKQLKAKIQK
        KMD   S    +D+F  K  V L     +     Q V+     P  Y++ K+ IKYGR+S T+ IV+++L+++ LEIK ++KD E L  RG+  K    K
Subjt:  KMDPEMSARFEIDKFDGKTNVSLWKKKIQALLVQQKVVKALIDPNKYKKSKSTIKYGRNSSTLHIVMNSLRSKELEIKTKKKDFEALFVRGKQLKAKIQK

Query:  AVRGEQIEFQRKSEG
        + +G++  F+ KS+G
Subjt:  AVRGEQIEFQRKSEG

A0A5C7IVV0 gag_pre-integrs domain-containing protein2.0e-0527.98Show/hide
Query:  DPEMSARFEIDKFDGKTNVSLWKKKIQALLVQQKVVKALIDPNK--------------------------------------------------------
        +P MS +FEIDKFDG  +  +W++K++ALL QQK++KA+  P+K                                                        
Subjt:  DPEMSARFEIDKFDGKTNVSLWKKKIQALLVQQKVVKALIDPNK--------------------------------------------------------

Query:  ----------------YKKSKSTIKYGRNSSTLHIVMNSLRSKELEIKTKKKD-FEALFVRGKQLKAK
                        +K  K+ IKYGR S +L   + +L+SKELE+K ++KD  E LFVR  ++  K
Subjt:  ----------------YKKSKSTIKYGRNSSTLHIVMNSLRSKELEIKTKKKD-FEALFVRGKQLKAK

A0A5D3DNU1 Putative gag-pol polyprotein7.7e-0535.65Show/hide
Query:  KMDPEMSARFEIDKFDGKTNVSLWKKKIQALLVQQKVVKALIDPNKYKKSKSTIKYGRNSSTLHIVMNSLRSKELEIKTKKKDFEALFVRGKQLKAKIQK
        KMD   S    +D+F  K  V L     +     Q V+     P  Y++ K+ IKYGR+S T+ IV+++L+++ LEIK ++KD E L  RG+  K    K
Subjt:  KMDPEMSARFEIDKFDGKTNVSLWKKKIQALLVQQKVVKALIDPNKYKKSKSTIKYGRNSSTLHIVMNSLRSKELEIKTKKKDFEALFVRGKQLKAKIQK

Query:  AVRGEQIEFQRKSEG
        + +G++  F+ KS+G
Subjt:  AVRGEQIEFQRKSEG

A0A6J1DXR5 uncharacterized protein LOC111024471 isoform X14.1e-0648.61Show/hide
Query:  MSARFEIDKFDGKTNVSLWKKKIQALLVQQKVVKALIDPNKYKKSKST------IKYGRNSSTLHIVMNSLR
        M+A+FE++ FDGK + SLWKKK++ALLVQQKV KAL DP+K   +K+        +   +S  LH+  N LR
Subjt:  MSARFEIDKFDGKTNVSLWKKKIQALLVQQKVVKALIDPNKYKKSKST------IKYGRNSSTLHIVMNSLR

A0A6J1DZD8 uncharacterized protein LOC111024471 isoform X34.1e-0648.61Show/hide
Query:  MSARFEIDKFDGKTNVSLWKKKIQALLVQQKVVKALIDPNKYKKSKST------IKYGRNSSTLHIVMNSLR
        M+A+FE++ FDGK + SLWKKK++ALLVQQKV KAL DP+K   +K+        +   +S  LH+  N LR
Subjt:  MSARFEIDKFDGKTNVSLWKKKIQALLVQQKVVKALIDPNKYKKSKST------IKYGRNSSTLHIVMNSLR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTATTTCCGTTGCCGGGACTTGAACCCGGGTCTCTCGGGTACATTCTGTAATTTGTTCACTCTACAGCTTAATGAAACTCCTCTCAGATTGAAAATGGATCCGGAGAT
GTCTGCTAGATTTGAGATTGATAAGTTCGATGGCAAGACAAATGTTAGTCTTTGGAAAAAGAAAATACAGGCGTTGCTCGTTCAACAGAAAGTTGTGAAGGCTTTAATCG
ATCCAAATAAATACAAAAAAAGTAAGTCAACCATCAAATATGGAAGAAATTCCTCAACATTACACATTGTTATGAACTCTCTTCGGTCAAAGGAGCTTGAAATTAAAACT
AAGAAAAAAGATTTTGAAGCTTTGTTCGTGAGAGGGAAGCAGTTAAAGGCCAAAATTCAAAAAGCAGTGAGAGGTGAACAAATCGAATTCCAAAGGAAGTCAGAAGGATA
A
mRNA sequenceShow/hide mRNA sequence
ATGTATTTCCGTTGCCGGGACTTGAACCCGGGTCTCTCGGGTACATTCTGTAATTTGTTCACTCTACAGCTTAATGAAACTCCTCTCAGATTGAAAATGGATCCGGAGAT
GTCTGCTAGATTTGAGATTGATAAGTTCGATGGCAAGACAAATGTTAGTCTTTGGAAAAAGAAAATACAGGCGTTGCTCGTTCAACAGAAAGTTGTGAAGGCTTTAATCG
ATCCAAATAAATACAAAAAAAGTAAGTCAACCATCAAATATGGAAGAAATTCCTCAACATTACACATTGTTATGAACTCTCTTCGGTCAAAGGAGCTTGAAATTAAAACT
AAGAAAAAAGATTTTGAAGCTTTGTTCGTGAGAGGGAAGCAGTTAAAGGCCAAAATTCAAAAAGCAGTGAGAGGTGAACAAATCGAATTCCAAAGGAAGTCAGAAGGATA
AGGATCACCTGTAATTATTGTAAGAAAGGACATATTAGATAAAAAAATCTAGTTTGTTAAAGTCTATTATGCTTTTGTTATTTAACTATTGTCTCTATTTAATCAACAGA
CTATCTCCTCTCTCCATCTATATATAATCCCTTTATTGA
Protein sequenceShow/hide protein sequence
MYFRCRDLNPGLSGTFCNLFTLQLNETPLRLKMDPEMSARFEIDKFDGKTNVSLWKKKIQALLVQQKVVKALIDPNKYKKSKSTIKYGRNSSTLHIVMNSLRSKELEIKT
KKKDFEALFVRGKQLKAKIQKAVRGEQIEFQRKSEG