; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0004811 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0004811
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr08:569469..575878
RNA-Seq ExpressionPI0004811
SyntenyPI0004811
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
WP_217833177.1 retrotransposon gag domain-containing protein, partial [Synechococcus sp. PCC 7002]1.6e-5249.76Show/hide
Query:  DNNRNAP-PPQAIQEQNA--------------LAHDLDMPIWSYVAPNLYNFNPGIVYPVFGENARFGIKPVMLQMIQNVRQFGGHPGEDPHEHIRSFYF
        D   N P  PQA   QNA              LAHD + P+  Y +PNLYNF PGI+ P F  N RF +KPVMLQM+Q   QFGG  GEDPH H++SF  
Subjt:  DNNRNAP-PPQAIQEQNA--------------LAHDLDMPIWSYVAPNLYNFNPGIVYPVFGENARFGIKPVMLQMIQNVRQFGGHPGEDPHEHIRSFYF

Query:  ICASFHMSGISPEELRFALFPLTLRDETKRWANALEDGEAGTWDQLIEKFMKKCFPPHENSRRRKELMSFQQKDRENLHDAWSRFKRMVKACPHNGIPEC
        IC++F M+G+  + +R  LFP +LRDE ++WA + E GE  TW +++EKFM+K FPP  +++RR+++++F+QKD E   +AW+RFKR+V+ CPHNGIP C
Subjt:  ICASFHMSGISPEELRFALFPLTLRDETKRWANALEDGEAGTWDQLIEKFMKKCFPPHENSRRRKELMSFQQKDRENLHDAWSRFKRMVKACPHNGIPEC

Query:  ILMEI
        + MEI
Subjt:  ILMEI

XP_030497803.1 uncharacterized protein LOC115713460 [Cannabis sativa]3.7e-4149.71Show/hide
Query:  ALAHDLDMPIWSYVAPNLYNFNPGIVYPVFGENARFGIKPVMLQMIQNVRQFGGHPGEDPHEHIRSFYFICASFHMSGISPEELRFALFPLTLRDETKRW
        ALA D    I  Y AP     NPGIV P   +   F +KPVM QM+Q V QFGG P EDPH HIRSF  +  SF + G+S E LR  LFP +LRD  + W
Subjt:  ALAHDLDMPIWSYVAPNLYNFNPGIVYPVFGENARFGIKPVMLQMIQNVRQFGGHPGEDPHEHIRSFYFICASFHMSGISPEELRFALFPLTLRDETKRW

Query:  ANALEDGEAGTWDQLIEKFMKKCFPPHENSRRRKELMSFQQKDRENLHDAWSRFKRMVKACPHNGIPECILME
         N L       W+ L EKF++K FPP  N++ R E+MSFQQ + E   DAW RFK +++ CPH+GIP CI +E
Subjt:  ANALEDGEAGTWDQLIEKFMKKCFPPHENSRRRKELMSFQQKDRENLHDAWSRFKRMVKACPHNGIPECILME

XP_030508936.1 uncharacterized protein LOC115723589 [Cannabis sativa]8.3e-4149.71Show/hide
Query:  ALAHDLDMPIWSYVAPNLYNFNPGIVYPVFGENARFGIKPVMLQMIQNVRQFGGHPGEDPHEHIRSFYFICASFHMSGISPEELRFALFPLTLRDETKRW
        ALA D    I  Y AP     NPGIV P   +   F +KPVM QM+Q V QFGG P EDPH HIRSF  +  SF + G+S E LR  LFP +LRD  + W
Subjt:  ALAHDLDMPIWSYVAPNLYNFNPGIVYPVFGENARFGIKPVMLQMIQNVRQFGGHPGEDPHEHIRSFYFICASFHMSGISPEELRFALFPLTLRDETKRW

Query:  ANALEDGEAGTWDQLIEKFMKKCFPPHENSRRRKELMSFQQKDRENLHDAWSRFKRMVKACPHNGIPECILME
         N L       W+ L EKF++K FPP  N++ R E+MSFQQ + E   DAW RFK +++ CPH+GIP CI +E
Subjt:  ANALEDGEAGTWDQLIEKFMKKCFPPHENSRRRKELMSFQQKDRENLHDAWSRFKRMVKACPHNGIPECILME

XP_038880527.1 uncharacterized protein LOC120072192 [Benincasa hispida]3.2e-4039.26Show/hide
Query:  LAHDLDMPIWSYVAPNLYNFNPGIVYPVFGENARFGIKPVMLQMIQNVRQFGGHPGEDPHEHIRSFYFICASFHMSGISPEELRFALFPLTLRDETKRWA
        +A++   P+  Y +P LY+F+PGI+YP+  +  RF +K VMLQM+Q  RQFGG  GEDPH H++ F   C  F +  I+PE++R +LFP +LRD+ K+W 
Subjt:  LAHDLDMPIWSYVAPNLYNFNPGIVYPVFGENARFGIKPVMLQMIQNVRQFGGHPGEDPHEHIRSFYFICASFHMSGISPEELRFALFPLTLRDETKRWA

Query:  NALEDGEAGTWDQLIEKFMKKCFPPHENSRRRKELMSFQQKDRENLHDAWSRFKRMVKACPHN-----------------------------GIPECILM
        ++LE  E  TW++L+EKFM+K FPP  N+RRR+E+M+F+Q+D E L  A  RF  +VK CP++                             G+ +    
Subjt:  NALEDGEAGTWDQLIEKFMKKCFPPHENSRRRKELMSFQQKDRENLHDAWSRFKRMVKACPHN-----------------------------GIPECILM

Query:  EIKATLDSMASNNEEWDEDDFGNRRGERGRSD--EGMDKNAV
        E K  L  +A +N EW +D +  R   R RS     +D NA+
Subjt:  EIKATLDSMASNNEEWDEDDFGNRRGERGRSD--EGMDKNAV

XP_038887458.1 uncharacterized protein LOC120077591 [Benincasa hispida]2.0e-5043.46Show/hide
Query:  MANDNNRNAPP-----PQAIQEQNALAHDLDMPIWSYVAPNLYNFNPGIVYPVFGENARFGIKPVMLQMIQNVRQFGGHPGEDPHEHIRSFYFICASFHM
        MA++NN  AP       + +Q+   LA D ++PI +Y APNLY+F+PGI  P+  ENARF IKPVM+QMIQN+RQF     E+PH H+  F  +C++F +
Subjt:  MANDNNRNAPP-----PQAIQEQNALAHDLDMPIWSYVAPNLYNFNPGIVYPVFGENARFGIKPVMLQMIQNVRQFGGHPGEDPHEHIRSFYFICASFHM

Query:  SGISPEELRFALFPLTLRDETKRWANALEDGEAGTWDQLIEKFMKKCFPPHENSRRRKELMSFQQKDRENLHDAWSRFKRMVKACPHNGIPECILMEI--
         GI+P  +R  LFP TLRD+ KRWA++LE  E  + DQL+E FMKK FPP  N+RRRK +++F++ D E L  AW RF+R+VK CPH GI +C+LME+  
Subjt:  SGISPEELRFALFPLTLRDETKRWANALEDGEAGTWDQLIEKFMKKCFPPHENSRRRKELMSFQQKDRENLHDAWSRFKRMVKACPHNGIPECILMEI--

Query:  ---------------------------KATLDSMASNNEEWDEDDFGNRRGERGRSDEGM
                                   K  LD ++ N ++W +D +  R  ER R+D  +
Subjt:  ---------------------------KATLDSMASNNEEWDEDDFGNRRGERGRSDEGM

TrEMBL top hitse value%identityAlignment
A0A392NID4 Retrotrans_gag domain-containing protein (Fragment)1.8e-3343.6Show/hide
Query:  LAHDLDMPIWSYVAPNLYNFNPGIVYPVFGENARFGIKPVMLQMIQNVRQFGGHPGEDPHEHIRSFYFICASFHMSGISPEELRFALFPLTLRDETKRWA
        +A+D    I  Y A +    N GIV P     A+F  KP+M QM+Q V QF     EDPH H++ F  + ++F + GI+ +  R  LFP +LRD  K W 
Subjt:  LAHDLDMPIWSYVAPNLYNFNPGIVYPVFGENARFGIKPVMLQMIQNVRQFGGHPGEDPHEHIRSFYFICASFHMSGISPEELRFALFPLTLRDETKRWA

Query:  NALEDGEAGTWDQLIEKFMKKCFPPHENSRRRKELMSFQQKDRENLHDAWSRFKRMVKACPHNGIPECILME
        N+LE      W+ L EKF+ K FPP +N++ R ++ SF+Q D E L DAW R+K M++ CPHNGIP CI +E
Subjt:  NALEDGEAGTWDQLIEKFMKKCFPPHENSRRRKELMSFQQKDRENLHDAWSRFKRMVKACPHNGIPECILME

A0A6J0ZX64 LOW QUALITY PROTEIN: uncharacterized protein LOC1104129451.8e-3334.41Show/hide
Query:  MANDNNRNAPPPQAIQEQNA--LAHDLDMPIWSYVAPNLYNFNPGIVYPVFGENARFGIKPVMLQMIQNVRQFGGHPGEDPHEHIRSFYFICASFHMSGI
        MA DNN N          NA  L  + +  +  YV P +   +  I  P    N  F IKP  +QMIQ+  QF G P +DP+ H+ +F  IC +F  +G+
Subjt:  MANDNNRNAPPPQAIQEQNA--LAHDLDMPIWSYVAPNLYNFNPGIVYPVFGENARFGIKPVMLQMIQNVRQFGGHPGEDPHEHIRSFYFICASFHMSGI

Query:  SPEELRFALFPLTLRDETKRWANALEDGEAGTWDQLIEKFMKKCFPPHENSRRRKELMSFQQKDRENLHDAWSRFKRMVKACPHNGIPECILME------
        + + +R  LFP +LRD+ K W N+L +G   TW+ L +KF+ K FPP + ++ R ++ SF Q D E+L++AW RFK +++ CPH+GIP+ + ++      
Subjt:  SPEELRFALFPLTLRDETKRWANALEDGEAGTWDQLIEKFMKKCFPPHENSRRRKELMSFQQKDRENLHDAWSRFKRMVKACPHNGIPECILME------

Query:  ---IKATLDS--------------------MASNNEEWDEDDFGNRR
           IK  +D+                    MASNN +W  +  G+R+
Subjt:  ---IKATLDS--------------------MASNNEEWDEDDFGNRR

A0A6J1EEI2 uncharacterized protein LOC1114333941.5e-3543.6Show/hide
Query:  LAHDLDMPIWSYVAPNLYNFNPGIVYPVFGENARFGIKPVMLQMIQNVRQFGGHPGEDPHEHIRSFYFICASFHMSGISPEELRFALFPLTLRDETKRWA
        LA D +  I +Y  P +   NP I+ P   +   F +KPVM QM+Q + QF G P EDPH H++SF  +  SF    +  + +R +LFP +LRD  K W 
Subjt:  LAHDLDMPIWSYVAPNLYNFNPGIVYPVFGENARFGIKPVMLQMIQNVRQFGGHPGEDPHEHIRSFYFICASFHMSGISPEELRFALFPLTLRDETKRWA

Query:  NALEDGEAGTWDQLIEKFMKKCFPPHENSRRRKELMSFQQKDRENLHDAWSRFKRMVKACPHNGIPECILME
        N L  G   +W+ L+EKF+ K FPP  N+R R E++ FQQ + + L +AW RFK M++ CPH+G+P CI ME
Subjt:  NALEDGEAGTWDQLIEKFMKKCFPPHENSRRRKELMSFQQKDRENLHDAWSRFKRMVKACPHNGIPECILME

A0A6J1H7E4 uncharacterized protein LOC1114611685.1e-3644.19Show/hide
Query:  LAHDLDMPIWSYVAPNLYNFNPGIVYPVFGENARFGIKPVMLQMIQNVRQFGGHPGEDPHEHIRSFYFICASFHMSGISPEELRFALFPLTLRDETKRWA
        LA D +  I +Y  P +   NP I+ P   +   F +KPVM QM+Q + QF G P EDPH H++SF  +  SF   G+  + +R +LFP +LRD  K W 
Subjt:  LAHDLDMPIWSYVAPNLYNFNPGIVYPVFGENARFGIKPVMLQMIQNVRQFGGHPGEDPHEHIRSFYFICASFHMSGISPEELRFALFPLTLRDETKRWA

Query:  NALEDGEAGTWDQLIEKFMKKCFPPHENSRRRKELMSFQQKDRENLHDAWSRFKRMVKACPHNGIPECILME
        N L      +W+ L EKF+ K FPP  N+R R E+++FQQ + E L +AW RFK M++ CPH+G+P CI ME
Subjt:  NALEDGEAGTWDQLIEKFMKKCFPPHENSRRRKELMSFQQKDRENLHDAWSRFKRMVKACPHNGIPECILME

U5CUI2 Retrotrans_gag domain-containing protein5.8e-4047.78Show/hide
Query:  QAIQEQNALAHDLDMPIWSYVAPNLYNFNPGIVYPVFGENARFGIKPVMLQMIQNVRQFGGHPGEDPHEHIRSFYFICASFHMSGISPEELRFALFPLTL
        Q I     LA D    I  Y AP     NPGIV P   +  +F +KPVM QM+Q V QF G P EDPH H+RSF  +  SF + G+S E LR  LFP +L
Subjt:  QAIQEQNALAHDLDMPIWSYVAPNLYNFNPGIVYPVFGENARFGIKPVMLQMIQNVRQFGGHPGEDPHEHIRSFYFICASFHMSGISPEELRFALFPLTL

Query:  RDETKRWANALEDGEAGTWDQLIEKFMKKCFPPHENSRRRKELMSFQQKDRENLHDAWSRFKRMVKACPHNGIPECILME
        RD  + W N L       W+ L EKF++K FPP  N++ R E+MSFQQ + E+  DAW RFK +++ CPH+GIP CI ME
Subjt:  RDETKRWANALEDGEAGTWDQLIEKFMKKCFPPHENSRRRKELMSFQQKDRENLHDAWSRFKRMVKACPHNGIPECILME

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAAATGACAATAATAGAAACGCTCCTCCGCCGCAAGCTATCCAAGAGCAAAATGCCTTAGCACATGACCTAGACATGCCAATTTGGTCATATGTGGCGCCTAACCT
CTACAACTTCAACCCAGGGATCGTCTACCCTGTGTTCGGTGAAAATGCAAGGTTTGGAATCAAACCCGTAATGCTACAAATGATACAGAATGTCAGGCAATTTGGCGGTC
ACCCTGGAGAGGATCCACATGAGCATATCAGAAGTTTCTACTTTATTTGTGCTTCCTTCCACATGTCAGGCATCTCACCTGAGGAACTAAGATTCGCACTCTTCCCGTTA
ACCCTAAGGGACGAGACGAAAAGGTGGGCCAATGCCTTGGAGGATGGTGAGGCGGGAACCTGGGACCAATTAATAGAGAAATTTATGAAGAAATGTTTCCCACCTCATGA
AAATTCCAGAAGAAGGAAGGAACTCATGAGTTTCCAACAAAAAGATAGAGAGAACCTACACGATGCGTGGAGTAGGTTCAAGAGGATGGTCAAAGCTTGCCCCCACAATG
GCATTCCGGAGTGTATATTGATGGAGATCAAGGCGACACTGGATTCAATGGCCAGCAACAATGAAGAATGGGATGAAGATGACTTCGGCAATCGCCGAGGAGAACGAGGA
AGAAGCGATGAAGGAATGGATAAGAACGCCGTGGTGGCGCAATTTGTATTGCGGAAAAGTGGCGAACAAGGAACGCCACAATACACTCACCTGAGACACGACAAGGGAAA
AGTATTGCTTAAACCTCCACCAGAAGCGGCTAAATTTTTTGAAGAAGTGGAGGTTGACAACTTGGATAAGGATGTCGTGCTCAATTTCCTCAACGAACGGGTGAGAAAGA
GAAAGGAGGCACATTTGAAGAGGACGAAAGAAGTTCGTCGCAGGAAGGAAGAACGAAAAAAAGAAGAAGATGCTCGCGGACTTGAGCGAGCAAGTGGTGGAGCTCCCGCG
AAGTTTAAAGCTTTGGAGCCTGAAAGAGACCTCGAGGTCATCGCGGAAGAACTCGAGGAAGAGCTGGAGGCGATGAGCCCAATTGATCAAGGACCACCGCCTAGAAAACA
AAGGGAGGTCGCCGAACCATCCAAAATGAAGAAGAAAAGTGGGAACTCTGGACCTGAAGAGCGCCCATCCGGCGGCGAGGCCAAAACAAAAACGCCCTCAATCAACTCAC
TAATCAAGGTTGAAAAAGGGGCACACCAAGGTACAAATAGGTGTGGTGGAGAAGTTTTCGCCGCTAAACTCAATGCTGAGGAGTTTAGCGTCGAAATAAGTGGAAAAACG
GTGAGTTTCGACGCGGAGGCCATCAACAGTCTATACGAAGTGCCCAAGGATGTTGAAATGCTTGGGCATGAATATGTGGTAAGTCCAACGAAGAAGATGGCCCGAGAAGC
ATTGGAAGTCATCGCCTGGCCTGGGGTCGTATGGGAGATCACGCCGACGGGGAAATATCAGCTTTATCCACACCAACTAACCACAGAAGCAATTGTGTGGCTATTTTTCA
TCAAGAAGGAGATATTCCCCACGCGCCATGACAGCACCATCAATTTAGAGTCAGCGATGCTACTTTACTGCATACTGACAAAGAAGCGCGTCAACCTCGGCAATTTGATA
GCTACATCCATTCTGGGTTGGATGCGACTCCCAAGGGCGCCATGCCCTTCCTATCTACAGTTGAAGCCCTCTGCGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCAAATGACAATAATAGAAACGCTCCTCCGCCGCAAGCTATCCAAGAGCAAAATGCCTTAGCACATGACCTAGACATGCCAATTTGGTCATATGTGGCGCCTAACCT
CTACAACTTCAACCCAGGGATCGTCTACCCTGTGTTCGGTGAAAATGCAAGGTTTGGAATCAAACCCGTAATGCTACAAATGATACAGAATGTCAGGCAATTTGGCGGTC
ACCCTGGAGAGGATCCACATGAGCATATCAGAAGTTTCTACTTTATTTGTGCTTCCTTCCACATGTCAGGCATCTCACCTGAGGAACTAAGATTCGCACTCTTCCCGTTA
ACCCTAAGGGACGAGACGAAAAGGTGGGCCAATGCCTTGGAGGATGGTGAGGCGGGAACCTGGGACCAATTAATAGAGAAATTTATGAAGAAATGTTTCCCACCTCATGA
AAATTCCAGAAGAAGGAAGGAACTCATGAGTTTCCAACAAAAAGATAGAGAGAACCTACACGATGCGTGGAGTAGGTTCAAGAGGATGGTCAAAGCTTGCCCCCACAATG
GCATTCCGGAGTGTATATTGATGGAGATCAAGGCGACACTGGATTCAATGGCCAGCAACAATGAAGAATGGGATGAAGATGACTTCGGCAATCGCCGAGGAGAACGAGGA
AGAAGCGATGAAGGAATGGATAAGAACGCCGTGGTGGCGCAATTTGTATTGCGGAAAAGTGGCGAACAAGGAACGCCACAATACACTCACCTGAGACACGACAAGGGAAA
AGTATTGCTTAAACCTCCACCAGAAGCGGCTAAATTTTTTGAAGAAGTGGAGGTTGACAACTTGGATAAGGATGTCGTGCTCAATTTCCTCAACGAACGGGTGAGAAAGA
GAAAGGAGGCACATTTGAAGAGGACGAAAGAAGTTCGTCGCAGGAAGGAAGAACGAAAAAAAGAAGAAGATGCTCGCGGACTTGAGCGAGCAAGTGGTGGAGCTCCCGCG
AAGTTTAAAGCTTTGGAGCCTGAAAGAGACCTCGAGGTCATCGCGGAAGAACTCGAGGAAGAGCTGGAGGCGATGAGCCCAATTGATCAAGGACCACCGCCTAGAAAACA
AAGGGAGGTCGCCGAACCATCCAAAATGAAGAAGAAAAGTGGGAACTCTGGACCTGAAGAGCGCCCATCCGGCGGCGAGGCCAAAACAAAAACGCCCTCAATCAACTCAC
TAATCAAGGTTGAAAAAGGGGCACACCAAGGTACAAATAGGTGTGGTGGAGAAGTTTTCGCCGCTAAACTCAATGCTGAGGAGTTTAGCGTCGAAATAAGTGGAAAAACG
GTGAGTTTCGACGCGGAGGCCATCAACAGTCTATACGAAGTGCCCAAGGATGTTGAAATGCTTGGGCATGAATATGTGGTAAGTCCAACGAAGAAGATGGCCCGAGAAGC
ATTGGAAGTCATCGCCTGGCCTGGGGTCGTATGGGAGATCACGCCGACGGGGAAATATCAGCTTTATCCACACCAACTAACCACAGAAGCAATTGTGTGGCTATTTTTCA
TCAAGAAGGAGATATTCCCCACGCGCCATGACAGCACCATCAATTTAGAGTCAGCGATGCTACTTTACTGCATACTGACAAAGAAGCGCGTCAACCTCGGCAATTTGATA
GCTACATCCATTCTGGGTTGGATGCGACTCCCAAGGGCGCCATGCCCTTCCTATCTACAGTTGAAGCCCTCTGCGTGA
Protein sequenceShow/hide protein sequence
MANDNNRNAPPPQAIQEQNALAHDLDMPIWSYVAPNLYNFNPGIVYPVFGENARFGIKPVMLQMIQNVRQFGGHPGEDPHEHIRSFYFICASFHMSGISPEELRFALFPL
TLRDETKRWANALEDGEAGTWDQLIEKFMKKCFPPHENSRRRKELMSFQQKDRENLHDAWSRFKRMVKACPHNGIPECILMEIKATLDSMASNNEEWDEDDFGNRRGERG
RSDEGMDKNAVVAQFVLRKSGEQGTPQYTHLRHDKGKVLLKPPPEAAKFFEEVEVDNLDKDVVLNFLNERVRKRKEAHLKRTKEVRRRKEERKKEEDARGLERASGGAPA
KFKALEPERDLEVIAEELEEELEAMSPIDQGPPPRKQREVAEPSKMKKKSGNSGPEERPSGGEAKTKTPSINSLIKVEKGAHQGTNRCGGEVFAAKLNAEEFSVEISGKT
VSFDAEAINSLYEVPKDVEMLGHEYVVSPTKKMAREALEVIAWPGVVWEITPTGKYQLYPHQLTTEAIVWLFFIKKEIFPTRHDSTINLESAMLLYCILTKKRVNLGNLI
ATSILGWMRLPRAPCPSYLQLKPSA