; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc05g0128761 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc05g0128761
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionIntegrase catalytic domain-containing protein
Genome locationCMiso1.1chr05:6400725..6402259
RNA-Seq ExpressionCmc05g0128761
SyntenyCmc05g0128761
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0031937.1 retroelement pol polyprotein-like [Cucumis melo var. makuwa]2.0e-71100Show/hide
Query:  MYFTKFKTLIDELNSYRPACTCGSCRCGENQEVANFLQTEYLMDFLMGLNDSYAQTRTQLLLMEPIPSISRAFSLLLQEEQQRAISSFSPAINTPTIALA
        MYFTKFKTLIDELNSYRPACTCGSCRCGENQEVANFLQTEYLMDFLMGLNDSYAQTRTQLLLMEPIPSISRAFSLLLQEEQQRAISSFSPAINTPTIALA
Subjt:  MYFTKFKTLIDELNSYRPACTCGSCRCGENQEVANFLQTEYLMDFLMGLNDSYAQTRTQLLLMEPIPSISRAFSLLLQEEQQRAISSFSPAINTPTIALA

Query:  ASSNNPKNNSAHKQRKDRPICTHCNIPGHTVDRCYK
        ASSNNPKNNSAHKQRKDRPICTHCNIPGHTVDRCYK
Subjt:  ASSNNPKNNSAHKQRKDRPICTHCNIPGHTVDRCYK

KAA8523936.1 hypothetical protein F0562_010359 [Nyssa sinensis]2.4e-7256.13Show/hide
Query:  MIIGLSVKNKLGFIDGTLPRP---NDDLLPSWIRNNNIVISWILNSVSKPISTSILFADSARSIWLDLKERFQRKNAPRIFHLKRSLAILSHNQESVSMY
        M+I L VKNKLGF+DG++P P   + DL  SWIRNNNIVISWILNSVSK IS SI+FA SAR IWLDL++RFQ++N PRIF LKR L  L   Q SVS+Y
Subjt:  MIIGLSVKNKLGFIDGTLPRP---NDDLLPSWIRNNNIVISWILNSVSKPISTSILFADSARSIWLDLKERFQRKNAPRIFHLKRSLAILSHNQESVSMY

Query:  FTKFKTLIDELNSYRPACTCGSCRCGENQEVANFLQTEYLMDFLMGLNDSYAQTRTQLLLMEPIPSISRAFSLLLQEEQQRAISSFSPAIN-TPTIALAA
        FTK KT+ +EL++ RP C+CG C CG  + + +  Q EY+M FLMGL+DS++Q R QLLLM+P+P I+R FSL++QEEQQR  +  S + N T T+A A 
Subjt:  FTKFKTLIDELNSYRPACTCGSCRCGENQEVANFLQTEYLMDFLMGLNDSYAQTRTQLLLMEPIPSISRAFSLLLQEEQQRAISSFSPAIN-TPTIALAA

Query:  -------------SSNNPKNNSAHKQRKDRPICTHCNIPGHTVDRCYKVHGYP
                     +S N  ++++  Q++D+P CTHC I GHTVDRCYK+HGYP
Subjt:  -------------SSNNPKNNSAHKQRKDRPICTHCNIPGHTVDRCYKVHGYP

KAA8536734.1 hypothetical protein F0562_029212 [Nyssa sinensis]4.1e-7255.73Show/hide
Query:  MIIGLSVKNKLGFIDGTLPRP---NDDLLPSWIRNNNIVISWILNSVSKPISTSILFADSARSIWLDLKERFQRKNAPRIFHLKRSLAILSHNQESVSMY
        M+I LSVKNKLGF+DG +P P   +++LL SWIRNNNIVISWILNS+SK IS SI+FA  AR IWLDL++RFQ++N PRIF LKR L  L   Q SVS+Y
Subjt:  MIIGLSVKNKLGFIDGTLPRP---NDDLLPSWIRNNNIVISWILNSVSKPISTSILFADSARSIWLDLKERFQRKNAPRIFHLKRSLAILSHNQESVSMY

Query:  FTKFKTLIDELNSYRPACTCGSCRCGENQEVANFLQTEYLMDFLMGLNDSYAQTRTQLLLMEPIPSISRAFSLLLQEEQQRAISSFSPAIN-TPTIAL--
        FTK KT+ +EL++YRP C+CG C CG  + + ++ QTEY+M FLMGL+DS++Q   QLLLM+ +P I+R FSL++QEEQQR  +  S + N T T+A   
Subjt:  FTKFKTLIDELNSYRPACTCGSCRCGENQEVANFLQTEYLMDFLMGLNDSYAQTRTQLLLMEPIPSISRAFSLLLQEEQQRAISSFSPAIN-TPTIAL--

Query:  -----------AASSNNPKNNSAHKQRKDRPICTHCNIPGHTVDRCYKVHGYP
                   + +S N  ++++  Q++DRP CTHC I GHTVDRCYK+HGYP
Subjt:  -----------AASSNNPKNNSAHKQRKDRPICTHCNIPGHTVDRCYKVHGYP

XP_022154973.1 uncharacterized protein LOC111022117 [Momordica charantia]1.2e-7157.38Show/hide
Query:  MIIGLSVKNKLGFIDGTLPRPNDDLLPSWIRNNNIVISWILNSVSKPISTSILFADSARSIWLDLKERFQRKNAPRIFHLKRSLAILSHNQESVSMYFTK
        M I LS+KNKLGFI+G+LP+P  DLLP WIRN ++VI+W LNSVSKPIS S++F +S   IWLDLK+RFQ +N P+IF L+R LA L+ +Q SV+MY+TK
Subjt:  MIIGLSVKNKLGFIDGTLPRPNDDLLPSWIRNNNIVISWILNSVSKPISTSILFADSARSIWLDLKERFQRKNAPRIFHLKRSLAILSHNQESVSMYFTK

Query:  FKTLIDELNSYRPACTCGSCRCGENQEVANFLQTEYLMDFLMGLNDSYAQTRTQLLLMEPIPSISRAFSLLLQEEQQRAISSFSPAINTPTIALAASSNN
         K L DE  SYRP CTCGSC CG  + V  F+Q E+LM FLMGLN+S+A  R Q+LLM+P PSI +AFSL+ QEEQQR I  FS       +A+  S ++
Subjt:  FKTLIDELNSYRPACTCGSCRCGENQEVANFLQTEYLMDFLMGLNDSYAQTRTQLLLMEPIPSISRAFSLLLQEEQQRAISSFSPAINTPTIALAASSNN

Query:  PKNNSAHKQRKDR-PICTHCNIPGHTVDRCYKVHGYP
          +NS  +QR    P CT+C I GHTVD+CY++HG+P
Subjt:  PKNNSAHKQRKDR-PICTHCNIPGHTVDRCYKVHGYP

XP_022856063.1 uncharacterized protein LOC111377235, partial [Olea europaea var. sylvestris]7.0e-7255.42Show/hide
Query:  MIIGLSVKNKLGFIDGTLPRP--NDDLLPSWIRNNNIVISWILNSVSKPISTSILFADSARSIWLDLKERFQRKNAPRIFHLKRSLAILSHNQESVSMYF
        M+I L VKNK+GFIDG++ +P  +DD + +WIRNNNIVISWILNSVSK IS S+++++SA  IW+DLKERFQ++N PRIF L+R L  L+  Q SV +YF
Subjt:  MIIGLSVKNKLGFIDGTLPRP--NDDLLPSWIRNNNIVISWILNSVSKPISTSILFADSARSIWLDLKERFQRKNAPRIFHLKRSLAILSHNQESVSMYF

Query:  TKFKTLIDELNSYRPACTCGSCRCGENQEVANFLQTEYLMDFLMGLNDSYAQTRTQLLLMEPIPSISRAFSLLLQEEQQRAISSFSPAINTPTIALAASS
        TK KT+ +EL++YRP C+CG C CGEN++++   Q EY+M FLMGLND++AQ R QLLLM+P+PSI++ FSL+ QEE QR IS  +   NT       + 
Subjt:  TKFKTLIDELNSYRPACTCGSCRCGENQEVANFLQTEYLMDFLMGLNDSYAQTRTQLLLMEPIPSISRAFSLLLQEEQQRAISSFSPAINTPTIALAASS

Query:  NNPKNNS--AHK-QRKDRPICTHCNIPGHTVDRCYKVHGYPLDIKQNIR
         +PK  S   HK Q+K++PICT+C + GH+VD+CYK+HGYP   K   R
Subjt:  NNPKNNS--AHK-QRKDRPICTHCNIPGHTVDRCYKVHGYPLDIKQNIR

TrEMBL top hitse value%identityAlignment
A0A5A7SRC2 Retroelement pol polyprotein-like9.9e-72100Show/hide
Query:  MYFTKFKTLIDELNSYRPACTCGSCRCGENQEVANFLQTEYLMDFLMGLNDSYAQTRTQLLLMEPIPSISRAFSLLLQEEQQRAISSFSPAINTPTIALA
        MYFTKFKTLIDELNSYRPACTCGSCRCGENQEVANFLQTEYLMDFLMGLNDSYAQTRTQLLLMEPIPSISRAFSLLLQEEQQRAISSFSPAINTPTIALA
Subjt:  MYFTKFKTLIDELNSYRPACTCGSCRCGENQEVANFLQTEYLMDFLMGLNDSYAQTRTQLLLMEPIPSISRAFSLLLQEEQQRAISSFSPAINTPTIALA

Query:  ASSNNPKNNSAHKQRKDRPICTHCNIPGHTVDRCYK
        ASSNNPKNNSAHKQRKDRPICTHCNIPGHTVDRCYK
Subjt:  ASSNNPKNNSAHKQRKDRPICTHCNIPGHTVDRCYK

A0A5D3CZP1 Copia protein9.9e-72100Show/hide
Query:  MYFTKFKTLIDELNSYRPACTCGSCRCGENQEVANFLQTEYLMDFLMGLNDSYAQTRTQLLLMEPIPSISRAFSLLLQEEQQRAISSFSPAINTPTIALA
        MYFTKFKTLIDELNSYRPACTCGSCRCGENQEVANFLQTEYLMDFLMGLNDSYAQTRTQLLLMEPIPSISRAFSLLLQEEQQRAISSFSPAINTPTIALA
Subjt:  MYFTKFKTLIDELNSYRPACTCGSCRCGENQEVANFLQTEYLMDFLMGLNDSYAQTRTQLLLMEPIPSISRAFSLLLQEEQQRAISSFSPAINTPTIALA

Query:  ASSNNPKNNSAHKQRKDRPICTHCNIPGHTVDRCYK
        ASSNNPKNNSAHKQRKDRPICTHCNIPGHTVDRCYK
Subjt:  ASSNNPKNNSAHKQRKDRPICTHCNIPGHTVDRCYK

A0A5J5A1K4 Retrotrans_gag domain-containing protein1.2e-7256.13Show/hide
Query:  MIIGLSVKNKLGFIDGTLPRP---NDDLLPSWIRNNNIVISWILNSVSKPISTSILFADSARSIWLDLKERFQRKNAPRIFHLKRSLAILSHNQESVSMY
        M+I L VKNKLGF+DG++P P   + DL  SWIRNNNIVISWILNSVSK IS SI+FA SAR IWLDL++RFQ++N PRIF LKR L  L   Q SVS+Y
Subjt:  MIIGLSVKNKLGFIDGTLPRP---NDDLLPSWIRNNNIVISWILNSVSKPISTSILFADSARSIWLDLKERFQRKNAPRIFHLKRSLAILSHNQESVSMY

Query:  FTKFKTLIDELNSYRPACTCGSCRCGENQEVANFLQTEYLMDFLMGLNDSYAQTRTQLLLMEPIPSISRAFSLLLQEEQQRAISSFSPAIN-TPTIALAA
        FTK KT+ +EL++ RP C+CG C CG  + + +  Q EY+M FLMGL+DS++Q R QLLLM+P+P I+R FSL++QEEQQR  +  S + N T T+A A 
Subjt:  FTKFKTLIDELNSYRPACTCGSCRCGENQEVANFLQTEYLMDFLMGLNDSYAQTRTQLLLMEPIPSISRAFSLLLQEEQQRAISSFSPAIN-TPTIALAA

Query:  -------------SSNNPKNNSAHKQRKDRPICTHCNIPGHTVDRCYKVHGYP
                     +S N  ++++  Q++D+P CTHC I GHTVDRCYK+HGYP
Subjt:  -------------SSNNPKNNSAHKQRKDRPICTHCNIPGHTVDRCYKVHGYP

A0A5J5B2C5 Uncharacterized protein2.0e-7255.73Show/hide
Query:  MIIGLSVKNKLGFIDGTLPRP---NDDLLPSWIRNNNIVISWILNSVSKPISTSILFADSARSIWLDLKERFQRKNAPRIFHLKRSLAILSHNQESVSMY
        M+I LSVKNKLGF+DG +P P   +++LL SWIRNNNIVISWILNS+SK IS SI+FA  AR IWLDL++RFQ++N PRIF LKR L  L   Q SVS+Y
Subjt:  MIIGLSVKNKLGFIDGTLPRP---NDDLLPSWIRNNNIVISWILNSVSKPISTSILFADSARSIWLDLKERFQRKNAPRIFHLKRSLAILSHNQESVSMY

Query:  FTKFKTLIDELNSYRPACTCGSCRCGENQEVANFLQTEYLMDFLMGLNDSYAQTRTQLLLMEPIPSISRAFSLLLQEEQQRAISSFSPAIN-TPTIAL--
        FTK KT+ +EL++YRP C+CG C CG  + + ++ QTEY+M FLMGL+DS++Q   QLLLM+ +P I+R FSL++QEEQQR  +  S + N T T+A   
Subjt:  FTKFKTLIDELNSYRPACTCGSCRCGENQEVANFLQTEYLMDFLMGLNDSYAQTRTQLLLMEPIPSISRAFSLLLQEEQQRAISSFSPAIN-TPTIAL--

Query:  -----------AASSNNPKNNSAHKQRKDRPICTHCNIPGHTVDRCYKVHGYP
                   + +S N  ++++  Q++DRP CTHC I GHTVDRCYK+HGYP
Subjt:  -----------AASSNNPKNNSAHKQRKDRPICTHCNIPGHTVDRCYKVHGYP

A0A6J1DLQ9 uncharacterized protein LOC1110221175.8e-7257.38Show/hide
Query:  MIIGLSVKNKLGFIDGTLPRPNDDLLPSWIRNNNIVISWILNSVSKPISTSILFADSARSIWLDLKERFQRKNAPRIFHLKRSLAILSHNQESVSMYFTK
        M I LS+KNKLGFI+G+LP+P  DLLP WIRN ++VI+W LNSVSKPIS S++F +S   IWLDLK+RFQ +N P+IF L+R LA L+ +Q SV+MY+TK
Subjt:  MIIGLSVKNKLGFIDGTLPRPNDDLLPSWIRNNNIVISWILNSVSKPISTSILFADSARSIWLDLKERFQRKNAPRIFHLKRSLAILSHNQESVSMYFTK

Query:  FKTLIDELNSYRPACTCGSCRCGENQEVANFLQTEYLMDFLMGLNDSYAQTRTQLLLMEPIPSISRAFSLLLQEEQQRAISSFSPAINTPTIALAASSNN
         K L DE  SYRP CTCGSC CG  + V  F+Q E+LM FLMGLN+S+A  R Q+LLM+P PSI +AFSL+ QEEQQR I  FS       +A+  S ++
Subjt:  FKTLIDELNSYRPACTCGSCRCGENQEVANFLQTEYLMDFLMGLNDSYAQTRTQLLLMEPIPSISRAFSLLLQEEQQRAISSFSPAINTPTIALAASSNN

Query:  PKNNSAHKQRKDR-PICTHCNIPGHTVDRCYKVHGYP
          +NS  +QR    P CT+C I GHTVD+CY++HG+P
Subjt:  PKNNSAHKQRKDR-PICTHCNIPGHTVDRCYKVHGYP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).3.4e-2433.9Show/hide
Query:  LSVKNKLGFIDGTLPRPN--DDLLPSWIRNNNIVISWILNSVSKPISTSILFADSARSIWLDLKERFQRKNAPRIFHLKRSLAILSHNQESVSMYFTKFK
        L V  K GFIDGTLP+P+    L   W + N +V+ W++NS++  +  S+++A++A  +W DL+  F      +I+ L+R LA L    +SV  YF K  
Subjt:  LSVKNKLGFIDGTLPRPN--DDLLPSWIRNNNIVISWILNSVSKPISTSILFADSARSIWLDLKERFQRKNAPRIFHLKRSLAILSHNQESVSMYFTKFK

Query:  TLIDELNSYR--PACTCGSCRCGENQEVANFLQTEYLMDFLMG--LNDSYAQTRTQLLLMEPIPSISRAFSLLLQEE
         +  EL+ Y   P C CG C C   +      + E   +FLMG  LN  +    T+++  +P PS+  AF+++   E
Subjt:  TLIDELNSYR--PACTCGSCRCGENQEVANFLQTEYLMDFLMG--LNDSYAQTRTQLLLMEPIPSISRAFSLLLQEE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATCATCGGATTATCGGTCAAAAATAAACTCGGTTTTATTGATGGTACCCTTCCCCGACCAAATGACGATCTCCTCCCTTCGTGGATCAGAAACAACAACATT
GTCATCTCTTGGATCTTGAATTCAGTATCCAAACCAATCTCAACCAGTATTCTCTTTGCAGATTCAGCAAGAAGTATTTGGCTTGACCTCAAAGAACGTTTCCAA
CGGAAGAATGCCCCTCGCATTTTTCACCTAAAGAGATCACTTGCAATACTTTCTCATAATCAAGAATCGGTAAGCATGTATTTCACCAAATTTAAAACCTTGATT
GATGAATTAAACTCATACAGACCTGCCTGCACCTGTGGAAGCTGTCGTTGCGGAGAAAATCAAGAAGTTGCAAATTTTCTTCAAACTGAATATCTCATGGATTTT
CTCATGGGACTGAATGATTCATATGCCCAAACCCGCACCCAACTACTCCTCATGGAACCCATTCCATCTATCTCTCGAGCTTTTTCCCTTCTACTTCAAGAAGAA
CAACAACGAGCTATCAGTTCTTTCTCTCCTGCTATCAATACTCCAACCATTGCCCTTGCTGCATCCTCAAATAACCCTAAAAACAATTCTGCACACAAGCAACGC
AAAGACAGACCTATTTGTACCCACTGTAATATCCCAGGACACACTGTTGATAGATGCTACAAGGTGCATGGATATCCCCTGGATATAAAACAAAACATCAGAGAA
CCAACAGTAGTAACTCCAGAAAAATATCGAACAATAGCACAAATTTAG
mRNA sequenceShow/hide mRNA sequence
TTTATATCTTTTACTTTTCCAATATGGTATCAGTTGAAATTAACCCCTTCTTTCTCTTTCCTTTTTTTCCCACCCCCAATTAATTCTCCCATTAACTCTCTTTTT
CTTCACCGTTTTATCCCATCCCCCCTCATTAATTTTTTTCCTTTTCTTCACAGACCCATCCCATCCCCTTATTAATTTTTTTTCTTTTCTTCACAGACCCATCCC
ATCCCCTCATTAATTTTTTTTCCTTTTCTTCACAGACCCATCCCATCTCCCCATTAATTTTTCTCTTCTTCTTCCCCGATCCATTCCATCTCCCAACCATTTTCC
TCTACTACACTCTCATGGCCGACCACTCCCAAACTCCCAATCCAAATCCTTCTACCGCGTCTCAGACTCAAGAAAACTCTCCTCCAACTTCTGATGGCCATCAAA
ACCCAAATCAAGGATACATCAATCCTTATTACCTACACCATAATGACAACACTAGTTTAATACTAGTTACTGAGCCATTAACAAAAGAAAATTACGTTTCGTGGA
GCGCGCGATGATCATCGGATTATCGGTCAAAAATAAACTCGGTTTTATTGATGGTACCCTTCCCCGACCAAATGACGATCTCCTCCCTTCGTGGATCAGAAACAA
CAACATTGTCATCTCTTGGATCTTGAATTCAGTATCCAAACCAATCTCAACCAGTATTCTCTTTGCAGATTCAGCAAGAAGTATTTGGCTTGACCTCAAAGAACG
TTTCCAACGGAAGAATGCCCCTCGCATTTTTCACCTAAAGAGATCACTTGCAATACTTTCTCATAATCAAGAATCGGTAAGCATGTATTTCACCAAATTTAAAAC
CTTGATTGATGAATTAAACTCATACAGACCTGCCTGCACCTGTGGAAGCTGTCGTTGCGGAGAAAATCAAGAAGTTGCAAATTTTCTTCAAACTGAATATCTCAT
GGATTTTCTCATGGGACTGAATGATTCATATGCCCAAACCCGCACCCAACTACTCCTCATGGAACCCATTCCATCTATCTCTCGAGCTTTTTCCCTTCTACTTCA
AGAAGAACAACAACGAGCTATCAGTTCTTTCTCTCCTGCTATCAATACTCCAACCATTGCCCTTGCTGCATCCTCAAATAACCCTAAAAACAATTCTGCACACAA
GCAACGCAAAGACAGACCTATTTGTACCCACTGTAATATCCCAGGACACACTGTTGATAGATGCTACAAGGTGCATGGATATCCCCTGGATATAAAACAAAACAT
CAGAGAACCAACAGTAGTAACTCCAGAAAAATATCGAACAATAGCACAAATTTAGTCACATCTCAAACCAGTAACGCTTCTATAAGCTTGAACAGCGTCACAAAT
CCAACAGAAGCCTTGTTACAATGCCCAAATCTTCTCAACCAACTTCAATCTCAACTTACTGCTTCATCAAATAATTCCACCAATCACATAGCACGTACTCTCTCT
AATTCATACTGGATTATTGATTCTGGAGCATCCACTCATATATGCTGTTCTAAAGAATTTCTCAC
Protein sequenceShow/hide protein sequence
MIIGLSVKNKLGFIDGTLPRPNDDLLPSWIRNNNIVISWILNSVSKPISTSILFADSARSIWLDLKERFQRKNAPRIFHLKRSLAILSHNQESVSMYFTKFKTLI
DELNSYRPACTCGSCRCGENQEVANFLQTEYLMDFLMGLNDSYAQTRTQLLLMEPIPSISRAFSLLLQEEQQRAISSFSPAINTPTIALAASSNNPKNNSAHKQR
KDRPICTHCNIPGHTVDRCYKVHGYPLDIKQNIREPTVVTPEKYRTIAQI