; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc04g0107121 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc04g0107121
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionIntegrase
Genome locationCMiso1.1chr04:25505432..25505833
RNA-Seq ExpressionCmc04g0107121
SyntenyCmc04g0107121
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0060708.1 integrase [Cucumis melo var. makuwa]1.7e-6998.5Show/hide
Query:  MDVKSAFLNGHLKEEIFVAQPLGYVQRGEEEKVYKLKKALYGLKQASRAWYSRIDSFFLKTGFRRCPYEHALYVKEDKYGKFLIVSLYVDDLLFTGNDKF
        MDVKSAFLNGHLKEEIFVAQPLGYVQRGEEEKV+KLKKALYGLKQA RAWYSRIDSFFLKTGFRRCPYEHALYVKEDKYGKFLIVSLYVDDLLFTGNDKF
Subjt:  MDVKSAFLNGHLKEEIFVAQPLGYVQRGEEEKVYKLKKALYGLKQASRAWYSRIDSFFLKTGFRRCPYEHALYVKEDKYGKFLIVSLYVDDLLFTGNDKF

Query:  LCDDFKNSMKKEFEMSDMGLIHYFLGIEVNQNE
        LCDDFKNSMKKEFEMSDMGLIHYFLGIEVNQNE
Subjt:  LCDDFKNSMKKEFEMSDMGLIHYFLGIEVNQNE

KAA0065483.1 integrase [Cucumis melo var. makuwa]5.7e-7099.25Show/hide
Query:  MDVKSAFLNGHLKEEIFVAQPLGYVQRGEEEKVYKLKKALYGLKQASRAWYSRIDSFFLKTGFRRCPYEHALYVKEDKYGKFLIVSLYVDDLLFTGNDKF
        MDVKSAFLNGHLKEEIFVAQPLGYVQRGEEEKVYKLKKALYGLKQA RAWYSRIDSFFLKTGFRRCPYEHALYVKEDKYGKFLIVSLYVDDLLFTGNDKF
Subjt:  MDVKSAFLNGHLKEEIFVAQPLGYVQRGEEEKVYKLKKALYGLKQASRAWYSRIDSFFLKTGFRRCPYEHALYVKEDKYGKFLIVSLYVDDLLFTGNDKF

Query:  LCDDFKNSMKKEFEMSDMGLIHYFLGIEVNQNE
        LCDDFKNSMKKEFEMSDMGLIHYFLGIEVNQNE
Subjt:  LCDDFKNSMKKEFEMSDMGLIHYFLGIEVNQNE

TYK08724.1 integrase [Cucumis melo var. makuwa]5.7e-7099.25Show/hide
Query:  MDVKSAFLNGHLKEEIFVAQPLGYVQRGEEEKVYKLKKALYGLKQASRAWYSRIDSFFLKTGFRRCPYEHALYVKEDKYGKFLIVSLYVDDLLFTGNDKF
        MDVKSAFLNGHLKEEIFVAQPLGYVQRGEEEKVYKLKKALYGLKQA RAWYSRIDSFFLKTGFRRCPYEHALYVKEDKYGKFLIVSLYVDDLLFTGNDKF
Subjt:  MDVKSAFLNGHLKEEIFVAQPLGYVQRGEEEKVYKLKKALYGLKQASRAWYSRIDSFFLKTGFRRCPYEHALYVKEDKYGKFLIVSLYVDDLLFTGNDKF

Query:  LCDDFKNSMKKEFEMSDMGLIHYFLGIEVNQNE
        LCDDFKNSMKKEFEMSDMGLIHYFLGIEVNQNE
Subjt:  LCDDFKNSMKKEFEMSDMGLIHYFLGIEVNQNE

TYK21844.1 integrase [Cucumis melo var. makuwa]1.7e-6998.5Show/hide
Query:  MDVKSAFLNGHLKEEIFVAQPLGYVQRGEEEKVYKLKKALYGLKQASRAWYSRIDSFFLKTGFRRCPYEHALYVKEDKYGKFLIVSLYVDDLLFTGNDKF
        MDVKSAFLNGHLKEEIFVAQPLGYVQRGEEEKV+KLKKALYGLKQA RAWYSRIDSFFLKTGFRRCPYEHALYVKEDKYGKFLIVSLYVDDLLFTGNDKF
Subjt:  MDVKSAFLNGHLKEEIFVAQPLGYVQRGEEEKVYKLKKALYGLKQASRAWYSRIDSFFLKTGFRRCPYEHALYVKEDKYGKFLIVSLYVDDLLFTGNDKF

Query:  LCDDFKNSMKKEFEMSDMGLIHYFLGIEVNQNE
        LCDDFKNSMKKEFEMSDMGLIHYFLGIEVNQNE
Subjt:  LCDDFKNSMKKEFEMSDMGLIHYFLGIEVNQNE

TYK30104.1 integrase [Cucumis melo var. makuwa]5.7e-7099.25Show/hide
Query:  MDVKSAFLNGHLKEEIFVAQPLGYVQRGEEEKVYKLKKALYGLKQASRAWYSRIDSFFLKTGFRRCPYEHALYVKEDKYGKFLIVSLYVDDLLFTGNDKF
        MDVKSAFLNGHLKEEIFVAQPLGYVQRGEEEKVYKLKKALYGLKQA RAWYSRIDSFFLKTGFRRCPYEHALYVKEDKYGKFLIVSLYVDDLLFTGNDKF
Subjt:  MDVKSAFLNGHLKEEIFVAQPLGYVQRGEEEKVYKLKKALYGLKQASRAWYSRIDSFFLKTGFRRCPYEHALYVKEDKYGKFLIVSLYVDDLLFTGNDKF

Query:  LCDDFKNSMKKEFEMSDMGLIHYFLGIEVNQNE
        LCDDFKNSMKKEFEMSDMGLIHYFLGIEVNQNE
Subjt:  LCDDFKNSMKKEFEMSDMGLIHYFLGIEVNQNE

TrEMBL top hitse value%identityAlignment
A0A5A7UZM3 Integrase8.1e-7098.5Show/hide
Query:  MDVKSAFLNGHLKEEIFVAQPLGYVQRGEEEKVYKLKKALYGLKQASRAWYSRIDSFFLKTGFRRCPYEHALYVKEDKYGKFLIVSLYVDDLLFTGNDKF
        MDVKSAFLNGHLKEEIFVAQPLGYVQRGEEEKV+KLKKALYGLKQA RAWYSRIDSFFLKTGFRRCPYEHALYVKEDKYGKFLIVSLYVDDLLFTGNDKF
Subjt:  MDVKSAFLNGHLKEEIFVAQPLGYVQRGEEEKVYKLKKALYGLKQASRAWYSRIDSFFLKTGFRRCPYEHALYVKEDKYGKFLIVSLYVDDLLFTGNDKF

Query:  LCDDFKNSMKKEFEMSDMGLIHYFLGIEVNQNE
        LCDDFKNSMKKEFEMSDMGLIHYFLGIEVNQNE
Subjt:  LCDDFKNSMKKEFEMSDMGLIHYFLGIEVNQNE

A0A5A7VBC7 Integrase2.8e-7099.25Show/hide
Query:  MDVKSAFLNGHLKEEIFVAQPLGYVQRGEEEKVYKLKKALYGLKQASRAWYSRIDSFFLKTGFRRCPYEHALYVKEDKYGKFLIVSLYVDDLLFTGNDKF
        MDVKSAFLNGHLKEEIFVAQPLGYVQRGEEEKVYKLKKALYGLKQA RAWYSRIDSFFLKTGFRRCPYEHALYVKEDKYGKFLIVSLYVDDLLFTGNDKF
Subjt:  MDVKSAFLNGHLKEEIFVAQPLGYVQRGEEEKVYKLKKALYGLKQASRAWYSRIDSFFLKTGFRRCPYEHALYVKEDKYGKFLIVSLYVDDLLFTGNDKF

Query:  LCDDFKNSMKKEFEMSDMGLIHYFLGIEVNQNE
        LCDDFKNSMKKEFEMSDMGLIHYFLGIEVNQNE
Subjt:  LCDDFKNSMKKEFEMSDMGLIHYFLGIEVNQNE

A0A5D3CBW3 Integrase2.8e-7099.25Show/hide
Query:  MDVKSAFLNGHLKEEIFVAQPLGYVQRGEEEKVYKLKKALYGLKQASRAWYSRIDSFFLKTGFRRCPYEHALYVKEDKYGKFLIVSLYVDDLLFTGNDKF
        MDVKSAFLNGHLKEEIFVAQPLGYVQRGEEEKVYKLKKALYGLKQA RAWYSRIDSFFLKTGFRRCPYEHALYVKEDKYGKFLIVSLYVDDLLFTGNDKF
Subjt:  MDVKSAFLNGHLKEEIFVAQPLGYVQRGEEEKVYKLKKALYGLKQASRAWYSRIDSFFLKTGFRRCPYEHALYVKEDKYGKFLIVSLYVDDLLFTGNDKF

Query:  LCDDFKNSMKKEFEMSDMGLIHYFLGIEVNQNE
        LCDDFKNSMKKEFEMSDMGLIHYFLGIEVNQNE
Subjt:  LCDDFKNSMKKEFEMSDMGLIHYFLGIEVNQNE

A0A5D3DE36 Integrase8.1e-7098.5Show/hide
Query:  MDVKSAFLNGHLKEEIFVAQPLGYVQRGEEEKVYKLKKALYGLKQASRAWYSRIDSFFLKTGFRRCPYEHALYVKEDKYGKFLIVSLYVDDLLFTGNDKF
        MDVKSAFLNGHLKEEIFVAQPLGYVQRGEEEKV+KLKKALYGLKQA RAWYSRIDSFFLKTGFRRCPYEHALYVKEDKYGKFLIVSLYVDDLLFTGNDKF
Subjt:  MDVKSAFLNGHLKEEIFVAQPLGYVQRGEEEKVYKLKKALYGLKQASRAWYSRIDSFFLKTGFRRCPYEHALYVKEDKYGKFLIVSLYVDDLLFTGNDKF

Query:  LCDDFKNSMKKEFEMSDMGLIHYFLGIEVNQNE
        LCDDFKNSMKKEFEMSDMGLIHYFLGIEVNQNE
Subjt:  LCDDFKNSMKKEFEMSDMGLIHYFLGIEVNQNE

A0A5D3E2J1 Integrase2.8e-7099.25Show/hide
Query:  MDVKSAFLNGHLKEEIFVAQPLGYVQRGEEEKVYKLKKALYGLKQASRAWYSRIDSFFLKTGFRRCPYEHALYVKEDKYGKFLIVSLYVDDLLFTGNDKF
        MDVKSAFLNGHLKEEIFVAQPLGYVQRGEEEKVYKLKKALYGLKQA RAWYSRIDSFFLKTGFRRCPYEHALYVKEDKYGKFLIVSLYVDDLLFTGNDKF
Subjt:  MDVKSAFLNGHLKEEIFVAQPLGYVQRGEEEKVYKLKKALYGLKQASRAWYSRIDSFFLKTGFRRCPYEHALYVKEDKYGKFLIVSLYVDDLLFTGNDKF

Query:  LCDDFKNSMKKEFEMSDMGLIHYFLGIEVNQNE
        LCDDFKNSMKKEFEMSDMGLIHYFLGIEVNQNE
Subjt:  LCDDFKNSMKKEFEMSDMGLIHYFLGIEVNQNE

SwissProt top hitse value%identityAlignment
P04146 Copia protein2.0e-1735.82Show/hide
Query:  MDVKSAFLNGHLKEEIFVAQPLGYVQRGEEEKVYKLKKALYGLKQASRAWYSRIDSFFLKTGFRRCPYEHALYV-KEDKYGKFLIVSLYVDDLLFTGNDK
        MDVK+AFLNG LKEEI++  P G       + V KL KA+YGLKQA+R W+   +    +  F     +  +Y+  +    + + V LYVDD++    D 
Subjt:  MDVKSAFLNGHLKEEIFVAQPLGYVQRGEEEKVYKLKKALYGLKQASRAWYSRIDSFFLKTGFRRCPYEHALYV-KEDKYGKFLIVSLYVDDLLFTGNDK

Query:  FLCDDFKNSMKKEFEMSDMGLIHYFLGIEVNQNE
           ++FK  + ++F M+D+  I +F+GI +   E
Subjt:  FLCDDFKNSMKKEFEMSDMGLIHYFLGIEVNQNE

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-948.4e-2441.86Show/hide
Query:  MDVKSAFLNGHLKEEIFVAQPLGYVQRGEEEKVYKLKKALYGLKQASRAWYSRIDSFFLKTGFRRCPYEHALYVKEDKYGKFLIVSLYVDDLLFTGNDKF
        +DVK+AFL+G L+EEI++ QP G+   G++  V KL K+LYGLKQA R WY + DSF     + +   +  +Y K      F+I+ LYVDD+L  G DK 
Subjt:  MDVKSAFLNGHLKEEIFVAQPLGYVQRGEEEKVYKLKKALYGLKQASRAWYSRIDSFFLKTGFRRCPYEHALYVKEDKYGKFLIVSLYVDDLLFTGNDKF

Query:  LCDDFKNSMKKEFEMSDMGLIHYFLGIEV
        L    K  + K F+M D+G     LG+++
Subjt:  LCDDFKNSMKKEFEMSDMGLIHYFLGIEV

P25600 Putative transposon Ty5-1 protein YCL074W2.4e-1837.12Show/hide
Query:  MDVKSAFLNGHLKEEIFVAQPLGYVQRGEEEKVYKLKKALYGLKQASRAWYSRIDSFFLKTGFRRCPYEHALYVKEDKYGKFLIVSLYVDDLLFTGNDKF
        MDV +AFLN  + E I+V QP G+V     + V++L   +YGLKQA   W   I++   K GF R   EH LY +    G  + +++YVDDLL       
Subjt:  MDVKSAFLNGHLKEEIFVAQPLGYVQRGEEEKVYKLKKALYGLKQASRAWYSRIDSFFLKTGFRRCPYEHALYVKEDKYGKFLIVSLYVDDLLFTGNDKF

Query:  LCDDFKNSMKKEFEMSDMGLIHYFLGIEVNQN
        + D  K  + K + M D+G +  FLG+ ++Q+
Subjt:  LCDDFKNSMKKEFEMSDMGLIHYFLGIEVNQN

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE17.9e-2240.15Show/hide
Query:  MDVKSAFLNGHLKEEIFVAQPLGYVQRGEEEKVYKLKKALYGLKQASRAWYSRIDSFFLKTGFRRCPYEHALYVKEDKYGKFLIVSL-YVDDLLFTGNDK
        +DV +AFL G L ++++++QP G++ +     V KL+KALYGLKQA RAWY  + ++ L  GF     + +L+V +   GK ++  L YVDD+L TGND 
Subjt:  MDVKSAFLNGHLKEEIFVAQPLGYVQRGEEEKVYKLKKALYGLKQASRAWYSRIDSFFLKTGFRRCPYEHALYVKEDKYGKFLIVSL-YVDDLLFTGNDK

Query:  FLCDDFKNSMKKEFEMSDMGLIHYFLGIEVNQ
         L  +  +++ + F + D   +HYFLGIE  +
Subjt:  FLCDDFKNSMKKEFEMSDMGLIHYFLGIEVNQ

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE25.1e-2139.39Show/hide
Query:  MDVKSAFLNGHLKEEIFVAQPLGYVQRGEEEKVYKLKKALYGLKQASRAWYSRIDSFFLKTGFRRCPYEHALYVKEDKYGKFLIVSL-YVDDLLFTGNDK
        +DV +AFL G L +E++++QP G+V +   + V +L+KA+YGLKQA RAWY  + ++ L  GF     + +L+V +   G+ +I  L YVDD+L TGND 
Subjt:  MDVKSAFLNGHLKEEIFVAQPLGYVQRGEEEKVYKLKKALYGLKQASRAWYSRIDSFFLKTGFRRCPYEHALYVKEDKYGKFLIVSL-YVDDLLFTGNDK

Query:  FLCDDFKNSMKKEFEMSDMGLIHYFLGIEVNQ
         L     +++ + F + +   +HYFLGIE  +
Subjt:  FLCDDFKNSMKKEFEMSDMGLIHYFLGIEVNQ

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 83.4e-2037.59Show/hide
Query:  MDVKSAFLNGHLKEEIFVAQPLGYVQRGEE----EKVYKLKKALYGLKQASRAWYSRIDSFFLKTGFRRCPYEHALYVKEDKYGKFLIVSLYVDDLLFTG
        +D+ +AFLNG L EEI++  P GY  R  +      V  LKK++YGLKQASR W+ +     +  GF +   +H  ++K      FL V +YVDD++   
Subjt:  MDVKSAFLNGHLKEEIFVAQPLGYVQRGEE----EKVYKLKKALYGLKQASRAWYSRIDSFFLKTGFRRCPYEHALYVKEDKYGKFLIVSLYVDDLLFTG

Query:  NDKFLCDDFKNSMKKEFEMSDMGLIHYFLGIEV
        N+    D+ K+ +K  F++ D+G + YFLG+E+
Subjt:  NDKFLCDDFKNSMKKEFEMSDMGLIHYFLGIEV

ATMG00810.1 DNA/RNA polymerases superfamily protein6.9e-0544.19Show/hide
Query:  LYVDDLLFTGNDKFLCDDFKNSMKKEFEMSDMGLIHYFLGIEV
        LYVDD+L TG+   L +     +   F M D+G +HYFLGI++
Subjt:  LYVDDLLFTGNDKFLCDDFKNSMKKEFEMSDMGLIHYFLGIEV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATGTAAAATCCGCTTTTTTGAATGGACACTTGAAGGAAGAGATATTTGTTGCACAACCTTTGGGCTATGTGCAAAGGGGAGAAGAAGAAAAAGTGTACAAGTTGAA
AAAAGCTTTGTATGGATTGAAGCAAGCTTCGCGAGCTTGGTACAGTCGTATCGACAGTTTTTTTCTAAAGACAGGATTTCGAAGGTGTCCATATGAGCATGCACTCTATG
TCAAAGAAGACAAGTATGGTAAATTTCTCATCGTTTCTCTTTACGTTGATGATTTACTTTTTACTGGAAATGATAAATTTTTGTGTGATGATTTTAAGAATTCCATGAAA
AAGGAATTTGAGATGAGTGATATGGGTCTCATCCATTACTTTCTCGGAATTGAAGTTAATCAAAATGAATGA
mRNA sequenceShow/hide mRNA sequence
ATGGATGTAAAATCCGCTTTTTTGAATGGACACTTGAAGGAAGAGATATTTGTTGCACAACCTTTGGGCTATGTGCAAAGGGGAGAAGAAGAAAAAGTGTACAAGTTGAA
AAAAGCTTTGTATGGATTGAAGCAAGCTTCGCGAGCTTGGTACAGTCGTATCGACAGTTTTTTTCTAAAGACAGGATTTCGAAGGTGTCCATATGAGCATGCACTCTATG
TCAAAGAAGACAAGTATGGTAAATTTCTCATCGTTTCTCTTTACGTTGATGATTTACTTTTTACTGGAAATGATAAATTTTTGTGTGATGATTTTAAGAATTCCATGAAA
AAGGAATTTGAGATGAGTGATATGGGTCTCATCCATTACTTTCTCGGAATTGAAGTTAATCAAAATGAATGA
Protein sequenceShow/hide protein sequence
MDVKSAFLNGHLKEEIFVAQPLGYVQRGEEEKVYKLKKALYGLKQASRAWYSRIDSFFLKTGFRRCPYEHALYVKEDKYGKFLIVSLYVDDLLFTGNDKFLCDDFKNSMK
KEFEMSDMGLIHYFLGIEVNQNE