; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc08g0222061 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc08g0222061
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionIntegrase
Genome locationCMiso1.1chr08:10327788..10328219
RNA-Seq ExpressionCmc08g0222061
SyntenyCmc08g0222061
Gene Ontology termsGO:0006397 - mRNA processing (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003729 - mRNA binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0038926.1 integrase [Cucumis melo var. makuwa]1.2e-5484.44Show/hide
Query:  MELSTNKQDLGVKWMYITKLKSDGNVEKYKTRLVVKDYKQKYGVDCEEIFASVTRIETIQLIFSLTAQNGLKVYQMDVKSAFLKGHLKEEIFVAQPLGYV
        MEL TNKQ LGVKW+Y TKLKSDGNVEKYK RLVVK YKQ+YGVD EEIFA VTRIETI+LI SL AQNG KVYQMDVKSAFL GHLKEEIFVAQPLGYV
Subjt:  MELSTNKQDLGVKWMYITKLKSDGNVEKYKTRLVVKDYKQKYGVDCEEIFASVTRIETIQLIFSLTAQNGLKVYQMDVKSAFLKGHLKEEIFVAQPLGYV

Query:  KRGEEEKVYKLKKVLYGLKQALRDWYNPIDSFFSK
        +RGEEEKVYKLKK LYGLKQA R WY+ IDSFF K
Subjt:  KRGEEEKVYKLKKVLYGLKQALRDWYNPIDSFFSK

KAA0039947.1 integrase [Cucumis melo var. makuwa]1.2e-5484.44Show/hide
Query:  MELSTNKQDLGVKWMYITKLKSDGNVEKYKTRLVVKDYKQKYGVDCEEIFASVTRIETIQLIFSLTAQNGLKVYQMDVKSAFLKGHLKEEIFVAQPLGYV
        MEL TNKQ LGVKW+Y TKLKSDGNVEKYK RLVVK YKQ+YGVD EEIFA VTRIETI+LI SL AQNG KVYQMDVKSAFL GHLKEEIFVAQPLGYV
Subjt:  MELSTNKQDLGVKWMYITKLKSDGNVEKYKTRLVVKDYKQKYGVDCEEIFASVTRIETIQLIFSLTAQNGLKVYQMDVKSAFLKGHLKEEIFVAQPLGYV

Query:  KRGEEEKVYKLKKVLYGLKQALRDWYNPIDSFFSK
        +RGEEEKVYKLKK LYGLKQA R WY+ IDSFF K
Subjt:  KRGEEEKVYKLKKVLYGLKQALRDWYNPIDSFFSK

KAA0060377.1 integrase [Cucumis melo var. makuwa]1.2e-5484.44Show/hide
Query:  MELSTNKQDLGVKWMYITKLKSDGNVEKYKTRLVVKDYKQKYGVDCEEIFASVTRIETIQLIFSLTAQNGLKVYQMDVKSAFLKGHLKEEIFVAQPLGYV
        MEL TNKQ LGVKW+Y TKLKSDGNVEKYK RLVVK YKQ+YGVD EEIFA VTRIETI+LI SL AQNG KVYQMDVKSAFL GHLKEEIFVAQPLGYV
Subjt:  MELSTNKQDLGVKWMYITKLKSDGNVEKYKTRLVVKDYKQKYGVDCEEIFASVTRIETIQLIFSLTAQNGLKVYQMDVKSAFLKGHLKEEIFVAQPLGYV

Query:  KRGEEEKVYKLKKVLYGLKQALRDWYNPIDSFFSK
        +RGEEEKVYKLKK LYGLKQA R WY+ IDSFF K
Subjt:  KRGEEEKVYKLKKVLYGLKQALRDWYNPIDSFFSK

TYJ95504.1 integrase [Cucumis melo var. makuwa]1.2e-5484.44Show/hide
Query:  MELSTNKQDLGVKWMYITKLKSDGNVEKYKTRLVVKDYKQKYGVDCEEIFASVTRIETIQLIFSLTAQNGLKVYQMDVKSAFLKGHLKEEIFVAQPLGYV
        MEL TNKQ LGVKW+Y TKLKSDGNVEKYK RLVVK YKQ+YGVD EEIFA VTRIETI+LI SL AQNG KVYQMDVKSAFL GHLKEEIFVAQPLGYV
Subjt:  MELSTNKQDLGVKWMYITKLKSDGNVEKYKTRLVVKDYKQKYGVDCEEIFASVTRIETIQLIFSLTAQNGLKVYQMDVKSAFLKGHLKEEIFVAQPLGYV

Query:  KRGEEEKVYKLKKVLYGLKQALRDWYNPIDSFFSK
        +RGEEEKVYKLKK LYGLKQA R WY+ IDSFF K
Subjt:  KRGEEEKVYKLKKVLYGLKQALRDWYNPIDSFFSK

TYK30104.1 integrase [Cucumis melo var. makuwa]1.2e-5484.44Show/hide
Query:  MELSTNKQDLGVKWMYITKLKSDGNVEKYKTRLVVKDYKQKYGVDCEEIFASVTRIETIQLIFSLTAQNGLKVYQMDVKSAFLKGHLKEEIFVAQPLGYV
        MEL TNKQ LGVKW+Y TKLKSDGNVEKYK RLVVK YKQ+YGVD EEIFA VTRIETI+LI SL AQNG KVYQMDVKSAFL GHLKEEIFVAQPLGYV
Subjt:  MELSTNKQDLGVKWMYITKLKSDGNVEKYKTRLVVKDYKQKYGVDCEEIFASVTRIETIQLIFSLTAQNGLKVYQMDVKSAFLKGHLKEEIFVAQPLGYV

Query:  KRGEEEKVYKLKKVLYGLKQALRDWYNPIDSFFSK
        +RGEEEKVYKLKK LYGLKQA R WY+ IDSFF K
Subjt:  KRGEEEKVYKLKKVLYGLKQALRDWYNPIDSFFSK

TrEMBL top hitse value%identityAlignment
A0A5A7UZJ8 Integrase6.0e-5584.44Show/hide
Query:  MELSTNKQDLGVKWMYITKLKSDGNVEKYKTRLVVKDYKQKYGVDCEEIFASVTRIETIQLIFSLTAQNGLKVYQMDVKSAFLKGHLKEEIFVAQPLGYV
        MEL TNKQ LGVKW+Y TKLKSDGNVEKYK RLVVK YKQ+YGVD EEIFA VTRIETI+LI SL AQNG KVYQMDVKSAFL GHLKEEIFVAQPLGYV
Subjt:  MELSTNKQDLGVKWMYITKLKSDGNVEKYKTRLVVKDYKQKYGVDCEEIFASVTRIETIQLIFSLTAQNGLKVYQMDVKSAFLKGHLKEEIFVAQPLGYV

Query:  KRGEEEKVYKLKKVLYGLKQALRDWYNPIDSFFSK
        +RGEEEKVYKLKK LYGLKQA R WY+ IDSFF K
Subjt:  KRGEEEKVYKLKKVLYGLKQALRDWYNPIDSFFSK

A0A5D3BQ81 Integrase6.0e-5584.44Show/hide
Query:  MELSTNKQDLGVKWMYITKLKSDGNVEKYKTRLVVKDYKQKYGVDCEEIFASVTRIETIQLIFSLTAQNGLKVYQMDVKSAFLKGHLKEEIFVAQPLGYV
        MEL TNKQ LGVKW+Y TKLKSDGNVEKYK RLVVK YKQ+YGVD EEIFA VTRIETI+LI SL AQNG KVYQMDVKSAFL GHLKEEIFVAQPLGYV
Subjt:  MELSTNKQDLGVKWMYITKLKSDGNVEKYKTRLVVKDYKQKYGVDCEEIFASVTRIETIQLIFSLTAQNGLKVYQMDVKSAFLKGHLKEEIFVAQPLGYV

Query:  KRGEEEKVYKLKKVLYGLKQALRDWYNPIDSFFSK
        +RGEEEKVYKLKK LYGLKQA R WY+ IDSFF K
Subjt:  KRGEEEKVYKLKKVLYGLKQALRDWYNPIDSFFSK

A0A5D3CLV1 Integrase6.0e-5584.44Show/hide
Query:  MELSTNKQDLGVKWMYITKLKSDGNVEKYKTRLVVKDYKQKYGVDCEEIFASVTRIETIQLIFSLTAQNGLKVYQMDVKSAFLKGHLKEEIFVAQPLGYV
        MEL TNKQ LGVKW+Y TKLKSDGNVEKYK RLVVK YKQ+YGVD EEIFA VTRIETI+LI SL AQNG KVYQMDVKSAFL GHLKEEIFVAQPLGYV
Subjt:  MELSTNKQDLGVKWMYITKLKSDGNVEKYKTRLVVKDYKQKYGVDCEEIFASVTRIETIQLIFSLTAQNGLKVYQMDVKSAFLKGHLKEEIFVAQPLGYV

Query:  KRGEEEKVYKLKKVLYGLKQALRDWYNPIDSFFSK
        +RGEEEKVYKLKK LYGLKQA R WY+ IDSFF K
Subjt:  KRGEEEKVYKLKKVLYGLKQALRDWYNPIDSFFSK

A0A5D3CXM6 Integrase6.0e-5584.44Show/hide
Query:  MELSTNKQDLGVKWMYITKLKSDGNVEKYKTRLVVKDYKQKYGVDCEEIFASVTRIETIQLIFSLTAQNGLKVYQMDVKSAFLKGHLKEEIFVAQPLGYV
        MEL TNKQ LGVKW+Y TKLKSDGNVEKYK RLVVK YKQ+YGVD EEIFA VTRIETI+LI SL AQNG KVYQMDVKSAFL GHLKEEIFVAQPLGYV
Subjt:  MELSTNKQDLGVKWMYITKLKSDGNVEKYKTRLVVKDYKQKYGVDCEEIFASVTRIETIQLIFSLTAQNGLKVYQMDVKSAFLKGHLKEEIFVAQPLGYV

Query:  KRGEEEKVYKLKKVLYGLKQALRDWYNPIDSFFSK
        +RGEEEKVYKLKK LYGLKQA R WY+ IDSFF K
Subjt:  KRGEEEKVYKLKKVLYGLKQALRDWYNPIDSFFSK

A0A5D3E3T2 Integrase6.0e-5584.44Show/hide
Query:  MELSTNKQDLGVKWMYITKLKSDGNVEKYKTRLVVKDYKQKYGVDCEEIFASVTRIETIQLIFSLTAQNGLKVYQMDVKSAFLKGHLKEEIFVAQPLGYV
        MEL TNKQ LGVKW+Y TKLKSDGNVEKYK RLVVK YKQ+YGVD EEIFA VTRIETI+LI SL AQNG KVYQMDVKSAFL GHLKEEIFVAQPLGYV
Subjt:  MELSTNKQDLGVKWMYITKLKSDGNVEKYKTRLVVKDYKQKYGVDCEEIFASVTRIETIQLIFSLTAQNGLKVYQMDVKSAFLKGHLKEEIFVAQPLGYV

Query:  KRGEEEKVYKLKKVLYGLKQALRDWYNPIDSFFSK
        +RGEEEKVYKLKK LYGLKQA R WY+ IDSFF K
Subjt:  KRGEEEKVYKLKKVLYGLKQALRDWYNPIDSFFSK

SwissProt top hitse value%identityAlignment
P04146 Copia protein2.6e-2342.75Show/hide
Query:  NKQDLGVKWMYITKLKSDGNVEKYKTRLVVKDYKQKYGVDCEEIFASVTRIETIQLIFSLTAQNGLKVYQMDVKSAFLKGHLKEEIFVAQPLGYVKRGEE
        NK  +  +W++  K    GN  +YK RLV + + QKY +D EE FA V RI + + I SL  Q  LKV+QMDVK+AFL G LKEEI++  P G       
Subjt:  NKQDLGVKWMYITKLKSDGNVEKYKTRLVVKDYKQKYGVDCEEIFASVTRIETIQLIFSLTAQNGLKVYQMDVKSAFLKGHLKEEIFVAQPLGYVKRGEE

Query:  EKVYKLKKVLYGLKQALRDWYNPIDSFFSKYRISNVSI
        + V KL K +YGLKQA R W+   +    +    N S+
Subjt:  EKVYKLKKVLYGLKQALRDWYNPIDSFFSKYRISNVSI

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-948.8e-2749.25Show/hide
Query:  MELSTNKQDLGVKWMYITKLKSDGNVE--KYKTRLVVKDYKQKYGVDCEEIFASVTRIETIQLIFSLTAQNGLKVYQMDVKSAFLKGHLKEEIFVAQPLG
        +EL   K+ L  KW++  KLK DG+ +  +YK RLVVK ++QK G+D +EIF+ V ++ +I+ I SL A   L+V Q+DVK+AFL G L+EEI++ QP G
Subjt:  MELSTNKQDLGVKWMYITKLKSDGNVE--KYKTRLVVKDYKQKYGVDCEEIFASVTRIETIQLIFSLTAQNGLKVYQMDVKSAFLKGHLKEEIFVAQPLG

Query:  YVKRGEEEKVYKLKKVLYGLKQALRDWYNPIDSF
        +   G++  V KL K LYGLKQA R WY   DSF
Subjt:  YVKRGEEEKVYKLKKVLYGLKQALRDWYNPIDSF

Q12490 Transposon Ty1-BL Gag-Pol polyprotein1.2e-0730.58Show/hide
Query:  MYITKLKSDGNVEKYKTRLVVKDYKQKYGVDCEEIFASVTRIETIQLIFSLTAQNGLKVYQMDVKSAFLKGHLKEEIFVAQPLGYVKRGEEEKVYKLKKV
        M+I   K DG    +K R V +   Q        + ++      +    SL   N   + Q+D+ SA+L   +KEE+++  P      G  +K+ +LKK 
Subjt:  MYITKLKSDGNVEKYKTRLVVKDYKQKYGVDCEEIFASVTRIETIQLIFSLTAQNGLKVYQMDVKSAFLKGHLKEEIFVAQPLGYVKRGEEEKVYKLKKV

Query:  LYGLKQALRDWYNPIDSFFSK
        LYGLKQ+  +WY  I S+  K
Subjt:  LYGLKQALRDWYNPIDSFFSK

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE15.5e-2138.46Show/hide
Query:  LGVKWMYITKLKSDGNVEKYKTRLVVKDYKQKYGVDCEEIFASVTRIETIQLIFSLTAQNGLKVYQMDVKSAFLKGHLKEEIFVAQPLGYVKRGEEEKVY
        +G +W++  K  SDG++ +YK RLV K Y Q+ G+D  E F+ V +  +I+++  +       + Q+DV +AFL+G L ++++++QP G++ +     V 
Subjt:  LGVKWMYITKLKSDGNVEKYKTRLVVKDYKQKYGVDCEEIFASVTRIETIQLIFSLTAQNGLKVYQMDVKSAFLKGHLKEEIFVAQPLGYVKRGEEEKVY

Query:  KLKKVLYGLKQALRDWY
        KL+K LYGLKQA R WY
Subjt:  KLKKVLYGLKQALRDWY

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE23.2e-2138.46Show/hide
Query:  LGVKWMYITKLKSDGNVEKYKTRLVVKDYKQKYGVDCEEIFASVTRIETIQLIFSLTAQNGLKVYQMDVKSAFLKGHLKEEIFVAQPLGYVKRGEEEKVY
        +G +W++  K  SDG++ +YK RLV K Y Q+ G+D  E F+ V +  +I+++  +       + Q+DV +AFL+G L +E++++QP G+V +   + V 
Subjt:  LGVKWMYITKLKSDGNVEKYKTRLVVKDYKQKYGVDCEEIFASVTRIETIQLIFSLTAQNGLKVYQMDVKSAFLKGHLKEEIFVAQPLGYVKRGEEEKVY

Query:  KLKKVLYGLKQALRDWY
        +L+K +YGLKQA R WY
Subjt:  KLKKVLYGLKQALRDWY

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 89.3e-2442.19Show/hide
Query:  LSTNKQDLGVKWMYITKLKSDGNVEKYKTRLVVKDYKQKYGVDCEEIFASVTRIETIQLIFSLTAQNGLKVYQMDVKSAFLKGHLKEEIFVAQPLGYVKR
        L  NK+ +G KW+Y  K  SDG +E+YK RLV K Y Q+ G+D  E F+ V ++ +++LI +++A     ++Q+D+ +AFL G L EEI++  P GY  R
Subjt:  LSTNKQDLGVKWMYITKLKSDGNVEKYKTRLVVKDYKQKYGVDCEEIFASVTRIETIQLIFSLTAQNGLKVYQMDVKSAFLKGHLKEEIFVAQPLGYVKR

Query:  GEE----EKVYKLKKVLYGLKQALRDWY
          +      V  LKK +YGLKQA R W+
Subjt:  GEE----EKVYKLKKVLYGLKQALRDWY

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)4.6e-0739.68Show/hide
Query:  NKQDLGVKWMYITKLKSDGNVEKYKTRLVVKDYKQKYGVDCEEIFASVTRIETIQLIFSLTAQ
        N+  LG KW++ TKL SDG +++ K RLV K + Q+ G+   E ++ V R  TI+ I ++  Q
Subjt:  NKQDLGVKWMYITKLKSDGNVEKYKTRLVVKDYKQKYGVDCEEIFASVTRIETIQLIFSLTAQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGCTTTCGACAAACAAACAAGATCTTGGAGTAAAATGGATGTACATAACAAAGTTGAAGTCAGATGGTAATGTTGAAAAATACAAGACAAGACTTGTTGTAAAAGA
CTACAAGCAGAAATATGGTGTGGATTGTGAAGAAATATTTGCCTCTGTGACAAGAATTGAGACCATTCAGTTGATTTTTTCATTAACTGCTCAAAATGGATTGAAAGTTT
ATCAAATGGATGTAAAATCCGCCTTTTTAAAAGGACACTTGAAGGAAGAGATATTCGTTGCACAACCTTTGGGCTATGTGAAAAGGGGAGAAGAAGAAAAAGTGTACAAG
TTGAAAAAGGTATTGTATGGATTGAAGCAAGCTCTCCGAGATTGGTACAATCCTATCGATAGTTTTTTTTCTAAATATAGGATTTCGAATGTGTCAATATAA
mRNA sequenceShow/hide mRNA sequence
ATGGAGCTTTCGACAAACAAACAAGATCTTGGAGTAAAATGGATGTACATAACAAAGTTGAAGTCAGATGGTAATGTTGAAAAATACAAGACAAGACTTGTTGTAAAAGA
CTACAAGCAGAAATATGGTGTGGATTGTGAAGAAATATTTGCCTCTGTGACAAGAATTGAGACCATTCAGTTGATTTTTTCATTAACTGCTCAAAATGGATTGAAAGTTT
ATCAAATGGATGTAAAATCCGCCTTTTTAAAAGGACACTTGAAGGAAGAGATATTCGTTGCACAACCTTTGGGCTATGTGAAAAGGGGAGAAGAAGAAAAAGTGTACAAG
TTGAAAAAGGTATTGTATGGATTGAAGCAAGCTCTCCGAGATTGGTACAATCCTATCGATAGTTTTTTTTCTAAATATAGGATTTCGAATGTGTCAATATAA
Protein sequenceShow/hide protein sequence
MELSTNKQDLGVKWMYITKLKSDGNVEKYKTRLVVKDYKQKYGVDCEEIFASVTRIETIQLIFSLTAQNGLKVYQMDVKSAFLKGHLKEEIFVAQPLGYVKRGEEEKVYK
LKKVLYGLKQALRDWYNPIDSFFSKYRISNVSI