; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc03g0064461 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc03g0064461
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationCMiso1.1chr03:5302835..5303522
RNA-Seq ExpressionCmc03g0064461
SyntenyCmc03g0064461
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0016787 - hydrolase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0046865.1 putative gag-pol polyprotein, identical [Cucumis melo var. makuwa]2.3e-5868.75Show/hide
Query:  NLNEKLQQNDGAKMADAQRFKSLVGGLIYLTHARPDILYSIDVFSRFMQCSSRDHFEAAK--------------WLELLLGA-------------KQATV
        N+N+KLQQNDG +MADAQRF+SLVGGLIYLTH RPDI YSI V SRFMQC SRDHF AAK              W  + +               KQA V
Subjt:  NLNEKLQQNDGAKMADAQRFKSLVGGLIYLTHARPDILYSIDVFSRFMQCSSRDHFEAAK--------------WLELLLGA-------------KQATV

Query:  ALSSSEAEYGAATLATCQAIWLRRMLTELQHEQEGASVIFCDNKAMISMTKNLTFHSRTKHIDIRFHFIGDLVAK-EVSLSYYSTHSSGLIF
        ALSSSEAEY AAT A CQAIWLRRMLTELQHEQEGA+VIF DNKA  SMTKN TFHSRTKHIDI FHFI DLVAK EVSLSY STH  GLIF
Subjt:  ALSSSEAEYGAATLATCQAIWLRRMLTELQHEQEGASVIFCDNKAMISMTKNLTFHSRTKHIDIRFHFIGDLVAK-EVSLSYYSTHSSGLIF

KAA0056051.1 putative gag-pol polyprotein, identical [Cucumis melo var. makuwa]3.7e-5659.65Show/hide
Query:  MNLNEKLQQNDGAKMADAQRFKSLVGGLIYLTHARPDILYSIDVFSRFMQCSSRDHFEAAK-------------------------------WLELL---
        +N+NEKLQQNDGA+MA+AQRF+SLVGGLIYLTH RPDI YSI V SRFMQ  SRDHF AAK                               W   L   
Subjt:  MNLNEKLQQNDGAKMADAQRFKSLVGGLIYLTHARPDILYSIDVFSRFMQCSSRDHFEAAK-------------------------------WLELL---

Query:  -----------LGA------KQATVALSSSEAEYGAATLATCQAIWLRRMLTELQHEQEGASVIFCDNKAMISMTKNLTFHSRTKHIDIRFHFIGDLVAK
                   LG       KQAT ALSSSEAEY AAT A CQAIWLRRMLTELQHEQEGA+VIFCDNKA ISMTKN TFHSRTKHIDIRFHFI DLVAK
Subjt:  -----------LGA------KQATVALSSSEAEYGAATLATCQAIWLRRMLTELQHEQEGASVIFCDNKAMISMTKNLTFHSRTKHIDIRFHFIGDLVAK

Query:  -EVSLSYYSTHSSGL-IFSQKLCQRRSC
         EVSL+Y STH     I ++ L + + C
Subjt:  -EVSLSYYSTHSSGL-IFSQKLCQRRSC

KAA0058958.1 retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]1.5e-57100Show/hide
Query:  LELLLGAKQATVALSSSEAEYGAATLATCQAIWLRRMLTELQHEQEGASVIFCDNKAMISMTKNLTFHSRTKHIDIRFHFIGDLVAKEVSLSYYSTHSSG
        LELLLGAKQATVALSSSEAEYGAATLATCQAIWLRRMLTELQHEQEGASVIFCDNKAMISMTKNLTFHSRTKHIDIRFHFIGDLVAKEVSLSYYSTHSSG
Subjt:  LELLLGAKQATVALSSSEAEYGAATLATCQAIWLRRMLTELQHEQEGASVIFCDNKAMISMTKNLTFHSRTKHIDIRFHFIGDLVAKEVSLSYYSTHSSG

Query:  LIFSQKLCQRRSCVTLEL
        LIFSQKLCQRRSCVTLEL
Subjt:  LIFSQKLCQRRSCVTLEL

KAA0063731.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]5.4e-5568.48Show/hide
Query:  MNLNEKLQQNDGAKMADAQRFKSLVGGLIYLTHARPDILYSIDVFSRFMQCSSRDHFEAAK--------------W-----------LELLLGAKQATVA
        MN+NEKLQQND AKM DAQRF+SLVGGLIYLTH  PDI YSI V SRFMQC SRDHF AAK              W             ++   KQA VA
Subjt:  MNLNEKLQQNDGAKMADAQRFKSLVGGLIYLTHARPDILYSIDVFSRFMQCSSRDHFEAAK--------------W-----------LELLLGAKQATVA

Query:  LSSSEAEYGAATLATCQAIWLRRMLTELQHEQEGASVIFCDNKAMISMTKNLTFHSRTKHIDIRFHFIGDLVAK-EVSLSYYST
        LSS EAEY A+T A CQ IWLRRMLTELQHEQEGA+VIFC+NKA ISMTKN TFHSRTKHIDI FHFI DLVAK EVSLSY ST
Subjt:  LSSSEAEYGAATLATCQAIWLRRMLTELQHEQEGASVIFCDNKAMISMTKNLTFHSRTKHIDIRFHFIGDLVAK-EVSLSYYST

TYK03281.1 putative gag-pol polyprotein, identical [Cucumis melo var. makuwa]2.3e-5868.75Show/hide
Query:  NLNEKLQQNDGAKMADAQRFKSLVGGLIYLTHARPDILYSIDVFSRFMQCSSRDHFEAAK--------------WLELLLGA-------------KQATV
        N+N+KLQQNDG +MADAQRF+SLVGGLIYLTH RPDI YSI V SRFMQC SRDHF AAK              W  + +               KQA V
Subjt:  NLNEKLQQNDGAKMADAQRFKSLVGGLIYLTHARPDILYSIDVFSRFMQCSSRDHFEAAK--------------WLELLLGA-------------KQATV

Query:  ALSSSEAEYGAATLATCQAIWLRRMLTELQHEQEGASVIFCDNKAMISMTKNLTFHSRTKHIDIRFHFIGDLVAK-EVSLSYYSTHSSGLIF
        ALSSSEAEY AAT A CQAIWLRRMLTELQHEQEGA+VIF DNKA  SMTKN TFHSRTKHIDI FHFI DLVAK EVSLSY STH  GLIF
Subjt:  ALSSSEAEYGAATLATCQAIWLRRMLTELQHEQEGASVIFCDNKAMISMTKNLTFHSRTKHIDIRFHFIGDLVAK-EVSLSYYSTHSSGLIF

TrEMBL top hitse value%identityAlignment
A0A5A7TZP7 Putative gag-pol polyprotein, identical1.1e-5868.75Show/hide
Query:  NLNEKLQQNDGAKMADAQRFKSLVGGLIYLTHARPDILYSIDVFSRFMQCSSRDHFEAAK--------------WLELLLGA-------------KQATV
        N+N+KLQQNDG +MADAQRF+SLVGGLIYLTH RPDI YSI V SRFMQC SRDHF AAK              W  + +               KQA V
Subjt:  NLNEKLQQNDGAKMADAQRFKSLVGGLIYLTHARPDILYSIDVFSRFMQCSSRDHFEAAK--------------WLELLLGA-------------KQATV

Query:  ALSSSEAEYGAATLATCQAIWLRRMLTELQHEQEGASVIFCDNKAMISMTKNLTFHSRTKHIDIRFHFIGDLVAK-EVSLSYYSTHSSGLIF
        ALSSSEAEY AAT A CQAIWLRRMLTELQHEQEGA+VIF DNKA  SMTKN TFHSRTKHIDI FHFI DLVAK EVSLSY STH  GLIF
Subjt:  ALSSSEAEYGAATLATCQAIWLRRMLTELQHEQEGASVIFCDNKAMISMTKNLTFHSRTKHIDIRFHFIGDLVAK-EVSLSYYSTHSSGLIF

A0A5A7URA0 Putative gag-pol polyprotein, identical1.8e-5659.65Show/hide
Query:  MNLNEKLQQNDGAKMADAQRFKSLVGGLIYLTHARPDILYSIDVFSRFMQCSSRDHFEAAK-------------------------------WLELL---
        +N+NEKLQQNDGA+MA+AQRF+SLVGGLIYLTH RPDI YSI V SRFMQ  SRDHF AAK                               W   L   
Subjt:  MNLNEKLQQNDGAKMADAQRFKSLVGGLIYLTHARPDILYSIDVFSRFMQCSSRDHFEAAK-------------------------------WLELL---

Query:  -----------LGA------KQATVALSSSEAEYGAATLATCQAIWLRRMLTELQHEQEGASVIFCDNKAMISMTKNLTFHSRTKHIDIRFHFIGDLVAK
                   LG       KQAT ALSSSEAEY AAT A CQAIWLRRMLTELQHEQEGA+VIFCDNKA ISMTKN TFHSRTKHIDIRFHFI DLVAK
Subjt:  -----------LGA------KQATVALSSSEAEYGAATLATCQAIWLRRMLTELQHEQEGASVIFCDNKAMISMTKNLTFHSRTKHIDIRFHFIGDLVAK

Query:  -EVSLSYYSTHSSGL-IFSQKLCQRRSC
         EVSL+Y STH     I ++ L + + C
Subjt:  -EVSLSYYSTHSSGL-IFSQKLCQRRSC

A0A5A7VDB5 Retrovirus-related Pol polyprotein from transposon TNT 1-942.6e-5568.48Show/hide
Query:  MNLNEKLQQNDGAKMADAQRFKSLVGGLIYLTHARPDILYSIDVFSRFMQCSSRDHFEAAK--------------W-----------LELLLGAKQATVA
        MN+NEKLQQND AKM DAQRF+SLVGGLIYLTH  PDI YSI V SRFMQC SRDHF AAK              W             ++   KQA VA
Subjt:  MNLNEKLQQNDGAKMADAQRFKSLVGGLIYLTHARPDILYSIDVFSRFMQCSSRDHFEAAK--------------W-----------LELLLGAKQATVA

Query:  LSSSEAEYGAATLATCQAIWLRRMLTELQHEQEGASVIFCDNKAMISMTKNLTFHSRTKHIDIRFHFIGDLVAK-EVSLSYYST
        LSS EAEY A+T A CQ IWLRRMLTELQHEQEGA+VIFC+NKA ISMTKN TFHSRTKHIDI FHFI DLVAK EVSLSY ST
Subjt:  LSSSEAEYGAATLATCQAIWLRRMLTELQHEQEGASVIFCDNKAMISMTKNLTFHSRTKHIDIRFHFIGDLVAK-EVSLSYYST

A0A5D3BU79 Putative gag-pol polyprotein, identical1.1e-5868.75Show/hide
Query:  NLNEKLQQNDGAKMADAQRFKSLVGGLIYLTHARPDILYSIDVFSRFMQCSSRDHFEAAK--------------WLELLLGA-------------KQATV
        N+N+KLQQNDG +MADAQRF+SLVGGLIYLTH RPDI YSI V SRFMQC SRDHF AAK              W  + +               KQA V
Subjt:  NLNEKLQQNDGAKMADAQRFKSLVGGLIYLTHARPDILYSIDVFSRFMQCSSRDHFEAAK--------------WLELLLGA-------------KQATV

Query:  ALSSSEAEYGAATLATCQAIWLRRMLTELQHEQEGASVIFCDNKAMISMTKNLTFHSRTKHIDIRFHFIGDLVAK-EVSLSYYSTHSSGLIF
        ALSSSEAEY AAT A CQAIWLRRMLTELQHEQEGA+VIF DNKA  SMTKN TFHSRTKHIDI FHFI DLVAK EVSLSY STH  GLIF
Subjt:  ALSSSEAEYGAATLATCQAIWLRRMLTELQHEQEGASVIFCDNKAMISMTKNLTFHSRTKHIDIRFHFIGDLVAK-EVSLSYYSTHSSGLIF

A0A5D3CI69 Retrovirus-related Pol polyprotein from transposon TNT 1-947.4e-58100Show/hide
Query:  LELLLGAKQATVALSSSEAEYGAATLATCQAIWLRRMLTELQHEQEGASVIFCDNKAMISMTKNLTFHSRTKHIDIRFHFIGDLVAKEVSLSYYSTHSSG
        LELLLGAKQATVALSSSEAEYGAATLATCQAIWLRRMLTELQHEQEGASVIFCDNKAMISMTKNLTFHSRTKHIDIRFHFIGDLVAKEVSLSYYSTHSSG
Subjt:  LELLLGAKQATVALSSSEAEYGAATLATCQAIWLRRMLTELQHEQEGASVIFCDNKAMISMTKNLTFHSRTKHIDIRFHFIGDLVAKEVSLSYYSTHSSG

Query:  LIFSQKLCQRRSCVTLEL
        LIFSQKLCQRRSCVTLEL
Subjt:  LIFSQKLCQRRSCVTLEL

SwissProt top hitse value%identityAlignment
P04146 Copia protein4.2e-1040.22Show/hide
Query:  KQATVALSSSEAEYGAATLATCQAIWLRRMLTELQHEQEGASVIFCDNKAMISMTKNLTFHSRTKHIDIRFHFIGDLVAKEV-SLSYYSTHS
        +Q +VA SS+EAEY A   A  +A+WL+ +LT +  + E    I+ DN+  IS+  N + H R KHIDI++HF  + V   V  L Y  T +
Subjt:  KQATVALSSSEAEYGAATLATCQAIWLRRMLTELQHEQEGASVIFCDNKAMISMTKNLTFHSRTKHIDIRFHFIGDLVAKEV-SLSYYSTHS

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.1e-1732.97Show/hide
Query:  FKSLVGGLIY-LTHARPDILYSIDVFSRFMQCSSRDHFEAAKW-LELLLGAK------------------------------------------------
        + S VG L+Y +   RPDI +++ V SRF++   ++H+EA KW L  L G                                                  
Subjt:  FKSLVGGLIY-LTHARPDILYSIDVFSRFMQCSSRDHFEAAKW-LELLLGAK------------------------------------------------

Query:  -QATVALSSSEAEYGAATLATCQAIWLRRMLTEL-QHEQEGASVIFCDNKAMISMTKNLTFHSRTKHIDIRFHFIGDLVAKE
         Q  VALS++EAEY AAT    + IWL+R L EL  H++E   V++CD+++ I ++KN  +H+RTKHID+R+H+I ++V  E
Subjt:  -QATVALSSSEAEYGAATLATCQAIWLRRMLTEL-QHEQEGASVIFCDNKAMISMTKNLTFHSRTKHIDIRFHFIGDLVAKE

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE14.8e-1428.64Show/hide
Query:  KLQQNDGAKMADAQRFKSLVGGLIYLTHARPDILYSIDVFSRFMQCSSRDHFEAAK-WLELLLGA-----------------------------------
        KL    G K+ D   ++ +VG L YL   RPDI Y+++  S+FM   + +H +A K  L  L G                                    
Subjt:  KLQQNDGAKMADAQRFKSLVGGLIYLTHARPDILYSIDVFSRFMQCSSRDHFEAAK-WLELLLGA-----------------------------------

Query:  ---------------KQATVALSSSEAEYGAATLATCQAIWLRRMLTELQHEQEGASVIFCDNKAMISMTKNLTFHSRTKHIDIRFHFIGDLV-AKEVSL
                       KQ  V  SS+EAEY +    + +  W+  +LTEL        VI+CDN     +  N  FHSR KHI I +HFI + V +  + +
Subjt:  ---------------KQATVALSSSEAEYGAATLATCQAIWLRRMLTELQHEQEGASVIFCDNKAMISMTKNLTFHSRTKHIDIRFHFIGDLV-AKEVSL

Query:  SYYSTH
         + STH
Subjt:  SYYSTH

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE26.3e-1427.49Show/hide
Query:  MNLNEKLQQNDGAKMADAQRFKSLVGGLIYLTHARPDILYSIDVFSRFMQCSSRDHFEAAK-WLELLLGA------------------------------
        M  + KL  + G K+ D   ++ +VG L YL   RPD+ Y+++  S++M   + DH+ A K  L  L G                               
Subjt:  MNLNEKLQQNDGAKMADAQRFKSLVGGLIYLTHARPDILYSIDVFSRFMQCSSRDHFEAAK-WLELLLGA------------------------------

Query:  --------------------KQATVALSSSEAEYGAATLATCQAIWLRRMLTELQHEQEGASVIFCDNKAMISMTKNLTFHSRTKHIDIRFHFIGDLV-A
                            KQ  V  SS+EAEY +    + +  W+  +LTEL  +     VI+CDN     +  N  FHSR KHI + +HFI + V +
Subjt:  --------------------KQATVALSSSEAEYGAATLATCQAIWLRRMLTELQHEQEGASVIFCDNKAMISMTKNLTFHSRTKHIDIRFHFIGDLV-A

Query:  KEVSLSYYSTH
          + + + STH
Subjt:  KEVSLSYYSTH

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 83.8e-1426.9Show/hide
Query:  NDGAKMADAQRFKSLVGGLIYLTHARPDILYSIDVFSRFMQCSSRDHFEAA-KWLELLLGA---------------------------------------
        + G    DA+ ++ L+G L+YL   R DI ++++  S+F +     H +A  K L  + G                                        
Subjt:  NDGAKMADAQRFKSLVGGLIYLTHARPDILYSIDVFSRFMQCSSRDHFEAA-KWLELLLGA---------------------------------------

Query:  -----------KQATVALSSSEAEYGAATLATCQAIWLRRMLTELQHEQEGASVIFCDNKAMISMTKNLTFHSRTKHIDIRFHFIGDLVAKEVSLSY
                   KQ  V+ SS+EAEY A + AT + +WL +   ELQ      +++FCDN A I +  N  FH RTKHI+   H + +    + +LSY
Subjt:  -----------KQATVALSSSEAEYGAATLATCQAIWLRRMLTELQHEQEGASVIFCDNKAMISMTKNLTFHSRTKHIDIRFHFIGDLVAKEVSLSY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATTTGAATGAGAAGCTGCAACAAAATGATGGTGCAAAGATGGCCGATGCGCAGCGGTTTAAAAGCCTTGTTGGAGGCTTGATTTATCTAACTCATGCTCGTCCCGA
TATTTTGTATTCTATTGATGTGTTTTCCAGGTTTATGCAATGTTCTTCAAGGGATCATTTCGAAGCAGCAAAGTGGTTGGAGTTATTACTTGGAGCTAAACAAGCAACTG
TTGCTTTATCATCTTCGGAAGCAGAATATGGTGCAGCAACTTTAGCAACATGTCAGGCAATTTGGTTGCGAAGAATGCTAACAGAACTCCAACATGAGCAAGAGGGAGCA
AGTGTGATATTCTGCGACAACAAAGCAATGATCTCAATGACGAAAAATCTGACATTTCATAGCCGGACAAAGCACATTGATATTCGCTTTCATTTTATTGGTGATTTGGT
TGCAAAAGAAGTTTCTCTGTCATATTACAGCACACATAGCAGTGGACTGATATTCTCACAAAAGCTTTGTCAAAGGAGAAGTTGTGTTACTTTAGAGCTATGA
mRNA sequenceShow/hide mRNA sequence
ATGAATTTGAATGAGAAGCTGCAACAAAATGATGGTGCAAAGATGGCCGATGCGCAGCGGTTTAAAAGCCTTGTTGGAGGCTTGATTTATCTAACTCATGCTCGTCCCGA
TATTTTGTATTCTATTGATGTGTTTTCCAGGTTTATGCAATGTTCTTCAAGGGATCATTTCGAAGCAGCAAAGTGGTTGGAGTTATTACTTGGAGCTAAACAAGCAACTG
TTGCTTTATCATCTTCGGAAGCAGAATATGGTGCAGCAACTTTAGCAACATGTCAGGCAATTTGGTTGCGAAGAATGCTAACAGAACTCCAACATGAGCAAGAGGGAGCA
AGTGTGATATTCTGCGACAACAAAGCAATGATCTCAATGACGAAAAATCTGACATTTCATAGCCGGACAAAGCACATTGATATTCGCTTTCATTTTATTGGTGATTTGGT
TGCAAAAGAAGTTTCTCTGTCATATTACAGCACACATAGCAGTGGACTGATATTCTCACAAAAGCTTTGTCAAAGGAGAAGTTGTGTTACTTTAGAGCTATGA
Protein sequenceShow/hide protein sequence
MNLNEKLQQNDGAKMADAQRFKSLVGGLIYLTHARPDILYSIDVFSRFMQCSSRDHFEAAKWLELLLGAKQATVALSSSEAEYGAATLATCQAIWLRRMLTELQHEQEGA
SVIFCDNKAMISMTKNLTFHSRTKHIDIRFHFIGDLVAKEVSLSYYSTHSSGLIFSQKLCQRRSCVTLEL