; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc06g0170821 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc06g0170821
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionReverse transcriptase Ty1/copia-type domain-containing protein
Genome locationCMiso1.1chr06:26323026..26323562
RNA-Seq ExpressionCmc06g0170821
SyntenyCmc06g0170821
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
AQY61295.1 Pol [Coffea arabica]1.4e-7478.77Show/hide
Query:  MSLFYSNKSNFDLVGYADAGYLSDPHKARSQTCYLFTCGGTAISWRSVKQTMMATLSNHAEILAIHEASRECVWLRSMTHHIRETCGLSFSK-NLPTILF
        + LFYSNKS  +LVGYADAGYLSDPHKARSQT YLFT GGTAISWRS KQ++ AT SNHAEI+AIHEASRECVWLRSMTH+IR+ CGLS  K N PTIL+
Subjt:  MSLFYSNKSNFDLVGYADAGYLSDPHKARSQTCYLFTCGGTAISWRSVKQTMMATLSNHAEILAIHEASRECVWLRSMTHHIRETCGLSFSK-NLPTILF

Query:  EDNTACIAQIKGGYIKGDRTKHISPKLFYTHDLEENSDISVQQISSKDNLADLFTKALPTSTFEKLVHNIGMRRLRELK
        EDN ACIAQ+KGGYIKGDRTKHISPK F+THDL++NS+I VQQI S +NLADLFTKALPT+TFEKLV +IGM+RL++LK
Subjt:  EDNTACIAQIKGGYIKGDRTKHISPKLFYTHDLEENSDISVQQISSKDNLADLFTKALPTSTFEKLVHNIGMRRLRELK

ERM98543.1 hypothetical protein AMTR_s04311p00004950, partial [Amborella trichopoda]1.4e-7680Show/hide
Query:  LFYSNKSNFDLVGYADAGYLSDPHKARSQTCYLFTCGGTAISWRSVKQTMMATLSNHAEILAIHEASRECVWLRSMTHHIRETCGLSFSKNLPTILFEDN
        LFYSN     L+GYADAGYLSDPHKARSQT YLFTCG TAISWRS KQT++AT SNHAEILAIHEASRECVWLRSMT HIR TCGL+ +K +PTIL+EDN
Subjt:  LFYSNKSNFDLVGYADAGYLSDPHKARSQTCYLFTCGGTAISWRSVKQTMMATLSNHAEILAIHEASRECVWLRSMTHHIRETCGLSFSKNLPTILFEDN

Query:  TACIAQIKGGYIKGDRTKHISPKLFYTHDLEENSDISVQQISSKDNLADLFTKALPTSTFEKLVHNIGMRRLREL
         ACIAQ+KGGYIKGDRTKHISPK F+TH+L+EN DI VQQI S DNLADLFTK+LPTSTFEK+VH IGMRRL++L
Subjt:  TACIAQIKGGYIKGDRTKHISPKLFYTHDLEENSDISVQQISSKDNLADLFTKALPTSTFEKLVHNIGMRRLREL

KAA0067250.1 Pol [Cucumis melo var. makuwa]4.8e-8892.7Show/hide
Query:  MSLFYSNKSNFDLVGYADAGYLSDPHKARSQTCYLFTCGGTAISWRSVKQTMMATLSNHAEILAIHEASRECVWLRSMTHHIRETCGLSFSKNLPTILFE
        MSLFYSNKSNFDLVGYADAGYLSDP KARSQ  YLFT GGTAISWRSVKQTM AT SNHAEILAIHE SRECVWLRSMTHHIRETCGLSFSKNLPTILFE
Subjt:  MSLFYSNKSNFDLVGYADAGYLSDPHKARSQTCYLFTCGGTAISWRSVKQTMMATLSNHAEILAIHEASRECVWLRSMTHHIRETCGLSFSKNLPTILFE

Query:  DNTACIAQIKGGYIKGDRTKHISPKLFYTHDLEENSDISVQQISSKDNLADLFTKALPTSTFEKLVHNIGMRRLRELK
        DNTACIAQIKGGYIKGDRTKH+SPKLFYTH+LEEN DISVQQISSKDNL DLFTKALPT TFEKLVHNIGM RLRELK
Subjt:  DNTACIAQIKGGYIKGDRTKHISPKLFYTHDLEENSDISVQQISSKDNLADLFTKALPTSTFEKLVHNIGMRRLRELK

KAF7137709.1 hypothetical protein RHSIM_Rhsim07G0041900 [Rhododendron simsii]1.0e-7478.53Show/hide
Query:  MSLFYSNKSNFDLVGYADAGYLSDPHKARSQTCYLFTCGGTAISWRSVKQTMMATLSNHAEILAIHEASRECVWLRSMTHHIRETCGLSFSKNLPTILFE
        + LFYSN S   L+GYADAGYLSDPHKARSQT Y+FTCGG AISWRSVKQT+ AT SNH+EI+AIHE SRECVWLRSM  HIRE CGLS  K+ PTIL+E
Subjt:  MSLFYSNKSNFDLVGYADAGYLSDPHKARSQTCYLFTCGGTAISWRSVKQTMMATLSNHAEILAIHEASRECVWLRSMTHHIRETCGLSFSKNLPTILFE

Query:  DNTACIAQIKGGYIKGDRTKHISPKLFYTHDLEENSDISVQQISSKDNLADLFTKALPTSTFEKLVHNIGMRRLREL
        DN ACI QIKGGYIKGDRTKHISPK FYTHDL++N DI VQQI S DNLADLFTKALPT+TF+KLV NIGMRRL+ L
Subjt:  DNTACIAQIKGGYIKGDRTKHISPKLFYTHDLEENSDISVQQISSKDNLADLFTKALPTSTFEKLVHNIGMRRLREL

TYK21506.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]5.7e-8993.26Show/hide
Query:  MSLFYSNKSNFDLVGYADAGYLSDPHKARSQTCYLFTCGGTAISWRSVKQTMMATLSNHAEILAIHEASRECVWLRSMTHHIRETCGLSFSKNLPTILFE
        M LFYSNKSNFDLVGYADAG+LSDPHKARSQT YLFTC GTAISWRSVKQTM AT SNHAEILAIHEASRECVWLRSMTH+IRETCGLSFSKNLPTILFE
Subjt:  MSLFYSNKSNFDLVGYADAGYLSDPHKARSQTCYLFTCGGTAISWRSVKQTMMATLSNHAEILAIHEASRECVWLRSMTHHIRETCGLSFSKNLPTILFE

Query:  DNTACIAQIKGGYIKGDRTKHISPKLFYTHDLEENSDISVQQISSKDNLADLFTKALPTSTFEKLVHNIGMRRLRELK
        DNTACI QIKGGYIKGDRTKHISPKLFYTHDLEEN DISVQQISSKDNL DLFTKAL TSTFEKLVHNIGMR+LRELK
Subjt:  DNTACIAQIKGGYIKGDRTKHISPKLFYTHDLEENSDISVQQISSKDNLADLFTKALPTSTFEKLVHNIGMRRLRELK

TrEMBL top hitse value%identityAlignment
A0A1U9WYD5 Pol1.9e-7478.21Show/hide
Query:  MSLFYSNKSNFDLVGYADAGYLSDPHKARSQTCYLFTCGGTAISWRSVKQTMMATLSNHAEILAIHEASRECVWLRSMTHHIRETCGLSFSK-NLPTILF
        + LFYSNKS  +LVGYAD GYLSDPHKARSQT YLFT GGTAISWRS KQ++ AT SNHAEI+AIHEASRECVWLRSMTH+IR+ CGLS  K N PTIL+
Subjt:  MSLFYSNKSNFDLVGYADAGYLSDPHKARSQTCYLFTCGGTAISWRSVKQTMMATLSNHAEILAIHEASRECVWLRSMTHHIRETCGLSFSK-NLPTILF

Query:  EDNTACIAQIKGGYIKGDRTKHISPKLFYTHDLEENSDISVQQISSKDNLADLFTKALPTSTFEKLVHNIGMRRLRELK
        EDN ACIAQ+KGGYIKGDRTKHISPK F+THDL++NS+I VQQI S +NLADLFTKALPT+TFEKLV +IGM+RL++LK
Subjt:  EDNTACIAQIKGGYIKGDRTKHISPKLFYTHDLEENSDISVQQISSKDNLADLFTKALPTSTFEKLVHNIGMRRLRELK

A0A1U9WYE0 Pol6.5e-7578.77Show/hide
Query:  MSLFYSNKSNFDLVGYADAGYLSDPHKARSQTCYLFTCGGTAISWRSVKQTMMATLSNHAEILAIHEASRECVWLRSMTHHIRETCGLSFSK-NLPTILF
        + LFYSNKS  +LVGYADAGYLSDPHKARSQT YLFT GGTAISWRS KQ++ AT SNHAEI+AIHEASRECVWLRSMTH+IR+ CGLS  K N PTIL+
Subjt:  MSLFYSNKSNFDLVGYADAGYLSDPHKARSQTCYLFTCGGTAISWRSVKQTMMATLSNHAEILAIHEASRECVWLRSMTHHIRETCGLSFSK-NLPTILF

Query:  EDNTACIAQIKGGYIKGDRTKHISPKLFYTHDLEENSDISVQQISSKDNLADLFTKALPTSTFEKLVHNIGMRRLRELK
        EDN ACIAQ+KGGYIKGDRTKHISPK F+THDL++NS+I VQQI S +NLADLFTKALPT+TFEKLV +IGM+RL++LK
Subjt:  EDNTACIAQIKGGYIKGDRTKHISPKLFYTHDLEENSDISVQQISSKDNLADLFTKALPTSTFEKLVHNIGMRRLRELK

A0A5D3CKU4 Pol2.3e-8892.7Show/hide
Query:  MSLFYSNKSNFDLVGYADAGYLSDPHKARSQTCYLFTCGGTAISWRSVKQTMMATLSNHAEILAIHEASRECVWLRSMTHHIRETCGLSFSKNLPTILFE
        MSLFYSNKSNFDLVGYADAGYLSDP KARSQ  YLFT GGTAISWRSVKQTM AT SNHAEILAIHE SRECVWLRSMTHHIRETCGLSFSKNLPTILFE
Subjt:  MSLFYSNKSNFDLVGYADAGYLSDPHKARSQTCYLFTCGGTAISWRSVKQTMMATLSNHAEILAIHEASRECVWLRSMTHHIRETCGLSFSKNLPTILFE

Query:  DNTACIAQIKGGYIKGDRTKHISPKLFYTHDLEENSDISVQQISSKDNLADLFTKALPTSTFEKLVHNIGMRRLRELK
        DNTACIAQIKGGYIKGDRTKH+SPKLFYTH+LEEN DISVQQISSKDNL DLFTKALPT TFEKLVHNIGM RLRELK
Subjt:  DNTACIAQIKGGYIKGDRTKHISPKLFYTHDLEENSDISVQQISSKDNLADLFTKALPTSTFEKLVHNIGMRRLRELK

A0A5D3DD63 Retrovirus-related Pol polyprotein from transposon TNT 1-942.7e-8993.26Show/hide
Query:  MSLFYSNKSNFDLVGYADAGYLSDPHKARSQTCYLFTCGGTAISWRSVKQTMMATLSNHAEILAIHEASRECVWLRSMTHHIRETCGLSFSKNLPTILFE
        M LFYSNKSNFDLVGYADAG+LSDPHKARSQT YLFTC GTAISWRSVKQTM AT SNHAEILAIHEASRECVWLRSMTH+IRETCGLSFSKNLPTILFE
Subjt:  MSLFYSNKSNFDLVGYADAGYLSDPHKARSQTCYLFTCGGTAISWRSVKQTMMATLSNHAEILAIHEASRECVWLRSMTHHIRETCGLSFSKNLPTILFE

Query:  DNTACIAQIKGGYIKGDRTKHISPKLFYTHDLEENSDISVQQISSKDNLADLFTKALPTSTFEKLVHNIGMRRLRELK
        DNTACI QIKGGYIKGDRTKHISPKLFYTHDLEEN DISVQQISSKDNL DLFTKAL TSTFEKLVHNIGMR+LRELK
Subjt:  DNTACIAQIKGGYIKGDRTKHISPKLFYTHDLEENSDISVQQISSKDNLADLFTKALPTSTFEKLVHNIGMRRLRELK

U5CXL6 Uncharacterized protein (Fragment)7.0e-7780Show/hide
Query:  LFYSNKSNFDLVGYADAGYLSDPHKARSQTCYLFTCGGTAISWRSVKQTMMATLSNHAEILAIHEASRECVWLRSMTHHIRETCGLSFSKNLPTILFEDN
        LFYSN     L+GYADAGYLSDPHKARSQT YLFTCG TAISWRS KQT++AT SNHAEILAIHEASRECVWLRSMT HIR TCGL+ +K +PTIL+EDN
Subjt:  LFYSNKSNFDLVGYADAGYLSDPHKARSQTCYLFTCGGTAISWRSVKQTMMATLSNHAEILAIHEASRECVWLRSMTHHIRETCGLSFSKNLPTILFEDN

Query:  TACIAQIKGGYIKGDRTKHISPKLFYTHDLEENSDISVQQISSKDNLADLFTKALPTSTFEKLVHNIGMRRLREL
         ACIAQ+KGGYIKGDRTKHISPK F+TH+L+EN DI VQQI S DNLADLFTK+LPTSTFEK+VH IGMRRL++L
Subjt:  TACIAQIKGGYIKGDRTKHISPKLFYTHDLEENSDISVQQISSKDNLADLFTKALPTSTFEKLVHNIGMRRLREL

SwissProt top hitse value%identityAlignment
P04146 Copia protein7.3e-1529.31Show/hide
Query:  MSLFYSNKSNFD--LVGYADAGYLSDPHKARSQTCYLFTC-GGTAISWRSVKQTMMATLSNHAEILAIHEASRECVWLRSMTHHIRETCGLSFSKNLPTI
        M L +     F+  ++GY D+ +       +S T YLF       I W + +Q  +A  S  AE +A+ EA RE +WL+ +         ++     P  
Subjt:  MSLFYSNKSNFD--LVGYADAGYLSDPHKARSQTCYLFTC-GGTAISWRSVKQTMMATLSNHAEILAIHEASRECVWLRSMTHHIRETCGLSFSKNLPTI

Query:  LFEDNTACIAQIKGGYIKGDRTKHISPKLFYTHDLEENSDISVQQISSKDNLADLFTKALPTSTFEKLVHNIGM
        ++EDN  CI+ I        R KHI  K  +  +  +N+ I ++ I +++ LAD+FTK LP + F +L   +G+
Subjt:  LFEDNTACIAQIKGGYIKGDRTKHISPKLFYTHDLEENSDISVQQISSKDNLADLFTKALPTSTFEKLVHNIGM

P0CV72 Secreted RxLR effector protein 1611.7e-0838.27Show/hide
Query:  LFYSNKSNFDLVGYADAGYLSDPHKARSQTCYLFTCGGTAISWRSVKQTMMATLSNHAEILAIHEASRECVWLRSMTHHIR
        L ++      LVGY+DA +  D    RS + YLF   G  +SWRS KQ  +A  S   E +A+ EA++E VWL   T   +
Subjt:  LFYSNKSNFDLVGYADAGYLSDPHKARSQTCYLFTCGGTAISWRSVKQTMMATLSNHAEILAIHEASRECVWLRSMTHHIR

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-945.6e-1531.45Show/hide
Query:  LVGYADAGYLSDPHKARSQTCYLFTCGGTAISWRSVKQTMMATLSNHAEILAIHEASRECVWLRSMTHHIRETCGLSFSKNLPTILFEDNTACIAQIKGG
        L GY DA    D    +S T YLFT  G AISW+S  Q  +A  +  AE +A  E  +E +WL+     +    GL   +    +++ D+ + I   K  
Subjt:  LVGYADAGYLSDPHKARSQTCYLFTCGGTAISWRSVKQTMMATLSNHAEILAIHEASRECVWLRSMTHHIRETCGLSFSKNLPTILFEDNTACIAQIKGG

Query:  YIKGDRTKHISPKLFYTHDLEENSDISVQQISSKDNLADLFTKALPTSTFEKLVHNIGM
             RTKHI  +  +  ++ ++  + V +IS+ +N AD+ TK +P + FE     +GM
Subjt:  YIKGDRTKHISPKLFYTHDLEENSDISVQQISSKDNLADLFTKALPTSTFEKLVHNIGM

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.7e-0925Show/hide
Query:  LFYSNKSNFDLVGYADAGYLSDPHKARSQTCYLFTCGGTAISWRSVKQTMMATLSNHAEILAIHEASRECVWLRSMTHHIRETCGLSFSKNLPTILFEDN
        +F    +   L  Y+DA +  D     S   Y+   G   ISW S KQ  +   S  AE  ++   S E  W+ S+   +    G+  ++  P +++ DN
Subjt:  LFYSNKSNFDLVGYADAGYLSDPHKARSQTCYLFTCGGTAISWRSVKQTMMATLSNHAEILAIHEASRECVWLRSMTHHIRETCGLSFSKNLPTILFEDN

Query:  TACIAQIKGGYIKGDRTKHISPKLFYTHDLEENSDISVQQISSKDNLADLFTKALPTSTFEKLVHNIGMRRL
              +    +   R KHI+    +  +  ++  + V  +S+ D LAD  TK L  + F+     IG+ R+
Subjt:  TACIAQIKGGYIKGDRTKHISPKLFYTHDLEENSDISVQQISSKDNLADLFTKALPTSTFEKLVHNIGMRRL

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.7e-0825.44Show/hide
Query:  LFYSNKSNFDLVGYADAGYLSDPHKARSQTCYLFTCGGTAISWRSVKQTMMATLSNHAEILAIHEASRECVWLRSMTHHIRETCGLSFSKNLPTILFEDN
        +F    +   L  Y+DA +  D     S   Y+   G   ISW S KQ  +   S  AE  ++   S E  W+ S+   +    G+  S   P +++ DN
Subjt:  LFYSNKSNFDLVGYADAGYLSDPHKARSQTCYLFTCGGTAISWRSVKQTMMATLSNHAEILAIHEASRECVWLRSMTHHIRETCGLSFSKNLPTILFEDN

Query:  TACIAQIKGGYIKGDRTKHISPKLFYTHDLEENSDISVQQISSKDNLADLFTKALPTSTFEKLVHNIGM
              +    +   R KHI+    +  +  ++  + V  +S+ D LAD  TK L    F+     IG+
Subjt:  TACIAQIKGGYIKGDRTKHISPKLFYTHDLEENSDISVQQISSKDNLADLFTKALPTSTFEKLVHNIGM

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 83.9e-1130.94Show/hide
Query:  LFYSNKSNFDLVGYADAGYLSDPHKARSQTCYLFTCGGTAISWRSVKQTMMATLSNHAEILAIHEASRECVWLRSMTHHIRETCGLSFSKNLPTILFEDN
        LFYS+++   L  ++DA + S     RS   Y    G + ISW+S KQ +++  S  AE  A+  A+ E +WL      ++    L  SK  PT+LF DN
Subjt:  LFYSNKSNFDLVGYADAGYLSDPHKARSQTCYLFTCGGTAISWRSVKQTMMATLSNHAEILAIHEASRECVWLRSMTHHIRETCGLSFSKNLPTILFEDN

Query:  TACIAQIKGGYIKGDRTKHISPKLFYTHDLEENS------DISVQQISSKDNLADLFTKALPTSTFEKLVHNIGMRRLREL
        TA I  I    +  +RTKHI       H + E S        S Q    +D   +  +  L   T   +V   G+  L  L
Subjt:  TACIAQIKGGYIKGDRTKHISPKLFYTHDLEENS------DISVQQISSKDNLADLFTKALPTSTFEKLVHNIGMRRLREL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTTTGTTTTATTCAAATAAATCAAACTTTGATCTAGTTGGTTATGCGGATGCTGGATATTTATCGGATCCACACAAAGCAAGATCTCAAACATGTTATCTGTTTAC
ATGTGGAGGAACTGCTATATCTTGGCGATCTGTAAAGCAAACCATGATGGCCACTTTATCGAATCATGCAGAAATTCTTGCAATTCATGAAGCTAGTAGAGAATGTGTAT
GGTTGAGGTCAATGACTCATCATATTCGAGAAACATGTGGTTTGTCTTTCAGTAAAAATTTACCAACAATATTATTTGAAGATAATACCGCATGTATAGCACAAATCAAA
GGAGGGTATATAAAAGGAGATAGAACAAAGCATATCTCACCAAAACTCTTCTATACGCATGACCTTGAAGAAAATAGTGACATCAGTGTTCAACAAATTTCTTCAAAAGA
CAACTTGGCGGACTTATTCACAAAAGCATTACCGACATCAACATTTGAAAAGCTAGTGCACAACATTGGAATGCGACGACTCAGAGAACTTAAGTGA
mRNA sequenceShow/hide mRNA sequence
ATGAGTTTGTTTTATTCAAATAAATCAAACTTTGATCTAGTTGGTTATGCGGATGCTGGATATTTATCGGATCCACACAAAGCAAGATCTCAAACATGTTATCTGTTTAC
ATGTGGAGGAACTGCTATATCTTGGCGATCTGTAAAGCAAACCATGATGGCCACTTTATCGAATCATGCAGAAATTCTTGCAATTCATGAAGCTAGTAGAGAATGTGTAT
GGTTGAGGTCAATGACTCATCATATTCGAGAAACATGTGGTTTGTCTTTCAGTAAAAATTTACCAACAATATTATTTGAAGATAATACCGCATGTATAGCACAAATCAAA
GGAGGGTATATAAAAGGAGATAGAACAAAGCATATCTCACCAAAACTCTTCTATACGCATGACCTTGAAGAAAATAGTGACATCAGTGTTCAACAAATTTCTTCAAAAGA
CAACTTGGCGGACTTATTCACAAAAGCATTACCGACATCAACATTTGAAAAGCTAGTGCACAACATTGGAATGCGACGACTCAGAGAACTTAAGTGA
Protein sequenceShow/hide protein sequence
MSLFYSNKSNFDLVGYADAGYLSDPHKARSQTCYLFTCGGTAISWRSVKQTMMATLSNHAEILAIHEASRECVWLRSMTHHIRETCGLSFSKNLPTILFEDNTACIAQIK
GGYIKGDRTKHISPKLFYTHDLEENSDISVQQISSKDNLADLFTKALPTSTFEKLVHNIGMRRLRELK