; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0020587 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0020587
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
DescriptionGag-pol polyprotein
Genome locationchr05:17194115..17194607
RNA-Seq ExpressionPay0020587
SyntenyPay0020587
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0032212.1 putative gag-pol polyprotein [Cucumis melo var. makuwa]7.7e-6081.76Show/hide
Query:  MKSDFEMSMVGELSCFLGLQIKQRSEGIFISQEKYAKNVVKKFGLDQCQHKRTPAATHVKVTKDVNGIVVDHKLYRSMVGSLLYLTASRPDIAYAIGICA
        MKSDFEMSMV ELSCFL LQIKQRSEGIFISQEKYAKNVVKKFGLDQCQHKRTPAATHVKVTKDVNG VVDHKLYRSMVGSLLYLTASR DIAYA+GICA
Subjt:  MKSDFEMSMVGELSCFLGLQIKQRSEGIFISQEKYAKNVVKKFGLDQCQHKRTPAATHVKVTKDVNGIVVDHKLYRSMVGSLLYLTASRPDIAYAIGICA

Query:  RFQSDPRMT-----------------LEFCILLTQPLPWLDIVMLIGLVLQMIGKALLE
        RFQS PR +                    CILLTQPLPWLDIVMLI LVLQMIGKALLE
Subjt:  RFQSDPRMT-----------------LEFCILLTQPLPWLDIVMLIGLVLQMIGKALLE

KAA0051798.1 gag-pol polyprotein [Cucumis melo var. makuwa]1.2e-4484.11Show/hide
Query:  MKSDFEMSMVGELSCFLGLQIKQRSEGIFISQEKYAKNVVKKFGLDQCQHKRTPAATHVKVTKDVNGIVVDHKLYRSMVGSLLYLTASRPDIAYAIGICA
        MKS+FEMS+VGELSCFLGLQIKQRSEG+FISQEKYAKN+VKKFGLDQ QHKRTPAATH K+TKD+ G  VDHKLYRSM+GSLLYL ASRPDI YA+GICA
Subjt:  MKSDFEMSMVGELSCFLGLQIKQRSEGIFISQEKYAKNVVKKFGLDQCQHKRTPAATHVKVTKDVNGIVVDHKLYRSMVGSLLYLTASRPDIAYAIGICA

Query:  RFQSDPR
        R+QS+PR
Subjt:  RFQSDPR

TYK21443.1 gag-pol polyprotein [Cucumis melo var. makuwa]1.2e-4484.11Show/hide
Query:  MKSDFEMSMVGELSCFLGLQIKQRSEGIFISQEKYAKNVVKKFGLDQCQHKRTPAATHVKVTKDVNGIVVDHKLYRSMVGSLLYLTASRPDIAYAIGICA
        MKS+FEMS+VGELSCFLGLQIKQRSEG+FISQEKYAKN+VKKFGLDQ QHKRTPAATH K+TKD+ G  VDHKLYRSM+GSLLYL ASRPDI YA+GICA
Subjt:  MKSDFEMSMVGELSCFLGLQIKQRSEGIFISQEKYAKNVVKKFGLDQCQHKRTPAATHVKVTKDVNGIVVDHKLYRSMVGSLLYLTASRPDIAYAIGICA

Query:  RFQSDPR
        R+QS+PR
Subjt:  RFQSDPR

TYK22184.1 putative gag-pol polyprotein [Cucumis melo var. makuwa]6.3e-6283.12Show/hide
Query:  MKSDFEMSMVGELSCFLGLQIKQRSEGIFISQEKYAKNVVKKFGLDQCQHKRTPAATHVKVTKDVNGIVVDHKLYRSMVGSLLYLTASRPDIAYAIGICA
        MKSDFEMSMVGELSCFL LQIKQRSEGIFISQEKYAKNVVKKFGLDQCQHKRTP ATHVKVTKDVNG VVDHKLYRSMVGSLLYLT SR DIAYA+GICA
Subjt:  MKSDFEMSMVGELSCFLGLQIKQRSEGIFISQEKYAKNVVKKFGLDQCQHKRTPAATHVKVTKDVNGIVVDHKLYRSMVGSLLYLTASRPDIAYAIGICA

Query:  RFQSDPRM---------------TLEF--CILLTQPLPWLDIVMLIGLVLQMIGKALLEV
        RFQSDPR                T +F  CILLTQPLPWLDIVMLIGLVLQMIGKALLE+
Subjt:  RFQSDPRM---------------TLEF--CILLTQPLPWLDIVMLIGLVLQMIGKALLEV

XP_016899484.1 PREDICTED: uncharacterized mitochondrial protein AtMg00810-like [Cucumis melo]2.6e-6384.05Show/hide
Query:  MKSDFEMSMVGELSCFLGLQIKQRSEGIFISQEKYAKNVVKKFGLDQCQHKRTPAATHVKVTKDVNGIVVDHKLYRSMVGSLLYLTASRPDIAYAIGICA
        MKSDFEMSMVGELSCFL LQIKQRSEGIFISQEKYAKNVVKKFGLDQCQHKRTP ATHVKVTKDVNG VVDHKLYRSMVGSLLYLT SR DIAYA+GICA
Subjt:  MKSDFEMSMVGELSCFLGLQIKQRSEGIFISQEKYAKNVVKKFGLDQCQHKRTPAATHVKVTKDVNGIVVDHKLYRSMVGSLLYLTASRPDIAYAIGICA

Query:  RFQSDPRM---------------TLEF--CILLTQPLPWLDIVMLIGLVLQMIGKALLEVVSL
        RFQSDPR                T +F  CILLTQPLPWLDIVMLIGLVLQMIGKALLEVVSL
Subjt:  RFQSDPRM---------------TLEF--CILLTQPLPWLDIVMLIGLVLQMIGKALLEVVSL

TrEMBL top hitse value%identityAlignment
A0A1S4DU18 uncharacterized mitochondrial protein AtMg00810-like1.2e-6384.05Show/hide
Query:  MKSDFEMSMVGELSCFLGLQIKQRSEGIFISQEKYAKNVVKKFGLDQCQHKRTPAATHVKVTKDVNGIVVDHKLYRSMVGSLLYLTASRPDIAYAIGICA
        MKSDFEMSMVGELSCFL LQIKQRSEGIFISQEKYAKNVVKKFGLDQCQHKRTP ATHVKVTKDVNG VVDHKLYRSMVGSLLYLT SR DIAYA+GICA
Subjt:  MKSDFEMSMVGELSCFLGLQIKQRSEGIFISQEKYAKNVVKKFGLDQCQHKRTPAATHVKVTKDVNGIVVDHKLYRSMVGSLLYLTASRPDIAYAIGICA

Query:  RFQSDPRM---------------TLEF--CILLTQPLPWLDIVMLIGLVLQMIGKALLEVVSL
        RFQSDPR                T +F  CILLTQPLPWLDIVMLIGLVLQMIGKALLEVVSL
Subjt:  RFQSDPRM---------------TLEF--CILLTQPLPWLDIVMLIGLVLQMIGKALLEVVSL

A0A5A7SMF2 Putative gag-pol polyprotein3.7e-6081.76Show/hide
Query:  MKSDFEMSMVGELSCFLGLQIKQRSEGIFISQEKYAKNVVKKFGLDQCQHKRTPAATHVKVTKDVNGIVVDHKLYRSMVGSLLYLTASRPDIAYAIGICA
        MKSDFEMSMV ELSCFL LQIKQRSEGIFISQEKYAKNVVKKFGLDQCQHKRTPAATHVKVTKDVNG VVDHKLYRSMVGSLLYLTASR DIAYA+GICA
Subjt:  MKSDFEMSMVGELSCFLGLQIKQRSEGIFISQEKYAKNVVKKFGLDQCQHKRTPAATHVKVTKDVNGIVVDHKLYRSMVGSLLYLTASRPDIAYAIGICA

Query:  RFQSDPRMT-----------------LEFCILLTQPLPWLDIVMLIGLVLQMIGKALLE
        RFQS PR +                    CILLTQPLPWLDIVMLI LVLQMIGKALLE
Subjt:  RFQSDPRMT-----------------LEFCILLTQPLPWLDIVMLIGLVLQMIGKALLE

A0A5A7U931 Gag-pol polyprotein5.8e-4584.11Show/hide
Query:  MKSDFEMSMVGELSCFLGLQIKQRSEGIFISQEKYAKNVVKKFGLDQCQHKRTPAATHVKVTKDVNGIVVDHKLYRSMVGSLLYLTASRPDIAYAIGICA
        MKS+FEMS+VGELSCFLGLQIKQRSEG+FISQEKYAKN+VKKFGLDQ QHKRTPAATH K+TKD+ G  VDHKLYRSM+GSLLYL ASRPDI YA+GICA
Subjt:  MKSDFEMSMVGELSCFLGLQIKQRSEGIFISQEKYAKNVVKKFGLDQCQHKRTPAATHVKVTKDVNGIVVDHKLYRSMVGSLLYLTASRPDIAYAIGICA

Query:  RFQSDPR
        R+QS+PR
Subjt:  RFQSDPR

A0A5D3DCZ8 Gag-pol polyprotein5.8e-4584.11Show/hide
Query:  MKSDFEMSMVGELSCFLGLQIKQRSEGIFISQEKYAKNVVKKFGLDQCQHKRTPAATHVKVTKDVNGIVVDHKLYRSMVGSLLYLTASRPDIAYAIGICA
        MKS+FEMS+VGELSCFLGLQIKQRSEG+FISQEKYAKN+VKKFGLDQ QHKRTPAATH K+TKD+ G  VDHKLYRSM+GSLLYL ASRPDI YA+GICA
Subjt:  MKSDFEMSMVGELSCFLGLQIKQRSEGIFISQEKYAKNVVKKFGLDQCQHKRTPAATHVKVTKDVNGIVVDHKLYRSMVGSLLYLTASRPDIAYAIGICA

Query:  RFQSDPR
        R+QS+PR
Subjt:  RFQSDPR

A0A5D3DF22 Putative gag-pol polyprotein3.1e-6283.12Show/hide
Query:  MKSDFEMSMVGELSCFLGLQIKQRSEGIFISQEKYAKNVVKKFGLDQCQHKRTPAATHVKVTKDVNGIVVDHKLYRSMVGSLLYLTASRPDIAYAIGICA
        MKSDFEMSMVGELSCFL LQIKQRSEGIFISQEKYAKNVVKKFGLDQCQHKRTP ATHVKVTKDVNG VVDHKLYRSMVGSLLYLT SR DIAYA+GICA
Subjt:  MKSDFEMSMVGELSCFLGLQIKQRSEGIFISQEKYAKNVVKKFGLDQCQHKRTPAATHVKVTKDVNGIVVDHKLYRSMVGSLLYLTASRPDIAYAIGICA

Query:  RFQSDPRM---------------TLEF--CILLTQPLPWLDIVMLIGLVLQMIGKALLEV
        RFQSDPR                T +F  CILLTQPLPWLDIVMLIGLVLQMIGKALLE+
Subjt:  RFQSDPRM---------------TLEF--CILLTQPLPWLDIVMLIGLVLQMIGKALLEV

SwissProt top hitse value%identityAlignment
P04146 Copia protein7.1e-0828.71Show/hide
Query:  FEMSMVGELSCFLGLQIKQRSEGIFISQEKYAKNVVKKFGLDQCQHKRTPAATHVKVTKDVNGIVVDHKLYRSMVGSLLY-LTASRPDIAYAIGICARFQ
        F M+ + E+  F+G++I+ + + I++SQ  Y K ++ KF ++ C    TP  + +   + +N     +   RS++G L+Y +  +RPD+  A+ I +R+ 
Subjt:  FEMSMVGELSCFLGLQIKQRSEGIFISQEKYAKNVVKKFGLDQCQHKRTPAATHVKVTKDVNGIVVDHKLYRSMVGSLLY-LTASRPDIAYAIGICARFQ

Query:  S
        S
Subjt:  S

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.8e-1133.91Show/hide
Query:  MKSDFEMSMVGELSCFLGLQI--KQRSEGIFISQEKYAKNVVKKFGLDQCQHKRTPAATHVKVTKDVNGIVVDHK------LYRSMVGSLLY-LTASRPD
        +   F+M  +G     LG++I  ++ S  +++SQEKY + V+++F +   +   TP A H+K++K +    V+ K       Y S VGSL+Y +  +RPD
Subjt:  MKSDFEMSMVGELSCFLGLQI--KQRSEGIFISQEKYAKNVVKKFGLDQCQHKRTPAATHVKVTKDVNGIVVDHK------LYRSMVGSLLY-LTASRPD

Query:  IAYAIGICARFQSDP
        IA+A+G+ +RF  +P
Subjt:  IAYAIGICARFQSDP

P92519 Uncharacterized mitochondrial protein AtMg008101.1e-1134.53Show/hide
Query:  MKSDFEMSMVGELSCFLGLQIKQRSEGIFISQEKYAKNVVKKFGLDQCQHKRTPAATHVKVTKDVN-GIVVDHKLYRSMVGSLLYLTASRPDIAYAIGIC
        + S F M  +G +  FLG+QIK    G+F+SQ KYA+ ++   G+  C+   TP    +K+   V+     D   +RS+VG+L YLT +RPDI+YA+ I 
Subjt:  MKSDFEMSMVGELSCFLGLQIKQRSEGIFISQEKYAKNVVKKFGLDQCQHKRTPAATHVKVTKDVN-GIVVDHKLYRSMVGSLLYLTASRPDIAYAIGIC

Query:  ARFQSDPRMTLEFCILLTQPLPWLDIVMLIGLVLQMIGK
         +   +P  TL    LL + L ++   +  GL +    K
Subjt:  ARFQSDPRMTLEFCILLTQPLPWLDIVMLIGLVLQMIGK

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.6e-0732.63Show/hide
Query:  ELSCFLGLQIKQRSEGIFISQEKYAKNVVKKFGLDQCQHKRTPAATHVKVTKDVNGIVVDHKLYRSMVGSLLYLTASRPDIAYAIGICARFQSDP
        EL  FLG++ K+   G+ +SQ +Y  +++ +  +   +   TP A   K++      + D   YR +VGSL YL  +RPDI+YA+   ++F   P
Subjt:  ELSCFLGLQIKQRSEGIFISQEKYAKNVVKKFGLDQCQHKRTPAATHVKVTKDVNGIVVDHKLYRSMVGSLLYLTASRPDIAYAIGICARFQSDP

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE24.2e-0831.58Show/hide
Query:  ELSCFLGLQIKQRSEGIFISQEKYAKNVVKKFGLDQCQHKRTPAATHVKVTKDVNGIVVDHKLYRSMVGSLLYLTASRPDIAYAIGICARFQSDP
        +L  FLG++ K+  +G+ +SQ +Y  +++ +  +   +   TP AT  K+T      + D   YR +VGSL YL  +RPD++YA+   +++   P
Subjt:  ELSCFLGLQIKQRSEGIFISQEKYAKNVVKKFGLDQCQHKRTPAATHVKVTKDVNGIVVDHKLYRSMVGSLLYLTASRPDIAYAIGICARFQSDP

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 81.5e-1333.91Show/hide
Query:  MKSDFEMSMVGELSCFLGLQIKQRSEGIFISQEKYAKNVVKKFGLDQCQHKRTPAATHVKVTKDVNGIVVDHKLYRSMVGSLLYLTASRPDIAYAIGICA
        +KS F++  +G L  FLGL+I + + GI I Q KYA +++ + GL  C+    P    V  +    G  VD K YR ++G L+YL  +R DI++A+   +
Subjt:  MKSDFEMSMVGELSCFLGLQIKQRSEGIFISQEKYAKNVVKKFGLDQCQHKRTPAATHVKVTKDVNGIVVDHKLYRSMVGSLLYLTASRPDIAYAIGICA

Query:  RFQSDPRMTLEFCIL
        +F   PR+  +  ++
Subjt:  RFQSDPRMTLEFCIL

ATMG00810.1 DNA/RNA polymerases superfamily protein7.5e-1334.53Show/hide
Query:  MKSDFEMSMVGELSCFLGLQIKQRSEGIFISQEKYAKNVVKKFGLDQCQHKRTPAATHVKVTKDVN-GIVVDHKLYRSMVGSLLYLTASRPDIAYAIGIC
        + S F M  +G +  FLG+QIK    G+F+SQ KYA+ ++   G+  C+   TP    +K+   V+     D   +RS+VG+L YLT +RPDI+YA+ I 
Subjt:  MKSDFEMSMVGELSCFLGLQIKQRSEGIFISQEKYAKNVVKKFGLDQCQHKRTPAATHVKVTKDVN-GIVVDHKLYRSMVGSLLYLTASRPDIAYAIGIC

Query:  ARFQSDPRMTLEFCILLTQPLPWLDIVMLIGLVLQMIGK
         +   +P  TL    LL + L ++   +  GL +    K
Subjt:  ARFQSDPRMTLEFCILLTQPLPWLDIVMLIGLVLQMIGK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAATCAGATTTTGAAATGAGCATGGTAGGAGAACTTTCCTGTTTTCTAGGTCTACAGATCAAACAGAGAAGTGAGGGTATATTTATATCTCAAGAGAAGTATGCCAA
GAACGTGGTCAAAAAATTTGGTCTGGATCAGTGTCAACATAAAAGGACTCCAGCAGCGACACATGTTAAAGTTACTAAAGATGTTAATGGTATAGTAGTAGATCACAAAC
TGTACAGGAGCATGGTTGGAAGCCTTCTATATTTAACGGCAAGCAGACCTGACATTGCCTATGCTATTGGCATATGTGCTCGATTTCAGTCAGATCCTCGCATGACTTTG
GAATTTTGTATTCTTCTGACACAACCTCTACCTTGGTTGGATATTGTGATGCTGATTGGGCTGGTTCTCCAGATGATAGGAAAAGCACTTTTGGAGGTTGTTTCTTTGTA
A
mRNA sequenceShow/hide mRNA sequence
ATGAAATCAGATTTTGAAATGAGCATGGTAGGAGAACTTTCCTGTTTTCTAGGTCTACAGATCAAACAGAGAAGTGAGGGTATATTTATATCTCAAGAGAAGTATGCCAA
GAACGTGGTCAAAAAATTTGGTCTGGATCAGTGTCAACATAAAAGGACTCCAGCAGCGACACATGTTAAAGTTACTAAAGATGTTAATGGTATAGTAGTAGATCACAAAC
TGTACAGGAGCATGGTTGGAAGCCTTCTATATTTAACGGCAAGCAGACCTGACATTGCCTATGCTATTGGCATATGTGCTCGATTTCAGTCAGATCCTCGCATGACTTTG
GAATTTTGTATTCTTCTGACACAACCTCTACCTTGGTTGGATATTGTGATGCTGATTGGGCTGGTTCTCCAGATGATAGGAAAAGCACTTTTGGAGGTTGTTTCTTTGTA
A
Protein sequenceShow/hide protein sequence
MKSDFEMSMVGELSCFLGLQIKQRSEGIFISQEKYAKNVVKKFGLDQCQHKRTPAATHVKVTKDVNGIVVDHKLYRSMVGSLLYLTASRPDIAYAIGICARFQSDPRMTL
EFCILLTQPLPWLDIVMLIGLVLQMIGKALLEVVSL