; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0012977 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0012977
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
DescriptionGag-protease polyprotein
Genome locationchr08:15574309..15575575
RNA-Seq ExpressionPay0012977
SyntenyPay0012977
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0004519 - endonuclease activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025111.1 gag-protease polyprotein [Cucumis melo var. makuwa]1.3e-8776Show/hide
Query:  QDRGTAWWQTTERMLGGDVSKITCEQFKESFYAQFFS----------FLNFERGDM---TYDAEFDMLSRFNPNVAKDEEAKTKKFVKGLILDLQGIVRA
        +DRGT WW+T ERMLGGDV+KI  EQFKESFY +FFS          FLN E+GDM    YDAEFDMLSRF PNV KDE+A+T+KFV+GL LDLQGIVRA
Subjt:  QDRGTAWWQTTERMLGGDVSKITCEQFKESFYAQFFS----------FLNFERGDM---TYDAEFDMLSRFNPNVAKDEEAKTKKFVKGLILDLQGIVRA

Query:  FRPTTHADALRLALDLSLHERAGLSKAAGRGSALGQKRKVESQPDLTPQQNLRLRGVFQRHRRELAAAGRTFRELPVCPSCGRVHGDCCLVGSGVCFKCR
         RPTTH DALRL LDLSLHERA LSK AGRGS L QKRK+ESQPDLTPQ+NLR RGVFQRHRRELAAAGRT RELPVC SCGRVHG  CLV S VCFKCR
Subjt:  FRPTTHADALRLALDLSLHERAGLSKAAGRGSALGQKRKVESQPDLTPQQNLRLRGVFQRHRRELAAAGRTFRELPVCPSCGRVHGDCCLVGSGVCFKCR

Query:  QPGHTADACPQKLIETTPHQHHTSQ
        QP HTADACP+K IETTPHQ   SQ
Subjt:  QPGHTADACPQKLIETTPHQHHTSQ

KAA0035800.1 gag-protease polyprotein [Cucumis melo var. makuwa]3.0e-10890.18Show/hide
Query:  DRGTAWWQTTERMLGGDVSKITCEQFKESFYAQFFS----------FLNFERGDMT---YDAEFDMLSRFNPNVAKDEEAKTKKFVKGLILDLQGIVRAF
        DRGTAWWQT ERMLGGDVSKITCEQFKESFYA+FFS          FLN ERGDMT   YDAEFDMLSRF PNV KDEEAKTKKFVKGL LDLQGIVRAF
Subjt:  DRGTAWWQTTERMLGGDVSKITCEQFKESFYAQFFS----------FLNFERGDMT---YDAEFDMLSRFNPNVAKDEEAKTKKFVKGLILDLQGIVRAF

Query:  RPTTHADALRLALDLSLHERAGLSKAAGRGSALGQKRKVESQPDLTPQQNLRLRGVFQRHRRELAAAGRTFRELPVCPSCGRVHGDCCLVGSGVCFKCRQ
        RPTTHADALRLALDL+LHERAGLSKAAGRGSALGQKRKVESQPDLT QQNLRLRGVFQRHR ELAAAGRTFRELPVCPSCGRVHGDCCLVGSGVCFKCRQ
Subjt:  RPTTHADALRLALDLSLHERAGLSKAAGRGSALGQKRKVESQPDLTPQQNLRLRGVFQRHRRELAAAGRTFRELPVCPSCGRVHGDCCLVGSGVCFKCRQ

Query:  PGHTADACPQKLIETTPHQHHTSQ
        PGHTADACPQKLIETTPHQHHTSQ
Subjt:  PGHTADACPQKLIETTPHQHHTSQ

TYK19353.1 gag-protease polyprotein [Cucumis melo var. makuwa]1.3e-8776Show/hide
Query:  QDRGTAWWQTTERMLGGDVSKITCEQFKESFYAQFFS----------FLNFERGDM---TYDAEFDMLSRFNPNVAKDEEAKTKKFVKGLILDLQGIVRA
        +DRGT WW+T ERMLGGDV+KI  EQFKESFY +FFS          FLN E+GDM    YDAEFDMLSRF PNV KDE+A+T+KFV+GL LDLQGIVRA
Subjt:  QDRGTAWWQTTERMLGGDVSKITCEQFKESFYAQFFS----------FLNFERGDM---TYDAEFDMLSRFNPNVAKDEEAKTKKFVKGLILDLQGIVRA

Query:  FRPTTHADALRLALDLSLHERAGLSKAAGRGSALGQKRKVESQPDLTPQQNLRLRGVFQRHRRELAAAGRTFRELPVCPSCGRVHGDCCLVGSGVCFKCR
         RPTTH DALRL LDLSLHERA LSK AGRGS L QKRK+ESQPDLTPQ+NLR RGVFQRHRRELAAAGRT RELPVC SCGRVHG  CLV S VCFKCR
Subjt:  FRPTTHADALRLALDLSLHERAGLSKAAGRGSALGQKRKVESQPDLTPQQNLRLRGVFQRHRRELAAAGRTFRELPVCPSCGRVHGDCCLVGSGVCFKCR

Query:  QPGHTADACPQKLIETTPHQHHTSQ
        QP HTADACP+K IETTPHQ   SQ
Subjt:  QPGHTADACPQKLIETTPHQHHTSQ

TYK29818.1 gag-pol polyprotein [Cucumis melo var. makuwa]7.7e-10487.11Show/hide
Query:  QDRGTAWWQTTERMLGGDVSKITCEQFKESFYAQFFS----------FLNFERGDMT---YDAEFDMLSRFNPNVAKDEEAKTKKFVKGLILDLQGIVRA
        +D+GTAWWQT ERMLG  VSKITCEQFKESFYA+FFS          FLN ERGDMT   YDA+FDMLSRF PNVAKDEEAKTKKFVKGL LDLQGIVRA
Subjt:  QDRGTAWWQTTERMLGGDVSKITCEQFKESFYAQFFS----------FLNFERGDMT---YDAEFDMLSRFNPNVAKDEEAKTKKFVKGLILDLQGIVRA

Query:  FRPTTHADALRLALDLSLHERAGLSKAAGRGSALGQKRKVESQPDLTPQQNLRLRGVFQRHRRELAAAGRTFRELPVCPSCGRVHGDCCLVGSGVCFKCR
        F+PTTHAD LRLALDLSLHERAGLSKAAGRGSALGQKRKVESQPDLTPQQNLRLRGVFQRHRRELAAAGR+FRELPV PS GRVHGDCCLVGSGVCFKCR
Subjt:  FRPTTHADALRLALDLSLHERAGLSKAAGRGSALGQKRKVESQPDLTPQQNLRLRGVFQRHRRELAAAGRTFRELPVCPSCGRVHGDCCLVGSGVCFKCR

Query:  QPGHTADACPQKLIETTPHQHHTSQ
        QPGHTADACPQKLIETTPHQHHT Q
Subjt:  QPGHTADACPQKLIETTPHQHHTSQ

XP_008442300.1 PREDICTED: uncharacterized protein LOC103486213 [Cucumis melo]9.1e-11389.74Show/hide
Query:  MSIRSLVTRQDRGTAWWQTTERMLGGDVSKITCEQFKESFYAQFFS----------FLNFERGDMT---YDAEFDMLSRFNPNVAKDEEAKTKKFVKGLI
        M+IRSLVTR+DRGTAWWQT ERMLGGDVSKITCEQFKESFYA+FFS          FLN ERGDMT   YDAEFDMLSRF PNV KDEEAKTKKFVKGL 
Subjt:  MSIRSLVTRQDRGTAWWQTTERMLGGDVSKITCEQFKESFYAQFFS----------FLNFERGDMT---YDAEFDMLSRFNPNVAKDEEAKTKKFVKGLI

Query:  LDLQGIVRAFRPTTHADALRLALDLSLHERAGLSKAAGRGSALGQKRKVESQPDLTPQQNLRLRGVFQRHRRELAAAGRTFRELPVCPSCGRVHGDCCLV
        LDLQGIVRAFRPTTHADALRLALDL+LHERAGLSKAAGRGSALGQKRKVESQPDLT QQNLRLRGVFQRHR ELAAAGRTFRELPVCPSCGRVHGDCCLV
Subjt:  LDLQGIVRAFRPTTHADALRLALDLSLHERAGLSKAAGRGSALGQKRKVESQPDLTPQQNLRLRGVFQRHRRELAAAGRTFRELPVCPSCGRVHGDCCLV

Query:  GSGVCFKCRQPGHTADACPQKLIETTPHQHHTSQ
        GSGVCFKCRQPGHTADACPQKLIETTPHQHHTSQ
Subjt:  GSGVCFKCRQPGHTADACPQKLIETTPHQHHTSQ

TrEMBL top hitse value%identityAlignment
A0A1S3B610 uncharacterized protein LOC1034862134.4e-11389.74Show/hide
Query:  MSIRSLVTRQDRGTAWWQTTERMLGGDVSKITCEQFKESFYAQFFS----------FLNFERGDMT---YDAEFDMLSRFNPNVAKDEEAKTKKFVKGLI
        M+IRSLVTR+DRGTAWWQT ERMLGGDVSKITCEQFKESFYA+FFS          FLN ERGDMT   YDAEFDMLSRF PNV KDEEAKTKKFVKGL 
Subjt:  MSIRSLVTRQDRGTAWWQTTERMLGGDVSKITCEQFKESFYAQFFS----------FLNFERGDMT---YDAEFDMLSRFNPNVAKDEEAKTKKFVKGLI

Query:  LDLQGIVRAFRPTTHADALRLALDLSLHERAGLSKAAGRGSALGQKRKVESQPDLTPQQNLRLRGVFQRHRRELAAAGRTFRELPVCPSCGRVHGDCCLV
        LDLQGIVRAFRPTTHADALRLALDL+LHERAGLSKAAGRGSALGQKRKVESQPDLT QQNLRLRGVFQRHR ELAAAGRTFRELPVCPSCGRVHGDCCLV
Subjt:  LDLQGIVRAFRPTTHADALRLALDLSLHERAGLSKAAGRGSALGQKRKVESQPDLTPQQNLRLRGVFQRHRRELAAAGRTFRELPVCPSCGRVHGDCCLV

Query:  GSGVCFKCRQPGHTADACPQKLIETTPHQHHTSQ
        GSGVCFKCRQPGHTADACPQKLIETTPHQHHTSQ
Subjt:  GSGVCFKCRQPGHTADACPQKLIETTPHQHHTSQ

A0A5A7SLJ3 Gag-protease polyprotein6.4e-8876Show/hide
Query:  QDRGTAWWQTTERMLGGDVSKITCEQFKESFYAQFFS----------FLNFERGDM---TYDAEFDMLSRFNPNVAKDEEAKTKKFVKGLILDLQGIVRA
        +DRGT WW+T ERMLGGDV+KI  EQFKESFY +FFS          FLN E+GDM    YDAEFDMLSRF PNV KDE+A+T+KFV+GL LDLQGIVRA
Subjt:  QDRGTAWWQTTERMLGGDVSKITCEQFKESFYAQFFS----------FLNFERGDM---TYDAEFDMLSRFNPNVAKDEEAKTKKFVKGLILDLQGIVRA

Query:  FRPTTHADALRLALDLSLHERAGLSKAAGRGSALGQKRKVESQPDLTPQQNLRLRGVFQRHRRELAAAGRTFRELPVCPSCGRVHGDCCLVGSGVCFKCR
         RPTTH DALRL LDLSLHERA LSK AGRGS L QKRK+ESQPDLTPQ+NLR RGVFQRHRRELAAAGRT RELPVC SCGRVHG  CLV S VCFKCR
Subjt:  FRPTTHADALRLALDLSLHERAGLSKAAGRGSALGQKRKVESQPDLTPQQNLRLRGVFQRHRRELAAAGRTFRELPVCPSCGRVHGDCCLVGSGVCFKCR

Query:  QPGHTADACPQKLIETTPHQHHTSQ
        QP HTADACP+K IETTPHQ   SQ
Subjt:  QPGHTADACPQKLIETTPHQHHTSQ

A0A5A7SXN2 Gag-protease polyprotein1.5e-10890.18Show/hide
Query:  DRGTAWWQTTERMLGGDVSKITCEQFKESFYAQFFS----------FLNFERGDMT---YDAEFDMLSRFNPNVAKDEEAKTKKFVKGLILDLQGIVRAF
        DRGTAWWQT ERMLGGDVSKITCEQFKESFYA+FFS          FLN ERGDMT   YDAEFDMLSRF PNV KDEEAKTKKFVKGL LDLQGIVRAF
Subjt:  DRGTAWWQTTERMLGGDVSKITCEQFKESFYAQFFS----------FLNFERGDMT---YDAEFDMLSRFNPNVAKDEEAKTKKFVKGLILDLQGIVRAF

Query:  RPTTHADALRLALDLSLHERAGLSKAAGRGSALGQKRKVESQPDLTPQQNLRLRGVFQRHRRELAAAGRTFRELPVCPSCGRVHGDCCLVGSGVCFKCRQ
        RPTTHADALRLALDL+LHERAGLSKAAGRGSALGQKRKVESQPDLT QQNLRLRGVFQRHR ELAAAGRTFRELPVCPSCGRVHGDCCLVGSGVCFKCRQ
Subjt:  RPTTHADALRLALDLSLHERAGLSKAAGRGSALGQKRKVESQPDLTPQQNLRLRGVFQRHRRELAAAGRTFRELPVCPSCGRVHGDCCLVGSGVCFKCRQ

Query:  PGHTADACPQKLIETTPHQHHTSQ
        PGHTADACPQKLIETTPHQHHTSQ
Subjt:  PGHTADACPQKLIETTPHQHHTSQ

A0A5D3D772 Gag-protease polyprotein6.4e-8876Show/hide
Query:  QDRGTAWWQTTERMLGGDVSKITCEQFKESFYAQFFS----------FLNFERGDM---TYDAEFDMLSRFNPNVAKDEEAKTKKFVKGLILDLQGIVRA
        +DRGT WW+T ERMLGGDV+KI  EQFKESFY +FFS          FLN E+GDM    YDAEFDMLSRF PNV KDE+A+T+KFV+GL LDLQGIVRA
Subjt:  QDRGTAWWQTTERMLGGDVSKITCEQFKESFYAQFFS----------FLNFERGDM---TYDAEFDMLSRFNPNVAKDEEAKTKKFVKGLILDLQGIVRA

Query:  FRPTTHADALRLALDLSLHERAGLSKAAGRGSALGQKRKVESQPDLTPQQNLRLRGVFQRHRRELAAAGRTFRELPVCPSCGRVHGDCCLVGSGVCFKCR
         RPTTH DALRL LDLSLHERA LSK AGRGS L QKRK+ESQPDLTPQ+NLR RGVFQRHRRELAAAGRT RELPVC SCGRVHG  CLV S VCFKCR
Subjt:  FRPTTHADALRLALDLSLHERAGLSKAAGRGSALGQKRKVESQPDLTPQQNLRLRGVFQRHRRELAAAGRTFRELPVCPSCGRVHGDCCLVGSGVCFKCR

Query:  QPGHTADACPQKLIETTPHQHHTSQ
        QP HTADACP+K IETTPHQ   SQ
Subjt:  QPGHTADACPQKLIETTPHQHHTSQ

A0A5D3E207 Gag-pol polyprotein3.7e-10487.11Show/hide
Query:  QDRGTAWWQTTERMLGGDVSKITCEQFKESFYAQFFS----------FLNFERGDMT---YDAEFDMLSRFNPNVAKDEEAKTKKFVKGLILDLQGIVRA
        +D+GTAWWQT ERMLG  VSKITCEQFKESFYA+FFS          FLN ERGDMT   YDA+FDMLSRF PNVAKDEEAKTKKFVKGL LDLQGIVRA
Subjt:  QDRGTAWWQTTERMLGGDVSKITCEQFKESFYAQFFS----------FLNFERGDMT---YDAEFDMLSRFNPNVAKDEEAKTKKFVKGLILDLQGIVRA

Query:  FRPTTHADALRLALDLSLHERAGLSKAAGRGSALGQKRKVESQPDLTPQQNLRLRGVFQRHRRELAAAGRTFRELPVCPSCGRVHGDCCLVGSGVCFKCR
        F+PTTHAD LRLALDLSLHERAGLSKAAGRGSALGQKRKVESQPDLTPQQNLRLRGVFQRHRRELAAAGR+FRELPV PS GRVHGDCCLVGSGVCFKCR
Subjt:  FRPTTHADALRLALDLSLHERAGLSKAAGRGSALGQKRKVESQPDLTPQQNLRLRGVFQRHRRELAAAGRTFRELPVCPSCGRVHGDCCLVGSGVCFKCR

Query:  QPGHTADACPQKLIETTPHQHHTSQ
        QPGHTADACPQKLIETTPHQHHT Q
Subjt:  QPGHTADACPQKLIETTPHQHHTSQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTATACGGTCTCTCGTCACTCGACAGGATAGAGGCACCGCGTGGTGGCAGACTACTGAGAGGATGCTGGGTGGAGATGTCAGTAAGATAACTTGCGAGCAGTTCAA
GGAGAGCTTCTATGCTCAGTTCTTTTCTTTTCTGAACTTTGAGCGAGGCGACATGACGTATGACGCTGAGTTTGACATGCTATCCCGTTTTAACCCTAATGTTGCAAAGG
ATGAGGAGGCCAAGACCAAGAAGTTCGTCAAAGGTCTCATACTAGATCTCCAGGGTATCGTTCGAGCCTTCAGGCCAACTACCCATGCTGATGCTTTACGCCTGGCACTA
GACTTGAGTCTGCATGAGAGAGCTGGTCTGTCCAAGGCTGCAGGCAGAGGGTCAGCCCTTGGTCAGAAAAGGAAGGTTGAGTCGCAGCCTGACTTGACACCGCAACAAAA
TCTGAGGTTACGAGGTGTCTTCCAGCGGCATCGTCGAGAACTTGCAGCAGCCGGAAGAACCTTTAGAGAGCTACCTGTTTGTCCTAGCTGTGGAAGAGTTCATGGAGATT
GTTGCTTGGTCGGGAGTGGAGTTTGCTTCAAATGTAGGCAACCAGGGCATACCGCTGATGCTTGTCCTCAGAAACTCATTGAGACTACCCCGCACCAGCATCACACATCC
CAATAG
mRNA sequenceShow/hide mRNA sequence
ATGTCTATACGGTCTCTCGTCACTCGACAGGATAGAGGCACCGCGTGGTGGCAGACTACTGAGAGGATGCTGGGTGGAGATGTCAGTAAGATAACTTGCGAGCAGTTCAA
GGAGAGCTTCTATGCTCAGTTCTTTTCTTTTCTGAACTTTGAGCGAGGCGACATGACGTATGACGCTGAGTTTGACATGCTATCCCGTTTTAACCCTAATGTTGCAAAGG
ATGAGGAGGCCAAGACCAAGAAGTTCGTCAAAGGTCTCATACTAGATCTCCAGGGTATCGTTCGAGCCTTCAGGCCAACTACCCATGCTGATGCTTTACGCCTGGCACTA
GACTTGAGTCTGCATGAGAGAGCTGGTCTGTCCAAGGCTGCAGGCAGAGGGTCAGCCCTTGGTCAGAAAAGGAAGGTTGAGTCGCAGCCTGACTTGACACCGCAACAAAA
TCTGAGGTTACGAGGTGTCTTCCAGCGGCATCGTCGAGAACTTGCAGCAGCCGGAAGAACCTTTAGAGAGCTACCTGTTTGTCCTAGCTGTGGAAGAGTTCATGGAGATT
GTTGCTTGGTCGGGAGTGGAGTTTGCTTCAAATGTAGGCAACCAGGGCATACCGCTGATGCTTGTCCTCAGAAACTCATTGAGACTACCCCGCACCAGCATCACACATCC
CAATAG
Protein sequenceShow/hide protein sequence
MSIRSLVTRQDRGTAWWQTTERMLGGDVSKITCEQFKESFYAQFFSFLNFERGDMTYDAEFDMLSRFNPNVAKDEEAKTKKFVKGLILDLQGIVRAFRPTTHADALRLAL
DLSLHERAGLSKAAGRGSALGQKRKVESQPDLTPQQNLRLRGVFQRHRRELAAAGRTFRELPVCPSCGRVHGDCCLVGSGVCFKCRQPGHTADACPQKLIETTPHQHHTS
Q