; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0020928 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0020928
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionGag/pol protein
Genome locationchr08:1005670..1010257
RNA-Seq ExpressionPI0020928
SyntenyPI0020928
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008234 - cysteine-type peptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0035879.1 gag/pol protein [Cucumis melo var. makuwa]1.4e-8671.19Show/hide
Query:  MSSSIIALLKNEKLTGDNYATWKSNLNMILVIDDLRFVLMEECPPSPVRNASQTIRDAHDRWTKANEKARVYILASMSDVLSKKRESMVTARQIMDSLQE
        M+S+ + +L  +KL G+NYA+WK+ +N +L+IDDLRFVL+EECP  P  NA++T+R+ ++RW KANEKAR YILAS+S+VL+KK ESM+TAR+IMDSLQE
Subjt:  MSSSIIALLKNEKLTGDNYATWKSNLNMILVIDDLRFVLMEECPPSPVRNASQTIRDAHDRWTKANEKARVYILASMSDVLSKKRESMVTARQIMDSLQE

Query:  MFGQPSIQIQHEAIKYVYNARMKEGQSVREHVLDMMVQFNVAETNGAVIDEQSQVSFILESLPKSFLQFRSNAVMKKIEYTMTTLLNELQTFQSLMKNKG
        MFGQ S QI+H+A+KY+YNARM EG SVREHVL+MMV FNVAE NGAVIDE SQVSFILESLP+SFLQFRSNAVM KI YT+TTLLNELQTF+SLMK KG
Subjt:  MFGQPSIQIQHEAIKYVYNARMKEGQSVREHVLDMMVQFNVAETNGAVIDEQSQVSFILESLPKSFLQFRSNAVMKKIEYTMTTLLNELQTFQSLMKNKG

Query:  QKEGEANVAHS-KRFQKGSSSGTKSAPSSSRTKKIQKKKGGKG
        QK GEANVA S ++F +GS+SGTKS PSSS  KK +KKKGG+G
Subjt:  QKEGEANVAHS-KRFQKGSSSGTKSAPSSSRTKKIQKKKGGKG

KAA0044955.1 gag/pol protein [Cucumis melo var. makuwa]1.4e-8671.19Show/hide
Query:  MSSSIIALLKNEKLTGDNYATWKSNLNMILVIDDLRFVLMEECPPSPVRNASQTIRDAHDRWTKANEKARVYILASMSDVLSKKRESMVTARQIMDSLQE
        M+S+ + +L  +KL G+NYA+WK+ +N +L+IDDLRFVL+EECP  P  NA++T+R+ ++RW KANEKAR YILAS+S+VL+KK ESM+TAR+IMDSLQE
Subjt:  MSSSIIALLKNEKLTGDNYATWKSNLNMILVIDDLRFVLMEECPPSPVRNASQTIRDAHDRWTKANEKARVYILASMSDVLSKKRESMVTARQIMDSLQE

Query:  MFGQPSIQIQHEAIKYVYNARMKEGQSVREHVLDMMVQFNVAETNGAVIDEQSQVSFILESLPKSFLQFRSNAVMKKIEYTMTTLLNELQTFQSLMKNKG
        MFGQ S QI+H+A+KY+YNARM EG SVREHVL+MMV FNVAE NGAVIDE SQVSFILESLP+SFLQFRSNAVM KI YT+TTLLNELQTF+SLMK KG
Subjt:  MFGQPSIQIQHEAIKYVYNARMKEGQSVREHVLDMMVQFNVAETNGAVIDEQSQVSFILESLPKSFLQFRSNAVMKKIEYTMTTLLNELQTFQSLMKNKG

Query:  QKEGEANVAHS-KRFQKGSSSGTKSAPSSSRTKKIQKKKGGKG
        QK GEANVA S ++F +GS+SGTKS PSSS  KK +KKKGG+G
Subjt:  QKEGEANVAHS-KRFQKGSSSGTKSAPSSSRTKKIQKKKGGKG

KAA0051952.1 gag/pol protein [Cucumis melo var. makuwa]1.4e-8671.19Show/hide
Query:  MSSSIIALLKNEKLTGDNYATWKSNLNMILVIDDLRFVLMEECPPSPVRNASQTIRDAHDRWTKANEKARVYILASMSDVLSKKRESMVTARQIMDSLQE
        M+S+ + +L  +KL G+NYA+WK+ +N +L+IDDLRFVL+EECP  P  NA++T+R+ ++RW KANEKAR YILAS+S+VL+KK ESM+TAR+IMDSLQE
Subjt:  MSSSIIALLKNEKLTGDNYATWKSNLNMILVIDDLRFVLMEECPPSPVRNASQTIRDAHDRWTKANEKARVYILASMSDVLSKKRESMVTARQIMDSLQE

Query:  MFGQPSIQIQHEAIKYVYNARMKEGQSVREHVLDMMVQFNVAETNGAVIDEQSQVSFILESLPKSFLQFRSNAVMKKIEYTMTTLLNELQTFQSLMKNKG
        MFGQ S QI+H+A+KY+YNARM EG SVREHVL+MMV FNVAE NGAVIDE SQVSFILESLP+SFLQFRSNAVM KI YT+TTLLNELQTF+SLMK KG
Subjt:  MFGQPSIQIQHEAIKYVYNARMKEGQSVREHVLDMMVQFNVAETNGAVIDEQSQVSFILESLPKSFLQFRSNAVMKKIEYTMTTLLNELQTFQSLMKNKG

Query:  QKEGEANVAHS-KRFQKGSSSGTKSAPSSSRTKKIQKKKGGKG
        QK GEANVA S ++F +GS+SGTKS PSSS  KK +KKKGG+G
Subjt:  QKEGEANVAHS-KRFQKGSSSGTKSAPSSSRTKKIQKKKGGKG

KAA0054490.1 gag/pol protein [Cucumis melo var. makuwa]1.4e-8671.19Show/hide
Query:  MSSSIIALLKNEKLTGDNYATWKSNLNMILVIDDLRFVLMEECPPSPVRNASQTIRDAHDRWTKANEKARVYILASMSDVLSKKRESMVTARQIMDSLQE
        M+S+ + +L  +KL G+NYA+WK+ +N +L+IDDLRFVL+EECP  P  NA++T+R+ ++RW KANEKAR YILAS+S+VL+KK ESM+TAR+IMDSLQE
Subjt:  MSSSIIALLKNEKLTGDNYATWKSNLNMILVIDDLRFVLMEECPPSPVRNASQTIRDAHDRWTKANEKARVYILASMSDVLSKKRESMVTARQIMDSLQE

Query:  MFGQPSIQIQHEAIKYVYNARMKEGQSVREHVLDMMVQFNVAETNGAVIDEQSQVSFILESLPKSFLQFRSNAVMKKIEYTMTTLLNELQTFQSLMKNKG
        MFGQ S QI+H+A+KY+YNARM EG SVREHVL+MMV FNVAE NGAVIDE SQVSFILESLP+SFLQFRSNAVM KI YT+TTLLNELQTF+SLMK KG
Subjt:  MFGQPSIQIQHEAIKYVYNARMKEGQSVREHVLDMMVQFNVAETNGAVIDEQSQVSFILESLPKSFLQFRSNAVMKKIEYTMTTLLNELQTFQSLMKNKG

Query:  QKEGEANVAHS-KRFQKGSSSGTKSAPSSSRTKKIQKKKGGKG
        QK GEANVA S ++F +GS+SGTKS PSSS  KK +KKKGG+G
Subjt:  QKEGEANVAHS-KRFQKGSSSGTKSAPSSSRTKKIQKKKGGKG

TYK14550.1 gag/pol protein [Cucumis melo var. makuwa]1.4e-8671.19Show/hide
Query:  MSSSIIALLKNEKLTGDNYATWKSNLNMILVIDDLRFVLMEECPPSPVRNASQTIRDAHDRWTKANEKARVYILASMSDVLSKKRESMVTARQIMDSLQE
        M+S+ + +L  +KL G+NYA+WK+ +N +L+IDDLRFVL+EECP  P  NA++T+R+ ++RW KANEKAR YILAS+S+VL+KK ESM+TAR+IMDSLQE
Subjt:  MSSSIIALLKNEKLTGDNYATWKSNLNMILVIDDLRFVLMEECPPSPVRNASQTIRDAHDRWTKANEKARVYILASMSDVLSKKRESMVTARQIMDSLQE

Query:  MFGQPSIQIQHEAIKYVYNARMKEGQSVREHVLDMMVQFNVAETNGAVIDEQSQVSFILESLPKSFLQFRSNAVMKKIEYTMTTLLNELQTFQSLMKNKG
        MFGQ S QI+H+A+KY+YNARM EG SVREHVL+MMV FNVAE NGAVIDE SQVSFILESLP+SFLQFRSNAVM KI YT+TTLLNELQTF+SLMK KG
Subjt:  MFGQPSIQIQHEAIKYVYNARMKEGQSVREHVLDMMVQFNVAETNGAVIDEQSQVSFILESLPKSFLQFRSNAVMKKIEYTMTTLLNELQTFQSLMKNKG

Query:  QKEGEANVAHS-KRFQKGSSSGTKSAPSSSRTKKIQKKKGGKG
        QK GEANVA S ++F +GS+SGTKS PSSS  KK +KKKGG+G
Subjt:  QKEGEANVAHS-KRFQKGSSSGTKSAPSSSRTKKIQKKKGGKG

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein6.7e-8771.19Show/hide
Query:  MSSSIIALLKNEKLTGDNYATWKSNLNMILVIDDLRFVLMEECPPSPVRNASQTIRDAHDRWTKANEKARVYILASMSDVLSKKRESMVTARQIMDSLQE
        M+S+ + +L  +KL G+NYA+WK+ +N +L+IDDLRFVL+EECP  P  NA++T+R+ ++RW KANEKAR YILAS+S+VL+KK ESM+TAR+IMDSLQE
Subjt:  MSSSIIALLKNEKLTGDNYATWKSNLNMILVIDDLRFVLMEECPPSPVRNASQTIRDAHDRWTKANEKARVYILASMSDVLSKKRESMVTARQIMDSLQE

Query:  MFGQPSIQIQHEAIKYVYNARMKEGQSVREHVLDMMVQFNVAETNGAVIDEQSQVSFILESLPKSFLQFRSNAVMKKIEYTMTTLLNELQTFQSLMKNKG
        MFGQ S QI+H+A+KY+YNARM EG SVREHVL+MMV FNVAE NGAVIDE SQVSFILESLP+SFLQFRSNAVM KI YT+TTLLNELQTF+SLMK KG
Subjt:  MFGQPSIQIQHEAIKYVYNARMKEGQSVREHVLDMMVQFNVAETNGAVIDEQSQVSFILESLPKSFLQFRSNAVMKKIEYTMTTLLNELQTFQSLMKNKG

Query:  QKEGEANVAHS-KRFQKGSSSGTKSAPSSSRTKKIQKKKGGKG
        QK GEANVA S ++F +GS+SGTKS PSSS  KK +KKKGG+G
Subjt:  QKEGEANVAHS-KRFQKGSSSGTKSAPSSSRTKKIQKKKGGKG

A0A5A7TU93 Gag/pol protein6.7e-8771.19Show/hide
Query:  MSSSIIALLKNEKLTGDNYATWKSNLNMILVIDDLRFVLMEECPPSPVRNASQTIRDAHDRWTKANEKARVYILASMSDVLSKKRESMVTARQIMDSLQE
        M+S+ + +L  +KL G+NYA+WK+ +N +L+IDDLRFVL+EECP  P  NA++T+R+ ++RW KANEKAR YILAS+S+VL+KK ESM+TAR+IMDSLQE
Subjt:  MSSSIIALLKNEKLTGDNYATWKSNLNMILVIDDLRFVLMEECPPSPVRNASQTIRDAHDRWTKANEKARVYILASMSDVLSKKRESMVTARQIMDSLQE

Query:  MFGQPSIQIQHEAIKYVYNARMKEGQSVREHVLDMMVQFNVAETNGAVIDEQSQVSFILESLPKSFLQFRSNAVMKKIEYTMTTLLNELQTFQSLMKNKG
        MFGQ S QI+H+A+KY+YNARM EG SVREHVL+MMV FNVAE NGAVIDE SQVSFILESLP+SFLQFRSNAVM KI YT+TTLLNELQTF+SLMK KG
Subjt:  MFGQPSIQIQHEAIKYVYNARMKEGQSVREHVLDMMVQFNVAETNGAVIDEQSQVSFILESLPKSFLQFRSNAVMKKIEYTMTTLLNELQTFQSLMKNKG

Query:  QKEGEANVAHS-KRFQKGSSSGTKSAPSSSRTKKIQKKKGGKG
        QK GEANVA S ++F +GS+SGTKS PSSS  KK +KKKGG+G
Subjt:  QKEGEANVAHS-KRFQKGSSSGTKSAPSSSRTKKIQKKKGGKG

A0A5A7TWB9 Gag/pol protein6.7e-8771.19Show/hide
Query:  MSSSIIALLKNEKLTGDNYATWKSNLNMILVIDDLRFVLMEECPPSPVRNASQTIRDAHDRWTKANEKARVYILASMSDVLSKKRESMVTARQIMDSLQE
        M+S+ + +L  +KL G+NYA+WK+ +N +L+IDDLRFVL+EECP  P  NA++T+R+ ++RW KANEKAR YILAS+S+VL+KK ESM+TAR+IMDSLQE
Subjt:  MSSSIIALLKNEKLTGDNYATWKSNLNMILVIDDLRFVLMEECPPSPVRNASQTIRDAHDRWTKANEKARVYILASMSDVLSKKRESMVTARQIMDSLQE

Query:  MFGQPSIQIQHEAIKYVYNARMKEGQSVREHVLDMMVQFNVAETNGAVIDEQSQVSFILESLPKSFLQFRSNAVMKKIEYTMTTLLNELQTFQSLMKNKG
        MFGQ S QI+H+A+KY+YNARM EG SVREHVL+MMV FNVAE NGAVIDE SQVSFILESLP+SFLQFRSNAVM KI YT+TTLLNELQTF+SLMK KG
Subjt:  MFGQPSIQIQHEAIKYVYNARMKEGQSVREHVLDMMVQFNVAETNGAVIDEQSQVSFILESLPKSFLQFRSNAVMKKIEYTMTTLLNELQTFQSLMKNKG

Query:  QKEGEANVAHS-KRFQKGSSSGTKSAPSSSRTKKIQKKKGGKG
        QK GEANVA S ++F +GS+SGTKS PSSS  KK +KKKGG+G
Subjt:  QKEGEANVAHS-KRFQKGSSSGTKSAPSSSRTKKIQKKKGGKG

A0A5D3CPJ6 Gag/pol protein6.7e-8771.19Show/hide
Query:  MSSSIIALLKNEKLTGDNYATWKSNLNMILVIDDLRFVLMEECPPSPVRNASQTIRDAHDRWTKANEKARVYILASMSDVLSKKRESMVTARQIMDSLQE
        M+S+ + +L  +KL G+NYA+WK+ +N +L+IDDLRFVL+EECP  P  NA++T+R+ ++RW KANEKAR YILAS+S+VL+KK ESM+TAR+IMDSLQE
Subjt:  MSSSIIALLKNEKLTGDNYATWKSNLNMILVIDDLRFVLMEECPPSPVRNASQTIRDAHDRWTKANEKARVYILASMSDVLSKKRESMVTARQIMDSLQE

Query:  MFGQPSIQIQHEAIKYVYNARMKEGQSVREHVLDMMVQFNVAETNGAVIDEQSQVSFILESLPKSFLQFRSNAVMKKIEYTMTTLLNELQTFQSLMKNKG
        MFGQ S QI+H+A+KY+YNARM EG SVREHVL+MMV FNVAE NGAVIDE SQVSFILESLP+SFLQFRSNAVM KI YT+TTLLNELQTF+SLMK KG
Subjt:  MFGQPSIQIQHEAIKYVYNARMKEGQSVREHVLDMMVQFNVAETNGAVIDEQSQVSFILESLPKSFLQFRSNAVMKKIEYTMTTLLNELQTFQSLMKNKG

Query:  QKEGEANVAHS-KRFQKGSSSGTKSAPSSSRTKKIQKKKGGKG
        QK GEANVA S ++F +GS+SGTKS PSSS  KK +KKKGG+G
Subjt:  QKEGEANVAHS-KRFQKGSSSGTKSAPSSSRTKKIQKKKGGKG

A0A5D3CSZ6 Gag/pol protein6.7e-8771.19Show/hide
Query:  MSSSIIALLKNEKLTGDNYATWKSNLNMILVIDDLRFVLMEECPPSPVRNASQTIRDAHDRWTKANEKARVYILASMSDVLSKKRESMVTARQIMDSLQE
        M+S+ + +L  +KL G+NYA+WK+ +N +L+IDDLRFVL+EECP  P  NA++T+R+ ++RW KANEKAR YILAS+S+VL+KK ESM+TAR+IMDSLQE
Subjt:  MSSSIIALLKNEKLTGDNYATWKSNLNMILVIDDLRFVLMEECPPSPVRNASQTIRDAHDRWTKANEKARVYILASMSDVLSKKRESMVTARQIMDSLQE

Query:  MFGQPSIQIQHEAIKYVYNARMKEGQSVREHVLDMMVQFNVAETNGAVIDEQSQVSFILESLPKSFLQFRSNAVMKKIEYTMTTLLNELQTFQSLMKNKG
        MFGQ S QI+H+A+KY+YNARM EG SVREHVL+MMV FNVAE NGAVIDE SQVSFILESLP+SFLQFRSNAVM KI YT+TTLLNELQTF+SLMK KG
Subjt:  MFGQPSIQIQHEAIKYVYNARMKEGQSVREHVLDMMVQFNVAETNGAVIDEQSQVSFILESLPKSFLQFRSNAVMKKIEYTMTTLLNELQTFQSLMKNKG

Query:  QKEGEANVAHS-KRFQKGSSSGTKSAPSSSRTKKIQKKKGGKG
        QK GEANVA S ++F +GS+SGTKS PSSS  KK +KKKGG+G
Subjt:  QKEGEANVAHS-KRFQKGSSSGTKSAPSSSRTKKIQKKKGGKG

SwissProt top hitse value%identityAlignment
P04146 Copia protein2.1e-0525.49Show/hide
Query:  GDNYATWKSNLNMILVIDDLRFVLMEECPPSPVRNASQTIRDAHDRWTKANEKARVYILASMSDVLSKKRESMVTARQIMDSLQEMFGQPSIQIQHEAIK
        G+ YA WK  +  +L   D+  V+    P            +  D W KA   A+  I+  +SD       S +TARQI+++L  ++ + S+  Q    K
Subjt:  GDNYATWKSNLNMILVIDDLRFVLMEECPPSPVRNASQTIRDAHDRWTKANEKARVYILASMSDVLSKKRESMVTARQIMDSLQEMFGQPSIQIQHEAIK

Query:  YVYNARMKEGQSVRE--HVLDMMVQFNVAETNGAVIDEQSQVSFILESLPKSF
         + + ++    S+    H+ D ++   +A   GA I+E  ++S +L +LP  +
Subjt:  YVYNARMKEGQSVRE--HVLDMMVQFNVAETNGAVIDEQSQVSFILESLPKSF

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTTCCTCAATTATAGCACTGCTTAAAAATGAAAAATTAACCGGTGATAATTATGCGACGTGGAAATCGAACCTGAATATGATTCTTGTAATCGATGATCTACGCTT
CGTCTTAATGGAGGAATGTCCTCCTTCCCCTGTTCGAAATGCATCCCAGACCATCAGGGATGCACATGACCGCTGGACAAAGGCCAATGAAAAGGCCCGAGTCTACATCT
TGGCCAGTATGTCTGACGTACTGAGCAAGAAACGTGAGTCTATGGTCACTGCACGTCAGATCATGGACTCCCTCCAGGAGATGTTTGGACAACCGTCCATTCAAATCCAG
CACGAGGCTATTAAATACGTTTATAATGCCCGTATGAAGGAAGGCCAATCTGTTAGAGAACATGTTCTCGACATGATGGTCCAATTCAACGTGGCAGAAACGAACGGAGC
GGTCATTGACGAGCAAAGTCAGGTATCCTTTATCTTAGAATCTCTTCCGAAGAGCTTCCTTCAATTCCGTAGCAATGCGGTTATGAAGAAGATAGAGTATACCATGACTA
CCCTCCTTAACGAGTTGCAAACTTTTCAATCTCTTATGAAAAATAAGGGACAGAAAGAAGGAGAGGCAAATGTTGCCCATTCCAAGAGGTTTCAGAAGGGTTCATCCTCT
GGAACCAAGTCTGCACCTTCATCTTCTAGGACTAAGAAAATCCAGAAGAAGAAAGGAGGAAAGGGGAAAGTGATCCAACGCATTCGTGTAGGTGACATGCGAGTGAGGGT
ATCTCATGCAATGAGTTTGCATAAGACCGGACCACGAAATAGTAACCACTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCTTCCTCAATTATAGCACTGCTTAAAAATGAAAAATTAACCGGTGATAATTATGCGACGTGGAAATCGAACCTGAATATGATTCTTGTAATCGATGATCTACGCTT
CGTCTTAATGGAGGAATGTCCTCCTTCCCCTGTTCGAAATGCATCCCAGACCATCAGGGATGCACATGACCGCTGGACAAAGGCCAATGAAAAGGCCCGAGTCTACATCT
TGGCCAGTATGTCTGACGTACTGAGCAAGAAACGTGAGTCTATGGTCACTGCACGTCAGATCATGGACTCCCTCCAGGAGATGTTTGGACAACCGTCCATTCAAATCCAG
CACGAGGCTATTAAATACGTTTATAATGCCCGTATGAAGGAAGGCCAATCTGTTAGAGAACATGTTCTCGACATGATGGTCCAATTCAACGTGGCAGAAACGAACGGAGC
GGTCATTGACGAGCAAAGTCAGGTATCCTTTATCTTAGAATCTCTTCCGAAGAGCTTCCTTCAATTCCGTAGCAATGCGGTTATGAAGAAGATAGAGTATACCATGACTA
CCCTCCTTAACGAGTTGCAAACTTTTCAATCTCTTATGAAAAATAAGGGACAGAAAGAAGGAGAGGCAAATGTTGCCCATTCCAAGAGGTTTCAGAAGGGTTCATCCTCT
GGAACCAAGTCTGCACCTTCATCTTCTAGGACTAAGAAAATCCAGAAGAAGAAAGGAGGAAAGGGGAAAGTGATCCAACGCATTCGTGTAGGTGACATGCGAGTGAGGGT
ATCTCATGCAATGAGTTTGCATAAGACCGGACCACGAAATAGTAACCACTAG
Protein sequenceShow/hide protein sequence
MSSSIIALLKNEKLTGDNYATWKSNLNMILVIDDLRFVLMEECPPSPVRNASQTIRDAHDRWTKANEKARVYILASMSDVLSKKRESMVTARQIMDSLQEMFGQPSIQIQ
HEAIKYVYNARMKEGQSVREHVLDMMVQFNVAETNGAVIDEQSQVSFILESLPKSFLQFRSNAVMKKIEYTMTTLLNELQTFQSLMKNKGQKEGEANVAHSKRFQKGSSS
GTKSAPSSSRTKKIQKKKGGKGKVIQRIRVGDMRVRVSHAMSLHKTGPRNSNH