; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0028755 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0028755
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionGag/pol protein
Genome locationchr06:21242294..21246877
RNA-Seq ExpressionPI0028755
SyntenyPI0028755
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008234 - cysteine-type peptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0035879.1 gag/pol protein [Cucumis melo var. makuwa]2.8e-9073.25Show/hide
Query:  MSSSIIALLKNEKLTGDNYATWKSNLNMILVIDDLRFVLMEECPPSPVRNASQTVRDAYDRWTKANEKARVYILASMSDVLSKKHESMVTARQIMDSLQE
        M+S+ + +L  +KL G+NYA+WK+ +N +L+IDDLRFVL+EECP  P  NA++TVR+ Y+RW KANEKAR YILAS+S+VL+KKHESM+TAR+IMDSLQE
Subjt:  MSSSIIALLKNEKLTGDNYATWKSNLNMILVIDDLRFVLMEECPPSPVRNASQTVRDAYDRWTKANEKARVYILASMSDVLSKKHESMVTARQIMDSLQE

Query:  MFGQPSIQIRHEAIKYVYNARMKEGQSVREHVLDMMVQFNVAETNGAVIDEQSQVSFILESLPKSFLQFRSNAVMNKIEYTMTTLLNELQTFQSLMKNKG
        MFGQ S QI+H+A+KY+YNARM EG SVREHVL+MMV FNVAE NGAVIDE SQVSFILESLP+SFLQFRSNAVMNKI YT+TTLLNELQTF+SLMK KG
Subjt:  MFGQPSIQIRHEAIKYVYNARMKEGQSVREHVLDMMVQFNVAETNGAVIDEQSQVSFILESLPKSFLQFRSNAVMNKIEYTMTTLLNELQTFQSLMKNKG

Query:  QKEGEANVAHS-KRFQKGSSSGTKSAPSSSGTKKIQKKKGGKG
        QK GEANVA S ++F +GS+SGTKS PSSSG KK +KKKGG+G
Subjt:  QKEGEANVAHS-KRFQKGSSSGTKSAPSSSGTKKIQKKKGGKG

KAA0044955.1 gag/pol protein [Cucumis melo var. makuwa]2.8e-9073.25Show/hide
Query:  MSSSIIALLKNEKLTGDNYATWKSNLNMILVIDDLRFVLMEECPPSPVRNASQTVRDAYDRWTKANEKARVYILASMSDVLSKKHESMVTARQIMDSLQE
        M+S+ + +L  +KL G+NYA+WK+ +N +L+IDDLRFVL+EECP  P  NA++TVR+ Y+RW KANEKAR YILAS+S+VL+KKHESM+TAR+IMDSLQE
Subjt:  MSSSIIALLKNEKLTGDNYATWKSNLNMILVIDDLRFVLMEECPPSPVRNASQTVRDAYDRWTKANEKARVYILASMSDVLSKKHESMVTARQIMDSLQE

Query:  MFGQPSIQIRHEAIKYVYNARMKEGQSVREHVLDMMVQFNVAETNGAVIDEQSQVSFILESLPKSFLQFRSNAVMNKIEYTMTTLLNELQTFQSLMKNKG
        MFGQ S QI+H+A+KY+YNARM EG SVREHVL+MMV FNVAE NGAVIDE SQVSFILESLP+SFLQFRSNAVMNKI YT+TTLLNELQTF+SLMK KG
Subjt:  MFGQPSIQIRHEAIKYVYNARMKEGQSVREHVLDMMVQFNVAETNGAVIDEQSQVSFILESLPKSFLQFRSNAVMNKIEYTMTTLLNELQTFQSLMKNKG

Query:  QKEGEANVAHS-KRFQKGSSSGTKSAPSSSGTKKIQKKKGGKG
        QK GEANVA S ++F +GS+SGTKS PSSSG KK +KKKGG+G
Subjt:  QKEGEANVAHS-KRFQKGSSSGTKSAPSSSGTKKIQKKKGGKG

KAA0051952.1 gag/pol protein [Cucumis melo var. makuwa]2.8e-9073.25Show/hide
Query:  MSSSIIALLKNEKLTGDNYATWKSNLNMILVIDDLRFVLMEECPPSPVRNASQTVRDAYDRWTKANEKARVYILASMSDVLSKKHESMVTARQIMDSLQE
        M+S+ + +L  +KL G+NYA+WK+ +N +L+IDDLRFVL+EECP  P  NA++TVR+ Y+RW KANEKAR YILAS+S+VL+KKHESM+TAR+IMDSLQE
Subjt:  MSSSIIALLKNEKLTGDNYATWKSNLNMILVIDDLRFVLMEECPPSPVRNASQTVRDAYDRWTKANEKARVYILASMSDVLSKKHESMVTARQIMDSLQE

Query:  MFGQPSIQIRHEAIKYVYNARMKEGQSVREHVLDMMVQFNVAETNGAVIDEQSQVSFILESLPKSFLQFRSNAVMNKIEYTMTTLLNELQTFQSLMKNKG
        MFGQ S QI+H+A+KY+YNARM EG SVREHVL+MMV FNVAE NGAVIDE SQVSFILESLP+SFLQFRSNAVMNKI YT+TTLLNELQTF+SLMK KG
Subjt:  MFGQPSIQIRHEAIKYVYNARMKEGQSVREHVLDMMVQFNVAETNGAVIDEQSQVSFILESLPKSFLQFRSNAVMNKIEYTMTTLLNELQTFQSLMKNKG

Query:  QKEGEANVAHS-KRFQKGSSSGTKSAPSSSGTKKIQKKKGGKG
        QK GEANVA S ++F +GS+SGTKS PSSSG KK +KKKGG+G
Subjt:  QKEGEANVAHS-KRFQKGSSSGTKSAPSSSGTKKIQKKKGGKG

KAA0054490.1 gag/pol protein [Cucumis melo var. makuwa]2.8e-9073.25Show/hide
Query:  MSSSIIALLKNEKLTGDNYATWKSNLNMILVIDDLRFVLMEECPPSPVRNASQTVRDAYDRWTKANEKARVYILASMSDVLSKKHESMVTARQIMDSLQE
        M+S+ + +L  +KL G+NYA+WK+ +N +L+IDDLRFVL+EECP  P  NA++TVR+ Y+RW KANEKAR YILAS+S+VL+KKHESM+TAR+IMDSLQE
Subjt:  MSSSIIALLKNEKLTGDNYATWKSNLNMILVIDDLRFVLMEECPPSPVRNASQTVRDAYDRWTKANEKARVYILASMSDVLSKKHESMVTARQIMDSLQE

Query:  MFGQPSIQIRHEAIKYVYNARMKEGQSVREHVLDMMVQFNVAETNGAVIDEQSQVSFILESLPKSFLQFRSNAVMNKIEYTMTTLLNELQTFQSLMKNKG
        MFGQ S QI+H+A+KY+YNARM EG SVREHVL+MMV FNVAE NGAVIDE SQVSFILESLP+SFLQFRSNAVMNKI YT+TTLLNELQTF+SLMK KG
Subjt:  MFGQPSIQIRHEAIKYVYNARMKEGQSVREHVLDMMVQFNVAETNGAVIDEQSQVSFILESLPKSFLQFRSNAVMNKIEYTMTTLLNELQTFQSLMKNKG

Query:  QKEGEANVAHS-KRFQKGSSSGTKSAPSSSGTKKIQKKKGGKG
        QK GEANVA S ++F +GS+SGTKS PSSSG KK +KKKGG+G
Subjt:  QKEGEANVAHS-KRFQKGSSSGTKSAPSSSGTKKIQKKKGGKG

TYK14550.1 gag/pol protein [Cucumis melo var. makuwa]2.8e-9073.25Show/hide
Query:  MSSSIIALLKNEKLTGDNYATWKSNLNMILVIDDLRFVLMEECPPSPVRNASQTVRDAYDRWTKANEKARVYILASMSDVLSKKHESMVTARQIMDSLQE
        M+S+ + +L  +KL G+NYA+WK+ +N +L+IDDLRFVL+EECP  P  NA++TVR+ Y+RW KANEKAR YILAS+S+VL+KKHESM+TAR+IMDSLQE
Subjt:  MSSSIIALLKNEKLTGDNYATWKSNLNMILVIDDLRFVLMEECPPSPVRNASQTVRDAYDRWTKANEKARVYILASMSDVLSKKHESMVTARQIMDSLQE

Query:  MFGQPSIQIRHEAIKYVYNARMKEGQSVREHVLDMMVQFNVAETNGAVIDEQSQVSFILESLPKSFLQFRSNAVMNKIEYTMTTLLNELQTFQSLMKNKG
        MFGQ S QI+H+A+KY+YNARM EG SVREHVL+MMV FNVAE NGAVIDE SQVSFILESLP+SFLQFRSNAVMNKI YT+TTLLNELQTF+SLMK KG
Subjt:  MFGQPSIQIRHEAIKYVYNARMKEGQSVREHVLDMMVQFNVAETNGAVIDEQSQVSFILESLPKSFLQFRSNAVMNKIEYTMTTLLNELQTFQSLMKNKG

Query:  QKEGEANVAHS-KRFQKGSSSGTKSAPSSSGTKKIQKKKGGKG
        QK GEANVA S ++F +GS+SGTKS PSSSG KK +KKKGG+G
Subjt:  QKEGEANVAHS-KRFQKGSSSGTKSAPSSSGTKKIQKKKGGKG

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein1.3e-9073.25Show/hide
Query:  MSSSIIALLKNEKLTGDNYATWKSNLNMILVIDDLRFVLMEECPPSPVRNASQTVRDAYDRWTKANEKARVYILASMSDVLSKKHESMVTARQIMDSLQE
        M+S+ + +L  +KL G+NYA+WK+ +N +L+IDDLRFVL+EECP  P  NA++TVR+ Y+RW KANEKAR YILAS+S+VL+KKHESM+TAR+IMDSLQE
Subjt:  MSSSIIALLKNEKLTGDNYATWKSNLNMILVIDDLRFVLMEECPPSPVRNASQTVRDAYDRWTKANEKARVYILASMSDVLSKKHESMVTARQIMDSLQE

Query:  MFGQPSIQIRHEAIKYVYNARMKEGQSVREHVLDMMVQFNVAETNGAVIDEQSQVSFILESLPKSFLQFRSNAVMNKIEYTMTTLLNELQTFQSLMKNKG
        MFGQ S QI+H+A+KY+YNARM EG SVREHVL+MMV FNVAE NGAVIDE SQVSFILESLP+SFLQFRSNAVMNKI YT+TTLLNELQTF+SLMK KG
Subjt:  MFGQPSIQIRHEAIKYVYNARMKEGQSVREHVLDMMVQFNVAETNGAVIDEQSQVSFILESLPKSFLQFRSNAVMNKIEYTMTTLLNELQTFQSLMKNKG

Query:  QKEGEANVAHS-KRFQKGSSSGTKSAPSSSGTKKIQKKKGGKG
        QK GEANVA S ++F +GS+SGTKS PSSSG KK +KKKGG+G
Subjt:  QKEGEANVAHS-KRFQKGSSSGTKSAPSSSGTKKIQKKKGGKG

A0A5A7TU93 Gag/pol protein1.3e-9073.25Show/hide
Query:  MSSSIIALLKNEKLTGDNYATWKSNLNMILVIDDLRFVLMEECPPSPVRNASQTVRDAYDRWTKANEKARVYILASMSDVLSKKHESMVTARQIMDSLQE
        M+S+ + +L  +KL G+NYA+WK+ +N +L+IDDLRFVL+EECP  P  NA++TVR+ Y+RW KANEKAR YILAS+S+VL+KKHESM+TAR+IMDSLQE
Subjt:  MSSSIIALLKNEKLTGDNYATWKSNLNMILVIDDLRFVLMEECPPSPVRNASQTVRDAYDRWTKANEKARVYILASMSDVLSKKHESMVTARQIMDSLQE

Query:  MFGQPSIQIRHEAIKYVYNARMKEGQSVREHVLDMMVQFNVAETNGAVIDEQSQVSFILESLPKSFLQFRSNAVMNKIEYTMTTLLNELQTFQSLMKNKG
        MFGQ S QI+H+A+KY+YNARM EG SVREHVL+MMV FNVAE NGAVIDE SQVSFILESLP+SFLQFRSNAVMNKI YT+TTLLNELQTF+SLMK KG
Subjt:  MFGQPSIQIRHEAIKYVYNARMKEGQSVREHVLDMMVQFNVAETNGAVIDEQSQVSFILESLPKSFLQFRSNAVMNKIEYTMTTLLNELQTFQSLMKNKG

Query:  QKEGEANVAHS-KRFQKGSSSGTKSAPSSSGTKKIQKKKGGKG
        QK GEANVA S ++F +GS+SGTKS PSSSG KK +KKKGG+G
Subjt:  QKEGEANVAHS-KRFQKGSSSGTKSAPSSSGTKKIQKKKGGKG

A0A5A7TWB9 Gag/pol protein1.3e-9073.25Show/hide
Query:  MSSSIIALLKNEKLTGDNYATWKSNLNMILVIDDLRFVLMEECPPSPVRNASQTVRDAYDRWTKANEKARVYILASMSDVLSKKHESMVTARQIMDSLQE
        M+S+ + +L  +KL G+NYA+WK+ +N +L+IDDLRFVL+EECP  P  NA++TVR+ Y+RW KANEKAR YILAS+S+VL+KKHESM+TAR+IMDSLQE
Subjt:  MSSSIIALLKNEKLTGDNYATWKSNLNMILVIDDLRFVLMEECPPSPVRNASQTVRDAYDRWTKANEKARVYILASMSDVLSKKHESMVTARQIMDSLQE

Query:  MFGQPSIQIRHEAIKYVYNARMKEGQSVREHVLDMMVQFNVAETNGAVIDEQSQVSFILESLPKSFLQFRSNAVMNKIEYTMTTLLNELQTFQSLMKNKG
        MFGQ S QI+H+A+KY+YNARM EG SVREHVL+MMV FNVAE NGAVIDE SQVSFILESLP+SFLQFRSNAVMNKI YT+TTLLNELQTF+SLMK KG
Subjt:  MFGQPSIQIRHEAIKYVYNARMKEGQSVREHVLDMMVQFNVAETNGAVIDEQSQVSFILESLPKSFLQFRSNAVMNKIEYTMTTLLNELQTFQSLMKNKG

Query:  QKEGEANVAHS-KRFQKGSSSGTKSAPSSSGTKKIQKKKGGKG
        QK GEANVA S ++F +GS+SGTKS PSSSG KK +KKKGG+G
Subjt:  QKEGEANVAHS-KRFQKGSSSGTKSAPSSSGTKKIQKKKGGKG

A0A5D3CPJ6 Gag/pol protein1.3e-9073.25Show/hide
Query:  MSSSIIALLKNEKLTGDNYATWKSNLNMILVIDDLRFVLMEECPPSPVRNASQTVRDAYDRWTKANEKARVYILASMSDVLSKKHESMVTARQIMDSLQE
        M+S+ + +L  +KL G+NYA+WK+ +N +L+IDDLRFVL+EECP  P  NA++TVR+ Y+RW KANEKAR YILAS+S+VL+KKHESM+TAR+IMDSLQE
Subjt:  MSSSIIALLKNEKLTGDNYATWKSNLNMILVIDDLRFVLMEECPPSPVRNASQTVRDAYDRWTKANEKARVYILASMSDVLSKKHESMVTARQIMDSLQE

Query:  MFGQPSIQIRHEAIKYVYNARMKEGQSVREHVLDMMVQFNVAETNGAVIDEQSQVSFILESLPKSFLQFRSNAVMNKIEYTMTTLLNELQTFQSLMKNKG
        MFGQ S QI+H+A+KY+YNARM EG SVREHVL+MMV FNVAE NGAVIDE SQVSFILESLP+SFLQFRSNAVMNKI YT+TTLLNELQTF+SLMK KG
Subjt:  MFGQPSIQIRHEAIKYVYNARMKEGQSVREHVLDMMVQFNVAETNGAVIDEQSQVSFILESLPKSFLQFRSNAVMNKIEYTMTTLLNELQTFQSLMKNKG

Query:  QKEGEANVAHS-KRFQKGSSSGTKSAPSSSGTKKIQKKKGGKG
        QK GEANVA S ++F +GS+SGTKS PSSSG KK +KKKGG+G
Subjt:  QKEGEANVAHS-KRFQKGSSSGTKSAPSSSGTKKIQKKKGGKG

A0A5D3CSZ6 Gag/pol protein1.3e-9073.25Show/hide
Query:  MSSSIIALLKNEKLTGDNYATWKSNLNMILVIDDLRFVLMEECPPSPVRNASQTVRDAYDRWTKANEKARVYILASMSDVLSKKHESMVTARQIMDSLQE
        M+S+ + +L  +KL G+NYA+WK+ +N +L+IDDLRFVL+EECP  P  NA++TVR+ Y+RW KANEKAR YILAS+S+VL+KKHESM+TAR+IMDSLQE
Subjt:  MSSSIIALLKNEKLTGDNYATWKSNLNMILVIDDLRFVLMEECPPSPVRNASQTVRDAYDRWTKANEKARVYILASMSDVLSKKHESMVTARQIMDSLQE

Query:  MFGQPSIQIRHEAIKYVYNARMKEGQSVREHVLDMMVQFNVAETNGAVIDEQSQVSFILESLPKSFLQFRSNAVMNKIEYTMTTLLNELQTFQSLMKNKG
        MFGQ S QI+H+A+KY+YNARM EG SVREHVL+MMV FNVAE NGAVIDE SQVSFILESLP+SFLQFRSNAVMNKI YT+TTLLNELQTF+SLMK KG
Subjt:  MFGQPSIQIRHEAIKYVYNARMKEGQSVREHVLDMMVQFNVAETNGAVIDEQSQVSFILESLPKSFLQFRSNAVMNKIEYTMTTLLNELQTFQSLMKNKG

Query:  QKEGEANVAHS-KRFQKGSSSGTKSAPSSSGTKKIQKKKGGKG
        QK GEANVA S ++F +GS+SGTKS PSSSG KK +KKKGG+G
Subjt:  QKEGEANVAHS-KRFQKGSSSGTKSAPSSSGTKKIQKKKGGKG

SwissProt top hitse value%identityAlignment
P04146 Copia protein6.4e-0524.84Show/hide
Query:  GDNYATWKSNLNMILVIDDLRFVLMEECPPSPVRNASQTVRDAYDRWTKANEKARVYILASMSDVLSKKHESMVTARQIMDSLQEMFGQPSIQIRHEAIK
        G+ YA WK  +  +L   D+  V+    P            +  D W KA   A+  I+  +SD       S +TARQI+++L  ++ + S+  +    K
Subjt:  GDNYATWKSNLNMILVIDDLRFVLMEECPPSPVRNASQTVRDAYDRWTKANEKARVYILASMSDVLSKKHESMVTARQIMDSLQEMFGQPSIQIRHEAIK

Query:  YVYNARMKEGQSVRE--HVLDMMVQFNVAETNGAVIDEQSQVSFILESLPKSF
         + + ++    S+    H+ D ++   +A   GA I+E  ++S +L +LP  +
Subjt:  YVYNARMKEGQSVRE--HVLDMMVQFNVAETNGAVIDEQSQVSFILESLPKSF

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTTCCTCAATTATAGCACTGCTTAAAAATGAAAAATTAACCGGTGATAATTATGCGACGTGGAAATCGAACCTGAATATGATTCTTGTAATCGATGATCTACGCTT
CGTCTTAATGGAGGAATGTCCTCCTTCCCCTGTTCGAAATGCATCCCAGACCGTCAGGGATGCATATGACCGCTGGACAAAGGCCAATGAAAAGGCCCGAGTCTACATCT
TGGCCAGTATGTCTGACGTACTGAGCAAGAAACATGAGTCCATGGTCACTGCACGTCAGATCATGGACTCCCTCCAGGAGATGTTTGGACAACCGTCCATTCAAATCCGG
CACGAGGCTATTAAATACGTTTATAATGCCCGTATGAAGGAAGGCCAATCTGTTAGAGAACATGTTCTCGACATGATGGTCCAATTCAACGTGGCAGAAACGAACGGGGC
GGTCATTGACGAGCAAAGTCAGGTATCCTTTATCTTAGAATCTCTTCCGAAGAGCTTCCTTCAATTCCGTAGCAATGCGGTTATGAACAAGATAGAGTATACCATGACTA
CCCTCCTTAACGAGTTGCAGACTTTTCAGTCTCTTATGAAAAATAAGGGACAGAAAGAAGGAGAGGCAAATGTTGCCCATTCCAAGAGGTTTCAGAAGGGTTCATCCTCT
GGAACCAAGTCTGCACCTTCATCTTCTGGGACTAAGAAAATCCAGAAGAAGAAAGGAGGAAAGGGGAAAGCTCACTTTGTATTTGATACAAACGCAATGATCCAACGCGT
TCGTGTAGGTGACATGCGAGTGAGGGTATCCCATGCAATGAGTTTGCATAAGACTGGACCACGAAATAGTAACCACTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCTTCCTCAATTATAGCACTGCTTAAAAATGAAAAATTAACCGGTGATAATTATGCGACGTGGAAATCGAACCTGAATATGATTCTTGTAATCGATGATCTACGCTT
CGTCTTAATGGAGGAATGTCCTCCTTCCCCTGTTCGAAATGCATCCCAGACCGTCAGGGATGCATATGACCGCTGGACAAAGGCCAATGAAAAGGCCCGAGTCTACATCT
TGGCCAGTATGTCTGACGTACTGAGCAAGAAACATGAGTCCATGGTCACTGCACGTCAGATCATGGACTCCCTCCAGGAGATGTTTGGACAACCGTCCATTCAAATCCGG
CACGAGGCTATTAAATACGTTTATAATGCCCGTATGAAGGAAGGCCAATCTGTTAGAGAACATGTTCTCGACATGATGGTCCAATTCAACGTGGCAGAAACGAACGGGGC
GGTCATTGACGAGCAAAGTCAGGTATCCTTTATCTTAGAATCTCTTCCGAAGAGCTTCCTTCAATTCCGTAGCAATGCGGTTATGAACAAGATAGAGTATACCATGACTA
CCCTCCTTAACGAGTTGCAGACTTTTCAGTCTCTTATGAAAAATAAGGGACAGAAAGAAGGAGAGGCAAATGTTGCCCATTCCAAGAGGTTTCAGAAGGGTTCATCCTCT
GGAACCAAGTCTGCACCTTCATCTTCTGGGACTAAGAAAATCCAGAAGAAGAAAGGAGGAAAGGGGAAAGCTCACTTTGTATTTGATACAAACGCAATGATCCAACGCGT
TCGTGTAGGTGACATGCGAGTGAGGGTATCCCATGCAATGAGTTTGCATAAGACTGGACCACGAAATAGTAACCACTAG
Protein sequenceShow/hide protein sequence
MSSSIIALLKNEKLTGDNYATWKSNLNMILVIDDLRFVLMEECPPSPVRNASQTVRDAYDRWTKANEKARVYILASMSDVLSKKHESMVTARQIMDSLQEMFGQPSIQIR
HEAIKYVYNARMKEGQSVREHVLDMMVQFNVAETNGAVIDEQSQVSFILESLPKSFLQFRSNAVMNKIEYTMTTLLNELQTFQSLMKNKGQKEGEANVAHSKRFQKGSSS
GTKSAPSSSGTKKIQKKKGGKGKAHFVFDTNAMIQRVRVGDMRVRVSHAMSLHKTGPRNSNH