; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0026952 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0026952
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionGag/pol protein
Genome locationchr03:15458991..15459452
RNA-Seq ExpressionPI0026952
SyntenyPI0026952
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008234 - cysteine-type peptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0063887.1 gag/pol protein [Cucumis melo var. makuwa]2.8e-6079.08Show/hide
Query:  MSSSIIALLKNEKLTGDNYATWKSNLNMILVIDDLRFVLMEECPPSPVRNASQTVRDAYDRWTKANEKARVYILASMSDVLSKKHESMVTTRQIMDSLQE
        MSSSIIALLK ++LTG+NYATWKS LNMILVI DLRFVLMEECPP P +NASQ+V+DAYD WTKAN+KA +Y+LAS+SD+LSKKHE MVT RQIMDSL+E
Subjt:  MSSSIIALLKNEKLTGDNYATWKSNLNMILVIDDLRFVLMEECPPSPVRNASQTVRDAYDRWTKANEKARVYILASMSDVLSKKHESMVTTRQIMDSLQE

Query:  MFGQASIQIRHEAIKYVYNARMKEGQSVREHVLDMMVQFNVAETNGTIIDEQS
        MFGQ SIQI+ EAIKYVYNARMKEGQSVREHVL M+V FNVAE N  I DE+S
Subjt:  MFGQASIQIRHEAIKYVYNARMKEGQSVREHVLDMMVQFNVAETNGTIIDEQS

TYK00843.1 gag/pol protein [Cucumis melo var. makuwa]4.9e-5776.47Show/hide
Query:  MSSSIIALLKNEKLTGDNYATWKSNLNMILVIDDLRFVLMEECPPSPVRNASQTVRDAYDRWTKANEKARVYILASMSDVLSKKHESMVTTRQIMDSLQE
        MSSSIIALLK E LTG+NYA WKS LNMILVI DLRF+LMEECPP P +NAS+++RDAY+R TKAN+KA +YILASMSD+LSKKHE MVT RQIMDS +E
Subjt:  MSSSIIALLKNEKLTGDNYATWKSNLNMILVIDDLRFVLMEECPPSPVRNASQTVRDAYDRWTKANEKARVYILASMSDVLSKKHESMVTTRQIMDSLQE

Query:  MFGQASIQIRHEAIKYVYNARMKEGQSVREHVLDMMVQFNVAETNGTIIDEQS
        +F Q SIQI+ EAIKYVYNA MKEGQSVREH LDM+V FNVAE NG +IDE+S
Subjt:  MFGQASIQIRHEAIKYVYNARMKEGQSVREHVLDMMVQFNVAETNGTIIDEQS

TYK15919.1 gag/pol protein [Cucumis melo var. makuwa]2.8e-6079.08Show/hide
Query:  MSSSIIALLKNEKLTGDNYATWKSNLNMILVIDDLRFVLMEECPPSPVRNASQTVRDAYDRWTKANEKARVYILASMSDVLSKKHESMVTTRQIMDSLQE
        MSSSIIALLK ++LTG+NYATWKS LNMILVI DLRFVLMEECPP P +NASQ+V+DAYD WTKAN+KA +Y+LAS+SD+LSKKHE MVT RQIMDSL+E
Subjt:  MSSSIIALLKNEKLTGDNYATWKSNLNMILVIDDLRFVLMEECPPSPVRNASQTVRDAYDRWTKANEKARVYILASMSDVLSKKHESMVTTRQIMDSLQE

Query:  MFGQASIQIRHEAIKYVYNARMKEGQSVREHVLDMMVQFNVAETNGTIIDEQS
        MFGQ SIQI+ EAIKYVYNARMKEGQSVREHVL M+V FNVAE N  I DE+S
Subjt:  MFGQASIQIRHEAIKYVYNARMKEGQSVREHVLDMMVQFNVAETNGTIIDEQS

XP_022158197.1 uncharacterized protein LOC111024734 [Momordica charantia]2.4e-5672.55Show/hide
Query:  MSSSIIALLKNEKLTGDNYATWKSNLNMILVIDDLRFVLMEECPPSPVRNASQTVRDAYDRWTKANEKARVYILASMSDVLSKKHESMVTTRQIMDSLQE
        MS+SIIALL  +KL G+NY  WKSNLN ILVIDDLRFVL E+CP +PV NA+  VR+AYDRW K+N+KA+VYILAS+SDVL+KKHE  VTT++IMDSLQ 
Subjt:  MSSSIIALLKNEKLTGDNYATWKSNLNMILVIDDLRFVLMEECPPSPVRNASQTVRDAYDRWTKANEKARVYILASMSDVLSKKHESMVTTRQIMDSLQE

Query:  MFGQASIQIRHEAIKYVYNARMKEGQSVREHVLDMMVQFNVAETNGTIIDEQS
        MFGQ S+Q RHEA+K+VYN+RMKEG SVREHVL++MV FNVAE+NG +IDEQS
Subjt:  MFGQASIQIRHEAIKYVYNARMKEGQSVREHVLDMMVQFNVAETNGTIIDEQS

XP_022158568.1 uncharacterized protein LOC111025021 [Momordica charantia]7.1e-5672.55Show/hide
Query:  MSSSIIALLKNEKLTGDNYATWKSNLNMILVIDDLRFVLMEECPPSPVRNASQTVRDAYDRWTKANEKARVYILASMSDVLSKKHESMVTTRQIMDSLQE
        MS+S I LL ++KL GDNY  WKSNLN ILVIDDLRFVL EECPP+P  NA++TVRDAYDRW KANEKARVYILAS+S+VLSKKHE + TTR+IMDSLQ 
Subjt:  MSSSIIALLKNEKLTGDNYATWKSNLNMILVIDDLRFVLMEECPPSPVRNASQTVRDAYDRWTKANEKARVYILASMSDVLSKKHESMVTTRQIMDSLQE

Query:  MFGQASIQIRHEAIKYVYNARMKEGQSVREHVLDMMVQFNVAETNGTIIDEQS
        +FGQ S  + H+A+KYVYN RMKEG SVREHVL+MMV FNVAE N T+++E S
Subjt:  MFGQASIQIRHEAIKYVYNARMKEGQSVREHVLDMMVQFNVAETNGTIIDEQS

TrEMBL top hitse value%identityAlignment
A0A5A7VA67 Gag/pol protein1.3e-6079.08Show/hide
Query:  MSSSIIALLKNEKLTGDNYATWKSNLNMILVIDDLRFVLMEECPPSPVRNASQTVRDAYDRWTKANEKARVYILASMSDVLSKKHESMVTTRQIMDSLQE
        MSSSIIALLK ++LTG+NYATWKS LNMILVI DLRFVLMEECPP P +NASQ+V+DAYD WTKAN+KA +Y+LAS+SD+LSKKHE MVT RQIMDSL+E
Subjt:  MSSSIIALLKNEKLTGDNYATWKSNLNMILVIDDLRFVLMEECPPSPVRNASQTVRDAYDRWTKANEKARVYILASMSDVLSKKHESMVTTRQIMDSLQE

Query:  MFGQASIQIRHEAIKYVYNARMKEGQSVREHVLDMMVQFNVAETNGTIIDEQS
        MFGQ SIQI+ EAIKYVYNARMKEGQSVREHVL M+V FNVAE N  I DE+S
Subjt:  MFGQASIQIRHEAIKYVYNARMKEGQSVREHVLDMMVQFNVAETNGTIIDEQS

A0A5D3BM47 Gag/pol protein2.4e-5776.47Show/hide
Query:  MSSSIIALLKNEKLTGDNYATWKSNLNMILVIDDLRFVLMEECPPSPVRNASQTVRDAYDRWTKANEKARVYILASMSDVLSKKHESMVTTRQIMDSLQE
        MSSSIIALLK E LTG+NYA WKS LNMILVI DLRF+LMEECPP P +NAS+++RDAY+R TKAN+KA +YILASMSD+LSKKHE MVT RQIMDS +E
Subjt:  MSSSIIALLKNEKLTGDNYATWKSNLNMILVIDDLRFVLMEECPPSPVRNASQTVRDAYDRWTKANEKARVYILASMSDVLSKKHESMVTTRQIMDSLQE

Query:  MFGQASIQIRHEAIKYVYNARMKEGQSVREHVLDMMVQFNVAETNGTIIDEQS
        +F Q SIQI+ EAIKYVYNA MKEGQSVREH LDM+V FNVAE NG +IDE+S
Subjt:  MFGQASIQIRHEAIKYVYNARMKEGQSVREHVLDMMVQFNVAETNGTIIDEQS

A0A5D3D0D9 Gag/pol protein1.3e-6079.08Show/hide
Query:  MSSSIIALLKNEKLTGDNYATWKSNLNMILVIDDLRFVLMEECPPSPVRNASQTVRDAYDRWTKANEKARVYILASMSDVLSKKHESMVTTRQIMDSLQE
        MSSSIIALLK ++LTG+NYATWKS LNMILVI DLRFVLMEECPP P +NASQ+V+DAYD WTKAN+KA +Y+LAS+SD+LSKKHE MVT RQIMDSL+E
Subjt:  MSSSIIALLKNEKLTGDNYATWKSNLNMILVIDDLRFVLMEECPPSPVRNASQTVRDAYDRWTKANEKARVYILASMSDVLSKKHESMVTTRQIMDSLQE

Query:  MFGQASIQIRHEAIKYVYNARMKEGQSVREHVLDMMVQFNVAETNGTIIDEQS
        MFGQ SIQI+ EAIKYVYNARMKEGQSVREHVL M+V FNVAE N  I DE+S
Subjt:  MFGQASIQIRHEAIKYVYNARMKEGQSVREHVLDMMVQFNVAETNGTIIDEQS

A0A6J1DWG6 uncharacterized protein LOC1110250213.4e-5672.55Show/hide
Query:  MSSSIIALLKNEKLTGDNYATWKSNLNMILVIDDLRFVLMEECPPSPVRNASQTVRDAYDRWTKANEKARVYILASMSDVLSKKHESMVTTRQIMDSLQE
        MS+S I LL ++KL GDNY  WKSNLN ILVIDDLRFVL EECPP+P  NA++TVRDAYDRW KANEKARVYILAS+S+VLSKKHE + TTR+IMDSLQ 
Subjt:  MSSSIIALLKNEKLTGDNYATWKSNLNMILVIDDLRFVLMEECPPSPVRNASQTVRDAYDRWTKANEKARVYILASMSDVLSKKHESMVTTRQIMDSLQE

Query:  MFGQASIQIRHEAIKYVYNARMKEGQSVREHVLDMMVQFNVAETNGTIIDEQS
        +FGQ S  + H+A+KYVYN RMKEG SVREHVL+MMV FNVAE N T+++E S
Subjt:  MFGQASIQIRHEAIKYVYNARMKEGQSVREHVLDMMVQFNVAETNGTIIDEQS

A0A6J1DWL0 uncharacterized protein LOC1110247341.2e-5672.55Show/hide
Query:  MSSSIIALLKNEKLTGDNYATWKSNLNMILVIDDLRFVLMEECPPSPVRNASQTVRDAYDRWTKANEKARVYILASMSDVLSKKHESMVTTRQIMDSLQE
        MS+SIIALL  +KL G+NY  WKSNLN ILVIDDLRFVL E+CP +PV NA+  VR+AYDRW K+N+KA+VYILAS+SDVL+KKHE  VTT++IMDSLQ 
Subjt:  MSSSIIALLKNEKLTGDNYATWKSNLNMILVIDDLRFVLMEECPPSPVRNASQTVRDAYDRWTKANEKARVYILASMSDVLSKKHESMVTTRQIMDSLQE

Query:  MFGQASIQIRHEAIKYVYNARMKEGQSVREHVLDMMVQFNVAETNGTIIDEQS
        MFGQ S+Q RHEA+K+VYN+RMKEG SVREHVL++MV FNVAE+NG +IDEQS
Subjt:  MFGQASIQIRHEAIKYVYNARMKEGQSVREHVLDMMVQFNVAETNGTIIDEQS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTTCCTCAATTATAGCACTGCTTAAAAATGAAAAATTAACCGGTGATAATTATGCAACGTGGAAATCGAACCTGAATATGATTCTTGTAATCGATGATCTACGCTT
CGTCTTAATGGAGGAATGTCCTCCTTCCCCTGTTCGAAATGCATCCCAGACCGTCAGGGATGCATATGACCGCTGGACAAAGGCCAATGAAAAGGCCCGAGTTTACATCT
TGGCCAGTATGTCTGACGTATTGAGCAAGAAACATGAGTCCATGGTCACTACACGTCAGATCATGGACTCCCTCCAGGAGATGTTTGGACAAGCGTCCATTCAAATCCGG
CACGAGGCTATTAAATACGTTTATAATGCCCGTATGAAGGAAGGCCAATCTGTTAGAGAACATGTTCTCGACATGATGGTCCAATTCAACGTGGCAGAAACGAACGGGAC
AATCATTGACGAGCAAAGTTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCTTCCTCAATTATAGCACTGCTTAAAAATGAAAAATTAACCGGTGATAATTATGCAACGTGGAAATCGAACCTGAATATGATTCTTGTAATCGATGATCTACGCTT
CGTCTTAATGGAGGAATGTCCTCCTTCCCCTGTTCGAAATGCATCCCAGACCGTCAGGGATGCATATGACCGCTGGACAAAGGCCAATGAAAAGGCCCGAGTTTACATCT
TGGCCAGTATGTCTGACGTATTGAGCAAGAAACATGAGTCCATGGTCACTACACGTCAGATCATGGACTCCCTCCAGGAGATGTTTGGACAAGCGTCCATTCAAATCCGG
CACGAGGCTATTAAATACGTTTATAATGCCCGTATGAAGGAAGGCCAATCTGTTAGAGAACATGTTCTCGACATGATGGTCCAATTCAACGTGGCAGAAACGAACGGGAC
AATCATTGACGAGCAAAGTTAG
Protein sequenceShow/hide protein sequence
MSSSIIALLKNEKLTGDNYATWKSNLNMILVIDDLRFVLMEECPPSPVRNASQTVRDAYDRWTKANEKARVYILASMSDVLSKKHESMVTTRQIMDSLQEMFGQASIQIR
HEAIKYVYNARMKEGQSVREHVLDMMVQFNVAETNGTIIDEQS