; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG05G010550 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG05G010550
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionGag-pol polyprotein
Genome locationCG_Chr05:11812951..11814932
RNA-Seq ExpressionClCG05G010550
SyntenyClCG05G010550
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
AAO73529.1 gag-pol polyprotein [Glycine max]8.3e-0431Show/hide
Query:  MSKFVCMMNSSTDDLNMILSFGKQVFDKRGIGFNQHRKQFYKGESSLLNHFVQAQDLPMSVPMSKTKEMPTSLVDHNVKKKWICHYYGKECHIRPYYFTL
        M+K + M+N  +D L+ +L  GK+V ++RG+GFN H+        + +  FV A++   +           +    + +KKW CHY GK  HI+P+ + L
Subjt:  MSKFVCMMNSSTDDLNMILSFGKQVFDKRGIGFNQHRKQFYKGESSLLNHFVQAQDLPMSVPMSKTKEMPTSLVDHNVKKKWICHYYGKECHIRPYYFTL

KAA0043382.1 gag-pol polyprotein [Cucumis melo var. makuwa]3.7e-0429.08Show/hide
Query:  MSKFVCMMNSSTDDLNMILSFGKQVFDKRGIGFNQHRKQFYKGESSLLNHFVQAQDLPMSVPMSK-TKEMPTSLVDHNVKKKWICHYYGKECHIRPYYFT
        +SK V MM   T  L+ +L  GK+  DKRG+GF++    + + +  +  H   + D    +   K T++   S+   N +K+WIC++ GK  HIRPY + 
Subjt:  MSKFVCMMNSSTDDLNMILSFGKQVFDKRGIGFNQHRKQFYKGESSLLNHFVQAQDLPMSVPMSK-TKEMPTSLVDHNVKKKWICHYYGKECHIRPYYFT

Query:  LDALATNSFVYKKRIELTIHKKTKKERYVVK--RVVLTQKY
        L +L     V  +   +  H+   + +  ++  +V LT  Y
Subjt:  LDALATNSFVYKKRIELTIHKKTKKERYVVK--RVVLTQKY

KAA0045252.1 gag-proteinase polyprotein [Cucumis melo var. makuwa]3.7e-0431.82Show/hide
Query:  KFVCMMNSSTDDLNMILSFGKQVFDKRGIGFNQHRKQFYKGESSLLNHFVQAQDLPMSVPMSKTKEMPTSLVDHNVK-KKWICHYYGKECHIRPYYFTL-
        K V M+NS T++L++IL+ G+   ++ G+GF+   ++        +N   +   +P SV       M T +V  + K  KWICHY G++ HIRP+ + L 
Subjt:  KFVCMMNSSTDDLNMILSFGKQVFDKRGIGFNQHRKQFYKGESSLLNHFVQAQDLPMSVPMSKTKEMPTSLVDHNVK-KKWICHYYGKECHIRPYYFTL-

Query:  -DAL---ATNSFVYKKRIELTIHKKTKKERYV
         D L    T     K +   T   K K+ R V
Subjt:  -DAL---ATNSFVYKKRIELTIHKKTKKERYV

PNX91973.1 gag-protease polyprotein [Trifolium pratense]4.4e-0537.86Show/hide
Query:  MSKFVCMMNSSTDDLNMILSFGKQVFDKRGIGFNQHRKQFYKGESSLLNHFVQAQ---DLPMSVPMSKTKEMPTSLVDHNVKKKWICHYYGKECHIRPYY
        MSK V M+NS TD L  IL  G++  + +GIGFN       K   S +  FV ++   D  MS  M    +           K WICHY GK+ HI+P+ 
Subjt:  MSKFVCMMNSSTDDLNMILSFGKQVFDKRGIGFNQHRKQFYKGESSLLNHFVQAQ---DLPMSVPMSKTKEMPTSLVDHNVKKKWICHYYGKECHIRPYY

Query:  FTL
        F L
Subjt:  FTL

XP_008444307.1 PREDICTED: uncharacterized protein LOC103487675 [Cucumis melo]1.3e-0433.33Show/hide
Query:  KFVCMMNSSTDDLNMILSFGKQVFDKRGIGFNQHRKQFYKGESSLLNHFVQAQDLPMSVPMSKTKEMPTSLVDHNVK-KKWICHYYGKECHIRPYYFTL
        K V M+NS T++L++IL+ G+   ++ G+GF+   ++        +N   +   +P SV       M T +V  + K  KWICHY G++ HIRP+ + L
Subjt:  KFVCMMNSSTDDLNMILSFGKQVFDKRGIGFNQHRKQFYKGESSLLNHFVQAQDLPMSVPMSKTKEMPTSLVDHNVK-KKWICHYYGKECHIRPYYFTL

TrEMBL top hitse value%identityAlignment
A0A1S3BAW0 uncharacterized protein LOC1034876756.2e-0533.33Show/hide
Query:  KFVCMMNSSTDDLNMILSFGKQVFDKRGIGFNQHRKQFYKGESSLLNHFVQAQDLPMSVPMSKTKEMPTSLVDHNVK-KKWICHYYGKECHIRPYYFTL
        K V M+NS T++L++IL+ G+   ++ G+GF+   ++        +N   +   +P SV       M T +V  + K  KWICHY G++ HIRP+ + L
Subjt:  KFVCMMNSSTDDLNMILSFGKQVFDKRGIGFNQHRKQFYKGESSLLNHFVQAQDLPMSVPMSKTKEMPTSLVDHNVK-KKWICHYYGKECHIRPYYFTL

A0A2K3MMD3 Gag-protease polyprotein2.1e-0537.86Show/hide
Query:  MSKFVCMMNSSTDDLNMILSFGKQVFDKRGIGFNQHRKQFYKGESSLLNHFVQAQ---DLPMSVPMSKTKEMPTSLVDHNVKKKWICHYYGKECHIRPYY
        MSK V M+NS TD L  IL  G++  + +GIGFN       K   S +  FV ++   D  MS  M    +           K WICHY GK+ HI+P+ 
Subjt:  MSKFVCMMNSSTDDLNMILSFGKQVFDKRGIGFNQHRKQFYKGESSLLNHFVQAQ---DLPMSVPMSKTKEMPTSLVDHNVKKKWICHYYGKECHIRPYY

Query:  FTL
        F L
Subjt:  FTL

A0A5A7TJC1 Gag-pol polyprotein1.8e-0429.08Show/hide
Query:  MSKFVCMMNSSTDDLNMILSFGKQVFDKRGIGFNQHRKQFYKGESSLLNHFVQAQDLPMSVPMSK-TKEMPTSLVDHNVKKKWICHYYGKECHIRPYYFT
        +SK V MM   T  L+ +L  GK+  DKRG+GF++    + + +  +  H   + D    +   K T++   S+   N +K+WIC++ GK  HIRPY + 
Subjt:  MSKFVCMMNSSTDDLNMILSFGKQVFDKRGIGFNQHRKQFYKGESSLLNHFVQAQDLPMSVPMSK-TKEMPTSLVDHNVKKKWICHYYGKECHIRPYYFT

Query:  LDALATNSFVYKKRIELTIHKKTKKERYVVK--RVVLTQKY
        L +L     V  +   +  H+   + +  ++  +V LT  Y
Subjt:  LDALATNSFVYKKRIELTIHKKTKKERYVVK--RVVLTQKY

A0A5A7TPF7 Gag-proteinase polyprotein1.8e-0431.82Show/hide
Query:  KFVCMMNSSTDDLNMILSFGKQVFDKRGIGFNQHRKQFYKGESSLLNHFVQAQDLPMSVPMSKTKEMPTSLVDHNVK-KKWICHYYGKECHIRPYYFTL-
        K V M+NS T++L++IL+ G+   ++ G+GF+   ++        +N   +   +P SV       M T +V  + K  KWICHY G++ HIRP+ + L 
Subjt:  KFVCMMNSSTDDLNMILSFGKQVFDKRGIGFNQHRKQFYKGESSLLNHFVQAQDLPMSVPMSKTKEMPTSLVDHNVK-KKWICHYYGKECHIRPYYFTL-

Query:  -DAL---ATNSFVYKKRIELTIHKKTKKERYV
         D L    T     K +   T   K K+ R V
Subjt:  -DAL---ATNSFVYKKRIELTIHKKTKKERYV

Q84VH6 Gag-pol polyprotein4.0e-0431Show/hide
Query:  MSKFVCMMNSSTDDLNMILSFGKQVFDKRGIGFNQHRKQFYKGESSLLNHFVQAQDLPMSVPMSKTKEMPTSLVDHNVKKKWICHYYGKECHIRPYYFTL
        M+K + M+N  +D L+ +L  GK+V ++RG+GFN H+        + +  FV A++   +           +    + +KKW CHY GK  HI+P+ + L
Subjt:  MSKFVCMMNSSTDDLNMILSFGKQVFDKRGIGFNQHRKQFYKGESSLLNHFVQAQDLPMSVPMSKTKEMPTSLVDHNVKKKWICHYYGKECHIRPYYFTL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCAAAGTTTGTTTGCATGATGAACTCAAGCACCGATGATCTTAACATGATTCTTTCCTTTGGCAAGCAGGTTTTTGATAAGCGTGGCATTGGATTCAATCAGCATAG
GAAGCAGTTCTACAAAGGTGAATCATCCCTATTAAATCATTTTGTTCAAGCCCAGGATTTACCAATGTCGGTTCCAATGTCAAAGACAAAGGAAATGCCAACATCTCTAG
TGGATCATAATGTAAAGAAGAAATGGATTTGTCATTATTATGGCAAAGAATGTCATATTCGTCCCTATTATTTCACCCTGGATGCCCTTGCTACAAATTCATTTGTCTAT
AAGAAGAGAATTGAGCTTACGATACATAAGAAAACTAAGAAGGAAAGGTATGTTGTTAAACGAGTAGTTCTAACACAAAAATACCTCATCGATCATGTCATTGAGTTCAC
CAGGTATGTTTCTATGGGTATTTCATATAAGTTCTCCGCCTTTCCTAAAGATCTTGAAGGTAATACATCTACAACTAAATCTACTTTTTCTCTTGAGAAGCCTAGTAATT
TGACTTCTCTTGACTTGGTTGTGGTTACTGAAGAAACTAAGGTTGTGAGTGGTGACTAA
mRNA sequenceShow/hide mRNA sequence
ATGTCAAAGTTTGTTTGCATGATGAACTCAAGCACCGATGATCTTAACATGATTCTTTCCTTTGGCAAGCAGGTTTTTGATAAGCGTGGCATTGGATTCAATCAGCATAG
GAAGCAGTTCTACAAAGGTGAATCATCCCTATTAAATCATTTTGTTCAAGCCCAGGATTTACCAATGTCGGTTCCAATGTCAAAGACAAAGGAAATGCCAACATCTCTAG
TGGATCATAATGTAAAGAAGAAATGGATTTGTCATTATTATGGCAAAGAATGTCATATTCGTCCCTATTATTTCACCCTGGATGCCCTTGCTACAAATTCATTTGTCTAT
AAGAAGAGAATTGAGCTTACGATACATAAGAAAACTAAGAAGGAAAGGTATGTTGTTAAACGAGTAGTTCTAACACAAAAATACCTCATCGATCATGTCATTGAGTTCAC
CAGGTATGTTTCTATGGGTATTTCATATAAGTTCTCCGCCTTTCCTAAAGATCTTGAAGGTAATACATCTACAACTAAATCTACTTTTTCTCTTGAGAAGCCTAGTAATT
TGACTTCTCTTGACTTGGTTGTGGTTACTGAAGAAACTAAGGTTGTGAGTGGTGACTAA
Protein sequenceShow/hide protein sequence
MSKFVCMMNSSTDDLNMILSFGKQVFDKRGIGFNQHRKQFYKGESSLLNHFVQAQDLPMSVPMSKTKEMPTSLVDHNVKKKWICHYYGKECHIRPYYFTLDALATNSFVY
KKRIELTIHKKTKKERYVVKRVVLTQKYLIDHVIEFTRYVSMGISYKFSAFPKDLEGNTSTTKSTFSLEKPSNLTSLDLVVVTEETKVVSGD