; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG08G004040 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG08G004040
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionGag-pol polyprotein
Genome locationCG_Chr08:12790655..12793959
RNA-Seq ExpressionClCG08G004040
SyntenyClCG08G004040
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0051793.1 gag-pol polyprotein [Cucumis melo var. makuwa]1.2e-0439.53Show/hide
Query:  SIEKSYHMLYQQWLEDVKVLDVQKERIKTLLADNHRLVTTVSKLKCELNTTKSELESMHKSVKMLNSRKNELDKILSQDQDYSISK
        S +  Y  L  +W ED +V  VQKERI+ LL DNH L++ +  LK +L   ++E +S+ KSV++L  +   LD +LS+ +  S  K
Subjt:  SIEKSYHMLYQQWLEDVKVLDVQKERIKTLLADNHRLVTTVSKLKCELNTTKSELESMHKSVKMLNSRKNELDKILSQDQDYSISK

TrEMBL top hitse value%identityAlignment
A0A1S4E1H2 uncharacterized protein LOC1079915296.4e-0444.16Show/hide
Query:  KSIEKSYHMLYQQWLEDVKVLDVQKERIKTLLADNHRLVTTVSKLKCELNTTKSELESMHKSVKMLNSRKNELDKIL
        K+ E S   L   W ED +   +QKERI+ LL +N  L++ +S LK +L    +E + + KSVKMLNSR + LD IL
Subjt:  KSIEKSYHMLYQQWLEDVKVLDVQKERIKTLLADNHRLVTTVSKLKCELNTTKSELESMHKSVKMLNSRKNELDKIL

A0A5A7UHW4 Gag-pol polyprotein6.4e-0435.58Show/hide
Query:  WLEDVKVLDVQKERIKTLLADNHRLVTTVSKLKCELNTTKSELESMHKSVKMLNSRKNELDKILSQDQDYSISKHNEVMHLSKRFDRSYVNKKKISCTIN
        W ED   + +QKERI+ L+ +N RL++ +  LK +L   + E +   KSVKMLNSR   LD IL+  Q+  ++K+      S R   +    K +  ++N
Subjt:  WLEDVKVLDVQKERIKTLLADNHRLVTTVSKLKCELNTTKSELESMHKSVKMLNSRKNELDKILSQDQDYSISKHNEVMHLSKRFDRSYVNKKKISCTIN

Query:  KKID
         K D
Subjt:  KKID

A0A5A7VJQ7 Gag-pol polyprotein6.4e-0439.29Show/hide
Query:  KSIEKSYHMLYQQWLEDVKVLDVQKERIKTLLADNHRLVTTVSKLKCELNTTKSELESMHKSVKMLNSRKNELDKILSQDQDYS
        K+ + S   L   W ED +   +QKERI+ L+ +N RL++ +S LK +L   ++E + + KSVKMLNS    LD IL+ + + S
Subjt:  KSIEKSYHMLYQQWLEDVKVLDVQKERIKTLLADNHRLVTTVSKLKCELNTTKSELESMHKSVKMLNSRKNELDKILSQDQDYS

A0A5D3D857 Receptor-like protein 128.4e-0445.07Show/hide
Query:  YHMLYQQWLEDVKVLDVQKERIKTLLADNHRLVTTVSKLKCELNTTKSELESMHKSVKMLNSRKNELDKIL
        +  L   W ED ++L   +ER  TL  DNHRL+TT+S LK E    +SE E++ KSV+ML+     LD IL
Subjt:  YHMLYQQWLEDVKVLDVQKERIKTLLADNHRLVTTVSKLKCELNTTKSELESMHKSVKMLNSRKNELDKIL

A0A5D3DD57 Gag-pol polyprotein5.8e-0539.53Show/hide
Query:  SIEKSYHMLYQQWLEDVKVLDVQKERIKTLLADNHRLVTTVSKLKCELNTTKSELESMHKSVKMLNSRKNELDKILSQDQDYSISK
        S +  Y  L  +W ED +V  VQKERI+ LL DNH L++ +  LK +L   ++E +S+ KSV++L  +   LD +LS+ +  S  K
Subjt:  SIEKSYHMLYQQWLEDVKVLDVQKERIKTLLADNHRLVTTVSKLKCELNTTKSELESMHKSVKMLNSRKNELDKILSQDQDYSISK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCATAAGTCCATTGAAAAGTCTTATCATATGTTATATCAACAATGGCTTGAGGATGTAAAAGTCTTGGATGTTCAAAAAGAAAGAATCAAGACTCTTCTTGCTGACAA
TCATAGACTCGTGACCACTGTCTCTAAGCTTAAATGCGAGTTGAACACGACCAAAAGTGAATTGGAATCTATGCATAAATCCGTAAAGATGTTGAATTCTAGAAAAAATG
AGCTTGATAAGATTTTGTCTCAAGATCAAGACTATAGCATCTCTAAGCACAACGAAGTTATGCACTTGTCCAAGAGATTTGATCGATCCTATGTTAACAAGAAGAAAATT
AGTTGTACGATAAACAAGAAAATTGATAAGGAAATGTATGAAGTTAGACAGATTGTTATTACGACATTGGAAGACCTTAGATGTCATGTCACTAAGTCCACCAGTACTTC
TCTAGAGAAATCTAGAAATTTGACTCCTTTGCATGTGAGTTCTGAAGAAACTAGAGTTGTGAATGCTGACCAAATTACCTATAATGTTGACAGTACTATAATGGAGTCTA
TGAATTTTGTTGATCATGTTGTACTTATTGAGGTTGTTGACCTTGTTAATCATGTTGATTATCATGTTGAGTTGTCTAATATTTTGATTATTGCTCTATCTAAAACCTCT
ATTGGTAATGTTGATTTGATTGTCTCCTCTGTTATTGATCATACTACTCCTACTGAAACTATTGTTCTCTGTTGTTGCTTTCATTTTGATGTCCTCATACAGCCTATCTT
TAGTAGTGTTGCGATTAATTTGGTGAATTATATTGATCCCATTGTGCCCACTAAGTTTGTTGTTACTTCTACTGAAACTATGATTGAACATATTATGGCGGGTGAGATAC
TTATTGAGCTTTATGTGTTGTGTGAAGTCATGATTGATAATACTATTTTGGATGAATCTTTGGTTCTGTGCCATTTCCTCATTCTCATGATGTCGGAACAAATGTGCAAA
ATTATAAAGGAACACATTAAGACACATAAGTCTCTTGACCTTGAAAAGGACATTGATAATGCTTCTGCCGATACTGAAGATCTGAATGCTACTACAAATTGTTCTATTGT
TTTAGATGTGACTGAGGAAATGAATGTTGATAGTGGTTTGGTTGATCAAACAGTTGTAACTACTGATGGTCAAGATGCTTGTAAGAACACTATTGTTGAAAATGTTGACA
GACGTCTCAAATATTGGTCGCATGATGGTGTTGATAAGTCTACTGTTAATTAA
mRNA sequenceShow/hide mRNA sequence
ATGCATAAGTCCATTGAAAAGTCTTATCATATGTTATATCAACAATGGCTTGAGGATGTAAAAGTCTTGGATGTTCAAAAAGAAAGAATCAAGACTCTTCTTGCTGACAA
TCATAGACTCGTGACCACTGTCTCTAAGCTTAAATGCGAGTTGAACACGACCAAAAGTGAATTGGAATCTATGCATAAATCCGTAAAGATGTTGAATTCTAGAAAAAATG
AGCTTGATAAGATTTTGTCTCAAGATCAAGACTATAGCATCTCTAAGCACAACGAAGTTATGCACTTGTCCAAGAGATTTGATCGATCCTATGTTAACAAGAAGAAAATT
AGTTGTACGATAAACAAGAAAATTGATAAGGAAATGTATGAAGTTAGACAGATTGTTATTACGACATTGGAAGACCTTAGATGTCATGTCACTAAGTCCACCAGTACTTC
TCTAGAGAAATCTAGAAATTTGACTCCTTTGCATGTGAGTTCTGAAGAAACTAGAGTTGTGAATGCTGACCAAATTACCTATAATGTTGACAGTACTATAATGGAGTCTA
TGAATTTTGTTGATCATGTTGTACTTATTGAGGTTGTTGACCTTGTTAATCATGTTGATTATCATGTTGAGTTGTCTAATATTTTGATTATTGCTCTATCTAAAACCTCT
ATTGGTAATGTTGATTTGATTGTCTCCTCTGTTATTGATCATACTACTCCTACTGAAACTATTGTTCTCTGTTGTTGCTTTCATTTTGATGTCCTCATACAGCCTATCTT
TAGTAGTGTTGCGATTAATTTGGTGAATTATATTGATCCCATTGTGCCCACTAAGTTTGTTGTTACTTCTACTGAAACTATGATTGAACATATTATGGCGGGTGAGATAC
TTATTGAGCTTTATGTGTTGTGTGAAGTCATGATTGATAATACTATTTTGGATGAATCTTTGGTTCTGTGCCATTTCCTCATTCTCATGATGTCGGAACAAATGTGCAAA
ATTATAAAGGAACACATTAAGACACATAAGTCTCTTGACCTTGAAAAGGACATTGATAATGCTTCTGCCGATACTGAAGATCTGAATGCTACTACAAATTGTTCTATTGT
TTTAGATGTGACTGAGGAAATGAATGTTGATAGTGGTTTGGTTGATCAAACAGTTGTAACTACTGATGGTCAAGATGCTTGTAAGAACACTATTGTTGAAAATGTTGACA
GACGTCTCAAATATTGGTCGCATGATGGTGTTGATAAGTCTACTGTTAATTAA
Protein sequenceShow/hide protein sequence
MHKSIEKSYHMLYQQWLEDVKVLDVQKERIKTLLADNHRLVTTVSKLKCELNTTKSELESMHKSVKMLNSRKNELDKILSQDQDYSISKHNEVMHLSKRFDRSYVNKKKI
SCTINKKIDKEMYEVRQIVITTLEDLRCHVTKSTSTSLEKSRNLTPLHVSSEETRVVNADQITYNVDSTIMESMNFVDHVVLIEVVDLVNHVDYHVELSNILIIALSKTS
IGNVDLIVSSVIDHTTPTETIVLCCCFHFDVLIQPIFSSVAINLVNYIDPIVPTKFVVTSTETMIEHIMAGEILIELYVLCEVMIDNTILDESLVLCHFLILMMSEQMCK
IIKEHIKTHKSLDLEKDIDNASADTEDLNATTNCSIVLDVTEEMNVDSGLVDQTVVTTDGQDACKNTIVENVDRRLKYWSHDGVDKSTVN