Core Workloads

Sean Busbey edited this page on Nov 30, 2020 · 6 revisions

YCSB includes a set of core workloads that define a basic benchmark for cloud systems. Of course, you can define your own workloads, as described in Implementing New Workloads. However, the core workloads are a useful first step, and obtaining these benchmark numbers for a variety of different systems would allow you to understand the performance
tradeoffs of different systems.

The core workloads consist of six different workloads:

Workload A: Update heavy workload

This workload has a mix of 50/50 reads and writes. An application example is a session store recording recent actions. Updates in this workload do not presume you read the original record first. The assumption is all update writes contain fields for a record that already exists; oftentimes writing only a subset of the total fields for that record. Some data stores need to read the underlying record in order to reconcile what the final record should look like, but not all do.

Workload B: Read mostly workload

This workload has a 95/5 reads/write mix. Application example: photo tagging; add a tag is an update, but most operations are to read tags. As with Workload A, these writes do not presume you read the original record before writing to it.

Workload C: Read only

This workload is 100% read. Application example: user profile cache, where profiles are constructed elsewhere (e.g., Hadoop).

Workload D: Read latest workload

In this workload, new records are inserted, and the most recently inserted records are the most popular. Application example: user status updates; people want to read the latest.

Workload E: Short ranges

In this workload, short ranges of records are queried, instead of individual records. Application example: threaded conversations, where each scan is for the posts in a given thread (assumed to be clustered by thread id).

Workload F: Read-modify-write

In this workload, the client will read a record, modify it, and write back the changes. Application example: user database, where user records are read and modified by the user or to record user activity. This workload forces a read of the record from the underlying datastore prior to writing an updated set of fields for that record. This effectively forces all datastores to read the underlying record prior to accepting a write for it. At the moment we use a random delta for the write rather than some value derived from the current record (say incrementing a counter). That can make the workload a bit harder to follow since the starting read seems unnecessary.

Running the workloads

All six workloads have a data set which is similar. Workloads D and E insert records during the test run. Thus, to keep the database size consistent, we recommend the following sequence:

Load the database, using workload A’s parameter file (workloads/workloada) and the “-load” switch to the client.
Run workload A (using workloads/workloada and “-t”) for a variety of throughputs.
Run workload B (using workloads/workloadb and “-t”) for a variety of throughputs.
Run workload C (using workloads/workloadc and “-t”) for a variety of throughputs.
Run workload F (using workloads/workloadf and “-t”) for a variety of throughputs.
Run workload D (using workloads/workloadd and “-t”) for a variety of throughputs. This workload inserts records, increasing the size of the database.
Delete the data in the database.
Reload the database, using workload E’s parameter file (workloads/workloade) and the "-load switch to the client.
Run workload E (using workloads/workloade and “-t”) for a variety of throughputs. This workload inserts records, increasing the size of the database.

Github Core Workloads · brianfrankcooper/YCSB Wiki · GitHub

Core Workloads

Running the workloads

Recommend

20 Best Free Pages & MS Word Resume CV Templates (2021)

五菱新配色、恒隆广场国风海报…这些作品很“洗眼”！| 一周案例

BTC重回60000美元关口

How a Sponsor Can Help Elevate Your Career in IT

China’s National Five-Year Plan Fuels Growth Momentum To Blockchain- PingWest

养老的每个细分领域都有望诞生千亿级公司 | 2021创业何处去

Foreign Holdings of Chinese Bonds Rise for 27th Month- PingWest

Nio, XPeng, Li Auto Reportedly Considers Second Listing in HongKong- PingWest

Anitian Announces Collaboration with Microsoft Azure

Cardano创始人：超过100家公司正从以太坊转移到Cardano

About Joyk