CS4224 Team A (CockroachDB)

Requirements

Python 3.6 (assumes aliases python3 and pip3)
CockroachDB v19.2.9 cluster on xcnc machines
- Run in insecure mode, scripts will use root user

Environment Setup

The only dependency required is psycopg2==2.8.6 (found under requirements.txt)

Can be installed using pip3 install -r requirements.txt --user

Scripts

setup.py

Sets up the cluster for experiment by loading in the initial data using IMPORT INTO (drops existing tables)

For CockroachDB v19.2.9, it requires that csv data files are present on every node under {node store}/extern/data-files/

Execution parameters

Run with python3 setup.py -hn <host number> -p <port>.

Example: python3 setup.py -hn 0 -p 26257 (run setup on node at xcnc0.comp.nus.edu.sg:26257)

client.py

Runs transactions given a transaction file, outputting metrics (to stderr) and transaction output (to stdout) when completed.

Execution parameters

Run with python3 client.py <transaction file> -hn <host number> -p <port>

Example : python3 client.py xact-files/1.txt -hn 0 -p 26257 (run transactions in 1.txt on node at xcnc0.comp.nus.edu.sg:26257)

output_state.py

Outputs the state of the database (15 statistics according to the project description) into the given file

Execution parameters

Run with python3 output_state.py <output file> -hn <host number> -p <port>

Example: python3 output_state.py experiment1.out -hn 0 -p 26257 (output state as comma separated values into experiment1.out, querying node at xcnc0.comp.nus.edu.sg:26257)

aggregate-metrics.py

Execution parameters

Run with python3 aggregate-metrics.py.

This file generates 3 files, depending on what function is called in its main function.

Calling write_aggregate_metrics(experiment_folders) will generate throughput.csv and all_metrics.csv files in the root directory. The former is the aggregated throughput metric (min, max, avg) grouped by experiment per row, while the latter is the aggregated metrics (min, max, avg) of all performance benchmarks, grouped by experiment per row.
- throughput.csv schema: experiment_number,min,avg,max
- all_metrics.csv schema: experiment_number,measurement_a_min,measurement_a_avg,measurement_a_max,measurement_b_min,...,measurement_g_max
Calling write_clients_csv(experiment_folders, nc_by_folder) will generate clients.csv file in the root directory. This is the file as requested in the project brief, which has the following schema:
- experiment_number,client_number,measurement_a,measurement_b,...,measurement_g

Running an experiment

The run-experiment.sh script is used to run clients in parallel, each reading a corresponding transaction file and assigned hosts in a round robin manner

Configuration

The number of host instances and their addresses can be configured in the run-experiment.sh script.

Example (to run on 3 nodes at xcnc0.comp.nus.edu.sg:26257, xcnc1.comp.nus.edu.sg:26257, xcnc2.comp.nus.edu.sg:26257):

####### CONFIGURATION OF MACHINES ##############
hosts[0]=0
ports[0]=26257
hosts[1]=1
ports[1]=26257
hosts[2]=2
ports[2]=26257
################################################

The run-experiment.sh script takes in an argument indicating the number of clients to run. It expects all transaction files to be under xact-files/ in the same directory.

For each client number {i}, it will call client.py with {i}.txt and output stdout to {i}_output.out and stderr to {i}_stats.out. Additionally, a comma separated values form of metrics is saved to {i}.metrics.

Procedure

Adjust configuration of run-experiment.sh
Reset state of database by running setup.py
- python3 setup.py -hn <host number> -p <port>
Run transactions on clients in parallel
- ./run-experiment.sh <number of clients>
Wait till all clients have completed (all python3 processes spawned by run-experiment.sh have terminated)
Retrieve database state with output_state.py
- python3 output_state.py <output file name> -hn <host number> -p <port>

Outputs

In the same folder as run-experiment.sh, you should see the following files per client ({i} from 1 to num clients):

{i}_output.out: Output of transactions
{i}_stats.out: Metrics of client and any transactions that were retried excessively
{i}.metrics: Metrics of client in comma separated values form

Analyzing output

When each experiment finishes, the output will be *_output.out, *_stats.out and *.metrics files for every client. At the end of each experiment, put all of these files into a folder with the name corresponding to the experiment number:

Experiment 1: run-20-node-4
Experiment 2: run-20-node-5
Experiment 3: run-40-node-4
Experiment 4: run-40-node-5

With the files inside these 4 folders, we can run the aggregate-metrics.py script to generate the clients.csv and throughput.csv file. You will need to open the aggregate-metrics.py file to make sure write_aggregate_metrics(experiment_folders) and write_clients_csv(experiment_folders, nc_by_folder) are both uncommented. Refer to aggregate-metrics.py for further details on what each function call is used for.

Other Important Files

transaction.py

This file contains the implementations of the transactions needed to run the experiments.

*.sql

There are several .sql files in the root directory. Their uses are categorized as such:

Setup

create-tables.sql
drop-tables.sql
load-data.sql

Transactions

popular-item.sql
top-balance.sql
related-customer.sql

test-xact-files/*.txt

Contains single transactions of each type to test client implementation.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CS4224 Team A (CockroachDB)

Requirements

Environment Setup

Scripts

setup.py

Execution parameters

client.py

Execution parameters

output_state.py

Execution parameters

aggregate-metrics.py

Execution parameters

Running an experiment

Configuration

Procedure

Outputs

Analyzing output

Other Important Files

transaction.py

*.sql

Setup

Transactions

test-xact-files/*.txt

About

Releases

Packages

Contributors 2

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 66 Commits
test-xact-files		test-xact-files
.gitignore		.gitignore
README.md		README.md
aggregate-metrics.py		aggregate-metrics.py
client.py		client.py
create-tables.sql		create-tables.sql
drop-tables.sql		drop-tables.sql
explain.py		explain.py
load-data.sql		load-data.sql
output_state.py		output_state.py
popular-item.sql		popular-item.sql
related-customer.sql		related-customer.sql
requirements-conda.txt		requirements-conda.txt
requirements.txt		requirements.txt
run-experiment.sh		run-experiment.sh
setup.py		setup.py
start-node.sh		start-node.sh
top-balance.sql		top-balance.sql
transaction.py		transaction.py

adamwth/dist-db

Folders and files

Latest commit

History

Repository files navigation

CS4224 Team A (CockroachDB)

Requirements

Environment Setup

Scripts

setup.py

Execution parameters

client.py

Execution parameters

output_state.py

Execution parameters

aggregate-metrics.py

Execution parameters

Running an experiment

Configuration

Procedure

Outputs

Analyzing output

Other Important Files

transaction.py

*.sql

Setup

Transactions

test-xact-files/*.txt

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages