Use rule target kronecker.txt and remove from macro_bench #209

shakthimaan · 2021-02-18T09:13:58Z

The graph500seq benchmarks were failing because of kronecker.txt: No such file or directory as mentioned at #207. This PR

Uses a target rule to build kronecker.txt before running kernel2 and kernel3.
Removes graph500seq benchmarks from macro_bench tag.

kayceesrk · 2021-03-25T11:46:09Z

What's the thought process behind removing the macro_benchmark tag? Do these benchmarks not run long enough?

shakthimaan · 2021-03-25T12:31:03Z

What's the thought process behind removing the macro_benchmark tag? Do these benchmarks not run long enough?

Yes, they do not run long enough, and hence the benchmark graphs are getting skewed. Hence, the request to remove them from macro_bench.

kayceesrk · 2021-03-26T03:05:18Z

If that's the case then the timing tags 1s_10s etc should also be removed?

Sudha247 · 2021-03-26T05:23:22Z

This is one of the results I got a while ago of graph500 benchmarks:

{"name":"kernel1.12_10","command":"taskset --cpu-list 5 ./kernel1.exe 12 10","time_secs":0.0026328563690185547,"user_time_secs":0.002383,"sys_time_secs":0.0,"maxrss_kB":5600,"ocaml":{"version":"4.10.0+multicore","c_compiler":"gcc","architecture":"amd64","word_size":"64","system":"linux","stats":"false","function_sections":"true","supports_shared_libraries":"true"},"gc":{"allocated_words":1874,"minor_words":1808,"promoted_words":0,"major_words":66,"minor_collections":0,"major_collections":0,"heap_words":4096,"top_heap_words":4096,"mean_space_overhead":0.0},"codesize":206811.0,"ocaml_url":"https://github.com/Sudha247/ocaml-multicore/archive/3deeb2604320a7337cc75f20116ea237d65fc400.tar.gz"}

Might be the case that the benchmark was not running due to an absent input file and this PR fixes it. But I did not see an error message saying the benchmark failed to run, it's possible the error message was not printed. We may have to revisit the timing tags for this. Also, wrt macro_bench tag, it will be good to confirm the results are not fluctuating (which I observed a few times) before adding it again as a macro benchmark.

kayceesrk · 2021-03-26T07:38:06Z

Can we confirm the running times before removing the macro_bench tag? If it does run for the amount of time it says it does, then I'd prefer to keep the macro_bench tag.

If there is variance between different runs, it may be due to the random number generation. We should fix the seed to a constant value if that is the case.

Sudha247 · 2021-03-31T04:45:34Z

I'm happy with retaining the tag if the benchmarks are running long enough. So, I think what needs to be done is:

Running the benchmarks with the target rule in this PR and getting the times for graph500 benchmarks, I believe this would give us accurate timing data. Once we have the results, timing tags might need to be adjusted accordingly.
Running the benchmarks some number of times to be sure there isn't much variation between different runs. All good if there is not much variation, in case there is this could become a source of noise in overall data and we need to find a way to fix it.

kayceesrk · 2021-03-31T08:58:11Z

My assessment is that we should leave the macro_bench tag in. kernel1 runs for about 4s (with changes; see below), kernel2 runs for 25 seconds, and kernel3 runs for 80 seconds. They also do a fair number of major and minor collections.

Looks like kernel1 does not actually run the main function, but only defines it. See https://github.com/ocaml-bench/sandmark/blob/master/benchmarks/graph500seq/kernel1.ml#L106-L113. It is unsurprising it finishes immediately.
An easy fix would be to rename kernel1.ml to graph_construction.ml and implement a new kernel1.ml that calls linkKronecker() function. Reading the command line arguments for scale and edgefactor should be moved to the new kernel1.ml from the graph_construction.ml. This would of course mean that linkKronecker function would take scale and edge factors as arguments.

It seems weird that all the kernels take command-line arguments, currently set as 12 10 in run_config.json, and the kernels in addition take the kronecker.txt. Currently, kronecker.txt is generated from kronecker.exe 12 10. The interface seems brittle. Surely, we shouldn't have to pass 12 10 as arguments to each of the executables. What would happen if I generate kronecker.txt with arguments 15 15 and then run the kernels with 12 10. Something must go wrong? If not, why? The whole thing smells a little fishy.

Kernel2 and kernel3 write to the standard output. We should redirect the output to /dev/null in run_config.json.

We should use snake case and not camel case to be consistent.

Use target kronecker.txt and remove macro_bench

fe902bc

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use rule target kronecker.txt and remove from macro_bench #209

Use rule target kronecker.txt and remove from macro_bench #209

shakthimaan commented Feb 18, 2021

kayceesrk commented Mar 25, 2021

shakthimaan commented Mar 25, 2021 •

edited

Loading

kayceesrk commented Mar 26, 2021

Sudha247 commented Mar 26, 2021

kayceesrk commented Mar 26, 2021

Sudha247 commented Mar 31, 2021

kayceesrk commented Mar 31, 2021

Use rule target kronecker.txt and remove from macro_bench #209

Are you sure you want to change the base?

Use rule target kronecker.txt and remove from macro_bench #209

Conversation

shakthimaan commented Feb 18, 2021

kayceesrk commented Mar 25, 2021

shakthimaan commented Mar 25, 2021 • edited Loading

kayceesrk commented Mar 26, 2021

Sudha247 commented Mar 26, 2021

kayceesrk commented Mar 26, 2021

Sudha247 commented Mar 31, 2021

kayceesrk commented Mar 31, 2021

shakthimaan commented Mar 25, 2021 •

edited

Loading