LCPC16 Submission Experiments

This is a repository containing all the artifacts and code versions we used to perform the evaluation section for the LCPC16 submission. In addition to the experiments mentioned in the paper, it contains experiments that were not discussed either because they turned out to be only tangentially relevant to our work, or we did not have the space in the paper. In addition, in this README and others in sub-directories, we explain in more details some of our choices in the experiment design. By making all of of the experiments publicly available as well as explaining in more details our experiment design, we aim to make our research more reproducible and easier to build upon for future work by our own or other researchers.

The repository follows the Wu Wei conventions and is compatible with its commandline tools (https://github.com/Sable/wu-wei-benchmarking-toolkit/).

Each benchmark has multiple implementations that were either created by hand or automatically using the Mc2Mc tool (https://github.com/Sable/Mc2Mc). Each benchmark has its own README that explains the results we obtained for the different versions and the interpretation we made of the different results.

Installation instructions

Install the Wu-Wei toolchain (https://github.com/Sable/wu-wei-benchmarking-toolkit/wiki/Installation).

Clone this repository:

 git clone [email protected]:Sable/lcpc16-analysis.git

Run all the experiments that were done for the LCPC paper:
```
 wu run paper-experiment
```
Report all results:
```
 wu report
```

Note: if you meet problems, please check the troubleshooting section on the bottom before you ask questions.

Part I: Analysis of the performance on the different versions of MATLAB

Code versions:

- matlab         : original MATLAB code
- matlab-mixed   : mixed MATLAB code from original and vectorized MATLAB code
- matlab-plus    : vectorized MATLAB code with checks
- matlab-plus-no : vectorized MATLAB code with no checks

Compilers:

- MATLAB R2013a
- MATLAB R2015b

9 benchmarks

- Backprop
- Blackscholes
- Capr
- Crni
- FFT
- Monte Carlo simulation
- NW
- PageRank
- SPMV

R2015 release note

MATLAB Execution Engine: Run programs faster with redesigned

architecture (Performance section) Link: http://www.mathworks.com/help/matlab/release-notes.html

"The new MATLAB execution engine includes performance improvements

to function calls, object-oriented operations, and many other MATLAB operations. These performance improvements result in significantly faster execution of many MATLAB programs with an average speed-up of 40% among 76 performance-sensitive applications from users. Of these tested applications, 13 ran at least twice as fast and only 1 slowed down by more than 10%. The new execution engine uses just-in-time compilation of all MATLAB code which makes performance more uniform and predictable. The new engine offers improved quality and provides a platform for future performance optimizations and language enhancements."

Part II: Analysis of the performance on Octave

The current Octave version is 4.0 which doesn't support the JIT. The benchmark 'pagerank' is tested with the medium-size input. To my surprise, the performance of Octave is much worse in loops compared with MATLAB. With vectorization, Octave can achieve impressive speedup (65x~73x). The reason that the 'pagerank' doesn't have much speedup is because of MATLAB's JITs. With the competitive JITs, the advantage of the vectorization for loops is decreased sometimes even worse.

implementation	environment	mean
matlab	matlab-vm	1.6423s
matlab-vector	matlab-vm	1.5448s
matlab-plus	matlab-vm	1.5272s
matlab	octave	728.1882s
matlab-vector	octave	11.1293s
matlab-plus	octave	9.8707s

Speedup summary

In MATLAB: (baseline matlab)

matlab	vector	plus
1.0000x	1.0631x	1.0754x

In Octave (baseline matlab)

matlab	vector	plus
1.0000x	65.4298x	73.7727x

Why is vectorization slowing down in some cases?

After looking at the three cases that slow down, I think we can see some possible explanations. First read the three messages where I added some comments.

Overall, I would expect vectorization to always be a good idea when we have something like

    for i = 1:n
       b(i) = f(a(i))
    end

vectorized to

    b = f(a)

In both cases we have to create a new vector b. The only problem might be if a copy of a needs to be made to call the library function f. I would hope that this is not the case.

I would expect vectorization not to be a good idea when the original loop was only reading values from an array and writing to a scalar. The capr example illustrates this.

Whenever we use colon to refer to parts of an array, it is important to optimize its use as much as possible. The fft might be an example, if the code I proposed is faster.

Troubleshooting

Q1: What if you try "wu list" and the system returns "....wu/setup.json not found, aborting"?

Answer:

1. Go to the root of the repository;
2. "vim .wu/setup.json"
3. input with
    {
            "platform": "lynx"
    }
4. save, exit and try "wu list" again

Hint: please check the directory "platforms/" to find a right platform (i.e. a folder name). More details about creating a new configuration can be found at Wu-Wei's wiki.

Q2: What if compilers are not correctly configured?

Answer:

1. Find where compilers are, for example MATLAB-2013a (e.g. which matlab) 
2. Type: vim environments/matlab-vm-2013a-jit/environment.json
3. Replace the "executable-path" field with the path to MATLAB-2013a 
4. Save and exit

Q3: What if the "wu run paper-experiment" fails in Octave?

Answer:

1. Go to "environments/octave-4.0"
2. Compile c code with `mkoctfile --mex createMatrixRandJS.c` and `mkoctfile --mex createRandomPageMatrices.c`
3. Move the generated libraries "\*.mex" into the subdirectory "linux/" or "osx/"
4. Try the command "wu run paper-experiment" again

Name		Name	Last commit message	Last commit date
Latest commit History 74 Commits
.wu		.wu
benchmarks		benchmarks
compilers		compilers
environments		environments
experiments		experiments
implementations		implementations
platforms		platforms
reports		reports
summary		summary
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
mixed.txt		mixed.txt
propogate.sh		propogate.sh
run-all.sh		run-all.sh
run-check.sh		run-check.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LCPC16 Submission Experiments

Installation instructions

Part I: Analysis of the performance on the different versions of MATLAB

Part II: Analysis of the performance on Octave

Speedup summary

Why is vectorization slowing down in some cases?

Troubleshooting

About

Releases

Packages

Contributors 2

Languages

License

Sable/lcpc16-analysis

Folders and files

Latest commit

History

Repository files navigation

LCPC16 Submission Experiments

Installation instructions

Part I: Analysis of the performance on the different versions of MATLAB

Part II: Analysis of the performance on Octave

Speedup summary

Why is vectorization slowing down in some cases?

Troubleshooting

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages