parallel_forest v0.1

Ensemble learning on IPython.parallel. The codes are under developing state and constantly updated, please check the update if you get any error :). Please feel free to write the wiki.

Requirements

You need

[Python 2.7.x] (https://www.python.org/downloads/)
[IPython.parallel 3.2.3] (http://ipython.org/ipython-doc/dev/parallel/)
[numpy 1.10.4] (http://www.numpy.org/)
pyzmq 15.2.0

To run [demo file] (https://github.com/wasit7/parallel_forest/blob/master/demo/spiral.py), this additional package is needed

matplotlib 1.5.1

To view [notebook file] (https://github.com/wasit7/parallel_forest/blob/master/nb/parallel%20forest.ipynb) locally, these additional packages are needed

Jinja2 2.8
tornado 4.3
jsonschema 2.5.1
matplotlib 1.5.1

Setup

$ python setup.py install

or install with pip by call the following command at project's root directory

$ pip install .

Demo

In order to run the [demo file] (https://github.com/wasit7/parallel_forest/blob/master/demo/spiral.py), you need to start the IPython's controller and engines. You can setup your own cluster or simply run the following command to start one controller and four engines at your computer

$ ipcluster start

After that you can run the demo file by

$ cd demo
$ python spiral.py

Sample

Have a look in the parallel_forest.ipynb

deprecated setup

Creating the dataset

By executing spiral_pickle.py then the data is recorded into a pickle file. You have to rename the spiral.pic to datasetXX.pic, where [XX] is index of an engine you have to change the index according to the number of engine you have. For example, a number of engines is 8 the number [XX] are 00, 01, 02, ..., 07.
The recorded data may be loaded and verified by executing spiral_pickle_r.py

Starting ipcluster. Here you can check the instruction in the Ipython website. If you are using TU-Scicloud please check the tutorial [here] (https://www.youtube.com/watch?v=7uZpEoLWhb4)
When the data and the cluster are ready, we can start training by running scmain.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!