Skip to content

Framework-free pre-trained model data links for neural network.

Notifications You must be signed in to change notification settings

shiba24/pretrained-model-collections

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

24 Commits
 
 
 
 

Repository files navigation

Pre-trained model links of latest neural network papers

A list of available pre-trained neural network, in particular caffe and chainer models. Like model zoo of caffe, but this repository aims to offer you framework-free collections of the latest paper implements.

Recently many deep-learning frameworks exist such as caffe, theano, chainer, tensorflow, keras and so on. However, each pre-trained models are sometimes not compatible each other (i.e. cannot be loaded with other frameworks). This kind of exclusion of models for specific frameworks might prevent open improvement for deep learning communities including students, academic fields, and industries.

For the users who haven't installed caffe, I made up some files where weight and bias are saved as pickle. You can re-construct neural networks of your own framework, if you know the model structure. See also Loading Caffe model without Caffe

Notice

  1. Please be careful about the LICENSE for each model of your own usage.
  2. This is in progress and we need your help! Thank you.

Contributing

If you want to inform and add the new pre-trained models available, feel free for Issues and PRs. Also if you have installed caffe and want to contribute, please see Caffe model weight and bias export for non-caffe users.

TODOs

  • Add more models and implementations
  • List the LICENSE for each model
  • Scripts for chainer2pkl

Table of Contents

Models

Models are sorted for each tasks or competitions.

ILSVRC

Pascal datasets

VOC 2010 datasets

  • Fully Convolutional Network
    • now in prep

FLIC dataset for human pose estimation

Caffe

Loading Caffe model with Caffe

See the Caffe documentation for loading caffe models with caffe.

Loading Caffe model without Caffe

There are few ways for loading caffe model without caffe. Currently my best practices are two below:

1. Loading from chaner.CaffeFunction

Chainer is compatible to Caffe model to some extent. Please see Caffe function in Chainer

2. Loading pickles

Chainer doesn't support yet some functions and layers (e.g. Deconvolution layer) and might output some error messages. In that case, it might be possible to [load pkls](#Loading pkl) exported from caffe. The pkl file should be exported by those who DO have caffe! (To caffe users: please see Caffe model weight and bias export for non-caffe users)

Chainer

Loading Chainer model with Chainer

See the Chainer documentation for loading chainer model with chainer. See also how to copy model to another model.

Loading Chainer model without Chainer

The best practice

Currently the best practice might be installing chainer on your own environment because of its easy installation. pip install chainer For more details, please see chainer installation guide.

chainer2pkl

Now I am implementing chainer2pkl.py for exporting W and b of chainer model. Please wait for a while.

Theano

Loading Theano model with Theano

See the Theano documentation for loading theano models with theano.

Loading Theano model without Theano

Now in prep

Tips

Who has not installed caffe

If you want to use caffe pre-trained models in chainer or keras, please see Loading pkl. You need .pkl file exported or downloaded beforehand.

Caffe model weight and bias export for non-caffe users

Why exporting pickles for non-caffe users?

Recently the framework that has plenty of pre-trained models is Caffe, particularly in the academic papers.

However, the installation of caffe is a bit complicated. Those who use other frameworks might not want to install caffe only for the pre-trained models. This kind of exclusive possession of models for specific frameworks prevent open improvement for deep learning communities including students, academic fields, and industries.

So, I wrote caffe2pkl.py. This script works only caffe installed evironment and makes pickles of weights and biases.

Usage

python caffe2pkl.py --prototxt PATH_TO_PROTOTXT --model PATH_TO_MODEL

Loading pkl

In python,

with open(PATH_TO_PKL, 'rb') as d_pickle:
    data = six.moves.cPickle.load(d_pickle)

data is dict and has *_W and *_b keys for each layer. W ans b is np.ndarray.

Who wants to copy pre-trained chainer model

See the function copy_model in this web site

Although this is in Japanese, but you will find the function very useful. (No need to understanding the Japanese explanation exactly!)

Licence

MIT LICENSE

Author

shiba24

About

Framework-free pre-trained model data links for neural network.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages