We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
The following paper implements a GPU accelerated version of NUTS, and shows some nice speedups on a logistic regression model.
Tran, Dustin, Matthew W. Hoffman, Dave Moore, Christopher Suter, Srinivas Vasudevan, and Alexey Radul. "Simple, distributed, and accelerated probabilistic programming." In Advances in Neural Information Processing Systems, pp. 7598-7609. 2018. http://papers.nips.cc/paper/7987-simple-distributed-and-accelerated-probabilistic-programming
Can we try to run the vectorised HMC in #117 on the same model, and check the speedups?
The text was updated successfully, but these errors were encountered:
Just sync the understanding. The paper has NUTS on GPUs but not the batch-mode multiple chain stuff in #117 right?
Sorry, something went wrong.
It runs one chain, with the log density being parallelised on multiple CPUs and GPUs.
We only need to compare between vectorised HMC and non-vectorised HMC on GPU and CPU I think. I listed the paper because it is a related work.
Related DynamicHMC.jl issue: tpapp/DynamicHMC.jl#110
We could also check the difference between AHMC and DHMC on GPU using the example there.
This is already supported.
No branches or pull requests
The following paper implements a GPU accelerated version of NUTS, and shows some nice speedups on a logistic regression model.
Tran, Dustin, Matthew W. Hoffman, Dave Moore, Christopher Suter, Srinivas Vasudevan, and Alexey Radul. "Simple, distributed, and accelerated probabilistic programming." In Advances in Neural Information Processing Systems, pp. 7598-7609. 2018.
http://papers.nips.cc/paper/7987-simple-distributed-and-accelerated-probabilistic-programming
Can we try to run the vectorised HMC in #117 on the same model, and check the speedups?
The text was updated successfully, but these errors were encountered: