Skip to content

Stochastic variational inference for latent dirichlet allocation models (whew)

License

Unknown, MIT licenses found

Licenses found

Unknown
LICENSE
MIT
LICENSE.md
Notifications You must be signed in to change notification settings

nerskin/lda.svi

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

49 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

lda.svi

This R package fits latent dirichlet allocation models to data using the stochastic variational inference method introduced in this paper by Matthew Hoffman and coauthors. This method purports to allow LDA models to be fit considerably faster, and using considerably less memory, than with the popular batch variational Bayes method. As far as I can tell this is the only R package implementing this method. The key functions are implemented in C++ for speed.

A somewhat more helpful reference for the method is this paper, which motivates the algorithms and places it in a broader context.

Non-R Dependencies

  • A C++ compiler supporting C++11.

Philosophy

The interface is designed with Hadley Wickham's tidy data principles in mind, and therefore fits in nicely with the tidytext package by Julia Silge and David Robinson, which I recommend for preprocessing text and postprocessing model output.

Installation

I might submit this to the CRAN at some point, but in the meantime you can install it by running

#install.packages('devtools')
devtools::install_github("nerskin/lda.svi")

About

Stochastic variational inference for latent dirichlet allocation models (whew)

Resources

License

Unknown, MIT licenses found

Licenses found

Unknown
LICENSE
MIT
LICENSE.md

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published