Top-Starred R GitHub Repos to Follow

Trending from 2017-07-01 to 2017-07-31

  1. catboost/catboost 1672
    CatBoost is an open-source gradient boosting on decision trees library with categorical features support out of the box for Python, R
  2. jonocarroll/ggshape 57
    Arrange 'ggplot' facets in arbitrary shapes
  3. dirkschumacher/llr 56
  4. WinVector/seplyr 29
    Standard Evaluation Interfaces for Common Dplyr Verbs
  5. PaulTaykalo/xcode-time-tracker 22
  6. xuzhougeng/Learn-Bioinformatics 22
  7. benmarwick/rrtools 21
    rrtools: Tools for Writing Reproducible Reseach in R
  8. jhoupt/DBDA2Estan 21
    Stan implementations of models in Doing Bayesian Data Analysis, 2nd Edition
  9. egnha/nofrills 20
    Low-cost anonymous functions
  10. dirkschumacher/awesome-r-organizations 20
    A community curated list of awesome companies/organizations that contribute open source R software/packages
  11. hrbrmstr/tidyweb 18
    Easily Install and Load Modern Web-Scraping Packages
  12. ThinkRstat/tweetstorm 15
  13. jumpingrivers/podcasts 15
    A collection of Data Science and Statistics podcasts
  14. thomasp85/reqres 11
    Powerful classes for http requests and responses
  15. seandavi/Bioc2017BigDataWorkshopSession 10
    Tutorial for working with cloud infrastructure and AWS from R
  16. dreamRs/billboarder 9
    🚧 Htmlwidget for billboard.js
  17. gilliganondata/ga-view-audit 9
    Script that cycles through a list of views (view IDs) and makes a snapshot of custom dimension, custom metrics, and goals that it then pushes to an Excel file.
  18. LucyMcGowan/gifr 8
    Making gifs in R. Totally open to a name change
  19. gsimchoni/kandinsky 8
    Turn any dataset into a Kandinsky painting
  20. dreamRs/addinit 8
    Initialize an 'RStudio' Project
  21. bhaskarvk/docker 8
    R Package For Accessing Docker via Docker APIs
  22. seandavi/MachineLearningIntro 8
    Machine learning use cases for teaching
  23. rstats-db/RMariaDB 7
    An R interface to MariaDB
  24. zonination/gisstemp 7
  25. jr-packages/efficientTutorial 6
    Slides and tutorials for useR!2017 Efficient R tutorial
  26. stephaniehicks/methylCC 6
    R/BioC package to estimate the cell composition of whole blood in DNA methylation samples in microarray or sequencing platforms
  27. mdsumner/sfraster 5
  28. stillmatic/quiltr 5
    R interface for Quilt Data Package Manager
  29. jeknov/EMNLP_17_submission 5
    The dataset and statistical analysis code released with the submission of EMNLP 2017 paper "Why We Need New Evaluation Metrics for NLG"
  30. openanalytics/poissontris 5


  1. kanaka/mal 3856
    mal - Make a Lisp
  2. facebookincubator/prophet 3599
    Tool for producing high quality forecasts for time series data that has multiple seasonality with linear or non-linear growth.
  3. johnmyleswhite/ML_for_Hackers 3059
    Code accompanying the book "Machine Learning for Hackers"
  4. tidyverse/ggplot2 2549
    An implementation of the Grammar of Graphics in R
  5. rstudio/shiny 2399
    Easy interactive web applications with R
  6. qinwf/awesome-R 2276
    A curated list of awesome R packages, frameworks and software.
  7. swirldev/swirl_courses 2275
    🎓 A collection of interactive courses for the swirl R package.
  8. h2oai/h2o-3 2251
    Open Source Fast Scalable Machine Learning API For Smarter Applications (Deep Learning, Gradient Boosting, Random Forest, Generalized Linear Modeling (Logistic Regression, Elastic Net), K-Means, PCA, Stacked Ensembles...)
  9. twitter/AnomalyDetection 2137
    Anomaly Detection with R
  10. tidyverse/dplyr 1894
    Dplyr: A grammar of data manipulation
  12. zonination/investing 1591
    Investing Returns on the Market as a Whole
  13. hadley/devtools 1491
    Tools to make an R developer's life easier
  14. yihui/knitr 1395
    A general-purpose tool for dynamic report generation in R
  15. jeroenjanssens/data-science-at-the-command-line 1271
    Data Science at the Command Line
  16. szilard/benchm-ml 1269
    A minimal benchmark for scalability, speed and accuracy of commonly used open source implementations (R packages, Python scikit-learn, H2O, xgboost, Spark MLlib etc.) of the top machine learning algorithms for binary classification (random forests, gradient boosted trees, deep neural networks etc.).
  17. toddwschneider/nyc-taxi-data 1002
    Import public NYC taxi and Uber trip data into PostgreSQL / PostGIS database, analyze with R
  18. hadley/adv-r 981
    Advanced R programming: a book
  19. ropensci/plotly 980
    An interactive graphing library for R
  20. rstudio/rmarkdown 871
    Dynamic Documents for R
  21. rich-iannone/DiagrammeR 829
    Graph and network visualization using tabular data in R
  22. hadley/rvest 813
    Simple web scraping for R
  23. rstudio/tensorflow 793
    TensorFlow for R
  24. hadley/r4ds 792
    R for data science
  25. jrnold/ggthemes 791
    ggplot themes and scales
  26. ramnathv/slidify 772
    Generate reproducible html5 slides from R markdown
  27. ujjwalkarn/DataScienceR 748
    a curated list of R tutorials for Data Science, NLP and Machine Learning
  28. IRkernel/IRkernel 727
    R kernel for Jupyter
  29. mlr-org/mlr 719
    mlr: Machine Learning in R
  30. genomicsclass/labs 703
    Rmd source files for the HarvardX series PH525x