Skip to content

xiaodaigh/data_manipulation_benchmarks

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

17 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Julia vs R data manipulation benchmark suite

A comparison of data manipulation prowess using synthetic data and the GE Flight Quest data

Set up instructions

  1. Change the settings.csv's data_path to a path that you can write to
  2. Download the 7z file (https://www.kaggle.com/c/flight/download/InitialTrainingSet_rev1.7z) and
  3. Extract it into the folder data_path/InitialTrainingSet_rev1

Synthetic benchmarks

Adapted from data.tables' official benchmarks

"Real-life" benchmarks

Uses GE Flight Quest data, the largest tabular dataset on Kaggle at the time of writing

Companion post

Speed of data manipulations in Julia vs R

About

A set of data manipulation benchmarking code for Julia and R

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 2

  •  
  •