Skip to content

huge variance in time to load iris  #79

Open
@davidbp

Description

@davidbp

Hello

I have observed a 10x difference when loading the iris dataset in 2 different machines.

Loading times are a bit unreasonable, is there anything I can do to speed this up?

ulia> using RDatasets

julia> @time iris = dataset("datasets", "iris"); # a DataFrame
100.068931 seconds (75.23 M allocations: 4.053 GiB, 3.19% gc time)

julia> 102.497734 seconds (75.35 M allocations: 4.062 GiB, 3.33% gc time)
       (v1.2) pkg> status RDatasets
           Status `~/.julia/environments/v1.2/Project.toml`
         [a93c6f00] DataFrames v0.19.4
         [ce6b1742] RDatasets v0.6.4

julia> versioninfo()
Julia Version 1.2.0
Commit c6da87ff4b (2019-08-20 00:03 UTC)
Platform Info:
  OS: macOS (x86_64-apple-darwin18.6.0)
  CPU: Intel(R) Core(TM) i5-4278U CPU @ 2.60GHz
  WORD_SIZE: 64
  LIBM: libopenlibm
  LLVM: libLLVM-6.0.1 (ORCJIT, haswell)
Environment:
  JULIA_EDITOR = subl

(v1.2) pkg> status RDatasets
    Status `~/.julia/environments/v1.2/Project.toml`
  [336ed68f] CSV v0.5.14
  [a93c6f00] DataFrames v0.19.4
  [ce6b1742] RDatasets v0.6.4

In the other machine I get:

julia> using RDatasets
[ Info: Recompiling stale cache file /home/david/.julia/compiled/v1.1/RDatasets/JyIbx.ji for RDatasets [ce6b1742-4840-55fa-b093-852dadbb1d8b]

julia> @time iris = dataset("datasets", "iris"); 
 10.544570 seconds (37.27 M allocations: 1.767 GiB, 8.98% gc time)

julia> versioninfo()
Julia Version 1.1.0
Commit 80516ca202 (2019-01-21 21:24 UTC)
Platform Info:
  OS: Linux (x86_64-pc-linux-gnu)
  CPU: Intel(R) Core(TM) i7-4600U CPU @ 2.10GHz
  WORD_SIZE: 64
  LIBM: libopenlibm
  LLVM: libLLVM-6.0.1 (ORCJIT, haswell)

(v1.1) pkg> status RDatasets
    Status `~/.julia/environments/v1.1/Project.toml`
  [336ed68f] CSV v0.5.14
  [a93c6f00] DataFrames v0.18.4
  [ce6b1742] RDatasets v0.6.1

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions