Skip to content

0.3.0

Compare
Choose a tag to compare
@longxiaofei longxiaofei released this 08 Aug 11:09
· 398 commits to main since this release

About Pygwalker 0.3.0

In the past, pygwalker has been using the data computing capabilities provided by graphic-walker, and all calculations were done in the browser.

However, when we need to analyze slightly larger datasets, the browser may crash or become unresponsive.

In the new pygwalker, graphic-walker only loads the data it needs; the calculation of the data is performed on the python side using duckdb. Duckdb is a very great database system; it provides a very simple invoke method and stronger computing power.♪(・ω・)ノ

Now, you can use pygwalker to explore larger datasets.

Enable kernel computation mode (duckDB) with

walker = pyg.walk(df, use_kernel_calc=True)

performance before (computation in javascript+webworker)

dataset with 300K rows

no_duckdb.mov

performance now (based on duckDB)

duckdb.mov

Feat

  • add new calculation, use duckdb as computing engine in the kernel. #193
  • update graphic-walker, add new filter tool and support limit feature. #193

Refactor

  • refactor data parser #185

@longxiaofei

Full Changelog: 0.2.0...0.3.0