Releases: databrickslabs/dbldatagen
Hotfix for serverless support
- fixed issues running on Databricks serverless
release 0.4.0
This release adds the following new features:
- various bug fixes
- support for Constraints
- support for standard datasets
The new standard dataset feature allows creation of synthetic data sets in just a couple of lines of code for benchmarking / optimization and other purposes
release/v.0.3.6.1: Feature hotfixes (#274)
Hot fixes post v0.3.6
- Updates to documentation
- updates to enable dbldatagen work better with Databricks Connect
- bumped version
Release v0.3.6
This release includes fixes for use of dbldatagen
on the Databricks shared clusters
release/v0.3.5
This release adds the following enhancements:
- doc updates and additional content
- additional support for creating struct and JSON valued columns
- modifications to data spec code generation from existing datasets
- enhancements to reduce scenarios where manual use of base columns are needed
Hotfix v0.3.4post2
Fix for use of values
in complex typed columns (array, struct, or map)
Hotfix v0.3.4post1
Hotfix to address logger modifying root logger behavior
release/v0.3.4
The new release - labelled v0.3.4
adds improvements to build ordering, some bug fixes along with a new feature for automatically generating code to generate your synthetic data set from an existing schema or dataset (once it has been loaded as a Spark dataframe).
The new code generation feature has been marked as experimental
-- meaning that it is subject to change in coming releases.
The generated code is intended to be stub code that users will modify or refine to generate synthetic data that more closely matches their data set.
Feel free to file issues, or add discussion items with suggestions or feedback on improving this feature as it evolves.
release/v0.3.3post2
Corrected release versioning as PyPi will still serve latest release
- includes corrections for logger initialization
- includes hotfix for template expansion
- includes use of pandas optimizations when processing column definitions with multiple output columns
Release v0.3.2 postfix 2
Hotfix for template expansion for v0.3.2