Skip to content

v3.2.0

Compare
Choose a tag to compare
@Conchylicultor Conchylicultor released this 10 Jul 21:39
· 3745 commits to master since this release

Future breaking change:

  • The tfds.features.text encoding API is deprecated. Please use tensorflow_text instead.

New features

API:

  • Add a tfds.ImageFolder and tfds.TranslateFolder to easily create custom datasets with your custom data.
  • Add a tfds.ReadConfig(input_context=) to shard dataset, for better multi-worker compatibility (#1426).
  • The default data_dir can be controlled by the TFDS_DATA_DIR environment variable.
  • Better usability when developing datasets outside TFDS
    • Downloads are always cached
    • Checksum are optional
  • Added a tfds.show_statistics(ds_info) to display FACETS OVERVIEW. Note: This require the dataset to have been generated with the statistics.
  • Open source various scripts to help deployment/documentation (Generate catalog documentation, export all metadata files,...)

Documentation:

  • Catalog display images (example)
  • Catalog shows which dataset have been recently added and are only available in tfds-nightly nights_stay

Breaking compatibility change:

  • Fix deterministic example order on Windows when path was used as key (this only impact a few datasets). Now example order should be the same on all platforms.
  • Remove tfds.load('image_label_folder') in favor of the more user-friendly tfds.ImageFolder

Other:

  • Various performances improvements for both generation and reading (e.g. use __slot__, fix parallelisation bug in tf.data.TFRecordReader,...)
  • Various fixes (typo, types annotations, better error messages, fixing dead links, better windows compatibility,...)

Thanks to all our contributors who help improving the state of dataset for the entire research community!