v0.1.18
What's Changed
- Add Single Item as a view type with pagination by @dsmilkov in #846
- Add monaco and enable column-level diffing. by @nsthorat in #845
- Add parallelism to dataset.map with dask. by @nsthorat in #847
- Upgrade Cohere embeddings to v3-light by @brilee in #833
- Integrate Presidio into PII detection by @brilee in #839
- Simplify the UI for choosing media fields by @nsthorat in #844
Other Changes
- Fix the
build_docs.sh
andwatch_docs.sh
scripts to use the latest version of Lilac by @dsmilkov in #829 - Add backend support for sampling jsonl files by @brilee in #826
- Fix the HF deploy script for windows. by @nsthorat in #831
- Fix the flaky hdbscan test by setting a UMAP random_state from the unit test. by @nsthorat in #832
- Remove redundant dataset_cache call by @brilee in #835
- Invalidate the query after the redirect to avoid 500 errors from deleted dataset. by @nsthorat in #836
- Fix dataset uploading on windows. by @nsthorat in #837
- OpenAI Azure connector by @dechantoine in #838
- Expose hbdscan in the docs by @dsmilkov in #840
- Add a query type to SemanticSimilaritySignal and SemanticSearch: 'question' | 'document' by @nsthorat in #841
- Fix missing token in hf upload by @brilee in #842
- Add debouncing to file watcher recompilation by @brilee in #843
- Fix bug where missing keys in the filter constraint would raise KeyError by @brilee in #849
- Pass the job_id to the dataset.map map_fn. by @nsthorat in #848
- Add unit tests for num_jobs=-1 by @nsthorat in #850
Full Changelog: v0.1.17...v0.1.18