What's Changed
- Fix Docker image and its dependencies
- Fix o1 concurrent generation output collection
- Update the code sanitization
Evaluated LLMs (157 models)
- o1-2024-12-17
- Gemini-2.0 series
Full Changelog: v0.2.1.post3...v0.2.1.post7
Full Changelog: v0.2.1.post3...v0.2.1.post7