Update branch #2

LouisHernandez17 · 2024-10-10T13:00:49Z

No description provided.

…eration (#1012) I've added an abridged version this [post on the .txt blog](https://blog.dottxt.co/coding-for-structured-generation.html) to the cookbook that should provide a good overview of a basic workflow for developing code when working with structured generation.

updated the link to https://outlines-dev.github.io/outlines/reference/serve/vllm/

Co-authored-by: Patrice Bechard <[email protected]>

…ration) (#1039) As [discussed in our Discord server](https://discord.com/channels/1182316225284554793/1182317446225481788/1261998326077984802) This PR adds support for custom regex parsers. This doesn't change the behavior of Outlines by default. But this allows us to write custom `Guide` classes that uses custom regex parsers for e.g. multimodal generation. Also improves documentation

…ate.*

@rlouf

As requested by @rlouf this PR adds a question answering with citations example to the Cookbook using llama-cpp-python.

…formers

add fallback tonenizer if tiktoken can not get encoding from model name support llm services which provide openai compatibility api such as ollama

- Correct link for llama-cpp-python - Add installation instructions for llama-cpp-python - Correct first question-answer

Rendered Docs: https://github.com/lapp0/outlines/blob/multimodal-models/docs/reference/models/transformers_vision.md - Fixes #787 - Fixes #662 # Changes - Introduce `models.transformers_vision` which subclasses `models.transformers` and overrides its behavior so it applies, instead of `AutoTokenizer`, `AutoProcessor` to handle the text AND `PIL.Images` media - Introduce `VisionSequenceGeneratorAdapter`, handling and validating the `media` argument. - Update `outlines.generate` to dispatch `TransformersVision` models to `VisionSequenceGeneratorAdapter` # Tests - `tests/generate/test_api.py`: Test `prompt` / `media` validation - `tests/generate/test_generate.py`: - Add `model_transformers_vision` fixture. **tests pass locally, but disabled because a model small enough for CI isn't available** - Test all `outlines.generate` generators to ensure dispatchers for this new sequence generator is handled correctly.

memory= parameter is deprecated in favor of size= See https://modal.com/docs/reference/changelog#062174-2024-05-17 Current doc example produces the following error: ``` /path/test_modal.py:56: DeprecationError: 2024-05-16: The `memory` parameter is deprecated. Use the `size='80GB'` parameter instead. @app.function(image=outlines_image, gpu=gpu.A100(memory=80)) ```

It seems modal deletes environment variables, which makes outlines.models.transformers("mistralai/Mistral-7B-Instruct-v0.2") fail even after login. This workaround instructs user to manually add a key before importing the model. Fixes #1024

Add links to the two examples: - Q&A with Citations - Knowledge Graph Generation

Hi, Thank you for this great library! It seems that the docstrings are not rendered correctly in the docs. I think we should explicitly set the `docstring_style` because [it defaults to `"google"`](https://mkdocstrings.github.io/python/usage/configuration/docstrings/#docstring_style) but outlines is using numpy. Before: ![Screenshot 2024-12-16 at 23 00 26](https://github.com/user-attachments/assets/c752ee3d-519e-4098-b943-3aab43c8af25) After: ![Screenshot 2024-12-16 at 23 00 41](https://github.com/user-attachments/assets/5b5f524b-6921-4dbe-994d-72c079e677bc) There seem to be other issues in the docstrings: - for example [`Properties`](https://github.com/dottxt-ai/outlines/blob/main/outlines/models/openai.py#L23) should be [`Attributes`](https://numpydoc.readthedocs.io/en/latest/format.html#parameters) - only openai and transformers models are present in the [api reference](https://github.com/dottxt-ai/outlines/blob/main/docs/api/models.md) I'm happy to make followup PRs for those. Please let me know if I missed something, I couldn't find related issues/PRs.

Fix #1356

Use `is` and `is not` for type comparisons, or `isinstance()` for isinstance checks

the old library structure has not been updated to reflect the present one.

Before this commit, when you ran pytest -k specific_test, it spawned dozens of the same skipped warnings message on stdout... IMHO, that was not ideal ^^ Bug introduced in d32dfde

Allow giving custom filters to the prompt decorator ``` def reverses: str) -> str: return s[::-1] @prompt(filters={ 'reverse': reverse }) def reverse_prompt(text): '''{{ text | reverse }}''' prompt = reverse_prompt("Hello") print(prompt) >>> "olleH" ```

There's an extra `outlines.generate` row in the feature matrix docs. This removes it. I also modified the markdown syntax for one header to use ** rather than __, consistent with the rest of the table.

We have noticing the following error with a recent version of outlines when used with MLX: ``` TypeError: argument 'token_id': 'float' object cannot be interpreted as an integer At: /.../outlines_core/fsm/guide.py(294): get_next_state /.../outlines/processors/structured.py(101): process_logits /.../outlines/processors/base_logits_processor.py(90): __call__ ``` The issue is that the MLX array of tokens, which are integers, are being force-converted to floats, even though outlines expects an integer array. This is because all MLX arrays are being converted to `float32`, even when it's not necessarily appropriate, like in this case. Looking at the [commented link](https://ml-explore.github.io/mlx/build/html/usage/numpy.html#pytorch), the advice was to convert to `float32` only for `bfloat16`, because numpy does not support `bfloat16`. Now the MLX `_to_torch` implementation matches the other array libraries, none of the other libraries are being force-casted to float

The existing README has underwhelming or incorrect results (Example is underwhelming #1347) due to lack of templating for instruct models. This adds special tokens for each instruct model call, as well as provide comments on how to obtain/produce special tokens. --------- Co-authored-by: Victoria Terenina <[email protected]>

This PR aims at integrating support of the `genson` package (in `generate.json`) to be able to use dynamic json schema generation as proposed in #1383.

Also add instructions about different outlines "flavors"! Co-authored-by: Cameron Pfiffer <[email protected]>

willkurt and others added 30 commits July 1, 2024 18:00

Cache Legal-Token Mask as torch.tensor

25b6bcd

fix json prompt example

ea5fbdb

Fix broken link in README.md regarding Serving with vLLM

497ed9f

updated the link to https://outlines-dev.github.io/outlines/reference/serve/vllm/

Fix bug in batched multi sample generation

0383ce1

Fixed __call__ as well

8c5f1d8

Update outlines/generate/api.py

b54a964

Co-authored-by: Patrice Bechard <[email protected]>

Update file contributors style for docs

59d88dc

Add missing blank space (#1036)

a48f86f

Use LogitsProcessors for models.transformers, apply in outlines.gener…

8224855

…ate.*

Use LogitsProcessors for models.transformers -> outlines.generate.*

517ff53

enable generate.fsm with llamacpp by using outlines.processors

5a7f082

Add QA with Citations example to the Cookbook (#1042)

08a3e54

As requested by @rlouf this PR adds a question answering with citations example to the Cookbook using llama-cpp-python.

Fix mamba integration by making it a variant of outlines.models.trans…

e684af2

…formers

fix PyPI name for autoawq

cd8c79a

add fallback tokenizer (#1046)

d64bfc2

add fallback tonenizer if tiktoken can not get encoding from model name support llm services which provide openai compatibility api such as ollama

Update cerebrium instructions

9ce0df3

Add Knowledge Graph Extraction example

1bf23be

Correct link and add llama-cpp-python installation instructions (#1051)

f6a6c29

- Correct link for llama-cpp-python - Add installation instructions for llama-cpp-python - Correct first question-answer

Use outlines.processors for vLLM

47dfa4b

Fix failing Modal example (#1058)

abe689c

It seems modal deletes environment variables, which makes outlines.models.transformers("mistralai/Mistral-7B-Instruct-v0.2") fail even after login. This workaround instructs user to manually add a key before importing the model. Fixes #1024

Add links to the two examples (#1062)

fcfef33

Add links to the two examples: - Q&A with Citations - Knowledge Graph Generation

add link to MMSG

d78041e

Correct documentation to disable caching

6e026a7

fix pad token id reset

bb92745

Fix link to mamba model reference

26e2934

More detailed mlxlm documentation

951b020

EdAbati and others added 30 commits December 21, 2024 14:19

Fix JSON structured import

6205c0b

Update mkdocs.yml: jump to mamba section

fef70c0

Update transformers.md: fix headings level for mkdocs toc

5920870

remove non_blocking=True

fddfc8f

optimize tensor creation

2f0740e

add mps to processor benchmark

6a8612b

Skip vllm-related tests on non-linux platforms

d32dfde

Fix #1356

Fix json_schema imports in cot example

3cc399d

Add from_file class method to the Prompt object

4456f3c

Remove prompt.render method

bf27c77

Apply a small fix suggested by linter: Ruff E721

7149a5e

Use `is` and `is not` for type comparisons, or `isinstance()` for isinstance checks

Update react_agent.md with current import structure

e7be562

the old library structure has not been updated to reflect the present one.

Update transformers_vision.md

03d3878

Fix vllm-related pytest warning (that was spaming user)

3af1e5a

Before this commit, when you ran pytest -k specific_test, it spawned dozens of the same skipped warnings message on stdout... IMHO, that was not ideal ^^ Bug introduced in d32dfde

Remove incorrect text from feature matrix docs

a03c254

There's an extra `outlines.generate` row in the feature matrix docs. This removes it. I also modified the markdown syntax for one header to use ** rather than __, consistent with the rest of the table.

Update vllm_integration.py to the latest version of outlines (#1352)

7b9012b

Fix typo in docs/community/examples

8b173a3

Fix readme to reflect routing through enum

063291d

Sort, clean and update .gitignore

51ef663

Add genson integration for json generation (#1390)

437ffe4

This PR aims at integrating support of the `genson` package (in `generate.json`) to be able to use dynamic json schema generation as proposed in #1383.

Add a few dotfiles to improve newcomers' experience (#1368)

1842fe7

Update Dockerfile to Multi-Stage

5d64b23

add DevContainer configuration file

69418da

replaced pycountry with iso3166

6c598c3

fixed code style

774fe56

Update "Contribute" guide with uv and DevContainer intructions (#1405)

ad7a740

Also add instructions about different outlines "flavors"! Co-authored-by: Cameron Pfiffer <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update branch #2

Update branch #2

LouisHernandez17 commented Oct 10, 2024

Update branch #2

Are you sure you want to change the base?

Update branch #2

Conversation

LouisHernandez17 commented Oct 10, 2024