Adding Example Annotation with Showcase Notebook #550

eshwarprasadS · 2025-02-11T19:53:46Z

Fixes #527 #525
This PR:

Adds an example notebook showing how to leverage SDG to enable annotation use cases.
The example explicitly calls to guided_decoding_backend of vLLM (xgrammar), to make sure that the annotation options (guided_choices) are respected, while generating.
Adds example notebook showcasing an end-to-end custom use case SDG to annotate a classification dataset using the composable components available SDG library (namely Pipeline)

…amples dir Signed-off-by: eshwarprasadS <[email protected]>

bbrowning · 2025-02-12T13:21:03Z

This looks like a useful example to have in the repository. I rarely use Jupyter notebooks and am not really setup with a way to validate or run this to ensure it works as expected, but perhaps that's something @aakankshaduggal or @khaledsulayman can help with to give this an approval?

Also, we should think of how we keep this updated to ensure it doesn't get stale. The other examples all run as part of our unit test suite in test_examples.py. That may not be possible with a notebook in the same way, but something to think about so we know how we'll keep it working and up-to-date if we can't automatically test it.

…teps Signed-off-by: eshwarprasadS <[email protected]>

docs/examples/annotation/annotation_example.ipynb

eshwarprasadS · 2025-02-12T19:09:51Z

Also, we should think of how we keep this updated to ensure it doesn't get stale. The other examples all run as part of our unit test suite in test_examples.py. That may not be possible with a notebook in the same way, but something to think about so we know how we'll keep it working and up-to-date if we can't automatically test it.

@bbrowning Thanks for the comment. You are right, that this would be a new type of artifact that would need its own testing suite. I think there are a few different ways to test notebooks in AI / ML libraries (such as ours), but something I came across that might fit our pattern could be nbmake. This plugs in to pytest and can be part of our CI and can be activated like so (?)

- name: Test Jupyter Notebooks
  run: pytest --nbmake path/to/notebooks/

I think it would make sense to make this a new issue, if we have a strong desire to keep our example jupyter notebooks tested and updated regularly.

bbrowning · 2025-02-13T13:57:59Z

@eshwarprasadS Yes, it's fine to make a new issue to track figuring out how or if we can keep the notebooks updated.

eshwarprasadS · 2025-02-18T19:15:34Z

@eshwarprasadS Yes, it's fine to make a new issue to track figuring out how or if we can keep the notebooks updated.

created this for book-keeping #562

abhi1092

We can add the prompt creation/prompt string to the YAML file at the start of the notebook, but it's not necessary—just a suggestion. Other than that everything looks good.

abhi1092 · 2025-02-19T03:19:08Z

docs/examples/annotation/annotation_example.ipynb

A quick suggestion for the notebook to make it more complete, we can add the creation of the annotation yamls/promts at the start. We assume the user has already done that when they start using the notebook.

Hey Abhishek, thanks for the comment. The intent of the notebook was also to demonstrate how to achieve a good prompt through prompt engineering (the way to achieve this in SDG is to change the yamls) step by step. That was the primary reason I did not do it as a 'once at the top' approach.

Having said this, do you think there's any merit to the approach I have taken here?

shivchander

A few suggestions, but other than that LGTM.

docs/examples/annotation/annotation_config.yaml

docs/examples/annotation/detailed_annotation_config.yaml

docs/examples/annotation/simple_annotation_config.yaml

…s, change backend to xgrammar Signed-off-by: eshwarprasadS <[email protected]>

feat: adding annotation example, notebook and related artifacts to ex…

dda0740

…amples dir Signed-off-by: eshwarprasadS <[email protected]>

mergify bot added the documentation Improvements or additions to documentation label Feb 11, 2025

feat: change example dataset, add prompt engineering iterations and s…

802cf58

…teps Signed-off-by: eshwarprasadS <[email protected]>

eshwarprasadS commented Feb 12, 2025

View reviewed changes

docs/examples/annotation/annotation_example.ipynb Outdated Show resolved Hide resolved

eshwarprasadS mentioned this pull request Feb 18, 2025

Add example annotation pipeline under pipelines/ #525

Closed

abhi1092 approved these changes Feb 19, 2025

View reviewed changes

mergify bot added the one-approval label Feb 19, 2025

shivchander requested changes Feb 20, 2025

View reviewed changes

docs/examples/annotation/annotation_config.yaml Outdated Show resolved Hide resolved

docs/examples/annotation/detailed_annotation_config.yaml Outdated Show resolved Hide resolved

docs/examples/annotation/simple_annotation_config.yaml Show resolved Hide resolved

fix: change annotation config to match ICL formats, fix variable name…

24db09a

…s, change backend to xgrammar Signed-off-by: eshwarprasadS <[email protected]>

eshwarprasadS requested a review from shivchander February 21, 2025 19:49

shivchander approved these changes Feb 23, 2025

View reviewed changes

mergify bot merged commit c91c439 into instructlab:main Feb 23, 2025
5 checks passed

mergify bot removed the one-approval label Feb 23, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adding Example Annotation with Showcase Notebook #550

Adding Example Annotation with Showcase Notebook #550

eshwarprasadS commented Feb 11, 2025 •

edited

Loading

bbrowning commented Feb 12, 2025

eshwarprasadS commented Feb 12, 2025

bbrowning commented Feb 13, 2025

eshwarprasadS commented Feb 18, 2025

abhi1092 left a comment •

edited

Loading

abhi1092 Feb 19, 2025

eshwarprasadS Feb 20, 2025

shivchander left a comment

Adding Example Annotation with Showcase Notebook #550

Adding Example Annotation with Showcase Notebook #550

Conversation

eshwarprasadS commented Feb 11, 2025 • edited Loading

bbrowning commented Feb 12, 2025

eshwarprasadS commented Feb 12, 2025

bbrowning commented Feb 13, 2025

eshwarprasadS commented Feb 18, 2025

abhi1092 left a comment • edited Loading

Choose a reason for hiding this comment

abhi1092 Feb 19, 2025

Choose a reason for hiding this comment

eshwarprasadS Feb 20, 2025

Choose a reason for hiding this comment

shivchander left a comment

Choose a reason for hiding this comment

eshwarprasadS commented Feb 11, 2025 •

edited

Loading

abhi1092 left a comment •

edited

Loading