[WIP] Add support for labels when visualizing embeddings #112

ehofesmann · 2022-10-28T13:40:59Z

UMAP fitting supports labels for supervised dimensionality reduction allowing it to accept an (incomplete) set of labels. It allows you to guide how the dimensionality reduction is performed to structure the embedding space based on certain tagged attribute.
Example workflow:

Take the coco dataset
tag 5 images of people and 5 with no people
compute embeddings to generate this plot which distributed all samples along the axis of "personness"
select samples of the tails of this distribution and tag them as "person" and "not person"
Recompute embeddings to get an even more structured embeddings visualization
Iteratively add good examples of classes, recomputing the visualization each time to generate more accurate clusters

This was all done just using tags in the App:

def visualize_from_tags(dataset, embeddings):
    labels = [t[0] if t else None for t in dataset.values("tags")]
    results = fob.compute_visualization(dataset, embeddings=embeddings, labels=labels)
    plot = results.visualize(labels=(F("tags").length() > 0).if_else(F("tags")[0], "None"))
    return plot

add support for labels in UMAP fit

0f54eb8

ehofesmann added the feature Work on a feature request label Oct 28, 2022

ehofesmann self-assigned this Oct 28, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] Add support for labels when visualizing embeddings #112

[WIP] Add support for labels when visualizing embeddings #112

ehofesmann commented Oct 28, 2022 •

edited

Loading

[WIP] Add support for labels when visualizing embeddings #112

Are you sure you want to change the base?

[WIP] Add support for labels when visualizing embeddings #112

Conversation

ehofesmann commented Oct 28, 2022 • edited Loading

ehofesmann commented Oct 28, 2022 •

edited

Loading