The Face Recognition Plugin for Dataiku DSS provides tools to automate the process of face recognition within your data workflows. This plugin includes two main recipes:
- Face Encoding Recipe: Converts a managed folder containing images or a dataset containing URLs toward images into face encodings. Encodings are a 128 floats vectors, and one encoding is produced for each face present in the source pictures.
- Reference Dataset Builder Recipe: Builds a reference dataset containing one grouping GUID for every matching face encoding.
To install the Face Recognition Plugin, follow these steps:
- Make sure cmake is installed on your DSS instance. On a Mac, run
brew install cmake
in a terminal. More details on the necessary installation steps here. - On Dataiku DSS, go to App > Plugins > Add Plugin > Fetch from Git repository > set
[email protected]:alexbourret/dss-plugin-face-recognition.git
in repository URL - Follow the installation prompts to complete the setup.
The Face Encoding Recipe transforms images into face encodings. This recipe can process:
- A managed folder containing images.
- A dataset containing URLs toward images.
- Managed Folder: A folder containing image files.
- Dataset with URLs: A dataset where each row contains a URL pointing to an image.
- Face Encodings Dataset: A dataset containing the face encodings extracted from the input images.
- URLs column: If the recipe's input is a dataset, specify the name of the column containing the URLs towards the images containing the faces to encode.
The Reference Dataset Builder Recipe creates a reference dataset
containing one grouping GUID for every matching encoding (faces). This
recipe helps in identifying and grouping similar faces.
- Face Encodings Dataset: The output from the Face Encoding Recipe or any dataset containing face encodings.
- Reference Dataset: A dataset with a unique grouping GUID for each set of matching face encodings.
- Uknown encodings column: Select the column containing the encodings in the dataset of faces not yet referenced
- Known encodings column: Select the column containing the encodings in the dataset of already referenced faces
- Known references column: Select the column containing the reference GUID
-
Create a Managed Folder or Dataset:
- Upload images to a managed folder in Dataiku DSS.
- Alternatively, create a dataset containing URLs toward images.
-
Run the Face Encoding Recipe:
- Create a new recipe and select "Face Encoding Recipe".
- Configure the input (managed folder or dataset with URLs) and output (face encodings dataset).
- Run the recipe to generate face encodings.
-
Build the Reference Dataset:
- Create a new recipe and select "Reference Dataset Builder Recipe".
- Configure the input (face encodings dataset) and output (reference dataset).
- Run the recipe to build the reference dataset with grouping GUIDs.
- Upload Images: Upload a set of images to a managed folder in Dataiku DSS.
- Encode Faces: Use the Face Encoding Recipe to convert these images into face encodings.
- Build Reference Dataset: Use the Reference Dataset Builder Recipe to create a reference dataset with grouping GUIDs for matching faces.
This plugin is distributed under the Apache License version 2.0