pfp validator

Setup

Requirements

Python 3.11
Docker

Clone the project to your computer.

git clone [email protected]:GIBZ/students/infa3a2023/face-detection.git

Start the project with Docker.
```
docker compose up --build
```
Open the Swagger page docs.

Interfaces

The Python FastAPI provides two interfaces:

Request Type	Path	Body	Description
GET	/heartbeat	No parameters	Returns the status of the service
POST	/image/process	A .png image	Processes the image and sends it back

Parameters "image/process"

Parameter	Default	Type	Description
bounds	False	boolean	Allow visible image edges
side_spacing	0.72	number	Distance of eyes to image edge 0 = Eyes at edge 0.9998 = Eyes centered from far
top_spacing	0.4	number	Vertical position of eyes 0 = Eyes at top edge 0.9998 = Eyes at bottom edge
width	512	integer	Width of final image
height	640	integer	Height of final image
binary_method	multiclass	string	Method used to remove background multiclass = more accurate, slower selfie = less accurate, faster

Process

The user sends the image from their smartphone to the Rust Service. This posts it to our FastAPI Service, which analyzes and processes the image and finally returns it. In the Rust Service, the final image is uploaded to the Google Bucket as usual.

Architecture Diagram

{width=60%}

Flow Diagram

{width=30%}

Flexibility

In our project, the individual steps of image processing were split into different functions. This makes them easy to exchange or extend.

Even in the Rust Service, the URL could simply be pointed to another service, as long as the parameters and return format don't change.

System Components / Frameworks

Design Decision

We decided to use Mediapipe due to its high configurability and the possibility of local execution. It also allows us to expand our knowledge of Computer Vision while deepening our Python skills.

For implementation into the existing GIBZ solution, we considered three variants:

Direct call via Mobile Client
Call in Rust Service
Call in Web Frontend

Finally, we chose the second method for the following reasons:

no updates needed in the Mobile Client, which could potentially be more difficult to implement/deploy
the read/write processes in the Google Bucket are kept to a minimum, as we assume that a high number of requests would lead to higher costs

Quality of the Solution

Advantages

Our service is only added in the backend. The user doesn't need to perform any updates or similar and won't notice anything.
We don't use external services like Google Vision or AWS Rekognition, but execute everything locally. This ensures student data protection.
The performance of our solution is already adequate but could easily be scaled by adjusting the cloud environment.
By avoiding external providers, we save costs.

Disadvantages

We had to invest a lot of time in learning Computer Vision and the actual implementation. This would certainly have been easier using an external service.

Implemented

Exactly one human face should be recognizable in the photo.
The face in the photo must be completely visible.
The face must be taken frontally. The head must not be tilted too much on any axis.
The face must not be covered (e.g., by masks, pets, sunglasses, ...).

Integration into Overall System

The integration into the overall system, as shown in the diagram, is not complex, and the required code changes were made in our fork.

Video

recording.mp4

Bonus

We are particularly proud of the implementation of background removal, which uses integrated smoothing. Equally impressive is the automatic alignment of the photo depending on the angle of the face. It also takes into account how far the face is from the camera, resulting in a uniform final image.

We owe this outstanding solution to Raphael Andermatt (@raphmatt), who invested a lot of time and effort into it.

Name		Name	Last commit message	Last commit date
Latest commit History 81 Commits
files		files
local_test		local_test
poc		poc
src		src
.dockerignore		.dockerignore
.gitattributes		.gitattributes
.gitignore		.gitignore
.gitlab-ci.yml		.gitlab-ci.yml
Dockerfile		Dockerfile
README.md		README.md
compose.yaml		compose.yaml
requirements-docker.txt		requirements-docker.txt
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

pfp validator

Setup

Requirements

Interfaces

Parameters "image/process"

Process

Architecture Diagram

Flow Diagram

Flexibility

System Components / Frameworks

Design Decision

Quality of the Solution

Advantages

Disadvantages

Implemented

Integration into Overall System

Video

Bonus

About

Releases

Packages

Languages

Raphmatt/face-detection

Folders and files

Latest commit

History

Repository files navigation

pfp validator

Setup

Requirements

Interfaces

Parameters "image/process"

Process

Architecture Diagram

Flow Diagram

Flexibility

System Components / Frameworks

Design Decision

Quality of the Solution

Advantages

Disadvantages

Implemented

Integration into Overall System

Video

Bonus

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages