AI-Based Audio Analyzer App

This is an AI-based audio analyzer app.

Within the scope of this technical specification, you need to implement the logic for audio analysis—breaking the audio into prompts that can then be used to create images in MidJourney or another image generation service. You can use any APIs; I used ASSEMBLYAI for converting audio to text and Langchain for generating prompts.

How to Run the Program

First, you need to fill the .env file with your data. The API_KEY_ASSEMBLYAI can be obtained for free by visiting their website: AssemblyAI.
Example of data if you want to run it with Docker Compose.
Then fill the docker-compose.env file (if you want it to run via Docker Compose).
Another example.
Finally, run docker-compose up --build and wait for it to start.

What I Have Implemented

Full reliable JWT authentication with login, logout, and registration with reliable validation.
Protected endpoints only for authorized users.
Ability to create prompt tasks both with audio files and text.
Optimized queryset to avoid the N+1 problem and a reliable database schema.
Pagination to avoid large querysets.
Docker and Docker Compose files.
CRUD operations for the analyzer task, including bulk update/destroy, optimized for large datasets.

Documentation

Created Swagger documentation for endpoints via this link: Swagger Documentation.

What I Did Not Manage to Do

I did not manage to implement bulk update from prompts through nested serialization, so it is currently read-only.
I planned to put the logic of converting audio and creating LLM prompts into Celery tasks (you may even see its configuration), but I decided not to do it for now.

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
audio_analyzer		audio_analyzer
.gitignore		.gitignore
README.md		README.md
docker-compose.yml		docker-compose.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AI-Based Audio Analyzer App

How to Run the Program

What I Have Implemented

Documentation

What I Did Not Manage to Do

About

Releases

Packages

Languages

Andry925/AI-Audio-Analyzer

Folders and files

Latest commit

History

Repository files navigation

AI-Based Audio Analyzer App

How to Run the Program

What I Have Implemented

Documentation

What I Did Not Manage to Do

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages