This project automates the process of capturing, transcribing, and summarizing live radio streams. It utilizes OpenAI's Whisper model for transcription and the GPT-4 model for generating summaries of the transcribed text. Additionally, it fetches radio station stream URLs using the Radio Browser API, allowing for detailed analysis and understanding of content from various radio stations. The project now includes a Flask backend and integrates the Mesop for python UI.
- Stream Capture: Captures live audio from predefined radio stations using URLs fetched from the Radio Browser API.
- Audio Transcription: Utilizes OpenAI's Whisper model and the MesloP model to transcribe audio content to text.
- Text Summarization: Leverages OpenAI's GPT-4 model for summarizing transcribed texts.
- Continuous Operation: Designed to run continuously until manually stopped, making it ideal for long-term data collection.
Before you run this project, ensure you have the following installed:
- Python 3.8 or higher
ffmpeg
for handling audio streams- Required Python libraries:
openai
,requests
,subprocess
,flask
,meslop
(or any other specific dependencies for the MesloP model)
Follow these steps to set up the project environment:
- Clone the repository:
git clone https://github.com/mklemos/radio-analysis-project.git
cd radio-analysis-project
- Install the necessary Python packages:
pip install -r requirements.txt
To start the project, run the following command in the project directory:
python main.py
You will be prompted to enter the name of the radio station. After entering a valid station name, the system will begin processing the stream.
Edit the stream_utils.py
file to add or modify the list of radio stations and update the Flask configuration if necessary.
Contributions to the project are welcome. Please follow these steps:
- Fork the repository.
- Create a new branch:
git checkout -b my-new-feature
- Commit your changes:
git commit -am 'Add some feature'
- Push to the branch:
git push origin my-new-feature
- Submit a pull request.
This project is licensed under the MIT License - see the LICENSE.md file for details.
- Thanks to OpenAI for providing the API for transcription and summarization.
- Special thanks to anyone who contributes to the project.