videoAutoGen

Automatic creation of image-slideshow video with voiceover from an input prompt by integrating GPT3.5, Stable Diffusion, Eleven Labs and FFMPEG

Licenses

Open-source, strictly intended for research purposes only

Example of output

Final video given input prompt about the new posh Italian restaurant in town

Example of terminal output (no audio, screencast)

Screencast-from-17-04-2023-01050.1.mp4

Getting started is simple

Download videoAutoGen.py, install dependencies (see below), create constants.py to hold OpenAI and ElevenLabs API keys, and run via command line. The program pauses after each step to allow users to terminate the program via command line. This is to be extended in the future to take in user feedback to refine step-wise output.

Tech stack

GPT3.5 via OpenAI (key required)
Stable Diffusion via HuggingFace Diffusers
ElevenLabs API (key required)
FFMPEG-Python (wrapper for FFMPEG)
Other Python libraries: torch, os, re, glob

How it works

User enters input prompt into command line. Connects via OpenAI to GPT3.5 to generate a short video script
Given the video script, connect via OpenAI to GPT3.5 to generate image descriptions for each scene
Feed the image descriptions into Hugging Face’s Diffusers library, using Stable Diffusion v2.1 to generate matching images
Parse the script for the voiceover text, passing into Eleven Labs to create voiceovers in Obama’s voice
Use FFMPEG to match image to voiceover to make one video per scene, then join all the videos together into the final video
Output video (combined.mp4) as well as component script, image, audio and video files are generated

Output format

.
├── videoAutoGen.py            
├── constants.py            # User's personal file to store API keys (note: do not upload or share!)  
├── videoAutoGen            # Directory created to store output
  ├── audio                 # Directory with individual audio files (.wav), numbered from 0
  ├── images                # Directory with individual image files (.png), numbered from 0
  ├── script                # Directory with script.txt
  ├── video                 # Directory with individual video files (.mp4), numbered from 0
├── combined.mp4            # Final video output

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
README.md		README.md
videoAutoGen.py		videoAutoGen.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

videoAutoGen

Licenses

Example of output

Example of terminal output (no audio, screencast)

Getting started is simple

Tech stack

How it works

Output format

About

Releases

Packages

Languages

iamkaiwei/videoAutoGen

Folders and files

Latest commit

History

Repository files navigation

videoAutoGen

Licenses

Example of output

Example of terminal output (no audio, screencast)

Getting started is simple

Tech stack

How it works

Output format

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages