AI-Powered YouTube Video Automation Software

A comprehensive AI-driven system for automatically generating professional-quality YouTube videos from just a topic or keyword. This software leverages cutting-edge AI technologies to handle the entire video production workflow: script writing, voiceover generation, media selection, video editing, and final rendering.

✨ Features

Script Generation: Creates engaging, structured scripts using OpenAI GPT-4 or Claude
AI Voiceovers: Converts scripts to natural-sounding speech using ElevenLabs, Google TTS, or Amazon Polly
Media Selection: Automatically sources relevant visuals from Pexels, Pixabay, and Unsplash
AI Image Generation: Creates custom visuals using DALL-E or Stable Diffusion when needed
Automated Video Editing: Handles scene transitions, Ken Burns effects, and timing
Audio Processing: Adds background music, enhances voice quality, and normalizes audio
Subtitle Generation: Creates synchronized captions for better accessibility
End-to-end Automation: From script to final YouTube-ready video with minimal human input

🛠️ Technical Architecture

The system is built with a modular architecture that separates concerns and allows for easy extension:

ai_video_generator/
├── core/                  # Core pipeline components
│   ├── pipeline.py        # Main orchestration
│   ├── script_generator.py
│   ├── voice_generator.py
│   ├── media_selector.py
│   ├── video_editor.py
│   ├── subtitle_generator.py
│   └── audio_processor.py
│
├── models/                # AI model interfaces
│   ├── llm_wrapper.py     # Language models (GPT-4, Claude)
│   ├── tts_wrapper.py     # Text-to-speech models
│   └── image_generator.py # Image generation (DALL-E, SD)
│
├── data_providers/        # External API integrations
│   ├── stock_video_api.py # Pexels, Pixabay
│   ├── stock_image_api.py # Unsplash, etc.
│   └── music_library_api.py
│
├── utils/                 # Utility functions
├── ui/                    # User interfaces
│   ├── cli/               # Command-line interface
│   └── web/               # Web interface
│
└── config/                # Configuration files

🚀 Getting Started

Prerequisites

Python 3.8+
FFmpeg installed on your system
API keys for services (setup instructions below)

Installation

Clone this repository:

git clone https://github.com/connorodea/ai-video-generator.git
cd ai-video-generator

Create and activate a virtual environment:

python -m venv venv
source venv/bin/activate  # On Windows: venv\Scripts\activate

Install dependencies:
```
pip install -r requirements.txt
```

Set up configuration:

# Edit config/api_keys.json with your API keys
# Edit config/default_settings.json for customization (optional)

Setting Up API Keys

The system requires API keys for various services. You can set these up in the config/api_keys.json file or through the web UI settings page:

OpenAI API: For script generation (GPT-4) and image generation (DALL-E)
ElevenLabs API: For realistic voiceovers
Pexels API: For stock videos and images
Pixabay API: For additional stock media
Unsplash API: For high-quality stock images

All APIs offer free tiers that are suitable for testing the system.

📊 Usage

Command Line Interface

Create a video with basic parameters:

python main.py create "The Future of Artificial Intelligence" --type educational --duration 5 --voice adam

List all your projects:

python main.py list

Get details for a specific project:

python main.py details <project_id>

Resume a failed or incomplete project:

python main.py resume <project_path> --start-from media_selection

Web Interface

Start the web server:

cd ui/web
python app.py

Then open your browser and navigate to http://localhost:5000

The web interface provides:

Project creation with advanced options
Real-time progress tracking
Project management dashboard
Settings configuration
Video preview and download

🎬 Example Output

Here's what you can expect from the generated videos:

Educational Content: Clear, structured explanations with relevant visuals, optimized for learning
Entertainment Videos: Engaging, dynamic content with smooth transitions and pacing
Marketing Material: Professional promotional videos with consistent branding

🔧 Advanced Configuration

The system is highly configurable through the config/default_settings.json file:

Video quality: Resolution, frame rate, and bitrate
Ken Burns settings: Animation speed and zoom levels
Audio processing: Music volume, voice enhancement
Visual styles: Color grading, overlays, transitions

🔄 Development Workflow

The system starts by generating a script based on your topic
It converts the script to audio using the selected voice
For each segment, it finds relevant media from stock providers
It generates custom images for concepts that lack stock media
The video editor assembles everything with transitions and effects
Audio processing enhances sound quality and adds background music
The final video is rendered in YouTube-ready format

🤝 Contributing

Contributions are welcome! Please check out our contribution guidelines for details.

📜 License

This project is licensed under the MIT License - see the LICENSE file for details.

🙏 Acknowledgements

OpenAI, Anthropic, and ElevenLabs for their powerful AI APIs
Pexels, Pixabay, and Unsplash for their stock media APIs
The FFmpeg team for their incredible video processing library

📧 Contact

For questions or feedback, please open an issue or contact the maintainers directly.

Consulting

For consulting or contracting, please contact us and we can discuss the details of your project.

Note: This software is intended for creating legitimate content. Please respect copyright and terms of service for all integrated APIs.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
core		core
data_providers		data_providers
models		models
setup_docs		setup_docs
ui/web		ui/web
.DS_Store		.DS_Store
README.md		README.md
codebase_structure.txt		codebase_structure.txt
config.py		config.py
default_settings.json		default_settings.json
demo_script.py		demo_script.py
dockerfile		dockerfile
enhanced_demo.py		enhanced_demo.py
example_full_pipeline.py		example_full_pipeline.py
llm_wrapper.py		llm_wrapper.py
read_this.md		read_this.md
requirements.txt		requirements.txt
script_generator.py		script_generator.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AI-Powered YouTube Video Automation Software

✨ Features

🛠️ Technical Architecture

🚀 Getting Started

Prerequisites

Installation

Setting Up API Keys

📊 Usage

Command Line Interface

Web Interface

🎬 Example Output

🔧 Advanced Configuration

🔄 Development Workflow

🤝 Contributing

📜 License

🙏 Acknowledgements

📧 Contact

Consulting

About

Releases

Packages

Languages

connorodea/AI_VIDEO_AUTOMATION

Folders and files

Latest commit

History

Repository files navigation

AI-Powered YouTube Video Automation Software

✨ Features

🛠️ Technical Architecture

🚀 Getting Started

Prerequisites

Installation

Setting Up API Keys

📊 Usage

Command Line Interface

Web Interface

🎬 Example Output

🔧 Advanced Configuration

🔄 Development Workflow

🤝 Contributing

📜 License

🙏 Acknowledgements

📧 Contact

Consulting

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages