Quick Start Guide
Get OpenTranscribe up and running in less than 5 minutes with our one-line installer.
One-Line Installation
Run this single command on any platform (Linux, macOS, Windows WSL2):
curl -fsSL https://raw.githubusercontent.com/davidamacey/OpenTranscribe/master/setup-opentranscribe.sh | bash
The installer will: ✅ Detect your hardware automatically (NVIDIA GPU, Apple Silicon, CPU) ✅ Configure optimal settings for your platform ✅ Download Docker images from Docker Hub ✅ Generate secure configuration ✅ Set up management scripts
Prerequisites
Before running the installer, ensure you have:
- Docker & Docker Compose installed and running
- Internet connection for downloading images and models
- 8GB+ RAM (16GB+ recommended)
For speaker diarization to work, you'll need a free HuggingFace token. The installer will prompt you for it. See HuggingFace Setup for details.
Installation Steps
Step 1: Run the Installer
curl -fsSL https://raw.githubusercontent.com/davidamacey/OpenTranscribe/master/setup-opentranscribe.sh | bash
Step 2: Follow the Prompts
The installer will ask for:
-
HuggingFace Token (optional but recommended)
- Used for speaker diarization models
- Get a free token at huggingface.co/settings/tokens
-
Whisper Model Size (default: auto-detected based on hardware)
large-v2- Best accuracy (NVIDIA GPU recommended)medium- Good balance (8GB+ GPU or Apple Silicon)base- Fast (CPU-only systems)
Step 3: Start OpenTranscribe
cd opentranscribe
./opentranscribe.sh start
Step 4: Access the Application
Open your browser and navigate to:
- Web Interface: http://localhost:5173
- API Documentation: http://localhost:8080/docs
- Task Monitor (Flower): http://localhost:5555/flower
First Transcription
1. Create an Account
When you first access OpenTranscribe, you'll see the registration page:
- Enter your email and password
- Click "Sign Up"
- You're automatically logged in
2. Upload a Media File
- Click the "Upload Files" button in the navbar
- Drag and drop a file or click to browse
- Supported formats: MP3, WAV, MP4, MOV, etc.
- Files up to 4GB are supported
3. Monitor Processing
Watch the progress in real-time:
- Upload progress shows in the floating upload manager
- Processing stages display 13 detailed steps
- Notifications appear when transcription completes
4. View Your Transcript
Once processing completes:
- Click on the file in your library
- View the interactive transcript with speaker labels
- Click on any word to jump to that moment in the audio
- Use the waveform visualization for precise navigation
What's Next?
Now that you have OpenTranscribe running, explore these features:
Configure AI Features
Set up LLM integration for AI-powered summarization:
- Go to User Settings → LLM Configuration
- Choose a provider (OpenAI, Claude, vLLM, Ollama)
- Enter your API key
- Test the connection
See LLM Integration for details.
Manage Speakers
OpenTranscribe automatically detects speakers, but you can improve accuracy:
- Edit speaker names in any transcript
- Create global speaker profiles
- Let AI suggest speaker identities across videos
See Speaker Management for more.
Organize with Collections
Group related media files:
- Create collections for projects, topics, or events
- Add files to multiple collections
- Filter your library by collection
See Collections for details.
Advanced Search
Find content across all your transcriptions:
- Keyword search - Find exact words or phrases
- Semantic search - Find similar concepts
- Filters - By speaker, date, duration, tags
- Speaker analytics - See who speaks most across your library
See Search & Filters for tips.
Common Commands
Management Commands
# Start OpenTranscribe
./opentranscribe.sh start
# Stop all services
./opentranscribe.sh stop
# View logs
./opentranscribe.sh logs
# Check status
./opentranscribe.sh status
# Restart services
./opentranscribe.sh restart
# Access database
./opentranscribe.sh shell postgres
Updating OpenTranscribe
# Pull latest images
docker compose pull
# Restart services
./opentranscribe.sh restart
Troubleshooting
Services Won't Start
# Check Docker is running
docker ps
# Check logs for errors
./opentranscribe.sh logs backend
GPU Not Detected
# Check GPU availability
nvidia-smi
# Test GPU in Docker
docker run --rm --gpus all nvidia/cuda:11.8-base-ubuntu22.04 nvidia-smi
Permission Errors
# Fix model cache permissions
./scripts/fix-model-permissions.sh
Slow Transcription
- Ensure GPU acceleration is enabled (
USE_GPU=truein.env) - Reduce model size if running out of memory
- Check GPU memory usage with
nvidia-smi
See Troubleshooting Guide for more solutions.
Getting Help
If you encounter issues:
- Check the FAQ
- Search GitHub Issues
- Ask in GitHub Discussions
- Read the Installation Guide for detailed setup
Next Steps
- User Guide: Learn about all features
- Configuration: Customize environment variables
- Development: Learn to contribute
Welcome to OpenTranscribe! 🎉