Docker Compose Installation

This guide covers installing OpenTranscribe using Docker Compose - the recommended method for all deployment scenarios.

Quick Installation

The fastest way to install OpenTranscribe is using our one-line installer:

curl -fsSL https://raw.githubusercontent.com/davidamacey/OpenTranscribe/master/setup-opentranscribe.sh | bash

Skip to Manual Installation if you prefer more control.

Prerequisites

Required Software

Docker Engine (v20.10+)

# Check Docker version
docker --version

# Expected output
Docker version 20.10.x or higher

Docker Compose (v2.0+)

# Check Docker Compose version
docker compose version

# Expected output
Docker Compose version v2.x.x or higher

Important

Use docker compose (with space), not docker-compose (with hyphen). OpenTranscribe requires Docker Compose V2.

Installing Docker:

Linux: docs.docker.com/engine/install
macOS: Docker Desktop for Mac
Windows: Docker Desktop for Windows with WSL2

System Requirements

See Hardware Requirements for detailed specs.

Minimum:

8GB RAM
4 CPU cores
50GB disk space
Internet connection (for initial setup)

Recommended:

16GB+ RAM
8+ CPU cores
100GB+ SSD storage
NVIDIA GPU with 8GB+ VRAM

Optional: NVIDIA GPU

For GPU acceleration (recommended):

NVIDIA GPU with CUDA support (GTX 1060 or better)
NVIDIA Driver (version 470.x or higher)
NVIDIA Container Toolkit installed

See GPU Setup Guide for installation instructions.

Manual Installation

If you prefer manual installation or need more control:

Step 1: Clone Repository (Development)

For development from source:

# Clone the repository
git clone https://github.com/davidamacey/OpenTranscribe.git
cd OpenTranscribe

# Make utility script executable
chmod +x opentr.sh

Step 2: Download Production Files (Deployment)

For production deployment with Docker Hub images:

# Create directory
mkdir opentranscribe && cd opentranscribe

# Download production docker-compose file
curl -O https://raw.githubusercontent.com/davidamacey/OpenTranscribe/master/docker-compose.prod.yml

# Rename for convenience
mv docker-compose.prod.yml docker-compose.yml

# Download management script
curl -O https://raw.githubusercontent.com/davidamacey/OpenTranscribe/master/opentranscribe.sh
chmod +x opentranscribe.sh

# Download environment template
curl -O https://raw.githubusercontent.com/davidamacey/OpenTranscribe/master/.env.example

Step 3: Configure Environment

# Copy environment template
cp .env.example .env

# Edit configuration
nano .env  # or use your preferred editor

Key variables to configure:

# Security (REQUIRED - generate strong secrets)
SECRET_KEY=your-super-secret-key-here
JWT_SECRET_KEY=your-jwt-secret-key

# Generate secrets with
openssl rand -hex 32

# HuggingFace Token (REQUIRED for speaker diarization)
HUGGINGFACE_TOKEN=hf_xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx

# Database (use strong passwords in production)
POSTGRES_PASSWORD=strong-password-here

# Storage
MINIO_ROOT_USER=minioadmin
MINIO_ROOT_PASSWORD=strong-minio-password

# GPU Settings (auto-detected by installer)
USE_GPU=true                  # false for CPU-only
CUDA_VISIBLE_DEVICES=0        # GPU device ID
WHISPER_MODEL=large-v2        # large-v2, medium, base
COMPUTE_TYPE=float16          # float16 for GPU, int8 for CPU
BATCH_SIZE=16                 # 16 for GPU, 1-4 for CPU

# Model Cache (where AI models are stored)
MODEL_CACHE_DIR=./models

See Environment Variables for all options.

Step 4: Set Up HuggingFace

This step is CRITICAL for speaker diarization

Get a token at huggingface.co/settings/tokens
Accept model agreements for:
- pyannote/segmentation-3.0
- pyannote/speaker-diarization-3.1
Add to .env: HUGGINGFACE_TOKEN=hf_xxxxxxxxxxxx

See HuggingFace Setup for detailed instructions.

Step 5: Download AI Models (Optional but Recommended)

Pre-download models to avoid first-use delays:

# Using Docker to download models
docker run --rm \
  -v ./models:/models \
  -e HUGGINGFACE_TOKEN=your_token_here \
  davidamacey/opentranscribe-backend:latest \
  python -c "from app.tasks.transcription import download_models; download_models()"

This downloads ~2.5GB of AI models.

Step 6: Start OpenTranscribe

For Development (source code):

./opentr.sh start dev

For Production (Docker Hub images):

./opentranscribe.sh start
# or
docker compose up -d

Step 7: Verify Installation

Check that all services are running:

# Check service status
docker compose ps

# Expected output - all services should show "Up"
NAME                    STATUS
opentranscribe-backend  Up (healthy)
opentranscribe-celery   Up (healthy)
opentranscribe-frontend Up
opentranscribe-postgres Up (healthy)
opentranscribe-redis    Up (healthy)
opentranscribe-minio    Up (healthy)
opentranscribe-opensearch Up (healthy)
opentranscribe-flower   Up

Check logs for any errors:

# View all logs
docker compose logs

# View specific service logs
docker compose logs backend
docker compose logs celery-worker

Step 8: Access the Application

Open your browser and navigate to:

Web Interface: http://localhost:5173
API Documentation: http://localhost:8080/docs
Task Monitor (Flower): http://localhost:5555/flower
MinIO Console: http://localhost:9091
OpenSearch: http://localhost:9200

Docker Compose Structure

OpenTranscribe uses a base + override pattern:

File Structure

docker-compose.yml          # Base configuration (all environments)
docker-compose.override.yml # Development overrides (auto-loaded)
docker-compose.prod.yml     # Production overrides
docker-compose.offline.yml  # Offline/airgapped overrides
docker-compose.gpu-scale.yml # Multi-GPU scaling overrides

Usage

Development (with hot reload):

# Automatically loads docker-compose.yml + docker-compose.override.yml
docker compose up

Production (with Docker Hub images):

docker compose -f docker-compose.yml -f docker-compose.prod.yml up -d

Offline (airgapped environment):

docker compose -f docker-compose.yml -f docker-compose.offline.yml up -d

Multi-GPU (parallel processing):

docker compose -f docker-compose.yml -f docker-compose.gpu-scale.yml up -d

Service Architecture

OpenTranscribe consists of 8 Docker services:

Core Services

backend (FastAPI)

REST API server
WebSocket support
User authentication
File management
Port: 8080

frontend (Svelte + Vite)

React-based UI
Progressive Web App
Port: 5173

postgres (PostgreSQL 15)

Main database
User data, transcriptions, speakers
Port: 5432

redis (Redis 7)

Celery message broker
Task queue coordination
Port: 6379

Storage Services

minio (MinIO)

S3-compatible object storage
Media files, audio, waveforms
Port: 9000 (API), 9091 (Console)

opensearch (OpenSearch 3.3.1)

Full-text search
Vector search for semantic search
Port: 9200

Worker Services

celery-worker (Celery)

Background AI processing
Transcription, diarization, summarization
Multi-queue architecture (GPU, CPU, NLP, Download)

flower (Celery monitoring)

Task monitoring dashboard
Worker status
Port: 5555

Management Commands

Starting and Stopping

# Start all services
docker compose up -d

# Stop all services
docker compose down

# Stop and remove volumes (deletes data!)
docker compose down -v

# Restart specific service
docker compose restart backend

Viewing Logs

# View all logs
docker compose logs

# Follow logs in real-time
docker compose logs -f

# View specific service logs
docker compose logs backend
docker compose logs celery-worker

# View last 100 lines
docker compose logs --tail=100

Accessing Containers

# Execute command in container
docker compose exec backend bash

# Access database
docker compose exec postgres psql -U postgres -d opentranscribe

# Check worker status
docker compose exec celery-worker celery -A app.tasks.celery_app inspect active

Updating

# Pull latest images (for production deployment)
docker compose pull

# Rebuild images (for development)
docker compose build

# Restart with new images
docker compose up -d

Advanced Configuration

Custom Ports

Edit docker-compose.yml or use environment variables:

services:
  frontend:
    ports:
      - "${FRONTEND_PORT:-5173}:5173"
  backend:
    ports:
      - "${BACKEND_PORT:-8080}:8080"

Resource Limits

services:
  celery-worker:
    deploy:
      resources:
        limits:
          cpus: '4'
          memory: 16G
        reservations:
          devices:
            - driver: nvidia
              count: 1
              capabilities: [gpu]

Persistent Data

Data is persisted in Docker volumes:

# List volumes
docker volume ls | grep opentranscribe

# Backup volume
docker run --rm -v opentranscribe_postgres-data:/data -v $(pwd):/backup \
  ubuntu tar czf /backup/postgres-backup.tar.gz /data

# Restore volume
docker run --rm -v opentranscribe_postgres-data:/data -v $(pwd):/backup \
  ubuntu tar xzf /backup/postgres-backup.tar.gz -C /

Troubleshooting

Services Won't Start

# Check Docker is running
systemctl status docker

# Check Docker Compose version
docker compose version

# Check for port conflicts
netstat -tulpn | grep -E '5173|8080|5432|6379|9000|9200'

# Check logs
docker compose logs

Out of Memory

# Check Docker resources
docker stats

# Reduce model size in .env
WHISPER_MODEL=medium  # or base
BATCH_SIZE=8          # or lower

# Increase Docker memory limit (Docker Desktop)
# Docker Desktop → Settings → Resources → Memory

GPU Not Working

# Check NVIDIA driver
nvidia-smi

# Check NVIDIA Container Toolkit
docker run --rm --gpus all nvidia/cuda:11.8-base-ubuntu22.04 nvidia-smi

# Check GPU is visible to container
docker compose exec celery-worker nvidia-smi

Permission Errors

# Fix model cache permissions
./scripts/fix-model-permissions.sh

# Or manually
sudo chown -R 1000:1000 ./models

HTTPS/SSL Setup

HTTPS is required for browser microphone recording when accessing from devices other than localhost (browser security requirement).

Quick Setup

# Run the SSL setup command
./opentranscribe.sh setup-ssl

This interactive command will:

Prompt for your hostname (e.g., opentranscribe.local)
Generate self-signed SSL certificates
Configure NGINX reverse proxy
Update your .env file

DNS Configuration

After SSL setup, configure DNS:

Router DNS: Add a DNS entry pointing your hostname to your server's IP.

Local hosts file: Add to /etc/hosts (Linux/Mac) or C:\Windows\System32\drivers\etc\hosts (Windows):

192.168.1.100  opentranscribe.local

Trust the Certificate

Import nginx/ssl/server.crt on each client device:

Windows: Double-click → Install Certificate → Trusted Root CAs
macOS: Double-click → Keychain → Always Trust
Linux Chrome: chrome://settings/certificates → Import
iOS: Install profile → Settings → Certificate Trust Settings
Android: Settings → Security → Install certificate

Restart with HTTPS

./opentranscribe.sh restart

Access at https://your-hostname. All services are available through the reverse proxy:

Frontend: https://your-hostname
API: https://your-hostname/api
Flower: https://your-hostname/flower/
MinIO Console: https://your-hostname/minio/

See NGINX Setup Guide for advanced configuration including Let's Encrypt certificates.

Updating OpenTranscribe

Check for Updates

./opentranscribe.sh version

Shows current version and checks GitHub for newer releases.

Update Containers Only

./opentranscribe.sh update

Pulls latest Docker images and restarts services. Fast update when only containers changed.

Full Update (Recommended)

./opentranscribe.sh update-full

Updates everything:

Docker container images
Docker Compose configuration files
NGINX configuration
Management and helper scripts

Preserved during updates:

.env configuration
SSL certificates
Database and transcriptions
Model cache

Next Steps

Hardware Requirements - Optimize for your hardware
GPU Setup - Enable GPU acceleration
Configuration - Customize settings
Troubleshooting - Resolve common issues

Getting Help

FAQ - Common questions
GitHub Issues - Report bugs
GitHub Discussions - Ask questions

Quick Installation​

Prerequisites​

Required Software​

System Requirements​

Optional: NVIDIA GPU​

Manual Installation​

Step 1: Clone Repository (Development)​

Step 2: Download Production Files (Deployment)​

Step 3: Configure Environment​

Step 4: Set Up HuggingFace​

Step 5: Download AI Models (Optional but Recommended)​

Step 6: Start OpenTranscribe​

Step 7: Verify Installation​

Step 8: Access the Application​

Docker Compose Structure​

File Structure​

Usage​

Service Architecture​

Core Services​

Storage Services​

Worker Services​

Management Commands​

Starting and Stopping​

Viewing Logs​

Accessing Containers​

Updating​

Advanced Configuration​

Custom Ports​

Resource Limits​

Persistent Data​

Troubleshooting​

Services Won't Start​

Out of Memory​

GPU Not Working​

Permission Errors​

HTTPS/SSL Setup​

Quick Setup​

DNS Configuration​

Trust the Certificate​

Restart with HTTPS​

Updating OpenTranscribe​

Check for Updates​

Update Containers Only​

Full Update (Recommended)​

Next Steps​

Getting Help​