Quick Start Guide

Get OpenTranscribe up and running in less than 5 minutes with our one-line installer.

One-Line Installation

Run this single command on any platform (Linux, macOS, Windows WSL2):

curl -fsSL https://raw.githubusercontent.com/davidamacey/OpenTranscribe/master/setup-opentranscribe.sh | bash

The installer will: ✅ Detect your hardware automatically (NVIDIA GPU, Apple Silicon, CPU) ✅ Configure optimal settings for your platform ✅ Download Docker images from Docker Hub ✅ Generate secure configuration ✅ Set up management scripts

Prerequisites

Before running the installer, ensure you have:

Docker & Docker Compose installed and running
Internet connection for downloading images and models
8GB+ RAM (16GB+ recommended)

HuggingFace Token Required

For speaker diarization to work, you'll need a free HuggingFace token. The installer will prompt you for it. See HuggingFace Setup for details.

Installation Steps

Step 1: Run the Installer

curl -fsSL https://raw.githubusercontent.com/davidamacey/OpenTranscribe/master/setup-opentranscribe.sh | bash

Step 2: Follow the Prompts

The installer will ask for:

HuggingFace Token (optional but recommended)
- Used for speaker diarization models
- Get a free token at huggingface.co/settings/tokens
Whisper Model Size (default: auto-detected based on hardware)
- large-v2 - Best accuracy (NVIDIA GPU recommended)
- medium - Good balance (8GB+ GPU or Apple Silicon)
- base - Fast (CPU-only systems)

Step 3: Start OpenTranscribe

cd opentranscribe
./opentranscribe.sh start

Step 4: Access the Application

Open your browser and navigate to:

Web Interface: http://localhost:5173
API Documentation: http://localhost:8080/docs
Task Monitor (Flower): http://localhost:5555/flower

First Transcription

1. Create an Account

When you first access OpenTranscribe, you'll see the registration page:

Enter your email and password
Click "Sign Up"
You're automatically logged in

2. Upload a Media File

Click the "Upload Files" button in the navbar
Drag and drop a file or click to browse
Supported formats: MP3, WAV, MP4, MOV, etc.
Files up to 4GB are supported

3. Monitor Processing

Watch the progress in real-time:

Upload progress shows in the floating upload manager
Processing stages display 13 detailed steps
Notifications appear when transcription completes

4. View Your Transcript

Once processing completes:

Click on the file in your library
View the interactive transcript with speaker labels
Click on any word to jump to that moment in the audio
Use the waveform visualization for precise navigation

What's Next?

Now that you have OpenTranscribe running, explore these features:

Configure AI Features

Set up LLM integration for AI-powered summarization:

Go to User Settings → LLM Configuration
Choose a provider (OpenAI, Claude, vLLM, Ollama)
Enter your API key
Test the connection

See LLM Integration for details.

Enterprise Authentication (Optional)

For enterprise deployments, OpenTranscribe supports:

LDAP/Active Directory - Integrate with existing AD
Keycloak/OIDC - Single Sign-On
PKI/X.509 - Certificate-based authentication
MFA - TOTP-based multi-factor authentication

See Authentication Overview for setup guides.

Manage Speakers

OpenTranscribe automatically detects speakers, but you can improve accuracy:

Edit speaker names in any transcript
Create global speaker profiles
Let AI suggest speaker identities across videos

See Speaker Management for more.

Organize with Collections

Group related media files:

Create collections for projects, topics, or events
Add files to multiple collections
Filter your library by collection

See Collections for details.

Advanced Search

Find content across all your transcriptions:

Keyword search - Find exact words or phrases
Semantic search - Find similar concepts
Filters - By speaker, date, duration, tags
Speaker analytics - See who speaks most across your library

See Search & Filters for tips.

Common Commands

Management Commands

# Start OpenTranscribe
./opentranscribe.sh start

# Stop all services
./opentranscribe.sh stop

# View logs
./opentranscribe.sh logs

# Check status
./opentranscribe.sh status

# Restart services
./opentranscribe.sh restart

# Access database
./opentranscribe.sh shell postgres

Updating OpenTranscribe

# Check current version and available updates
./opentranscribe.sh version

# Update containers only (quick)
./opentranscribe.sh update

# Full update including config files (recommended)
./opentranscribe.sh update-full

Troubleshooting

Services Won't Start

# Check Docker is running
docker ps

# Check logs for errors
./opentranscribe.sh logs backend

GPU Not Detected

# Check GPU availability
nvidia-smi

# Test GPU in Docker
docker run --rm --gpus all nvidia/cuda:11.8-base-ubuntu22.04 nvidia-smi

Permission Errors

# Fix model cache permissions
./scripts/fix-model-permissions.sh

Slow Transcription

Ensure GPU acceleration is enabled (USE_GPU=true in .env)
Reduce model size if running out of memory
Check GPU memory usage with nvidia-smi

See Troubleshooting Guide for more solutions.

Getting Help

If you encounter issues:

Check the FAQ
Search GitHub Issues
Ask in GitHub Discussions
Read the Installation Guide for detailed setup

Next Steps

User Guide: Learn about all features
Configuration: Customize environment variables
Development: Learn to contribute

Welcome to OpenTranscribe! 🎉

One-Line Installation​

Prerequisites​

Installation Steps​

Step 1: Run the Installer​

Step 2: Follow the Prompts​

Step 3: Start OpenTranscribe​

Step 4: Access the Application​

First Transcription​

1. Create an Account​

2. Upload a Media File​

3. Monitor Processing​

4. View Your Transcript​

What's Next?​

Configure AI Features​

Enterprise Authentication (Optional)​

Manage Speakers​

Organize with Collections​

Advanced Search​

Common Commands​

Management Commands​

Updating OpenTranscribe​

Troubleshooting​

Services Won't Start​

GPU Not Detected​

Permission Errors​

Slow Transcription​

Getting Help​

Next Steps​