Troubleshooting

This guide covers common issues and their solutions when installing and running OpenTranscribe.

Installation Issues

Permission Errors (Model Cache)

Symptoms:

Permission denied: '/home/appuser/.cache/huggingface/hub'
Permission denied: '/home/appuser/.cache/yt-dlp'
YouTube downloads fail with permission errors
Models fail to download or save

Cause: Docker creates model cache directories with root ownership, but containers run as non-root user (UID 1000) for security.

Solution:

# Option 1: Automated fix (recommended)
cd opentranscribe
./scripts/fix-model-permissions.sh

# Option 2: Manual fix using Docker
docker run --rm -v ./models:/models busybox chown -R 1000:1000 /models

# Option 3: Manual fix using sudo
sudo chown -R 1000:1000 ./models
sudo chmod -R 755 ./models

Verification:

# Check directory ownership (should show UID 1000)
ls -la models/

# Test write permissions
touch models/huggingface/test.txt && rm models/huggingface/test.txt

Prevention

The latest setup script automatically creates directories with correct permissions. For new installations, use the one-line installer:

curl -fsSL https://raw.githubusercontent.com/davidamacey/OpenTranscribe/master/setup-opentranscribe.sh | bash

Docker Not Found

Symptoms:

docker: command not found
Installation script fails

Solution:

# Ubuntu/Debian
curl -fsSL https://get.docker.com | sh
sudo usermod -aG docker $USER
newgrp docker

# Verify installation
docker --version
docker compose version

Port Already in Use

Symptoms:

Error: bind: address already in use
Services fail to start

Solution:

# Find process using port 5173 (example)
sudo lsof -i :5173

# Kill the process
sudo kill -9 <PID>

# Or change port in .env
FRONTEND_PORT=5274  # Different port

GPU Issues

GPU Not Detected

Symptoms:

Transcription uses CPU (very slow)
nvidia-smi fails

Diagnosis:

# 1. Check NVIDIA driver
nvidia-smi

# 2. Check Docker GPU access
docker run --rm --gpus all nvidia/cuda:12.2.0-base-ubuntu22.04 nvidia-smi

# 3. Check OpenTranscribe logs
./opentr.sh logs celery-worker | grep -i cuda

Solutions:

If nvidia-smi fails:

# Install/reinstall NVIDIA drivers
sudo ubuntu-drivers autoinstall
sudo reboot

If Docker GPU test fails:

# Install NVIDIA Container Toolkit
sudo apt update
sudo apt install -y nvidia-container-toolkit
sudo nvidia-ctk runtime configure --runtime=docker
sudo systemctl restart docker

If logs show CPU mode:

# Enable GPU in .env
USE_GPU=true
TORCH_DEVICE=cuda
GPU_DEVICE_ID=0

# Restart services
./opentr.sh restart

See GPU Setup for complete installation guide.

CUDA Out of Memory

Symptoms:

RuntimeError: CUDA out of memory
Transcription fails partway through
Container crashes during processing

Solutions:

1. Reduce batch size:

# Edit .env
BATCH_SIZE=8  # or 4, 2, 1

2. Use smaller model:

# Edit .env
WHISPER_MODEL=medium  # instead of large-v2

3. Reduce concurrent workers:

# For multi-GPU scaling
GPU_SCALE_WORKERS=2  # instead of 4

4. Close other GPU applications:

# Check GPU memory usage
nvidia-smi

# Close unnecessary applications using GPU

Slow GPU Performance

Symptoms:

GPU utilization less than 50%
Transcription slower than expected
Not achieving 70x realtime speed

Diagnosis:

# Monitor GPU during transcription
watch -n 1 nvidia-smi

Solutions:

1. Check compute type:

# Edit .env
COMPUTE_TYPE=float16  # not float32

2. Increase batch size (if VRAM available):

# Edit .env
BATCH_SIZE=16  # or 32

3. Verify CUDA version:

nvidia-smi | grep "CUDA Version"
# Should be 11.8+

HuggingFace / Speaker Diarization Issues

Gated Repository Error

Symptoms:

Cannot access gated repository
'NoneType' object has no attribute 'eval'
Speaker diarization fails

Cause: Model agreements not accepted or token invalid.

Solution:

Accept BOTH model agreements:
- pyannote/segmentation-3.0
- pyannote/speaker-diarization-3.1

Verify token in .env:

grep HUGGINGFACE_TOKEN .env
# Should show: HUGGINGFACE_TOKEN=hf_xxxxx

Regenerate token if needed:
- Visit https://huggingface.co/settings/tokens
- Create new read token
- Update .env
- Restart: ./opentr.sh restart

See HuggingFace Setup for detailed guide.

Models Download on Every Restart

Symptoms:

Models re-download after restart
Long startup times
Wasted bandwidth

Cause: Model cache not persisting.

Solution:

# 1. Check MODEL_CACHE_DIR in .env
grep MODEL_CACHE_DIR .env

# 2. Verify directory exists and has correct permissions
ls -la models/
sudo chown -R 1000:1000 models/

# 3. Restart services
./opentr.sh restart

Transcription Issues

Transcription Stuck/Hanging

Symptoms:

Transcription shows "Processing..." for hours
Progress stops at specific percentage
No errors in logs

Solutions:

1. Check container status:

./opentr.sh status

2. Check worker logs:

./opentr.sh logs celery-worker

3. Restart worker:

docker restart celery-worker

4. Reset failed jobs:

# Access Flower dashboard
# http://localhost:5175/flower
# Find and revoke stuck tasks

5. Full restart:

./opentr.sh restart

Incorrect Transcription Results

Symptoms:

Poor accuracy
Wrong language detected
Gibberish output

Solutions:

1. Use larger model:

# Edit .env
WHISPER_MODEL=large-v2  # instead of small/medium

2. Check audio quality:

Ensure audio is clear
Remove background noise
Check audio format (WAV/FLAC best, MP3 okay)

3. Force language (if auto-detection fails):

# In UI: Select specific language instead of "Auto-detect"

4. Check model downloaded correctly:

ls -lh models/huggingface/hub/models--Systran--faster-whisper-large-v2/

Speaker Diarization Not Working

Symptoms:

All speakers shown as "Speaker 1"
No speaker separation
Diarization skipped

Checklist:

HuggingFace token configured
Both model agreements accepted
Speaker diarization enabled in settings
Audio has multiple speakers
MIN_SPEAKERS and MAX_SPEAKERS set correctly

Solutions:

# 1. Verify token
grep HUGGINGFACE_TOKEN .env

# 2. Check model download
ls -lh models/torch/pyannote/

# 3. Adjust speaker range for large meetings
MIN_SPEAKERS=1
MAX_SPEAKERS=30  # or 50+ for conferences

# 4. Check logs
./opentr.sh logs celery-worker | grep -i diarization

Database Issues

Database Connection Failed

Symptoms:

could not connect to server
Backend fails to start
500 errors in web UI

Solutions:

1. Check database status:

./opentr.sh status postgres

2. Check database logs:

./opentr.sh logs postgres

3. Verify credentials in .env:

grep POSTGRES_ .env

4. Test connection:

./opentr.sh shell postgres
psql -U postgres -l

5. Reset database (⚠️ deletes all data):

./opentr.sh reset dev

Database Schema Errors

Symptoms:

relation does not exist
column does not exist
Migration errors

Solution:

# For development
./opentr.sh reset dev

# For production (preserves data)
./opentr.sh shell backend
alembic upgrade head

Network/Upload Issues

YouTube Download Fails

Symptoms:

YouTube downloads timeout
"Video unavailable" errors
Stuck at "Downloading..."

Solutions:

1. Check network connectivity:

curl -I https://youtube.com

2. Update yt-dlp (auto-updated, but verify):

./opentr.sh logs celery-worker | grep yt-dlp

3. Check YouTube URL is accessible:

Try URL in browser
Check if video is public/unlisted (not private)
Verify no geographic restrictions

4. Check firewall/proxy settings

Upload Fails

Symptoms:

Files don't upload
Upload progress stuck at 0%
Network errors

Solutions:

1. Check file size (max 4GB):

ls -lh your-file.mp4

2. Check MinIO storage:

./opentr.sh logs minio
docker exec -it minio df -h

3. Check network:

# Test backend connectivity
curl http://localhost:5174/api/health

4. Clear browser cache and retry

Performance Issues

Slow Transcription (CPU Mode)

Expected: 0.5-1x realtime (slower than playback)

Solutions:

Enable GPU: See GPU Setup
Use smaller model: WHISPER_MODEL=small
Reduce batch size: BATCH_SIZE=1
Close other applications

High Memory Usage

Symptoms:

System becomes unresponsive
Out of memory errors
Containers killed

Solutions:

1. Reduce resource usage:

# Edit .env
WHISPER_MODEL=medium  # instead of large-v2
BATCH_SIZE=8  # instead of 16
COMPUTE_TYPE=int8  # instead of float16

2. Increase system swap:

# Check current swap
free -h

# Add swap file (8GB example)
sudo fallocate -l 8G /swapfile
sudo chmod 600 /swapfile
sudo mkswap /swapfile
sudo swapon /swapfile

3. Limit Docker memory:

# Add to docker-compose.yml
services:
  celery-worker:
    mem_limit: 8g

Service-Specific Issues

OpenSearch Won't Start

Symptoms:

max virtual memory areas vm.max_map_count [65530] is too low

Solution:

# Temporary fix
sudo sysctl -w vm.max_map_count=262144

# Permanent fix
echo "vm.max_map_count=262144" | sudo tee -a /etc/sysctl.conf
sudo sysctl -p

Celery Worker Not Processing Tasks

Symptoms:

Tasks queued but not processed
Flower shows no active workers

Solutions:

1. Check worker status:

./opentr.sh logs celery-worker

2. Restart worker:

docker restart celery-worker

3. Check Redis connection:

docker exec -it redis redis-cli ping
# Should return: PONG

Frontend Not Loading

Symptoms:

Blank page
404 errors
Connection refused

Solutions:

1. Check frontend container:

./opentr.sh status frontend
./opentr.sh logs frontend

2. Verify port:

curl http://localhost:5173

3. Clear browser cache

4. Restart frontend:

docker restart frontend

Diagnostic Commands

Health Check

# Check all services
./opentr.sh status

# Check specific service logs
./opentr.sh logs [backend|frontend|celery-worker|postgres|redis|minio|opensearch]

# Test backend API
curl http://localhost:5174/api/health

# Test database connection
./opentr.sh shell postgres psql -U postgres -c "SELECT 1"

# Test GPU
nvidia-smi
docker run --rm --gpus all nvidia/cuda:12.2.0-base-ubuntu22.04 nvidia-smi

Resource Monitoring

# Docker container stats
docker stats

# GPU monitoring
watch -n 1 nvidia-smi

# Disk usage
df -h

# Memory usage
free -h

# Check model cache size
du -sh models/

Getting Help

Still having issues?

📚 Documentation: Review all docs in this site
🐛 GitHub Issues: Report bugs
💬 Discussions: Ask questions
📊 Flower Dashboard: http://localhost:5175/flower for task debugging
📝 Logs: Include relevant logs when asking for help

Collecting Debug Information

When reporting issues, include:

# System information
uname -a
docker --version
docker compose version
nvidia-smi

# OpenTranscribe version
git log -1 --format="%H %s"

# Container status
./opentr.sh status

# Relevant logs (last 100 lines)
./opentr.sh logs celery-worker | tail -100

# Environment (redact secrets!)
cat .env | grep -v PASSWORD | grep -v SECRET | grep -v TOKEN

Next Steps

Hardware Requirements - Optimize your setup
GPU Setup - Configure GPU acceleration
Environment Variables - Fine-tune settings
FAQ - Frequently asked questions

Installation Issues​

Permission Errors (Model Cache)​

Docker Not Found​

Port Already in Use​

GPU Issues​

GPU Not Detected​

CUDA Out of Memory​

Slow GPU Performance​

HuggingFace / Speaker Diarization Issues​

Gated Repository Error​

Models Download on Every Restart​

Transcription Issues​

Transcription Stuck/Hanging​

Incorrect Transcription Results​

Speaker Diarization Not Working​

Database Issues​

Database Connection Failed​

Database Schema Errors​

Network/Upload Issues​

YouTube Download Fails​

Upload Fails​

Performance Issues​

Slow Transcription (CPU Mode)​

High Memory Usage​

Service-Specific Issues​

OpenSearch Won't Start​

Celery Worker Not Processing Tasks​

Frontend Not Loading​

Diagnostic Commands​

Health Check​

Resource Monitoring​

Getting Help​

Collecting Debug Information​

Next Steps​

Installation Issues

Permission Errors (Model Cache)

Docker Not Found

Port Already in Use

GPU Issues

GPU Not Detected

CUDA Out of Memory

Slow GPU Performance

HuggingFace / Speaker Diarization Issues

Gated Repository Error

Models Download on Every Restart

Transcription Issues

Transcription Stuck/Hanging

Incorrect Transcription Results

Speaker Diarization Not Working

Database Issues

Database Connection Failed

Database Schema Errors

Network/Upload Issues

YouTube Download Fails

Upload Fails

Performance Issues

Slow Transcription (CPU Mode)

High Memory Usage

Service-Specific Issues

OpenSearch Won't Start

Celery Worker Not Processing Tasks

Frontend Not Loading

Diagnostic Commands

Health Check

Resource Monitoring

Getting Help

Collecting Debug Information

Next Steps