Welcome to OpenTranscribe
OpenTranscribe is a powerful, self-hosted AI-powered transcription and media analysis platform that turns your audio and video files into searchable, analyzable text with advanced features like speaker identification, AI summarization, and cross-media intelligence.
What is OpenTranscribe?β
OpenTranscribe combines state-of-the-art AI models with a modern web interface to provide:
- High-accuracy transcription using WhisperX with word-level timestamps
- Automatic speaker identification with voice fingerprinting across videos
- AI-powered summarization with customizable prompts and LLM integration
- Full-text and semantic search powered by OpenSearch
- Privacy-first processing - everything runs locally on your infrastructure
Key Featuresβ
π§ Advanced Transcriptionβ
- WhisperX integration with faster-whisper backend
- Word-level timestamps with WAV2VEC2 alignment
- Multi-language support with automatic English translation
- 70x realtime speed on GPU (large-v2 model)
- Support for audio (MP3, WAV, FLAC, M4A) and video (MP4, MOV, AVI, MKV)
π₯ Smart Speaker Managementβ
- Automatic speaker diarization using PyAnnote.audio
- Cross-video speaker recognition with voice fingerprinting
- LLM-enhanced speaker identification
- Global speaker profiles that persist across transcriptions
- Confidence scoring and manual verification workflow
π€ AI-Powered Featuresβ
- LLM summarization with BLUF (Bottom Line Up Front) format
- Support for multiple LLM providers (OpenAI, Claude, vLLM, Ollama, OpenRouter)
- Custom AI prompts for different content types
- Intelligent section-by-section processing for unlimited transcript lengths
- Speaker analytics and interaction patterns
π Search & Discoveryβ
- Hybrid search combining keyword and semantic search
- 9.5x faster vector search with OpenSearch 3.3.1
- Advanced filtering by speaker, date, tags, duration
- Collections for organizing related media
- Interactive waveform visualization with click-to-seek
π Analytics & Insightsβ
- Speaker analytics (talk time, interruptions, pace)
- Meeting efficiency metrics
- Action item extraction
- Cross-video speaker tracking
Why OpenTranscribe?β
Open Source & Self-Hostedβ
- Full control over your data - nothing leaves your infrastructure
- AGPL-3.0 License - open source with network copyleft protection
- No subscription fees - one-time setup, unlimited use
- Privacy-first - ideal for sensitive content (legal, medical, business)
Production-Readyβ
- Docker-based deployment - runs anywhere
- GPU acceleration - NVIDIA GPUs supported
- Multi-worker architecture - process multiple files in parallel
- Offline capable - works in airgapped environments
Modern Stackβ
- React + TypeScript frontend - responsive, PWA-enabled
- FastAPI backend - high-performance Python
- PostgreSQL + OpenSearch - reliable, scalable storage
- Celery workers - distributed background processing
Use Casesβ
OpenTranscribe is perfect for:
- π Meeting transcriptions - Record and analyze team meetings with speaker identification
- ποΈ Podcast production - Generate transcripts and show notes automatically
- π Academic research - Transcribe interviews and lectures for analysis
- βοΈ Legal & compliance - Accurate transcripts with speaker identification for depositions
- π Customer service - Analyze support calls for quality and training
- π¬ Content creation - Generate subtitles and content from videos
Quick Lookβ
# Install with one command
curl -fsSL https://raw.githubusercontent.com/davidamacey/OpenTranscribe/master/setup-opentranscribe.sh | bash
# Start the application
cd opentranscribe
./opentranscribe.sh start
# Access at http://localhost:5173
Architecture Overviewβ
βββββββββββββββ ββββββββββββββββ βββββββββββββββ
β Frontend ββββββΆβ Backend ββββββΆβ Workers β
β (Svelte) β β (FastAPI) β β (Celery) β
βββββββββββββββ ββββββββββββββββ βββββββββββββββ
β β
βΌ βΌ
ββββββββββββββββ βββββββββββββββ
β PostgreSQL β β WhisperX β
β MinIO β β PyAnnote β
β OpenSearch β β LLM β
ββββββββββββββββ βββββββββββββββ
System Requirementsβ
Minimum:
- 8GB RAM
- 4 CPU cores
- 50GB disk space
- Docker & Docker Compose
Recommended:
- 16GB+ RAM
- 8+ CPU cores
- 100GB+ SSD
- NVIDIA GPU with 8GB+ VRAM (RTX 3070 or better)
Next Stepsβ
Ready to get started? Follow our Quick Start Guide to install OpenTranscribe in minutes.
Or explore:
- Installation Guide - Detailed installation instructions
- Hardware Requirements - Hardware recommendations
- Configuration - Customize your setup