What can you do with it?
The/transcription-deepgram command enables you to transcribe audio and video files using Deepgram’s advanced AI transcription service. Perfect for converting meetings, interviews, podcasts, and video content into text with features like speaker identification, sentiment analysis, topic extraction, and automatic formatting.
How to use it?
Basic Command Structure
Parameters
Required:fileUrls- The audio or video file to transcribe (URL or uploaded file). Supports both audio and video formats.
model- Deepgram model to use:nova-3(default, recommended),nova-2,enhanced, orbaselanguageCode- Language code (e.g., “en”, “es”, “fr”). Default: auto-detectenableAutomaticPunctuation- Add punctuation automatically (default: true)enableDiarization- Identify different speakers (default: false)diarizationSpeakerCount- Number of expected speakers (required if diarization is enabled)enableUtterances- Segment transcription by speaker turns (default: false)enableParagraphs- Format output into paragraphs (default: false)enableSummary- Generate AI summary of content (default: false)enableTopics- Extract key topics discussed (default: false)enableIntents- Identify user intents (default: false)enableSentiment- Analyze sentiment (positive/negative/neutral) (default: false)customTopics- Custom topics to detect (array of strings)customIntents- Custom intents to detect (array of strings)collectionId- File store resource ID for storing results (use/filestoreslash command to select, default: multimedia artifact collection)triggerUrls- Webhook URLs to notify when processing completesoutputFileNames- Output file names for async mode (can be .json or .txt format)
Response Format
Async Response (for large files or when background processing is needed):Examples
Basic Transcription
Meeting Transcription with Speaker Identification
Video Transcription with Analysis
Customer Service Analysis
Async Processing for Large Files
Quick Sync Transcription
When to Use Async vs Sync
-
Use ASYNC for:
- Large files (more than 5 minutes)
- Multiple files
- When you need to continue working while transcription processes
- When you want to specify custom output file names
-
Use SYNC for:
- Quick transcriptions
- Short audio clips (less than 5 minutes)
- When immediate results are needed
- Simple, single-file transcriptions
Usage Recommendations
For Meeting Transcriptions
EnableenableDiarization=true, enableUtterances=true, and enableParagraphs=true to get speaker-labeled, well-formatted transcripts.
For Content Analysis
EnableenableSummary=true, enableTopics=true, and enableSentiment=true to extract insights from audio/video content.
For Customer Service
EnableenableIntents=true, enableSentiment=true, and enableDiarization=true to analyze customer interactions.
For Quick Transcripts
Use default settings withenableAutomaticPunctuation=true for fast, accurate transcription without additional features.
Supported Models
nova-3(default, recommended) - Latest and most accurate modelnova-2- Previous generation, high accuracyenhanced- High accuracy for challenging audio conditionsbase- Faster, cost-effective option for clear audio
Common Use Cases
- “Transcribe this meeting with speaker identification” → Use
enableDiarization=trueandenableUtterances=true - “Get a summary of this audio” → Use
enableSummary=true - “What topics are discussed?” → Use
enableTopics=true - “Analyze the sentiment” → Use
enableSentiment=true - “Format it nicely with paragraphs” → Use
enableParagraphs=true
Notes
Supported Formats:- Audio: MP3, WAV, M4A, FLAC, OGG, AAC, AIFF
- Video: MP4, MOV, AVI, and other common video formats
- Automatic language detection
- Over 50 languages supported
- Specify language code for better accuracy
- Async mode creates a placeholder file that updates when processing completes
- Use the
signedUrlfrom the response to monitor processing status - Results saved to the specified collection (default: multimedia artifact collection)

