ViddyScribe supports two audio description types, each with different output formats:Documentation Index
Fetch the complete documentation index at: https://docs.viddyscribe.com/llms.txt
Use this file to discover all available pages before exploring further.
| AD Type | Outputs |
|---|---|
| Standard AD (no pauses) descriptions placed in dialogue gaps | Text, Audio, Video |
| Extended AD (with pauses) descriptions added with pauses | Text, Video |
- Standard AD
- Extended AD
Input Video
Output AD Video
Output AD Audio
Output AD Text (VTT)
How It Works
Configure & Generate
Choose configuration options such as audio description (AD) type, language, format, and video category.
Quick Links
Quickstart
Get started in 5 minutes
API Reference
Explore all endpoints
Authentication
Secure your requests
Understand the parameters
AD Type (ad_type)
AD Type (ad_type)
Set
ad_type to choose how descriptions are timed:standard_ad— Fits descriptions into existing no-speech gaps. Video runtime stays the same; some elements may go undescribed if there’s no room.extended_ad— Briefly pauses the video to make room for fuller descriptions. Runtime increases.
Language (language)
Language (language)
Set
language (BCP-47 code, default en-US) on any generation request to control narration locale. ViddyScribe supports 53 languages including English variants, Spanish, French, German, Hindi, Mandarin, Arabic, and more.See the full list in Languages and Voices.Voice (voice)
Voice (voice)
Set
voice on audio and video generation requests to pick the narrator (default Achernar). 31 voices are available (15 female, 16 male). Most cover all 53 languages; one extra Robotic voice is English-only.See the full list in Languages and Voices.Custom Instructions (custom_instructions)
Custom Instructions (custom_instructions)
Set
custom_instructions to guide the AI for specific terminology, style preferences, or focus areas.Output Format (format)
Output Format (format)
Every generation endpoint returns the text descriptions in the response. Set
format (default vtt) to choose how that text is serialized:json— structured array with timestamps and description textvtt— WebVTT subtitlessrt— SubRip subtitlesedl— Edit Decision List for video editors
API Endpoints
Upload Endpoints
| Endpoint | Method | Description |
|---|---|---|
/enterprise/api/upload_media | POST | Upload video via File or URL (Input) |
Generation Endpoints
| Endpoint | Method | Description |
|---|---|---|
/enterprise/api/generate_ad_text | POST | Generate text-only descriptions (Input: media_id, url, file) |
/enterprise/api/generate_ad_audio | POST | Generate audio-only track with text (Input: media_id, url, file) |
/enterprise/api/generate_ad_video | POST | Generate video with descriptions, and text (Input: media_id, url, file) |
Results Endpoint
| Endpoint | Method | Description |
|---|---|---|
/enterprise/api/get_results | GET | Get results (Input: media_id, url, file) |
Need Help?
Support
Get in touch with our support team
API Reference
Explore all available endpoints

