Skip to main content

Documentation Index

Fetch the complete documentation index at: https://docs.viddyscribe.com/llms.txt

Use this file to discover all available pages before exploring further.

Standard AD vs Extended AD

ViddyScribe supports two audio description modes through the ad_type field:
ModeAPI valueBest forTiming behavior
Standard ADstandard_adConcise descriptions that fit around existing speechPlaces descriptions in natural pauses and no-speech zones.
Extended ADextended_adFuller descriptions when the video needs more visual contextCan extend or adjust timing to make room for longer descriptions.

Standard AD

Use standard_ad when you want concise descriptions that avoid interrupting dialogue or important audio. This is usually the right choice for videos with enough natural pauses, simple scenes, or workflows that need a separate audio description track.
{
  "generation_config": {
    "ad_type": "standard_ad",
    "auto_fit": true
  }
}

Extended AD

Use extended_ad when the video has dense visual information, important on-screen context, or scenes that need more detail than natural pauses allow. Extended AD is useful for educational material, tutorials, government meetings, news, and other content where missing visual details can make the video hard to follow.
{
  "generation_config": {
    "ad_type": "extended_ad",
    "read_all_onscreen_text": true
  }
}
OptionApplies toDescription
auto_fitStandard AD and Extended ADFits descriptions into detected no-speech zones when possible.
allow_descriptions_over_musicStandard AD and Extended ADAllows descriptions over detected music when no better timing is available.
read_all_onscreen_textExtended ADForces the model to read on-screen text verbatim for supported content types.