WCAG 1.2.5: Audio Description (Prerecorded)

WCAG 1.2.5, part of the Web Content Accessibility Guidelines, addresses the need for audio description for prerecorded video content within synchronized media. This criterion is crucial for making video content accessible to individuals who are blind or have low vision, ensuring they receive all critical visual information necessary to understand the full context of the media.

Understanding WCAG 1.2.5: Audio Description (Prerecorded)

At its core, WCAG 1.2.5 requires that a separate narration track is provided to describe important visual details in prerecorded synchronized media. This narration, known as audio description, is typically inserted into pauses in the media’s original soundtrack. It describes actions, body language, scene changes, on-screen text, and other visual information that is not conveyed through the main audio alone.

Why is Audio Description Important?

Audio description plays a vital role in digital accessibility for several reasons:

  • Empowers Blind and Low-Vision Users: Without audio description, individuals who cannot see the visual track of a video miss critical information. This includes visual cues like facial expressions, body language, scene settings, costume details, on-screen text, and visual actions that drive the narrative or convey important data. Audio description bridges this gap.
  • Enhances Comprehension: By providing a verbal account of visual elements, audio description ensures that users can fully comprehend the content, context, and storyline of any video. This is essential for educational, informational, and entertainment media alike.
  • Promotes Inclusivity: Implementing audio description ensures that your content is usable and enjoyable by a wider audience, demonstrating a commitment to inclusive design and equal access to information.
  • Legal and Ethical Compliance: Adhering to WCAG 1.2.5 helps organizations meet accessibility regulations and uphold ethical responsibilities towards people with disabilities.

WCAG 1.2.5 Success Criterion Details

The formal wording for Success Criterion 1.2.5 at Level AA is:

Audio Description (Prerecorded): Audio description is provided for all prerecorded video content in synchronized media.

Key terms to understand:

  • Audio Description: Narration added to the soundtrack of a video to describe important visual details that cannot be understood from the main soundtrack alone.
  • Prerecorded: Content that is not live. This criterion does not apply to live broadcasts (which are covered by WCAG 1.2.7).
  • Synchronized Media: Media that combines audio and/or video with another format for presenting information and/or with time-based interactive components. Examples include movies, TV shows, and video clips with soundtracks.

Practical Guidelines for Compliance

Achieving compliance with WCAG 1.2.5 involves careful planning and implementation:

How to Provide Audio Description

  • Standard Audio Description (AD): This is the most common form, where narration is inserted into natural pauses in the video’s original soundtrack without stopping the main content. The describer must be concise and prioritize the most crucial visual information.

  • Extended Audio Description (EAD): For content with very dense visual information and few natural pauses, extended audio description may be necessary. This involves temporarily pausing the video’s main content to allow for a more thorough description of the scene. The user should typically have control over whether to use standard or extended AD.

  • Separate Audio Track: The most robust method is to provide an entirely separate audio track that includes the original audio mixed with the audio description. This allows users to select the track with AD from their media player controls.

  • Description Track (WebVTT): For simpler implementations, especially on the web, a WebVTT file with description cues can be linked to the <video> element using the <track> element with kind="descriptions". While technically possible, this is less common for full audio description as browsers don’t natively synthesize audio from these tracks, requiring custom JavaScript solutions or relying on screen readers to interpret them (which is not ideal). It is more commonly used for visual descriptions in specific contexts or by specific platforms that process these tracks.

Key Considerations for Creating Audio Descriptions

  • Accuracy and Objectivity: Descriptions should accurately reflect the visual content without interpretation or adding unnecessary subjective commentary.
  • Clarity and Conciseness: Use clear, straightforward language. Every word counts, especially in standard AD, where space is limited.
  • Timing and Synchronization: The description must be perfectly synchronized with the visual events it describes. Incorrect timing can be more confusing than no description.
  • Contextual Relevance: Prioritize describing visual information that is essential for understanding the plot, character development, setting, or main message of the content.
  • Voice Quality: The audio describer should have a clear, pleasant, and easy-to-understand voice that contrasts sufficiently with the original narration or dialogue.
  • Technical Quality: Ensure the audio description is recorded and mixed at appropriate volume levels, so it’s clearly audible without overpowering the original content.

Examples

Correct Implementation: Video with a Dedicated Audio Description Track

The most common and recommended approach for web content is to provide multiple audio tracks, one of which includes the audio description. Users can then select their preferred track via the video player’s controls.

Scenario: An educational video about a scientific experiment, where visual steps are crucial. The video player offers an option to switch audio tracks.

HTML Structure for Video with Multiple Audio Tracks:

<video controls poster="thumbnail.jpg" aria-labelledby="video-title">
  <source src="video_original_audio.mp4" type="video/mp4" data-track-label="Original Audio" />
  <source src="video_audio_description.mp4" type="video/mp4" data-track-label="Audio Description" />
  <!-- Fallback content for browsers that don't support video -->
  <p>Your browser does not support HTML5 video. Here is a <a href="video_original_audio.mp4">link to the video (original audio)</a> and a <a href="video_audio_description.mp4">link to the video (audio described)</a>.</p>
</video>
<h3 id="video-title">The Amazing World of Quantum Physics</h3>

Note: In this setup, video_audio_description.mp4 is a version of the video file where the audio track has been pre-mixed to include the original audio along with the descriptive narration. A JavaScript-enhanced video player would typically manage the switching of these <source> elements or, more robustly, offer an audio track selection UI using a single video source with multiple audio tracks embedded within (e.g., using an HLS or DASH stream with multiple audio renditions).

Alternatively, if a WebVTT ‘description’ track is used, it often requires a custom player to read the cues and play synthesized speech, which is generally not what WCAG implies for ‘audio description’. However, for screen reader users, such a track could provide text alternatives that the screen reader might read out.

Example of a WebVTT Description Track (less common for full AD, but conceptually relevant for text-based descriptions):

WEBVTT
Kind: descriptions
Language: en

00:00:05.000 --> 00:00:08.000
A scientist in a white lab coat meticulously adjusts a microscope.

00:00:12.500 --> 00:00:16.000
Close-up of a bubbling beaker, with a blue liquid turning green.

00:00:20.000 --> 00:00:23.000
Text appears on screen: "Experiment Results: Phase 1 Complete."

For a true audio description, these text descriptions would be voiced by a narrator and embedded directly into an audio stream, not just provided as text.

Incorrect Implementation: Video without Audio Description

Scenario: A cooking show video where visual steps (e.g., how to chop vegetables, specific cooking techniques) are shown but not verbally described.

HTML Structure for Video Lacking Audio Description:

<video controls poster="cooking_show_thumbnail.jpg">
  <source src="cooking_show.mp4" type="video/mp4" />
  <p>Your browser does not support HTML5 video. Here is a <a href="cooking_show.mp4">link to the video</a>.</p>
</video>
<h3>Quick and Easy Dinner Recipes</h3>

Reason for Incorrectness: This video provides only the main audio track and visual content. Any user who cannot perceive the visual information will miss crucial cooking steps and techniques demonstrated on screen, making the content inaccessible and incomplete for them.

Best Practices and Common Pitfalls

Best Practices

  • Plan Early: Integrate audio description into the video production workflow from the start, rather than as an afterthought. This makes it easier to allocate time and resources.
  • Use Professional Describers: Engage experienced audio describers who understand the nuances of storytelling, timing, and objectivity.
  • Offer Clear Controls: Ensure your media player provides an obvious and easy-to-use control for users to turn audio description on or off, or to select an audio track with description.
  • Consider Extended AD: For highly visual and information-dense content, offer extended audio description as an option.
  • Test with Users: Conduct user testing with individuals who rely on audio description to gather feedback and refine your descriptions.
  • Label Clearly: Label the audio description option or track clearly in your player UI (e.g., “Audio Description,” “AD”).

Common Pitfalls

  • No Audio Description Provided: The most common failure is simply not offering any audio description for prerecorded video content.
  • Poor Quality Description: Descriptions that are too fast, too slow, unclear, incomplete, or subjective reduce accessibility instead of enhancing it.
  • Incorrect Timing: Descriptions that are not synchronized with the corresponding visual events can be highly disorienting and frustrating.
  • Insufficient Description: Only describing some visual elements while omitting others that are crucial for understanding the content.
  • Difficulty in Accessing AD: If the audio description is available but hidden behind complex menus or non-standard controls, users may struggle to activate it.
  • Overlapping Audio: Descriptions that talk over important dialogue or sound effects in the original track. Professional mixing is crucial.

Conclusion

WCAG 1.2.5 is a fundamental criterion for ensuring that all users, regardless of visual ability, can fully engage with and understand video content. By providing high-quality, well-timed, and easily accessible audio descriptions, you not only comply with accessibility standards but also create a more inclusive and equitable digital experience for everyone.

Privacy Overview

This website uses cookies so that we can provide you with the best user experience possible. Cookie information is stored in your browser and performs functions such as recognising you when you return to our website and helping our team to understand which sections of the website you find most interesting and useful.