WCAG 1.2.3: Audio Description or Media Alternative (Prerecorded)
The WCAG 1.2.3 success criterion, titled “Audio Description or Media Alternative (Prerecorded)”, is a foundational Level A requirement aimed at making prerecorded time-based media accessible to a broader audience. This criterion ensures that users who cannot perceive visual or auditory information in video or audio content still receive equivalent access to the information conveyed.
Understanding WCAG 1.2.3: Audio Description or Media Alternative (Prerecorded)
This criterion addresses different types of prerecorded media content:
- For prerecorded video-only content: It requires an alternative for time-based media (such as a text transcript) that presents all equivalent information conveyed visually.
- For prerecorded audio-only content: It requires an alternative for time-based media (such as a text transcript) that presents all equivalent information conveyed audibly.
- For prerecorded video content that includes audio: It requires an audio description of the video content.
Essentially, this criterion mandates a baseline level of accessibility for prerecorded multimedia, ensuring that critical information is not locked away in a single sensory modality.
Why This Criterion Matters for Accessibility
Accessibility Impact
Time-based media like videos, animations, and podcasts are rich sources of information and entertainment. However, without appropriate alternatives, they can become significant barriers for many users.
- For users who are blind or have low vision: Visual information in videos (e.g., text on screen, actions, non-verbal cues, data visualizations) is completely inaccessible without an audio description or a detailed text alternative.
- For users who are deaf or hard of hearing: While not the primary focus of 1.2.3 (that’s 1.2.2 for captions), media alternatives like transcripts can still be beneficial as they provide a complete textual representation of all content, including visual descriptions, which complements captions.
- For users with cognitive or learning disabilities: A well-structured text alternative can allow users to process information at their own pace, skip or reread sections, or use assistive technologies that may not interact seamlessly with video players.
By providing these alternatives, organizations ensure that the core message and all relevant details of their media content are perceivable and understandable by everyone, fostering true inclusion.
WCAG 1.2.3 Success Criterion Details
The formal requirements for WCAG 1.2.3 are:
-
For prerecorded video-only content: An alternative for time-based media is provided that presents equivalent information.
- Example: An animation demonstrating a process should be accompanied by a text transcript that thoroughly describes each visual step and outcome.
-
For prerecorded audio-only content: An alternative for time-based media is provided that presents equivalent information.
- Example: A podcast episode should have a full text transcript of all spoken words and significant non-speech audio (e.g., “(laughter)”, “(doorbell rings)”).
-
For prerecorded video content (that includes audio): Audio description of the prerecorded video content is provided.
- Example: A training video showing a presenter and on-screen graphics should include an audio description track that narrates the visual information not already conveyed by the main audio track.
A “media alternative” typically refers to a full text transcript that includes all relevant visual and auditory information, while “audio description” specifically adds spoken narration of important visual details to the video’s soundtrack.
Practical Guidelines for Compliance
To meet WCAG 1.2.3, consider the specific type of media content you are providing:
Providing Audio Description (for video with audio)
- Determine what needs description: Identify all visual information that is essential for understanding but is not conveyed through the primary audio track. This includes on-screen text, charts, graphs, character actions, scene changes, and relevant facial expressions.
-
Choose an audio description method:
- Integrated Audio Description: Narration is added during natural pauses in the video’s existing audio. This is often preferred as it’s seamless.
- Extended Audio Description: If there aren’t enough natural pauses, the video might be paused briefly to allow for longer descriptions. This requires careful editing of the video itself.
- Separate Audio Track: Provide an alternative audio track that includes the original audio along with the audio description. Users can switch to this track via player controls. This is typically implemented using the
<track kind="descriptions">
element in HTML5 video.
- Ensure clarity and conciseness: Audio descriptions should be clear, professionally narrated, and concise, adding value without overwhelming the listener or interfering with the primary content.
Providing a Media Alternative (for video-only or audio-only)
-
Create a comprehensive text transcript: This transcript must contain all relevant information, whether it was originally visual or auditory.
- For video-only: Describe all visual actions, on-screen text, graphics, and changes in scene. For instance, if a video shows a data chart, the transcript should describe what the chart depicts and any key insights.
- For audio-only: Transcribe all spoken dialogue and identify speakers. Also, describe any significant non-speech sounds (e.g., music, sound effects, audience reactions) that convey meaning.
- Link clearly: Provide a prominent link near the media player to access the full text alternative. This is often an HTML page but could also be a PDF if structured accessibly.
- Synchronize if beneficial: While not strictly required by 1.2.3, for complex media, a text alternative that is synchronized with the media (like an interactive transcript) can be highly beneficial for some users.
Examples of Implementation
Correct Implementations
Example 1: Video with Audio Description (Separate Track) and Full Transcript
This example demonstrates how to include an audio description track using the HTML <track>
element for video content that includes audio, alongside a link to a full text transcript.
<video controls poster="video-poster.jpg" width="640" height="360">
<source src="my-informative-video.mp4" type="video/mp4">
<!-- Audio Description Track -->
<track kind="descriptions" srclang="en" label="English Audio Description" src="my-informative-video-ad.vtt">
<!-- Captions for deaf/hard of hearing users -->
<track kind="captions" srclang="en" label="English Captions" src="my-informative-video-captions.vtt">
<p>Your browser does not support the video tag.</p>
</video>
<p><a href="my-informative-video-transcript.html">View Full Transcript (includes visual descriptions)</a></p>
Notes:
- The
my-informative-video-ad.vtt
file would contain the WebVTT cues for the audio description, synchronized with the video. - The
my-informative-video-transcript.html
would be a separate page providing a complete text alternative, including both the spoken content and descriptions of key visual elements.
Example 2: Video-only Content with a Media Alternative
For a video that contains only visual information (e.g., a silent animation or a graphical display), a detailed text alternative is required.
<video controls poster="animation-poster.jpg" width="640" height="360">
<source src="my-data-animation.mp4" type="video/mp4">
<p>Your browser does not support the video tag.</p>
</video>
<p>This animation illustrates key data trends from 2020-2023. <a href="my-data-animation-transcript.html">Read the full description and data analysis.</a></p>
Notes: The my-data-animation-transcript.html
page must contain a comprehensive description of all visual changes, data points, and insights presented in the animation, making the visual content accessible to screen reader users.
Example 3: Audio-only Content with a Media Alternative
For audio-only content like a podcast or an interview, a complete text transcript of all spoken words and significant sounds is essential.
<audio controls>
<source src="my-podcast-episode.mp3" type="audio/mpeg">
<p>Your browser does not support the audio tag.</p>
</audio>
<p>An episode discussing web accessibility best practices. <a href="my-podcast-episode-transcript.html">Read the full transcript.</a></p>
Notes: The my-podcast-episode-transcript.html
file must contain a word-for-word transcript, including speaker identification and descriptions of relevant non-speech audio events (e.g., “[Upbeat music plays]”, “[Audience applauds]”).
Incorrect Implementations
Example 1: Video with Visual Information but No Audio Description or Media Alternative
If a video contains crucial visual information (like on-screen text or actions) that is not conveyed by the main audio track and no alternative is provided, it fails WCAG 1.2.3.
<video controls src="product-demo.mp4">
<p>Your browser does not support the video tag.</p>
</video>
<!-- If 'product-demo.mp4' shows steps on a screen without verbal explanation -->
Reason: Users who are blind or have low vision will miss critical instructions or demonstrations presented visually, as there is no audio description or a text alternative to convey this information.
Example 2: Audio-only Content with an Incomplete Alternative
Providing only a summary or brief description for audio-only content is not an equivalent media alternative.
<audio controls src="expert-interview.mp3">
<p>Your browser does not support the audio tag.</p>
</audio>
<p>An interview with Dr. Jane Smith about current research topics.</p>
Reason: The provided text is merely a summary, not a full transcript of the interview. Users who cannot hear the audio will not have access to the full content of the conversation.
Best Practices and Common Pitfalls
Best Practices
- Plan for Accessibility from the Start: Integrate audio description and media alternative creation into your media production workflow. It’s much harder to add them after content is finalized.
- Provide Both When Possible: While 1.2.3 allows for an ‘or’ condition in some cases, providing both audio description and a full text transcript for all prerecorded video content offers the most comprehensive accessibility.
- Quality Matters: Ensure audio descriptions are clear, well-timed, and professionally narrated. Transcripts should be accurate, complete, and easy to read.
- Educate Content Creators: Train those creating videos and audio to understand the importance of descriptive visuals and clear audio, which can reduce the effort needed for separate audio descriptions.
- User Testing: Test your audio descriptions and media alternatives with users who rely on them to ensure they are effective and truly convey equivalent information.
- Accessible Player Controls: If using a custom media player, ensure that controls for toggling audio descriptions or accessing alternatives are clearly labeled, keyboard accessible, and compatible with screen readers.
Common Pitfalls
- Assuming Captions Suffice for Audio Description: Captions (WCAG 1.2.2) transcribe spoken dialogue and sound effects, but they do not describe visual information crucial for understanding. They serve different purposes.
- Incomplete Transcripts: Providing only a summary or partial transcript for audio-only or video-only content. A full, equivalent text alternative is required.
- Missing Visual Descriptions in Transcripts: For video-only content, a transcript that only includes audio (if any) or a basic summary of the topic, without describing the actual visual content, is insufficient.
- Poorly Timed or Overlapping Audio Descriptions: Audio descriptions that interfere with the main audio track, are too fast, or don’t allow enough time for processing can be more confusing than helpful.
- Difficult to Find/Activate Alternatives: Links to transcripts or controls for audio description that are hidden, unlabeled, or difficult to interact with for keyboard or screen reader users.
- Forgetting About Auto-Generated Content: Relying solely on auto-generated captions or transcripts without review and correction, as these are often inaccurate and lack proper description.
Related WCAG Guidelines
- WCAG 1.2.1: Audio-only and Video-only (Prerecorded) (Level A): Requires a media alternative for audio-only and video-only prerecorded content. This criterion partially overlaps with 1.2.3 by requiring the alternative but 1.2.3 also adds the audio description requirement for video with audio.
- WCAG 1.2.2: Captions (Prerecorded) (Level A): Requires captions for all prerecorded audio content in synchronized media. This addresses a different need (hearing impairment) than 1.2.3.
- WCAG 1.2.5: Audio Description (Prerecorded) (Level AA): Requires audio description for all prerecorded video content in synchronized media. This is a higher level of requirement for audio description than 1.2.3, which is Level A.
- WCAG 1.2.8: Media Alternative (Prerecorded) (Level AAA): Requires a media alternative for all prerecorded time-based media. This is a higher level of requirement for a media alternative than 1.2.3, which is Level A.