The starting point of accessible multimedia, regardless of the media type, is having a transcript. As defined by the World Wide Web Consortium (W3C) Web Content Accessibility Guidelines 1.0 (WCAG):
(Source: W3C
Core Techniques of the WCAG
1.0 , 13-16: Audio and Video
).
There are various methods and considerations for getting the transcript which include:
When creating transcript, it is important to transcribe spoken
words as well as other things that convey information. For example, a phone
ringing in the video should be indicated in a transcript such as [phone ringing].
Transcribing things like this helps to clarify the situation, because it tells
the viewer the phone has rung rather than the person is making a call. Pick-up
cues or off screen things that convey context should also be transcribed; to
illustrate, consider a horror movie. Seeing Jamie Lee Curtis turn around in
shock obviously shows she is afraid of something, but it helps to know if she
heard a scream, a glass break somewhere in the house, or a creepy voice saying,
"Jay Jay Jay Jay, Son Son Son Son"
.
The size of transcribed sentences can vary in length, and depends directly on the amount of room available for the captioning as well as the size of the font. A general rule is to keep things in proportion to one another and usually display only one or two sentences at a time. If you want larger fonts, create more space for the captioning. However, consider that larger fonts with a small captioning space mean less text on the screen; the larger the font size the more rapidly the text on the screen changes. Alternately, smaller fonts with a large captioning space mean more text on the screen; the smaller the font size, the more difficult it is to read and the greater the likelihood of getting lost in the transcript, not knowing which text currently relates to what is occurring in the video.
The next page will discuss the general aspects of captioning video.