Introduction to Audiovisual Translation (AVT)
From the kkkk curriculum
TOPIC: Introduction to Audiovisual Translation (AVT)
Introduction to Audiovisual Translation (AVT)
TL;DR
Audiovisual Translation (AVT) involves translating spoken or written content in multimedia for a new audience. It's a specialized field that ensures viewers can understand films, TV shows, and games, regardless of their original language. AVT primarily uses subtitling and dubbing, each with its own pros and cons for different situations.
1. The Mental Model
Think of AVT as making a story universally understandable. You're bridging a language gap while keeping the original emotion and timing, often within tight technical limits. It's not just about words; it's about making a viewing experience accessible globally.
2. The Core Material
AVT is a fascinating field that combines linguistic skill with technical constraints and cultural awareness. It's more than just translating text; you're transforming a complete audiovisual experience. The main goal is to make content like movies, TV series, documentaries, and video games accessible to audiences who don't speak the original language.
There are two primary forms of AVT:
Subtitling
Subtitling involves displaying synchronized text on screen, usually at the bottom, that translates the dialogue and sometimes other on-screen text or important sound effects.
Types of Subtitling:
- Interlingual Subtitling: The most common type, translating from one language to another (e.g., English dialogue to Spanish subtitles).
- Intralingual Subtitling (Closed Captions): Subtitles in the same language as the audio, often used for the hard of hearing, deaf, or for language learning. These often include descriptions of sounds (e.g.,
[door creaks]).
Key Constraints in Subtitling:
- Reading Speed: Viewers need enough time to read the text without missing important visual information. This means limiting characters per line and lines per screen (often 2 lines).
- Synchronization (Spotting): Subtitles must appear and disappear at the precise moment the corresponding dialogue is spoken.
- Space: Limited screen real estate means captions must be concise.
- Cultural Adaptation: Sometimes phrases need to be adapted to make sense in the target culture, even if it's not a literal translation.
Dubbing
Dubbing replaces the original vocal track with recorded translated speech, spoken by voice actors. The goal is to make it sound as if the actors on screen are speaking the new language.
Key Constraints in Dubbing:
- Lip-sync: The translated dialogue must closely match the mouth movements of the original actors. This is a very challenging and time-consuming aspect.
- Isopronunciation: Attempting to match the number of syllables spoken in the original.
- Speech Rate: The translated speech needs to maintain a similar pace to the original.
- Voice Matching: Selecting voice actors whose voices and acting style fit the characters.
- Mix Sync: Ensuring the dubbed audio blends seamlessly with the original sound effects and music.
Other Forms of AVT

Photo by Nataliya Vaitkevich on Pexels
While less common as primary translation methods, these also fall under AVT:
- Voice-over: narration is laid over the original audio, which is usually lowered in volume. Often used for documentaries or news reports and doesn't require lip-sync.
- Audio Description: A narration track for visually impaired audiences, describing important visual information (actions, settings, costumes) during pauses in dialogue.
- Surtitling: Live subtitles displayed on a screen above a stage performance (like opera or theatre).
Here's a look at the typical workflow for a subtitling project:
graph TD
A["Receive Video & Script"] --> B{"Analyze Content & Identify Challenges"};
B --> C["Spotting (Timing In/Out)"];
C --> D["Translation (Draft)"];
D --> E["Adaptation & Condensing (Readability/Sync)"];
E --> F["Review & Quality Check"];
F --> G["Formatting & Export"];
G --> H["Client Approval & Delivery"];
3. Worked Example
Let's consider a short scene from a hypothetical show where a character says: "Oh my goodness, that's incredibly astonishing!"
Original English Dialogue (Audio & Script): "Oh my goodness, that's incredibly astonishing!" (spoken in 3 seconds)
Subtitling Process:
- Spotting: Determine it's spoken between 00:01:05 and 00:01:08.
- Initial Translation (to Spanish): "¡Oh, Dios mío, eso es increíblemente asombroso!"
- Readability/Conciseness Check: This might be too long to read comfortably in 3 seconds. It's 42 characters including spaces, which is often over limits for 2 lines.
- Adaptation: You might condense it to: "¡Dios mío, qué asombroso!" (22 characters) or slightly longer, "¡Increíblemente asombroso!" (25 characters). The choice depends on the specific character limit and desired tone.
- Final Subtitle:
1
00:01:05,000 --> 00:01:08,000
¡Dios mío, qué asombroso!
Dubbing Process:
- Initial Translation (to Spanish): "¡Oh, Dios mío, eso es increíblemente asombroso!"
- Lip-sync/Syllable Matching: The original has 9 syllables (Oh-my-good-ness-that's-in-cred-i-bly-as-ton-ish-ing). The literal translation has 13 syllables (¡Oh-Dios-mí-o-e-so-es-in-cre-í-ble-men-te-as-om-bro-so!). This doesn't match the mouth movements well.
- Adaptation for Lip-sync & Natural Flow: You'd look for a phrase that conveys the same meaning, fits the mouth shapes for key sounds (like 'm' in 'my goodness', 'a' in 'astonishing'), and has a similar rhythm and syllable count. A voice actor might try variations like:
- "¡Dios, qué impresionante!" (Fits better, shorter)
- "¡Cielos, qué maravilla!" (Different nuance but good sync)
- The aim is to find something that doesn't look obviously out of sync. This often involves significant creative rephrasing beyond literal translation.
4. Key Takeaways
- AVT focuses on translating multimedia content, primarily via subtitling or dubbing.
- Subtitling involves displaying text on screen, prioritizing brevity and reading speed within strict timing.
- Dubbing replaces original dialogue with translated voice-overs, demanding precise lip-sync and rhythm matching.
- Cultural adaptation is crucial in both methods to ensure the translated content resonates with the new audience.
-
Technical constraints like character limits, timing, and synchronization are as important as linguistic accuracy.
-
Common Mistakes:
- Translating too literally, making subtitles too long to read.
- Ignoring lip-sync in dubbing, leading to unnatural viewing experiences.
- Neglecting cultural nuances, resulting in awkward or misunderstood dialogue.
- Poor synchronization of subtitles, causing them to appear too early or too late.
5. Now Try It
Choose a short, 1-minute clip from a foreign language show or movie you've watched (or search for a trailer online). Try to subtitle 2-3 spoken lines into your native language. Focus on keeping the translated lines concise (aim for 42 characters per line max, two lines at most) and ensuring they appear at a natural reading pace. What were the biggest challenges in condensing the meaning?
Frequently asked about Introduction to Audiovisual Translation (AVT)
Get the full kkkk curriculum
Clone the complete plan to your dashboard for unlimited AI-generated notes, practice quizzes, and a personalised revision schedule.
Create Free Account