T5 Model Architecture

esav · December 20, 2023, 3:41pm

The “Transformer: T5” lecture video in C4W3 has a slide that shows an encoder/decoder, a language model, and prefix LM architectures. The video ends by saying that I now know what the T5 architecture looks like. How do the three architectures relate to one another? Are they all part of the T5 architecture? I found this very confusing. It wasn’t clear how the 3 diagrams related to the next slide.

arvyzukai · December 21, 2023, 5:28pm

Hi @esav

I agree that this video is far from crystal clear. What I can offer is that:

the left side (“encoder/decoder”) illustrates traditional encoder-decoder architecture (non-causal on one input, causal on output, like in “Attention is all you need paper” in translation)
middle (somewhat ambiguously named “Language model”, the more clear name should have been “Causal LM”) illustrates the decoder-only architecture (like GPT style models)
the right side (“Prefix LM”) illustrates T5 architecture - here, a “prefix” separates two sections. Within one prefix section, any token can attend to any other token (non-causal, like in the traditional encoder). With another prefix section, decoding can attend to only itself and previous tokens, including the first prefix section (causal, like in the “Causal LM”).
From the paper:

image761×513 57.9 KB

Let me know if that helps. Cheers

esav · December 22, 2023, 10:38pm

I see! So “causal with prefix” is the T5 architecture. Thanks!

Topic		Replies	Views
Transformer decoder architecture in course 2 NLP with Attention Models week-module-2	11	580	April 30, 2024
Questions about transformer architecture Generative AI with Large Language Models ai-discussions	1	69	October 8, 2024
C4 week 2 The video with title Transformer Decoder NLP with Attention Models week-module-2	1	17	July 18, 2025
Problem with transformer NLP with Attention Models week-module-2	1	482	May 28, 2023
Comparing the models for W2 and W3 NLP with Attention Models week-module-3	3	396	November 18, 2023

T5 Model Architecture

Related topics