Is chatGPT a sophisticated sequence to sequence model?

Rodolfo_Novarini · April 27, 2023, 9:39pm

Looking at the videos on sequence to sequence model in C5W3. I wonder if chatGPT is a combination of a sequence to sequence model (to quick off and wrap up the responses) and some sort of one-to-many (probably with LSTM cells) in the middle, plus some regulator for response length (?)

TMosh · April 27, 2023, 9:41pm

It’s a very complicated example of a Transformer, which you see in C5 W4.

Rodolfo_Novarini · April 28, 2023, 12:15am

… will hold my horses until next week

Topic		Replies	Views
What does seq2seq mean in Transformer? Generative AI with Large Language Models week-module-1 , week-module-2	2	383	April 23, 2024
Sequence to sequence vs autoregressive models Generative AI with Large Language Models week-module-1	3	1098	July 18, 2023
Which model should I use for LLM application AI Discussions	1	174	January 6, 2024
Decoder only model vs encoder+decoder models Generative AI with Large Language Models week-module-1	1	721	July 27, 2023
Clarification about Course 4 Week 3 HW NLP with Attention Models week-module-3	2	596	May 6, 2022

Is chatGPT a sophisticated sequence to sequence model?

Related topics