Need more clarity on Constitutional AI

vsrinivas · October 9, 2023, 8:22am

I have couple of queries.

With respect to the Generative AI life cycle where does exactly Contitutional AI fit in? I am sure it is part of Adapt and Align stage, but want to know more specifically where does it fit in? A process flow diagram right from pretraining to prompt tuning or different flavours of fine tuning to application integration, incorporating elements like Constitutional AI, PPO etc., would be very useful.
How do we incorporate Constitutional policies - as prompt engineering/ few shot examples?

Request clarification.

vsrinivas · October 9, 2023, 10:37am

As far as my above query is concerned, after the 3rd week lab, PPO part is clear. But, the Constitutional AI portion is still not very clear. A comprehensive diagram showing the lifecycle along with process flow diagram will help.

leonardo.pabon · October 16, 2023, 10:34pm

You can use Constitutional AI in two stages:

Fine-tuning: use red teaming and the revised constitutional responses to generate data for fine-tuning your LLM
RLAIF: similar to RLHM, but with AI.

To fine-tune your model you should:

First, you ask your model in ways that will generate harmful content. This process is called red teaming.
Create a new prompt with the Constitutional AI rules and the previous step’s harmful content, and ask the LLM to reflect if it follows the rules.
The LMM will return a new completion pointing out why it failed the Constitutional AI rules.
Finally, you take the original prompt, the explanation of why it fails the rules, and ask the LLM to s

vsrinivas · October 17, 2023, 11:14am

Thanks for the info. The response seems to be truncated. Wish to see the missing portion.

leonardo.pabon · October 17, 2023, 1:58pm

I will rephrase my response.

You can use Constitutional AI to solve two problems related to refining a model:

1. Creating Training Data for Fine-Tuning

Training data is not always easy to get. You can synthetically create training data for fine-tuning using red teaming and constitutional response.

2. Training the Reward Model for RLAIF

RLAIF is similar in concept to RLHF (Reinforced Learning from Human Feedback).

Training the reward model necessary for RLHF may require massive human feedback. To solve this scaling problem, you replace human feedback with Constitutional AI feedback. It uses the Constitutional AI to select the best answers from red-teaming prompts to train your Reward Model.

Topic		Replies	Views
I have a question about the content of the lecture Generative AI with Large Language Models week-3	0	401	August 14, 2023
Architecture of the preference model and reward model in Constitutional AI Generative AI with Large Language Models week-3	0	196	March 29, 2024
RLHF... How? Generative AI for Everyone week-2	2	479	December 5, 2023
Week 2: Intuition check for Step 2.1 in "Perform Full Fine-Tuning" Generative AI with Large Language Models week-2	3	426	July 24, 2023
Why use RL instead of supervised learning? Generative AI with Large Language Models week-3	10	718	September 22, 2023

Need more clarity on Constitutional AI

Related topics