M5_UGL_1_R - How is code as a plan less brittle than tool soup?

Phil01 · October 23, 2025, 10:23am

The agent is very sensitive to the user input. If the user writes “I would like to Return 2 Aviators”, so with a capital “R”, the whole process fails. The code generated may or may not convert everything to lowercase; this seems brittle to me.

Am I missing something?

reinoudbosch · November 3, 2025, 10:47am

Hi Phil01,

Great question.

My understanding of Andrew’s statement about the approach with a tool soup being brittle is that it refers to the possiblity that a required tool may not be present in the tool soup, and that this may lead to either a break in execution or the invocation of an incorrect tool.

As Andrew states in the first video of this module, allowing an LLM to plan can be an experimental approach and can sometimes makes the system a little bit hard to control. So this approach can also be brittle but in a different way. In terms of making required tools available by means of code generation and execution, it may be more versatile (and in that sense less ‘brittle’) than a tool soup.

Just my two cents.

Phil01 · November 9, 2025, 10:10am

Thanks Reinoud

Topic		Replies	Views
When is Planning with code execution useful? Agentic AI week-module-5 , dl-ai-learning-platform	2	23	February 18, 2026
Code as plan - practical in real world scenarios? Agentic AI ai-discussions , week-module-5 , dl-ai-learning-platform	1	33	January 10, 2026
Creating and executing LLM plans, module 5 (why do we need it) Agentic AI week-module-5 , course	6	110	October 26, 2025
Planning with code execution limits Agentic AI week-module-5 , dl-ai-learning-platform	1	23	January 13, 2026
Planning Workflows for Highly Autonomous Agents: When early tool results change the rest of the plan Agentic AI week-module-5 , course	4	39	October 15, 2025

M5_UGL_1_R - How is code as a plan less brittle than tool soup?

Related topics