L7 Evaluation: Utils.py - A bit of prompt refinement needed II

egutierrez · March 4, 2024, 10:45am

Hi im executing the code in the video ( there are two lines that i have to guess because in the video dr. Isa didnt scroll enough to the right). I tried three different times and i face:

1 - First execution:

Step2 response

[
    {'category': 'Televisions and Home Theater Systems'}
]

2 - Second Execution:

Step2 response

[
    {'category': 'Smartphones and Accessories'},
    {'category': 'Cameras and Camcorders', 'products': ['FotoSnap DSLR Camera']}
]

3 - Third execution:

Step2 response

[
    {'category': 'Smartphones and Accessories'},
    {'category': 'Cameras and Camcorders'},
    {'category': 'Televisions and Home Theater Systems'}
]

So is surprising that in such controlled environment, with a tiny list of products and categories ( no subcategories or anything complicated), the results can vary so…specially when using temperature parameter as zero:

def get_completion_from_messages(messages, model="gpt-3.5-turbo", temperature=0, max_tokens=500):

The expected result should be ( as my understanding):

[
   {'products': 'SmartX ProPhone'},
   {'products': 'FotoSnap DSLR Camera'},
   {'category': 'Televisions and Home Theater Systems'}
]

Topic		Replies	Views
L5_student output differs from video, confusing Building Systems with the ChatGPT API	1	13	October 24, 2024
L7_student notebook possible errors Building Systems with the ChatGPT API	0	100	March 1, 2024
Doubt about very little improvement in L7 - Step2 in process_user_answer Building Systems with the ChatGPT API	0	106	March 4, 2024
Evaluation Building Systems with the ChatGPT API	1	112	January 5, 2025
L8 - The Lab output differ from results in Video Building Systems with the ChatGPT API	0	89	July 5, 2023

L7 Evaluation: Utils.py - A bit of prompt refinement needed II

Related topics