First of all, thanks for a great course!
I’m currently trying to improve the hotdog classifier code. So I changed this line to use:
hotdog_or_not = outlines.generate.choice(
vmodel,
['hotdog', 'not a hotdog'],
sampler=greedy(),
)
So it is working but the last image is classified as “hotdog”. So I tried to change the model to “HuggingFaceTB/SmolVLM-500M-Instruct” but the performance got worse: everything is classified as “not a hotdog”.
Can someone please explain why is that the case? Are there other models that can provide better performance than “SmolVLM-256M-Instruct”? Note that I tried these 2 but getting some errors:
- HuggingFaceTB/SmolVLM2-2.2B-Instruct
- openbmb/MiniCPM-Llama3-V-2_5