Course completion and future multimodal exploration request

lilyzhng · July 27, 2025, 10:20pm

It was great fun to pick up and complete the RAG course, which gives enough high level architecture design concepts but also production related software design details.

I am hoping we could have course of similar depth and production focus for multimodal RAG. I took the Building Multimodal Search and RAG (https://www.coursera.org/projects/building-multimodal-search-and-rag), it is a good begineer intro, but only scratches the surface.

Can any deep learning.ai staff help share if there are any intiatives in multimodal in-depth courses on qwen vl, gemma, nvidia cosmos eta?

ribarola · August 6, 2025, 10:18pm

Hi Lilyzhng.
I’m sharing a link from NVIDIA where you can find the topic of multimodality, and there’s even a training program for Multimodal Gen AI. I hope you find it useful.
The link is:

Topic		Replies	Views
What is the best course/short course for RAG which is multimodal. Text, images and Tables. Interested in Audio and Video late lat Generative AI with Large Language Models week-module-1 , ai-discussions , dl-ai-learning-platform	2	109	February 4, 2026
Building Multimodal Search and RAG - DeepLearning.AI Building Multimodal Search and RAGs	0	40	March 8, 2026
🌟 New Course! Enroll in Building Multimodal Search and RAG News and Announcements short-course , dl-ai-learning-platform	2	385	May 15, 2024
🌟 New Course! Enroll in Multimodal RAG: Chat with Videos News and Announcements short-course , rag , dl-ai-learning-platform	9	460	October 27, 2025
Please see this announcement Building Multimodal Search and RAGs	2	34	February 28, 2026

Course completion and future multimodal exploration request

Related topics