Making RAG work with locally running LLM

AndyJain · November 1, 2025, 2:16pm

I am trying to implement the course exercises locally using my own Jupyter lab and LLM running on a local server. The course has the following endpoints configured.

Coursera - url = os.path.join(‘https://proxy.dlai.link/coursera_proxy/together’, ‘v1/chat/completions’)
Together - client = Together(api_key = together_api_key)

Model being used is : model: str = “meta-llama/Meta-Llama-3.1-8B-Instruct-Turbo”,

Anyone has tried using the local LLM installation using Ollama or similar. I would appreciate if they could provide the sample code.

balaji.ambresh · November 1, 2025, 6:09pm

I’m not a mentor for this course. Does this help?

AndyJain · November 4, 2025, 3:07pm

Thank you. I tinkered with this further and am now re-writing the LLM functions (generate_with_single_input etc) that will use ollama instead of Coursera or Together. Will be happy to share it with others if needed.

Topic		Replies	Views
Running Local LLM with Ollama Agentic AI week-module-1 , course	12	282	November 26, 2025
How to run code of the course in local Prompt Engineering with Llama 2	3	354	May 6, 2024
Agents and tools with local llms Functions, Tools and Agents with LangChain	2	435	May 17, 2024
Local LLM set up advice/Langchain (Guidance needed) AI Discussions ai-discussions	1	126	June 24, 2025
It will be very helpful if you use open source LLM like falcon-40b-instruct along side OpenAI LangChain for LLM Application Development	0	108	June 7, 2023

Making RAG work with locally running LLM

Related topics