Problems using BridgeTower Local in Multimodal RAG without PredictionGuard

406459031 · November 25, 2024, 6:06am

I need help. I am currently studying Multimodal RAG: Chat with Videos

In the course, there is a use of bridgetower-large-itm-mlm-itc using predictionguard. When I want to try it on a local laptop, following all the examples in the course, I am currently working on the chapter L4_Multimodal Retrieval from Vector Stores. I am having trouble with bridgetower-large-itm-mlm-itc using predictionguard, which I do not have an API KEY for. So I searched for information on huggingface and found BridgeTower/bridgetower-large-itm-mlm-itc · Hugging Face. But the next problem I encountered is how do I make a function to solve this problem?

# helper function to compute the joint embedding of a prompt and a base64-encoded image through PredictionGuard
def bt_embedding_from_prediction_guard(prompt, base64_image):
    # get PredictionGuard client
    client = _getPredictionGuardClient()
    message = {"text": prompt,}
    if base64_image is not None and base64_image != "":
        if not isBase64(base64_image): 
            raise TypeError("image input must be in base64 encoding!")
        message['image'] = base64_image
    response = client.embeddings.create(
        model="bridgetower-large-itm-mlm-itc",
        input=[message]
    )
    return response['data'][0]['embedding']

Can you suggest how I should modify the function to successfully use bridgetower-large-itm-mlm-itc locally?

zoia · January 31, 2025, 11:25am

@406459031 Did you resolve this problem? Now I have the same.

406459031 · January 31, 2025, 11:51am

@zoia u can change to use clip huggingface , it can help u , it work for me but my course multimodal chat with video is so bad , u can try to use lmms-lab/LLaVA-Video-7B-Qwen2 · Hugging Face is better I confirm

zoia · January 31, 2025, 12:24pm

Thank you) I will try

Topic		Replies	Views
Using BridgeTower from Hugging Face produces different results Multimodal RAG: Chat with Videos	7	285	January 31, 2025
Need help with embedding bridgetower-large-itm-mlm-itc Multimodal RAG: Chat with Videos	0	50	November 18, 2024
LVLM from hugging face is needed Multimodal RAG: Chat with Videos ai-discussions	0	43	October 18, 2024
Prediction Guard API Key Multimodal RAG: Chat with Videos	19	418	November 21, 2024
🌟 New Course! Enroll in Multimodal RAG: Chat with Videos News and Announcements short-course , rag	4	240	October 7, 2024

Problems using BridgeTower Local in Multimodal RAG without PredictionGuard

Related topics