My question is about the RAG process and the models used first to embedding and later to QA.
The initial embedding is only for retrieval tasks and later the QA model receives the prompt (query + retrieval) in text format, to again with its embedding, vectorize and predict?
Then, the embedding in retrieval tasks and the embedding used by the QA model can be different?
Thanks!