✨ New course! Enroll in Retrieval Optimization: From Tokenization to Vector Quantization

Community-Team · October 2, 2024, 3:00pm

What you’ll learn in this course

In Retrieval Optimization: From Tokenization to Vector Quantization , taught by Kacper Łukawski , Developer Relations Lead of Qdrant , you’ll learn all about tokenization and also how to optimize vector search in your large-scale customer-facing RAG applications. You’ll explore the technical details of how vector search works and how to optimize it for better performance.

This course focuses on optimizing the first step in your RAG and search results. You’ll see how different tokenization techniques like Byte-Pair Encoding, WordPiece, and Unigram work and how they affect search relevancy. You’ll also learn how to address common challenges such as terminology mismatches and truncated chunks in embedding models.

To optimize your search, you need to be able to measure its quality. You will learn several quality metrics for this purpose. Most vector databases use Hierarchical Navigable Small Worlds (HNSW) for approximate nearest-neighbor search. You’ll see how to balance the HNSW parameters for higher speed and maximum relevance. Finally, you would use different vector quantization techniques to enhance memory usage and search speed.

Topic		Replies	Views
NLP Specialization C1 W4 Conclusion Video NLP with Classification and Vector Spaces week-4	1	549	May 29, 2022
New Course: Vector Databases: from Embeddings to Applications News and Announcements	2	400	November 13, 2023
Really enjoyed the class! Advanced Retrieval for AI with Chroma	1	181	January 7, 2024
🌟 New Course! Enroll in Prompt Compression and Query Optimization News and Announcements short-course	6	338	July 15, 2024
Advice on Information Retrieval Implementation with Naive Bayes NLP with Classification and Vector Spaces week-2 , week-3	7	483	June 2, 2023

✨ New course! Enroll in Retrieval Optimization: From Tokenization to Vector Quantization

What you’ll learn in this course

Related topics