Cobus GreylingSpeculative RAG By Google ResearchThis study shows how to enhance Retrieval Augmented Generation (RAG) through DraftingJul 12, 20241Jul 12, 20241
Bijit GhoshDesigning high-performing RAG systemsDesigning high-performing Retrieval Augmented Generation (RAG) systems, structured across the 5 main pillars :Mar 31, 20241Mar 31, 20241
Cobus GreylingRAT — Retrieval Augmented ThoughtsSynergising RAG With Sophisticated Long-Horizon ReasoningMar 13, 20242Mar 13, 20242
Cobus GreylingPlease Stop Saying Long Context Windows Will Replace RAGAnd I’m curious to know if anyone has innovative approaches to using long context windows efficiently?Mar 18, 20243Mar 18, 20243
InTDS ArchivebyMichał OleszakDesigning RAGsA guide to Retrieval-Augmented Generation design choices.Mar 14, 202410Mar 14, 202410
Cobus GreylingAgentic RAG: Context-Augmented OpenAI AgentsLlamaIndex has coined the phrase Agentic RAG…Agentic RAG can best be described as adding autonomous agent features to a RAG implementation.Mar 14, 2024Mar 14, 2024
MingWhy RAG is bigLLMs know a lot, but nothing about you. You may fine-tune, but that’s costly. Alternative? RAG. Plus, “ReAct” democratizes it for everyone.Jan 2, 2024Jan 2, 2024
InLevel Up CodingbyGao Dalie (高達烈)LangGraph + Corrective RAG + Local LLM = Powerful Rag ChatbotOne of the concerns with modern AI chatbots is their hallucinations This means they might give answers that are wrong or made-up.Feb 15, 20243Feb 15, 20243
DataStaxBuilding an Image Search App with a Vector Database and CLIP ModelsBy Aaron PloetzFeb 6, 2024Feb 6, 2024
InAI AdvancesbyKennedy Selvadurai, PhDVisualizing FAISS Vector Space to Understand its Influence on RAG PerformanceVisualizing embeddings using renumics-spotlight reveals useful insights into RAG generation behavior.Feb 26, 20244Feb 26, 20244
Cobus GreylingLanguage Model Quantization ExplainedSmall Language Models (SLMs) are very capable for NLG (Natural Language Generation, logic & common-sense reasoning, language understanding…Feb 27, 20241Feb 27, 20241
Bijit GhoshActiveRAG — Active LearningThe advent of large language models (LLMs) has ushered in a new era of conversational AI. These models can generate remarkably human-like…Feb 26, 20241Feb 26, 20241
Cobus GreylingAgentic RAG With LlamaIndexThe topic of Agentic RAG explores how agents can be incorporated into existing RAG pipelines for enhanced, conversational search and…Jan 30, 20243Jan 30, 20243
Cobus GreylingFine-Tuning or RAG?Comparing different LLM knowledge injection methods…Feb 21, 20241Feb 21, 20241
InWhyHow.AIbyChia Jeng YangWhy Gemini 1.5 (and other large context models) are bullish for RAGOptimization via RAG: How to overcome Accuracy, Cost, Latency and other performance limitations of large context models.Feb 18, 20248Feb 18, 20248
InLevel Up CodingbyFareed Khan100x Faster — Scaling Your RAG App for Billions of EmbeddingsComputing Cosine Similarity in parallelFeb 15, 20241Feb 15, 20241
InTDS ArchivebyDr. Varshita SherUsing LangChain ReAct Agents for Answering Multi-hop Questions in RAG SystemsUseful when answering complex queries on internal documents in a step-by-step manner with ReAct and Open AI Tools agents.Feb 15, 20248Feb 15, 20248
Cobus GreylingT-RAG = RAG + Fine-Tuning + Entity DetectionThe T-RAG approach is premised on combining RAG architecture with an open-source fine-tuned LLM and an entities tree vector database. The…Feb 15, 202411Feb 15, 202411
Neum AIRetrieval Augmented Generation at scale — Building a distributed system for synchronizing and…Technical and architectural details of how we synced and embedded 1 billion vectors for a RAG workflowSep 28, 20232Sep 28, 20232