DEV Community

# rag

Retrieval augmented generation, or RAG, is an architectural approach that can improve the efficacy of large language model (LLM) applications by leveraging custom data.

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
AIGoat - AI Security Playground to Attack and Defend LLMs. All Running Locally

AIGoat - AI Security Playground to Attack and Defend LLMs. All Running Locally

2
Comments 1
3 min read
How I Built a Hallucination Detector for RAG Pipelines in Python

How I Built a Hallucination Detector for RAG Pipelines in Python

Comments 1
3 min read
The architecture of persistent AI memory: Beyond simple vector search

The architecture of persistent AI memory: Beyond simple vector search

Comments
2 min read
Retrieval Finds Candidates. Reranking Finds the Right One.

Retrieval Finds Candidates. Reranking Finds the Right One.

2
Comments
4 min read
Compound AI Systems: How I Connect Multiple Models in a Single Production Product

Compound AI Systems: How I Connect Multiple Models in a Single Production Product

Comments
2 min read
Why Your LLM Ignores Detailed Instructions (It's Not a Bug)

Why Your LLM Ignores Detailed Instructions (It's Not a Bug)

Comments
2 min read
Most GenAI chatbot tutorials stop at “call an LLM get an answer.”

Most GenAI chatbot tutorials stop at “call an LLM get an answer.”

Comments
1 min read
🚀 Beyond RAG: Simulating the Future with MiroFish

🚀 Beyond RAG: Simulating the Future with MiroFish

2
Comments
2 min read
Perfect Retrieval Recall on LongMemEval — Running Fully Local

Perfect Retrieval Recall on LongMemEval — Running Fully Local

Comments 1
4 min read
I Ran 500 More Agent Memory Experiments. The Real Problem Wasn’t Recall. It Was Binding.

Rigor beyond happy-path testing

I Ran 500 More Agent Memory Experiments. The Real Problem Wasn’t Recall. It Was Binding.

56
Comments 29
14 min read
Beyond Vector Search: Building a Clause Forest (FoC) Architecture for Financial RAG

Beyond Vector Search: Building a Clause Forest (FoC) Architecture for Financial RAG

Comments
7 min read
đź§  Streaming LLM APIs Can Quietly Give Free Tokens

đź§  Streaming LLM APIs Can Quietly Give Free Tokens

Comments
1 min read
How I caught a silent NaN bug in production RAG, by asking the system to debug itself

How I caught a silent NaN bug in production RAG, by asking the system to debug itself

Comments
6 min read
Measuring RAG vs. Fine-tuning ROI for Agent Knowledge

Measuring RAG vs. Fine-tuning ROI for Agent Knowledge

Comments
9 min read
Neo4j graph database for GraphRAG, install, Cypher, vectors, ops

Neo4j graph database for GraphRAG, install, Cypher, vectors, ops

Comments
6 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.