DEV Community

# llm

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
Fine-Tuning Large Language Models Using AWS AI Services

Fine-Tuning Large Language Models Using AWS AI Services

Comments
5 min read
Your AI Agent Gets Dumber the More You Teach It. Skill Graphs Are the Fix.

Your AI Agent Gets Dumber the More You Teach It. Skill Graphs Are the Fix.

Comments
3 min read
Best GPU for Local AI & LLMs in 2026

Best GPU for Local AI & LLMs in 2026

Comments
4 min read
I built TrustLayer — an open-source trust layer for every AI tool you use

I built TrustLayer — an open-source trust layer for every AI tool you use

Comments
2 min read
Five Hard Problems in the MCP Ecosystem

Five Hard Problems in the MCP Ecosystem

3
Comments 2
9 min read
I got tired of my agents repeating the same mistakes, so I built a feedback loop for them — here's How it is worked

I got tired of my agents repeating the same mistakes, so I built a feedback loop for them — here's How it is worked

Comments
2 min read
Ollama, LM Studio, and GPT4All Are All Just llama.cpp — Here's Why Performance Still Differs

Ollama, LM Studio, and GPT4All Are All Just llama.cpp — Here's Why Performance Still Differs

Comments
6 min read
Prompt Injection Doesn't Come from Your Users

Prompt Injection Doesn't Come from Your Users

Comments
10 min read
Per-customer cost attribution without a proxy

Per-customer cost attribution without a proxy

Comments
3 min read
Anthropic Just Did Something Unprecedented: They Kept a Model Because It Was Too Good at Hacking

Anthropic Just Did Something Unprecedented: They Kept a Model Because It Was Too Good at Hacking

Comments
3 min read
GLM-5.1: The 754B Open Model That Writes Animated SVG

GLM-5.1: The 754B Open Model That Writes Animated SVG

Comments
1 min read
The Chinese Open-Source Model That Draws Pelicans Better Than GPT-4o

The Chinese Open-Source Model That Draws Pelicans Better Than GPT-4o

Comments
2 min read
LLM-as-Judge: using Claude to review a Gemini agent

LLM-as-Judge: using Claude to review a Gemini agent

Comments
7 min read
TurboQuant: How a Simple Spin Saves Gigabytes of GPU Memory

TurboQuant: How a Simple Spin Saves Gigabytes of GPU Memory

Comments
6 min read
99.8% of LLM Inference Power Isn't Spent on Computation

99.8% of LLM Inference Power Isn't Spent on Computation

Comments
7 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.