DEV Community

Deep Learning

This tag is for discussing, sharing articles, and asking questions primarily on deep learning - a subfield of machine learning.

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Why does paying more make your LLM reply faster?

Why does paying more make your LLM reply faster?

1
Comments 1
3 min read
Vertical Cognitive Depth and Structured Reasoning: A Practical Hypothesis for Robust Behavior Beyond Training Data

Vertical Cognitive Depth and Structured Reasoning: A Practical Hypothesis for Robust Behavior Beyond Training Data

Comments
6 min read
The Softmax Bottleneck: Why Making LLMs Bigger Doesn't Always Make Them Smarter

The Softmax Bottleneck: Why Making LLMs Bigger Doesn't Always Make Them Smarter

1
Comments
4 min read
The Exact Prompt Engineering That Makes Our Voice AI Sound Human (Full Prompts Included)

The Exact Prompt Engineering That Makes Our Voice AI Sound Human (Full Prompts Included)

Comments
6 min read
How Large Language Models Work — From Transformers to Conversational AI

How Large Language Models Work — From Transformers to Conversational AI

Comments
4 min read
Lost in the Middle: Why LLMs Quietly Ignore the Centre of Their Own Context Window

Lost in the Middle: Why LLMs Quietly Ignore the Centre of Their Own Context Window

Comments
3 min read
Stop Guessing Which Weights Your Neural Network Actually Learned: Deterministic Initialization That Tracks Every Change

Stop Guessing Which Weights Your Neural Network Actually Learned: Deterministic Initialization That Tracks Every Change

Comments
6 min read
VXN-RAMNet (VisionX Routine Adaptive Memory Network)

VXN-RAMNet (VisionX Routine Adaptive Memory Network)

Comments
1 min read
Generation 1 — Standalone Models (2018–2022)

Generation 1 — Standalone Models (2018–2022)

Comments
5 min read
The Paper That Taught Neural Networks to Learn Backwards

The Paper That Taught Neural Networks to Learn Backwards

Comments
13 min read
How Deep Learning Architectures Evolved — From DNNs to Transformers

How Deep Learning Architectures Evolved — From DNNs to Transformers

Comments
3 min read
# What LoRA Actually Adapts and Why Higher Rank Doesn't Always Buy What It Looks Like It Should Explainer by: Eyoel Nebiyu

# What LoRA Actually Adapts and Why Higher Rank Doesn't Always Buy What It Looks Like It Should Explainer by: Eyoel Nebiyu

Comments
5 min read
Did My LoRA Learn Tenacious Style—or Just Memorize Augmented Patterns?

Did My LoRA Learn Tenacious Style—or Just Memorize Augmented Patterns?

Comments
3 min read
LLM Study Diary #3: PyTorch

LLM Study Diary #3: PyTorch

Comments
2 min read
🐍 The "Production-Ready" Miniconda Cheatsheet: From Homebrew to JupyterLab

🐍 The "Production-Ready" Miniconda Cheatsheet: From Homebrew to JupyterLab

Comments
3 min read
👋 Sign in for the ability to sort posts by relevant, latest, or top.