DEV Community

Deep Learning

This tag is for discussing, sharing articles, and asking questions primarily on deep learning - a subfield of machine learning.

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
Fundamentals of Large Language Models: Understanding LLM Architectures

Fundamentals of Large Language Models: Understanding LLM Architectures

Comments
5 min read
Zero-Degradation Training: 92% ImageNet-100 Accuracy with 61% Energy Savings

Zero-Degradation Training: 92% ImageNet-100 Accuracy with 61% Energy Savings

Comments
4 min read
My Model Cheated: How Grad-CAM Exposed a 95% Accuracy Lie

My Model Cheated: How Grad-CAM Exposed a 95% Accuracy Lie

Comments
3 min read
🧑‍🚀 LLM Engine Telemetry: How to Profile Models and See Where Performance is Lost

🧑‍🚀 LLM Engine Telemetry: How to Profile Models and See Where Performance is Lost

Comments
5 min read
Developing a Variational Autoencoder in JAX using Antigravity

Developing a Variational Autoencoder in JAX using Antigravity

Comments
6 min read
Majestic Labs vs. the Memory Wall

Majestic Labs vs. the Memory Wall

6
Comments
5 min read
Tokenization in NLP: The Foundational Step That Turns Language Into Data

Tokenization in NLP: The Foundational Step That Turns Language Into Data

Comments
3 min read
Linear Algebra for AI — Part 1

Linear Algebra for AI — Part 1

1
Comments
2 min read
🦄 When ML Models Go Wild: Unintentional Art Created by Neural Networks

🦄 When ML Models Go Wild: Unintentional Art Created by Neural Networks

Comments 1
5 min read
Transformers and Attention: How LLMs Actually Process Text

Transformers and Attention: How LLMs Actually Process Text

3
Comments
19 min read
Building a 75,000-Product Image Feature Dataset for the Amazon ML Challenge 2025

Building a 75,000-Product Image Feature Dataset for the Amazon ML Challenge 2025

1
Comments
4 min read
DragonMemory: Neural Sequence Compression for Production RAG

DragonMemory: Neural Sequence Compression for Production RAG

2
Comments
8 min read
Observations from Finetuning Gemma Model on Strix Halo (Fedora 43)

Observations from Finetuning Gemma Model on Strix Halo (Fedora 43)

Comments
3 min read
How I Reached 84.35% on CIFAR-100 Using ResNet-50 (PyTorch Guide)

How I Reached 84.35% on CIFAR-100 Using ResNet-50 (PyTorch Guide)

Comments 1
2 min read
Nested Learning — My Reflections on a Model That Learns How to Learn

Nested Learning — My Reflections on a Model That Learns How to Learn

3
Comments
8 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.