DEV Community

Deep Learning

This tag is for discussing, sharing articles, and asking questions primarily on deep learning - a subfield of machine learning.

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Backpropagation explained by coding it

Backpropagation explained by coding it

Comments
2 min read
有人在拆 Transformer:Memory Caching 與 CTM 各拆走了一半

有人在拆 Transformer:Memory Caching 與 CTM 各拆走了一半

Comments
3 min read
SwarmLens Cognitive Index

SwarmLens Cognitive Index

1
Comments
6 min read
Flash Attention: what it does and why it matters

Flash Attention: what it does and why it matters

Comments
8 min read
Flash Attention: what it does and why it matters

Flash Attention: what it does and why it matters

Comments
8 min read
SFT Offline RL Online RL: The Three-Stage Training Pipeline Behind Mano-P

SFT Offline RL Online RL: The Three-Stage Training Pipeline Behind Mano-P

1
Comments
8 min read
A11: A Structured Way to Not Lie to Yourself During Reasoning

A11: A Structured Way to Not Lie to Yourself During Reasoning

Comments
3 min read
Day 8 — Beginning My Journey into Neural Networks

Day 8 — Beginning My Journey into Neural Networks

Comments
1 min read
Deep Learning Is More Logistic Regression Than You Think

Deep Learning Is More Logistic Regression Than You Think

Comments
4 min read
Better Data Beats Better Algorithms: Before Changing the Model, Change the Data

Better Data Beats Better Algorithms: Before Changing the Model, Change the Data

Comments
3 min read
Understanding Attention in Transformers — Intuition Before Equations

Understanding Attention in Transformers — Intuition Before Equations

Comments
3 min read
PyTorch from Scratch — Part 1: Tensors, Gradients & Activations

PyTorch from Scratch — Part 1: Tensors, Gradients & Activations

1
Comments
5 min read
What Does a Product Data Scientist Actually Do?

What Does a Product Data Scientist Actually Do?

Comments
2 min read
A11: A Structural Answer to AI Collapse

A11: A Structural Answer to AI Collapse

Comments
3 min read
Gemma 4 12B shows how far local multimodal AI has moved

Gemma 4 12B shows how far local multimodal AI has moved

Comments
5 min read
👋 Sign in for the ability to sort posts by relevant, latest, or top.