DEV Community

# costoptimization

Practical strategies and stories about reducing cloud infrastructure costs.

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
I Processed 2.4 Billion Tokens Across 52 AI Models for $0.52. Here's the Full Breakdown.

I Processed 2.4 Billion Tokens Across 52 AI Models for $0.52. Here's the Full Breakdown.

Comments
3 min read
How to optimize costs without adding servers: a cloud cost optimization guide

How to optimize costs without adding servers: a cloud cost optimization guide

Comments
3 min read
Model routing by task type: the savings math, the classifier overhead, and the A/B that proves it

Model routing by task type: the savings math, the classifier overhead, and the A/B that proves it

Comments
12 min read
4 Pitfalls Discovered After Migrating from Anthropic to Gemini

4 Pitfalls Discovered After Migrating from Anthropic to Gemini

Comments
4 min read
Vertex AI Grounding Cost Gap: Diagnosing the Missing $1300 on My Solo VM

Vertex AI Grounding Cost Gap: Diagnosing the Missing $1300 on My Solo VM

Comments
3 min read
Problem Framing

Problem Framing

Comments
5 min read
Your AI bill, minus the AI you've already paid for

Your AI bill, minus the AI you've already paid for

Comments
5 min read
The hidden cost of streaming LLMs: caches you can't use, bills you don't expect, and complexity you don't need

The hidden cost of streaming LLMs: caches you can't use, bills you don't expect, and complexity you don't need

Comments 1
11 min read
We Cut Our AI Agent Costs by 60%. Here's What Worked.

We Cut Our AI Agent Costs by 60%. Here's What Worked.

Comments 2
3 min read
CloudFront Cache Invalidation Costs Are Eating Into Our AWS Budget — Here's the Fix We Wish We Knew

CloudFront Cache Invalidation Costs Are Eating Into Our AWS Budget — Here's the Fix We Wish We Knew

2
Comments
4 min read
Kubernetes 1.36 Pod-Level Resource Managers: Advanced Resource Optimization in Production

Kubernetes 1.36 Pod-Level Resource Managers: Advanced Resource Optimization in Production

Comments
6 min read
Kubernetes 1.36 Pod-Level Resource Managers: Advanced Resource Optimization in Production

Kubernetes 1.36 Pod-Level Resource Managers: Advanced Resource Optimization in Production

Comments
6 min read
ARES: Cut LLM Agent Reasoning Costs 52% Per Step

ARES: Cut LLM Agent Reasoning Costs 52% Per Step

Comments
7 min read
I Cut My LLM API Bill by 73% — Here's the Exact Optimization Playbook

I Cut My LLM API Bill by 73% — Here's the Exact Optimization Playbook

Comments
5 min read
How a fintech startup cut cloud costs 65% with an open-source sovereign stack

How a fintech startup cut cloud costs 65% with an open-source sovereign stack

Comments
2 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.