DEV Community

# vram

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Qwen 3.6 35B-A3B for Local AI in 2026: The 24GB VRAM Line That Gets You 120 tok/s

Qwen 3.6 35B-A3B for Local AI in 2026: The 24GB VRAM Line That Gets You 120 tok/s

Comments
6 min read
How to Tune llama.cpp --n-gpu-layers: A Practical VRAM Guide (2026)

How to Tune llama.cpp --n-gpu-layers: A Practical VRAM Guide (2026)

Comments
3 min read
How to Tune --n-gpu-layers for Your VRAM Budget

How to Tune --n-gpu-layers for Your VRAM Budget

Comments
4 min read
Local LLM Hardware Requirements in 2026: What You Actually Need for Every Model Tier [Guide]

Local LLM Hardware Requirements in 2026: What You Actually Need for Every Model Tier [Guide]

Comments
8 min read
Best Local AI Models for Each VRAM Tier (4 GB to 80 GB) in 2026

Best Local AI Models for Each VRAM Tier (4 GB to 80 GB) in 2026

Comments
6 min read
Best GPU for Llama 70B in 2026 (48GB+ VRAM Required)

Best GPU for Llama 70B in 2026 (48GB+ VRAM Required)

Comments
6 min read
VRAMを増やせば解決する、は物理的に間違っている — HBM・CXL・Unified Memoryが取れなかったもの

VRAMを増やせば解決する、は物理的に間違っている — HBM・CXL・Unified Memoryが取れなかったもの

Comments
4 min read
Q4 KV Cache Fit 32K Context into 8GB VRAM — Only Math Broke

Q4 KV Cache Fit 32K Context into 8GB VRAM — Only Math Broke

Comments
8 min read
I built a duty-cycle throttler for my RTX 4060 (because undervolting wasn't enough)

I built a duty-cycle throttler for my RTX 4060 (because undervolting wasn't enough)

Comments
4 min read
Unleash Large AI Models: Extend GPU VRAM with System RAM (Nvidia Greenboost)

Unleash Large AI Models: Extend GPU VRAM with System RAM (Nvidia Greenboost)

Comments
17 min read
Cloud LLMs vs Local Models: Can 32GB of VRAM Actually Compete with Claude Opus?

Cloud LLMs vs Local Models: Can 32GB of VRAM Actually Compete with Claude Opus?

Comments
4 min read
👋 Sign in for the ability to sort posts by relevant, latest, or top.