Skip to content
Navigation menu
Search
Powered by Algolia
Search
Log in
Create account
DEV Community
Close
#
vram
Follow
Hide
Posts
Left menu
👋
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
Right menu
Qwen 3.6 35B-A3B for Local AI in 2026: The 24GB VRAM Line That Gets You 120 tok/s
Jovan Chan
Jovan Chan
Jovan Chan
Follow
Jun 11
Qwen 3.6 35B-A3B for Local AI in 2026: The 24GB VRAM Line That Gets You 120 tok/s
#
qwen
#
localllm
#
gpu
#
vram
Comments
Add Comment
6 min read
How to Tune llama.cpp --n-gpu-layers: A Practical VRAM Guide (2026)
Patrick Hughes
Patrick Hughes
Patrick Hughes
Follow
Jun 9
How to Tune llama.cpp --n-gpu-layers: A Practical VRAM Guide (2026)
#
localllm
#
llamacpp
#
gpu
#
vram
Comments
Add Comment
3 min read
How to Tune --n-gpu-layers for Your VRAM Budget
Patrick Hughes
Patrick Hughes
Patrick Hughes
Follow
Jun 8
How to Tune --n-gpu-layers for Your VRAM Budget
#
localllm
#
llamacpp
#
gpu
#
vram
Comments
Add Comment
4 min read
Local LLM Hardware Requirements in 2026: What You Actually Need for Every Model Tier [Guide]
Kunal
Kunal
Kunal
Follow
Jun 7
Local LLM Hardware Requirements in 2026: What You Actually Need for Every Model Tier [Guide]
#
localllm
#
hardware
#
vram
#
gpu
Comments
Add Comment
8 min read
Best Local AI Models for Each VRAM Tier (4 GB to 80 GB) in 2026
Jovan Chan
Jovan Chan
Jovan Chan
Follow
Jun 2
Best Local AI Models for Each VRAM Tier (4 GB to 80 GB) in 2026
#
localai
#
vram
#
hardware
#
gpu
Comments
Add Comment
6 min read
Best GPU for Llama 70B in 2026 (48GB+ VRAM Required)
Thurmon Demich
Thurmon Demich
Thurmon Demich
Follow
May 15
Best GPU for Llama 70B in 2026 (48GB+ VRAM Required)
#
gpu
#
llama
#
70b
#
vram
Comments
Add Comment
6 min read
VRAMを増やせば解決する、は物理的に間違っている — HBM・CXL・Unified Memoryが取れなかったもの
plasmon
plasmon
plasmon
Follow
Apr 14
VRAMを増やせば解決する、は物理的に間違っている — HBM・CXL・Unified Memoryが取れなかったもの
#
llm
#
gpu
#
vram
Comments
Add Comment
4 min read
Q4 KV Cache Fit 32K Context into 8GB VRAM — Only Math Broke
plasmon
plasmon
plasmon
Follow
Apr 8
Q4 KV Cache Fit 32K Context into 8GB VRAM — Only Math Broke
#
llm
#
quantization
#
vram
#
localllm
Comments
Add Comment
8 min read
I built a duty-cycle throttler for my RTX 4060 (because undervolting wasn't enough)
Yaroslav Pristupa
Yaroslav Pristupa
Yaroslav Pristupa
Follow
Apr 6
I built a duty-cycle throttler for my RTX 4060 (because undervolting wasn't enough)
#
softwaredevelopment
#
gpu
#
vram
#
hardware
Comments
Add Comment
4 min read
Unleash Large AI Models: Extend GPU VRAM with System RAM (Nvidia Greenboost)
Umair Bilal
Umair Bilal
Umair Bilal
Follow
Mar 19
Unleash Large AI Models: Extend GPU VRAM with System RAM (Nvidia Greenboost)
#
nvidia
#
gpu
#
vram
#
ai
Comments
Add Comment
17 min read
Cloud LLMs vs Local Models: Can 32GB of VRAM Actually Compete with Claude Opus?
Alan West
Alan West
Alan West
Follow
Mar 25
Cloud LLMs vs Local Models: Can 32GB of VRAM Actually Compete with Claude Opus?
#
localllm
#
claudeopus
#
ollama
#
vram
Comments
Add Comment
4 min read
👋
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
We're a place where coders share, stay up-to-date and grow their careers.
Log in
Create account