Skip to content
Navigation menu
Search
Powered by Algolia
Search
Log in
Create account
DEV Community
Close
#
localllm
Follow
Hide
Posts
Left menu
đź‘‹
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
Right menu
Qwen 3.6 35B-A3B for Local AI in 2026: The 24GB VRAM Line That Gets You 120 tok/s
Jovan Chan
Jovan Chan
Jovan Chan
Follow
Jun 11
Qwen 3.6 35B-A3B for Local AI in 2026: The 24GB VRAM Line That Gets You 120 tok/s
#
qwen
#
localllm
#
gpu
#
vram
Comments
Add Comment
6 min read
How to Tune llama.cpp --n-gpu-layers: A Practical VRAM Guide (2026)
Patrick Hughes
Patrick Hughes
Patrick Hughes
Follow
Jun 9
How to Tune llama.cpp --n-gpu-layers: A Practical VRAM Guide (2026)
#
localllm
#
llamacpp
#
gpu
#
vram
Comments
Add Comment
3 min read
How to Tune --n-gpu-layers for Your VRAM Budget
Patrick Hughes
Patrick Hughes
Patrick Hughes
Follow
Jun 8
How to Tune --n-gpu-layers for Your VRAM Budget
#
localllm
#
llamacpp
#
gpu
#
vram
Comments
Add Comment
4 min read
Open-LLM-VTuber Review: Offline AI Companion with Live2D
Andrew
Andrew
Andrew
Follow
Jun 8
Open-LLM-VTuber Review: Offline AI Companion with Live2D
#
openllmvtuber
#
live2d
#
localllm
#
ollama
Comments
Add Comment
10 min read
Local LLM Hardware Requirements in 2026: What You Actually Need for Every Model Tier [Guide]
Kunal
Kunal
Kunal
Follow
Jun 7
Local LLM Hardware Requirements in 2026: What You Actually Need for Every Model Tier [Guide]
#
localllm
#
hardware
#
vram
#
gpu
Comments
Add Comment
8 min read
Hermes Agent Desktop Free With Local LLMs: The Claude Code Alternative Nobody's Billing You For [2026]
Kunal
Kunal
Kunal
Follow
Jun 5
Hermes Agent Desktop Free With Local LLMs: The Claude Code Alternative Nobody's Billing You For [2026]
#
hermesagent
#
localllm
#
claudecodealternative
#
llamacpp
Comments
Add Comment
8 min read
[Day 11] I turned my cat into anime art — and the AI drew a human girl instead. One photo through IPAdapter pulls it back to a cat
PEPPERCORN
PEPPERCORN
PEPPERCORN
Follow
Jun 4
[Day 11] I turned my cat into anime art — and the AI drew a human girl instead. One photo through IPAdapter pulls it back to a cat
#
localllm
#
ai
#
dgxspark
#
stablediffusion
Comments
Add Comment
5 min read
Run Cursor with a Local Model: Privacy-First AI Coding Without a Subscription
Jovan Chan
Jovan Chan
Jovan Chan
Follow
Jun 2
Run Cursor with a Local Model: Privacy-First AI Coding Without a Subscription
#
aicoding
#
cursor
#
localllm
#
privacy
Comments
Add Comment
5 min read
Qwen3-Coder-Next review 2026: 80B params, 3B active, and the cheapest credible coding agent API
Jovan Chan
Jovan Chan
Jovan Chan
Follow
Jun 2
Qwen3-Coder-Next review 2026: 80B params, 3B active, and the cheapest credible coding agent API
#
qwen
#
localllm
#
review
#
opensource
Comments
Add Comment
5 min read
Qwen3-Coder-Next for Local AI in 2026: Which GPU Can Actually Run Alibaba's #1 Coding Agent?
Jovan Chan
Jovan Chan
Jovan Chan
Follow
Jun 2
Qwen3-Coder-Next for Local AI in 2026: Which GPU Can Actually Run Alibaba's #1 Coding Agent?
#
localllm
#
codingai
#
gpuguide
#
qwen
Comments
Add Comment
6 min read
RTX 5060 for Local AI in 2026: When 448 GB/s Hits an 8GB Wall
Jovan Chan
Jovan Chan
Jovan Chan
Follow
Jun 2
RTX 5060 for Local AI in 2026: When 448 GB/s Hits an 8GB Wall
#
gpu
#
nvidia
#
rtx5060
#
localllm
Comments
Add Comment
6 min read
Gemma 4 12B vs GPT-4o Mini vs Claude Haiku: Is Google's Local LLM Good Enough to Replace API Calls? [2026]
Kunal
Kunal
Kunal
Follow
Jun 4
Gemma 4 12B vs GPT-4o Mini vs Claude Haiku: Is Google's Local LLM Good Enough to Replace API Calls? [2026]
#
gemma4
#
localllm
#
googleai
#
ollama
Comments
Add Comment
7 min read
We pre-registered, ran, and verified the macro ablation: information per joule, measured
Cheng Qian
Cheng Qian
Cheng Qian
Follow
Jun 2
We pre-registered, ran, and verified the macro ablation: information per joule, measured
#
llm
#
localllm
#
opensource
#
ai
Comments
Add Comment
3 min read
We ported how brains manage the cost of thinking to LLM systems
Cheng Qian
Cheng Qian
Cheng Qian
Follow
May 31
We ported how brains manage the cost of thinking to LLM systems
#
opensource
#
ai
#
llm
#
localllm
Comments
2
 comments
9 min read
Localmaxxing isn't theory. Here's what my 3-GPU rig actually does.
Patrick Hughes
Patrick Hughes
Patrick Hughes
Follow
May 15
Localmaxxing isn't theory. Here's what my 3-GPU rig actually does.
#
localllm
#
aieconomics
#
agentcostcontrol
#
gpuinference
Comments
Add Comment
4 min read
đź‘‹
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
We're a place where coders share, stay up-to-date and grow their careers.
Log in
Create account