Skip to content

DEV Community

# localllm

👋 Sign in for the ability to sort posts by relevant, latest, or top.

Jovan Chan

Jun 11

Qwen 3.6 35B-A3B for Local AI in 2026: The 24GB VRAM Line That Gets You 120 tok/s

#qwen #localllm #gpu #vram

6 min read

Jun 9

How to Tune llama.cpp --n-gpu-layers: A Practical VRAM Guide (2026)

#localllm #llamacpp #gpu #vram

3 min read

Jun 8

How to Tune --n-gpu-layers for Your VRAM Budget

#localllm #llamacpp #gpu #vram

4 min read

Andrew

Jun 8

Open-LLM-VTuber Review: Offline AI Companion with Live2D

#openllmvtuber #live2d #localllm #ollama

10 min read

Kunal

Jun 7

Local LLM Hardware Requirements in 2026: What You Actually Need for Every Model Tier [Guide]

#localllm #hardware #vram #gpu

8 min read

Kunal

Jun 5

Hermes Agent Desktop Free With Local LLMs: The Claude Code Alternative Nobody's Billing You For [2026]

#hermesagent #localllm #claudecodealternative #llamacpp

8 min read

PEPPERCORN

Jun 4

[Day 11] I turned my cat into anime art — and the AI drew a human girl instead. One photo through IPAdapter pulls it back to a cat

#localllm #ai #dgxspark #stablediffusion

5 min read

Jovan Chan

Jun 2

Run Cursor with a Local Model: Privacy-First AI Coding Without a Subscription

#aicoding #cursor #localllm #privacy

5 min read

Jovan Chan

Jun 2

Qwen3-Coder-Next review 2026: 80B params, 3B active, and the cheapest credible coding agent API

#qwen #localllm #review #opensource

5 min read

Jovan Chan

Jun 2

Qwen3-Coder-Next for Local AI in 2026: Which GPU Can Actually Run Alibaba's #1 Coding Agent?

#localllm #codingai #gpuguide #qwen

6 min read

Jovan Chan

Jun 2

RTX 5060 for Local AI in 2026: When 448 GB/s Hits an 8GB Wall

#gpu #nvidia #rtx5060 #localllm

6 min read

Kunal

Jun 4

Gemma 4 12B vs GPT-4o Mini vs Claude Haiku: Is Google's Local LLM Good Enough to Replace API Calls? [2026]

#gemma4 #localllm #googleai #ollama

7 min read

Cheng Qian

Jun 2

We pre-registered, ran, and verified the macro ablation: information per joule, measured

#llm #localllm #opensource #ai

3 min read

Cheng Qian

May 31

We ported how brains manage the cost of thinking to LLM systems

#opensource #ai #llm #localllm

9 min read

May 15

Localmaxxing isn't theory. Here's what my 3-GPU rig actually does.

#localllm #aieconomics #agentcostcontrol #gpuinference

4 min read

👋 Sign in for the ability to sort posts by relevant, latest, or top.