DEV Community

# gpu

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
AMD GFX1156 Driver Prep, Intel OIDN 2.5 GPU Gains, NVIDIA RTX Accelerates DiffusionGemma

AMD GFX1156 Driver Prep, Intel OIDN 2.5 GPU Gains, NVIDIA RTX Accelerates DiffusionGemma

Comments
4 min read
How to Pick a GGUF Quant Level for Your VRAM Budget

How to Pick a GGUF Quant Level for Your VRAM Budget

Comments
3 min read
How ComputePool allocates work across a peer-to-peer GPU mesh in under 50ms

How ComputePool allocates work across a peer-to-peer GPU mesh in under 50ms

Comments
4 min read
Qwen 3.6 35B-A3B for Local AI in 2026: The 24GB VRAM Line That Gets You 120 tok/s

Qwen 3.6 35B-A3B for Local AI in 2026: The 24GB VRAM Line That Gets You 120 tok/s

Comments
6 min read
INT8 Q/DQ Calibration on Blackwell: 1.8 the TRT 10 + FP16 Baseline

INT8 Q/DQ Calibration on Blackwell: 1.8 the TRT 10 + FP16 Baseline

Comments
7 min read
Flash Attention: what it does and why it matters

Flash Attention: what it does and why it matters

Comments
8 min read
Generation-Side Tooling Outpaces Validation-Side Tooling

Generation-Side Tooling Outpaces Validation-Side Tooling

Comments
3 min read
CUDA for AMD Lemonade, Intel Arc Pro Linux Gains, XPU Manager 2.0

CUDA for AMD Lemonade, Intel Arc Pro Linux Gains, XPU Manager 2.0

Comments
3 min read
Vortex 3.0 RISC-V GPGPU, Pragtical SDL GPU Backend, NVIDIA RTX Spark Launch

Vortex 3.0 RISC-V GPGPU, Pragtical SDL GPU Backend, NVIDIA RTX Spark Launch

Comments
4 min read
GPU_WORKLOAD_MISMATCH: A Novel Security Finding Category for AI Container Workloads

GPU_WORKLOAD_MISMATCH: A Novel Security Finding Category for AI Container Workloads

Comments
9 min read
GPU_WORKLOAD_MISMATCH: A Novel Security Finding Category for AI Container Workloads

GPU_WORKLOAD_MISMATCH: A Novel Security Finding Category for AI Container Workloads

Comments
9 min read
How to Tune llama.cpp --n-gpu-layers: A Practical VRAM Guide (2026)

How to Tune llama.cpp --n-gpu-layers: A Practical VRAM Guide (2026)

Comments
3 min read
Wave-Level GPU Introspection Was Already in Production (Server Side)

Wave-Level GPU Introspection Was Already in Production (Server Side)

Comments
5 min read
How to Tune --n-gpu-layers for Your VRAM Budget

How to Tune --n-gpu-layers for Your VRAM Budget

Comments
4 min read
I Tested 9 Serverless GPU Providers for AI Inference in 2026. Here's What I'd Actually Use

I Tested 9 Serverless GPU Providers for AI Inference in 2026. Here's What I'd Actually Use

5
Comments
19 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.