Blog

#hardware

Posts on AI engineering, LLM systems, and software development.

Sort:

Local LlmsApril 24, 2026#7

Every machine can run a local LLM (here's what fits)

A per-tier guide to running local LLMs in 2026, from 8GB integrated graphics to a 192GB Mac Studio. Specific models, specific speeds, specific configs.

AI Hardware LLM Local Llms VRAM

Read →

AI RunningMarch 23, 2026#4

Why Apple Silicon punches above its weight on local LLMs

Unified memory lets the GPU see all of RAM. Here's why that beats a discrete-GPU PC past 32B parameters, what fits in 16/32/64/128/192GB, and where Apple Silicon still loses.

AI AI Running Apple Silicon Hardware LLM

Read →

AI RunningMarch 21, 2026#3

What it takes to run a model on your own machine

Why VRAM is the one number that decides whether a local LLM runs, what quantization really does to a model file, and the hardware ladder from an 8GB laptop to a 192GB workstation.

AI AI Running Hardware LLM Quantization

Read →

AI RunningMarch 16, 2026#1

Where AI actually runs: cloud, local, edge

When you use AI, a model file is sitting on a real machine. There are only three places it can be, and which one decides almost everything else.

AI Edge Hardware Inference LLM

Read →