Local-ai

4 posts

>filter:

POSTS

(4/4)

[2026-04-10]

Apple Silicon LLM Inference Optimization: The Complete Guide to Maximum Performance— @dylan >>

apple-siliconllmlocal-aimlxollamaquantizationinferencemac-miniperformancedeveloper-tools·27 min read

A comprehensive guide to maximizing LLM inference performance on Apple Silicon — MLX vs llama.cpp benchmarks, quantization formats, RAM requirements, MoE models, speculative decoding, KV cache optimization, and the best models for every Mac configuration.

[2026-04-05]

LLM Model Names Decoded: A Developer's Guide to Parameters, Quantization & Formats— @dylan >>

local-aillmmachine-learningollamahuggingfacequantizationaiopen-sourcedeveloper-tools·30 min read

A practical guide to decoding LLM model names — what B, IT, GGUF, Q4_K_M, MoE, and every other suffix means. Includes comparison tables, links to the best research resources, and real examples from Gemma 4, Qwen 3.5, and Llama 4.

[2026-03-21]

Local LLM Inference in 2026: The Complete Guide to Tools, Hardware & Open-Weight Models— @dylan >>

local-aillmollamahardwareaimachine-learningapple-siliconopen-sourcedeveloper-tools·26 min read

A comprehensive guide to running LLMs locally — comparing 10 inference tools, quantization formats, hardware at every budget, and the builders empowering developers with open-weight models.

[2026-02-28]

Best Mac Mini for Running Local LLMs and OpenClaw: Complete Pricing & Buying Guide (2026)— @dylan >>

mac-minillmlocal-aiapple-siliconaimachine-learningollamaopenclawhardwarebuying-guide·20 min read

A complete pricing comparison of every Apple Silicon Mac Mini — new vs used — with LLM capability breakdowns by RAM tier. Plus a full guide to running OpenClaw as a local AI agent on your Mac Mini with zero cloud costs.