Mac Pro's Decade-Old GPUs Now Run LLMs via Vulkan Support

A 2016 Mac Pro with AMD D700 GPUs—once considered obsolete for machine learning—is now running inference workloads after Linux kernel updates enabled Vulkan support for the machine's legacy southern islands GPU architecture.

The original poster reports achieving 11 tokens per second on Qwen 3.5 9B ...

Sign in to read the full analysis

Free — just an email. Get full analysis on LLM unit economics, plus the weekly Cost-of-Inference column.

Get started for free Sign in

Method & sources

Source type: Primary publication (lab/vendor blog) — our analysis + implication
Source link: r/localllama
Published: 2026-05-27 02:41:03 UTC
Byline: By the gotcontext.ai team (editorial standards)
Correction?: corrections@gotcontext.ai

← All Intelligence