The race to build a distributed GPU runtime

Apple Silicon GPU Support in Mojo

Show HN: Run Qwen3-Next-80B on 8GB GPU at 1tok/2s throughput

Gluon: a GPU programming language based on the same compiler stack as Triton

Computer arithmetic: Arbitrary Precision from scratch on a GPU

Java for AI: GPU support from pure-Java inference to land in LangChain4j

Apple's A19 Pro beats Ryzen 9 9950X in single-thread Geekbench tests — iPhone 17 Pro chip packs 11-12% CPU performance bump, GPU performance up 37% over predecessor

AMD’s RDNA4 GPU architecture

Nvidia Dominates GPU Shipments With 94% Share

Evolution of GPU Programming: From Smart Pixels to the Backbone of an AI-driven World