A New Era for GPU Programming: NVIDIA Finally Adds Native Python Support to CUDA

CubeCL: GPU Kernels in Rust for CUDA, ROCm, and WGPU

Lossless LLM compression for efficient GPU inference via dynamic-length float

VRAM Pro: Allocate more GPU memory on your Mac (menubar utility)

The AI industry doesn’t know if Trump just killed its GPU supply

Show HN: Neurox – GPU Observability for AI Infra

Dynamic Register Allocation on AMD's RDNA 4 GPU Architecture

Next-Gen GPU Programming: Hands-On with Mojo and Max Modular HQ

PanVK is officially Vulkan 1.1 conformant on the Arm Mali-G610 GPU

gpu-benchmark: Python CLI tool for benchmarking GPU performance with Stable Diffusion

How is Amd GPU for ML??

Building a Fast, SIMD/GPU-Friendly Random Number Generator for Fun and Profit

GPU.js Isn’t Dead — It’s Powerful, Versatile, and can be paired with other languages (Great for GPU-Accelerated Heatmaps!)

GPU Computing 101

Analyzing Modern NVIDIA GPU cores

Gemma3 – The current strongest model that fits on a single GPU

Asahi Lina Pausing Work on Apple GPU Linux Driver Development

Is the book Mastering GPU Architecture by Edward R. Deforest good for someone who wants to learn GPU arch?

GPU Compiler Interview

Lisa Su Says Radeon RX 9000 Series Is AMD's Most Successful GPU Launch Ever

Bolt Graphics Zeus a New GPU Architecture with Up to 2.25TB of Memory and 800GbE

Nvidia GPU roadmap confirms it: Moore's Law is dead and buried

Minecraft clone showcasing the SDL3 GPU API

Qualcomm GPU compiler engineer position interview

Looking Ahead at Intel's Xe3 GPU Architecture

Zoltan's FLOPs – GPU mini-grant, 1st iteration

AMD's GPU market share in Japan hits all-time high of 45%, aims for 70% | "AMD isn't used to selling so many graphics cards"

Startup Claims Its Upcoming (RISC-V ISA) Zeus GPU is 10X Faster Than Nvidia's RTX 5090

Haiku loves Nvidia (porting Nvidia GPU driver)

Google calls Gemma 3 the most powerful AI model you can run on one GPU

More →