Intel, NVIDIA, AMD GPU Drivers Finally Play Nice With ReactOS

Right-sizes LLM models to your system's RAM, CPU, and GPU

I pushed an AMD GPU to its limits for ZKPs: 18ms NTT and 2.5s FRI Proving via Zero-Copy and Algorithmic Dimensionality Reduction

Where are the places I can rent GPU?

Show HN: Horizon – GPU-accelerated infinite-canvas terminal in Rust

Autoresearch: Agents researching on single-GPU nanochat training automatically

AXIOM: Built a sparse dynamic routing architecture for LLM inference entirely in Rust. No ML frameworks, no GPU, 1.2M parameters

Track real-time GPU and LLM pricing across all cloud and inference providers

Track real-time GPU and LLM pricing across all cloud and inference providers

I built a computing environment in Rust where every program is AI-generated, compiled to WASM, and GPU-rendered via wgpu

GPU-accelerated declarative plotting in WebGL – introducing Gladly

ORE: A process manager written in Rust to schedule GPU resources and prevent security vulnerability, VRAM OOM (using Tokio Semaphores & Axum) for local LLMs

GPU Rack Power Density, 2015–2025

Async/Await on the GPU

How NVIDIA's CuTe replaces GPU index arithmetic with composable layout algebra

Testing "Raw" GPU Cache Latency

Tiny-gpu-compiler: An educational MLIR-based compiler targeting open-source GPU hardware

Parakeet.cpp – Parakeet ASR inference in pure C++ with Metal GPU acceleration

Show HN: A physically-based GPU ray tracer written in Julia

mdpt: Markdown TUI slides with GPU rendering (not terminal-dependent) — Rust

Numr: A high-performance numerical computing library with GPU acceleration

The Future for Tyr, a Rust GPU Driver for Arm Mali Hardware

Attyx – tiny and fast GPU-accelerated terminal emulator written in Zig

Show HN: Llama 3.1 70B on a single RTX 3090 via NVMe-to-GPU bypassing the CPU

Blinc: A declarative, reactive UI system with first-class state machines, spring physics animations, and GPU-accelerated rendering

which gpu should i get for ai training/inference/finetuning?

A browser benchmark that actually uses all your CPU/GPU cores

A browser benchmark that actually uses all your CPU/GPU cores

I got 14.84x GPU speedup by studying how octopus arms coordinate

I got 14.84x GPU speedup by studying how octopus arms coordinate

More →