Show HN: A new benchmark for testing LLMs for deterministic outputs

Testing OpenGraph on localhost from the CLI before you go public

Meaning in life and the mental health - addiction spiral: testing a unifying model

CDC pauses lab testing of rabies, monkeypox and other diseases

Thermal damage from focused ion beam milling can compromise microscopic cement testing, leading to the creation of optimized low-energy protocols at Czech Technical University in Prague

Waymo's Robot Car Testing Ends in NYC After Permits Expire

A communist Apple II and fourteen years of not knowing what you're testing

Announcing WayDriver — a Rust library for functional testing of Wayland apps (Playwright-style)

Deterministic Primality Testing for Limited Bit Width

Mysteries of Dropbox: Testing of a Distributed Sync Service (2016) [pdf]

Model-Based Testing for Dungeons & Dragons

Show HN: Finalrun – Spec-driven testing using English and vision for mobile apps

soak testing a desktop app in zig

A case study in testing with 100+ Claude agents in parallel

Live Life on the Edge: A Layered Strategy for Testing Data Models

Testing a New Product for Data Science Beginners

Development Driven Testing: Why TDD Is Not the Best Approach

Gopher Glide (gg) — Zero-scripting API load testing in Go with Behavioral Profiling Snapshots, Semantic Diffing, and a JetBrains Plugin

Repos Set up for Testing DLL on Different Architectures

Working software runs locally

Hegel, a universal property-based testing protocol and family of PBT libraries

The Cost of Concurrency Coordination

Jepsen: MariaDB Galera Cluster 12.1.2

Big-Endian Testing with QEMU

RocksDB development finds a CPU bug

Scaling a Monolith to 1M LOC: 113 Pragmatic Lessons from Tech Lead to CTO

Meta starts testing a premium subscription on Instagram

Nvidia "confirms" DLSS 5 relies on 2D frame data as testing reveals hallucinations

Cybersecurity stocks slumped on Friday following a report that Anthropic is testing a powerful new artificial intelligence model that is more advanced in cyber capabilities and also presents potential security risks.

Hegel - a property-based testing library from the authors of Hypothesis

More →