Python Performance: Why 'if not list' is 2x Faster Than Using len()

Hardware-Aware Coding: CPU Architecture Concepts Every Developer Should Know

A Software Engineer's Guide to Reading Research Papers

Context Switching and Performance: What Every Developer Should Know

GPU Computing 101

How Unix spell ran in 64kb RAM

Linux Context Switching Internals: Process State and Memory

The CAP theorem of Clustering: Why Every Algorithm Must Sacrifice Something

CPython's Garbage Collector and Its Impact on Application Performance

An Unreachable Hidden XKCD Easter Egg inside CPython

The Pythonic Emptiness

Disillusioning the Magic of the fork System Call

Disillusioning the Magic of the Fork System Call

What Every Developer Should Know About GPU Computing (2023)

A Selective Survey of Efficient Speculative Decoding Techniques for LLM Inference

CPython Runtime Internals: Key Data Structures & Runtime Bootstrapping

The Design and Implementation of the CPython Virtual Machine

Two Threads, One Core: How Simultaneous Multithreading Works Under the Hood

Recent Performance Improvements in Function Calls in CPython

How simultaneous multithreading works under the hood

All Set! Looking Under the Hood of Python's Set Data Structure

CPython Garbage Collection: The Internal Mechanics and Algorithms

How Python Compares Floats and Ints: When Equals Isn’t Really Equal