CMU TheAgentCompany: Benchmarking LLM Agents on Consequential Real World Tasks

Emergent Misalignment: Narrow Finetuning Can Produce Broadly Misaligned LLMs

LLM text chat is everywhere. Who’s optimizing its UX?

LLMs as Unbiased Oracles

After months of coding with LLMs, I'm going back to using my brain

GitHub - FireBird-Technologies/Auto-Analyst: Open-source AI-powered data science platform. LLM agnostic/MIT license, freemium

Show HN: Merliot – plugging physical devices into LLMs

New Life Hack: Using LLMs to Generate Constraint Solver Programs for Personal Logistics Tasks

Run LLMs on Apple Neural Engine (ANE)

Ask HN: Anyone working in traditional ML/stats research instead of LLMs?

Show HN: A free AI risk assessment tool for LLM applications

LLM-D: Kubernetes-Native Distributed Inference at Scale

Google Gemini has the worst LLM API

Explain LLMs like I am 5

Show HN: Clippy – 90s UI for local LLMs

Show HN: Min.js style compression of tech docs for LLM context

The Em Dash Conspiracy: More and More of Reddit Is from LLMs

LLM Mental offloading and brain drain

Show HN: Use Third Party LLM API in JetBrains AI Assistant

Introducing doc-scraper: A Go-Based Web Crawler for LLM Documentation

xAI dev leaks API key for private SpaceX, Tesla LLMs

My CLI tool "Lumen" is helping devs give LLMs better project context, 47 stars & 1400+ downloads in a 2 weeks!

LLM-God (A way to prompt multiple LLM's at the same time)!

BioStarsGPT – Fine-tuning LLMs on Bioinformatics Q&A Data

New LLM Release (v1.2.8): Voice-to-LLM-to-Voice is now possible!

LLM-D: Kubernetes-Native Distributed Inference

Pure "HTML first" JS library to connect LLMs with input/textarea elements

Bypassing Hallucinations in LLMs

LLM-God (Prompt multiple LLM's at once!)

How are you tracking usage and cost across LLM APIs like OpenAI and Anthropic?

More →