Loading...

Tag trends are in beta. Feedback? Thoughts? Email me at [email protected]

Beyond Semantics: Unreasonable Effectiveness of Reasonless Intermediate Tokens

A Formal Proof of Complexity Bounds on Diophantine Equations

Using Large Language Models for Commit Message Generation: A Preliminary Study

Why it is (nearly) impossible that we live in a simulation

X X^t can be faster

Byte latent transformer: Patches scale better than tokens

Sharp Knives Reduce Onion-Induced Tears By Limiting Droplet Spray, Study Finds

LLMs are more persuasive than incentivized human persuaders

Comparing Parallel Functional Array Languages: Programming and Performance

SUS backprop: linear backpropagation algorithm for long inputs in transformers

Base Models Beat Aligned Models at Randomness and Creativity

Robin: A multi-agent system for automating scientific discovery

Discord Unveiled: A Comprehensive Dataset of Public Communication (2015-2024)

Stop treating `AGI' as the north-star goal of AI research

Comparing Parallel Functional Array Languages: Programming and Performance

Prime Path Coverage in the GNU Compiler Collection

Harnessing the Universal Geometry of Embeddings

Steepest Descent Density Control for Compact 3D Gaussian Splatting

Can You Trust Code Copilots? Evaluating LLMs from a Code Security Perspec

Type-constrained code generation with language models

Rethinking Memory in AI: Taxonomy, Operations, Topics, and Future Directions

SPFresh: Incremental In-Place Update for Billion-Scale Vector Search

LLMs get lost in multi-turn conversation

Sugar-Coated Poison: Benign Generation Unlocks LLM Jailbreaking

µPC: Scaling Predictive Coding to 100 Layer Networks

TransMLA: Multi-head latent attention is all you need

Scoring the European Citizen in the AI Era

Understanding Transformers via N-gram Statistics

Improving Assembly Code Performance with LLMss via Reinforcement Learning

Toward a Sparse and Interpretable Audio Codec

More →