Loading...

Tag trends are in beta. Feedback? Thoughts? Email me at [email protected]

Implemented the research paper “Memorizing Transformers” from scratch with my own additional modifications in architecture and customized training pipeline .

Beyond Python: AI Agents in JavaScript with KaibanJS

LLM Embeddings Explained: A Visual and Intuitive Guide

Qwen3-Coder-30B-A3B-Instruct

Qwen3 235B beats Claude on some code benchmarks

Qwen3 30B-A3B

Voxtral-Mini-3B-2507 – Open source speech understanding model

Reachy Mini – The Open-Source Robot for Today's and Tomorrow's AI Builders

Qwen3-235B-A22B-Thinking-2507

Qwen3-235B-A22B-Instruct-2507

DeepSeek-TNG-R1T2-Chimera

Smollm3: Smol, multilingual, long-context reasoner LLM

Kyutai 1.6B Streaming TTS

Open Source 1.7tb Dataset of What AI Crawlers Are Doing

DiffuCoder-7B-CpGRPO: A code generation LLM developed by Apple

Evolutionary Algorithm Automatically Discovers GPU Optimizations Beating Expert Code

Jan-nano-128k: A 4B Model with a Super-Long Context Window (Still Outperforms 671B [in MCP])

Nanonets-OCR-s – OCR model that transforms documents into structured markdown

Show HN: ChatToSTL – AI text-to-CAD for 3D printing

Qwen3 embedding models

Show HN: Penny-1.7B Irish Penny Journal style transfer

Deepseek R1-0528

Qwen3 0.6B now on HuggingFace (quantized)

Hugging Face to sell open-source robots thanks to Pollen Robotics acquisition

FUTO open-sources 1M row keyboard swipe dataset

Understanding MCP Evals: Why Evals Matter for MCP

Qwen2.5-Omni Technical Report

Co-Doodle with Gemini

Open-sourcing 5,000hrs of self-driving dataset

Hugging Face datasets and models for cybersecurity/sofwtare vulnerabilities

More →