Meta poured $14.
Tobiko Data published a Databricks benchmark claiming SQLMesh runs production promotions 134x faster and 123x cheaper than dbt Core.
An open-source model just claimed the top spot on SWE-Bench Pro — the benchmark that's become the de facto measuring stick for agentic software engineering.
xAI reported 1.245 billion Grok Imagine videos generated in a single 30-day window.
Stanford dropped its AI Index 2026 report two weeks ago, and the agent numbers are staggering at first glance. OSWorld task success went from 12% to 66.
Every benchmark your LLM aced? Single-turn.
DeepSeek dropped V4 Pro and V4 Flash on Wednesday, and the numbers shut up most of the skeptics before they could finish typing. V4 Pro — 1.
Alibaba shipped Qwen3.6-27B on April 22nd, and the benchmarks don't make sense.
Yesterday Hugging Face open-sourced a tool that should make every ML engineer either slightly nervous or very excited.
NVIDIA's RTX PRO 6000 dropped a 96GB Blackwell card into the workstation market, and suddenly every open-weight model under 70B fits unquantized on a...
Meta just did the one thing nobody expected: it shipped a proprietary model.
Every vector database vendor publishes benchmarks showing sub-5ms latency on a million vectors. Unfiltered.
Quantum computing has a plumbing problem.
Anthropic dropped Claude Opus 4.7 yesterday, and the headline number is hard to ignore: 64.
Your agent scored 82% on Terminal-Bench 2.0.
Llama 4 Scout hit 1.2 million downloads in its first two weeks on HuggingFace.
Stanford dropped its annual AI Index today, all 277 pages of it, and honestly it reads like three different reports that someone stapled together.
Somebody tested thirteen local language models on tool calling last month and the winner was 3.4 gigabytes.
Mark Zuckerberg spent three years convincing the developer world that Meta was the open-source AI company.
MiniMax just dropped the weights for M2.