A Chinese AI lab just shipped the world's best coding model — 744 billion parameters, MIT license, trained entirely on Huawei chips — and most Western...

glm-5.1z-aiopen-source

Neural Dispatch · Apr 10 ·5 min read

Forget the Trillion Parameters — DeepSeek V4's Memory Architecture Is What Matters

Three days ago, screenshots from DeepSeek's gray-scale test started circulating on Weibo.

deepseekdeepseek-v4engram

Open Weight Weekly · Apr 8 ·4 min read

GLM-5.1 Just Clocked In for an 8-Hour Coding Shift

Z.AI dropped GLM-5.

glm-5.1z-aiswe-bench-pro

Synthetic Media · Apr 7 ·5 min read

Aurora Doesn't Diffuse — xAI's Autoregressive Bet on Image Generation

Most image generators in 2026 still work the same way they did in 2022: start with noise, denoise iteratively, hope the result matches your prompt.

image-generationxaigrok-imagine

Open Weight Weekly · Apr 4 ·4 min read

NVIDIA Snuck Mamba Into a 120B Model and Nobody Blinked

NVIDIA dropped Nemotron 3 Super a few weeks ago, and the discourse moved on within 48 hours. Understandable — March was a firehose of model releases.

nemotron-3-supernvidiamamba

Synthetic Media · Apr 3 ·5 min read

Wan2.2 Splits Its Brain in Half — And That's the Point

Every video diffusion model released in the last year has followed the same playbook: train bigger, throw more VRAM at inference, charge accordingly.

video-generationwan2.2mixture-of-experts

Neural Dispatch · Apr 3 ·4 min read

Gemma 4 Ships Under Apache 2.0 With an Architecture Nobody Expected

Google dropped Gemma 4 on Wednesday — four open-weight models under a genuine Apache 2.0 license, built from the same research behind Gemini 3.

gemma-4googleopen-source

Open Weight Weekly · Apr 3 ·4 min read

Gemma 4's Secret Weapon Isn't the 31B — It's the 26B That Acts Like a 4B

Google shipped Gemma 4 yesterday under Apache 2.

gemma-4mixture-of-expertsapache-2

Neural Dispatch · Apr 1 ·5 min read

Forget the 119B — Mistral Small 4's Killer Feature Is a Single API Parameter

Mistral shipped a model with 119 billion parameters and called it "Small." Under Apache 2.

mistralmixture-of-expertsopen-source

Neural Dispatch · Mar 31 ·5 min read

Nemotron 3 Super: 120B Parameters, 12B Active, and the Architecture Agents Actually Need

NVIDIA dropped Nemotron 3 Super a few weeks ago and it flew under the radar — buried by the Mythos leak drama and GPT-5.4's benchmark parade.

nvidianemotronmamba

← Prev 2 / 2