Posts tagged with open-weights

Neural Dispatch · Jul 8 ·5 min read

46% of US Developer Tokens Run on Chinese Models Now — Nobody Has a Plan B

Twelve months ago, Chinese-made language models handled less than 2% of the tokens flowing through OpenRouter from US-based developer accounts.

chinese-aiglm-5-2openrouter

Neural Dispatch · Jul 6 ·5 min read

A Classified NSA Process Now Sits Between You and the Next Frontier Model

Imagine shipping a feature that depends on a model you can see in benchmarks but can't call from your API key.

frontier-modelsai-regulationnsa

Neural Dispatch · Jul 4 ·4 min read

The $4.40 Model That Codes Better Than GPT-5.5

Z.ai dropped GLM-5.

glm-5-2z-aiopen-weights

Synthetic Media · Jul 3 ·5 min read

Twelve Billion Parameters, No Synthetic Data

Krea dropped the weights for their 12-billion-parameter image model on June 22. Within hours, quantized variants and ComfyUI nodes were live.

kreaopen-weightsdiffusion-transformer

Neural Dispatch · Jun 7 ·5 min read

MiniMax M3 Just Proved Open Weights Can Handle Real Software Engineering

Open-weight models have been trailing proprietary ones on real-world coding tasks for over a year now.

minimax-m3open-weightsswe-bench

Synthetic Media · Jun 6 ·5 min read

9.3 Billion Parameters and a JSON Schema

Ideogram just dropped a 9.3-billion-parameter image model that outperforms rivals three to eight times its size on text rendering benchmarks.

ideogramopen-weightsdiffusion-transformer

Open Weight Weekly · Jun 3 ·5 min read

MiniMax M3 Claims to Beat GPT-5.5. The Weights Aren't Out Yet.

Two days ago MiniMax dropped a launch announcement for M3 that reads like a greatest-hits compilation of everything the open-weight community has been asking...

minimax-m3sparse-attentionmixture-of-experts

Neural Dispatch · May 25 ·4 min read

DeepSeek V4 Pro Costs 13x Less Than GPT-5.5. NIST Measured the Gap.

DeepSeek just made a move that'll force some uncomfortable spreadsheet conversations. The 75% promotional discount on V4 Pro?

deepseekopen-weightsnist

Synthetic Media · May 23 ·5 min read

The Music Model That Won't Sing

Stability AI dropped four audio models this week and none of them will generate a human voice. No lyrics, no vocal harmonies, no Auto-Tuned hooks.

stable-audiomusic-generationdiffusion-transformer

Open Weight Weekly · May 20 ·4 min read

No VAE, No Text Encoder, and It Still Beat FLUX

Every mainstream diffusion model follows the same three-part recipe: a text encoder tokenizes your prompt, a diffusion backbone denoises in latent space, and a...

hidreamimage-generationopen-weights

Neural Dispatch · May 19 ·4 min read

Cursor Built Its Own Coding Model. It's Opus-Grade and One-Tenth the Price.

Two days ago, Cursor shipped Composer 2.5.

cursorcomposerkimi-k2-5

Neural Dispatch · May 18 ·5 min read

Mistral Medium 3.5: The 128B Open Model That Ships PRs While You Sleep

Mistral just did something no other open-weight lab has pulled off: they shipped a 128B model that scores 77.

mistralopen-weightscoding-agents

Synthetic Media · May 16 ·5 min read

Five Versions, Fourteen Months, Zero API Keys

Pull up the LLM-Stats video arena today and the number one slot belongs to an open-weights model. Not Runway.

video-generationwanopen-weights

Open Weight Weekly · May 16 ·5 min read

744 Billion Parameters, Zero NVIDIA GPUs

Z.ai's GLM-5.

glm-5.1zhipu-aihuawei-ascend

Open Weight Weekly · May 15 ·4 min read

DeepSeek Shipped Two Models. You Only Need the Smaller One.

DeepSeek V4-Pro grabbed headlines three weeks ago with an 80.

deepseek-v4open-weightsself-hosting

Neural Dispatch · May 13 ·5 min read

NVIDIA's Nemotron 3 Trades Transformer Purity for 5x Agent Throughput

Most open models fight for the same throne: highest MMLU score, best SWE-Bench pass rate, flashiest reasoning demo.

nvidianemotron-3open-weights

Neural Dispatch · May 12 ·4 min read

Mistral's New Model Is Mid. The Async Coding Agents Running On It Aren't.

Mistral dropped Medium 3.

mistralmistral-medium-3-5async-agents

Open Weight Weekly · May 11 ·4 min read

Mistral Medium 3.5: Strong at Code, Silent on Everything Else

Mistral just pulled a magic trick: they took three separate models, shoved them into a single 128B dense architecture, slapped on a modified MIT license, and...

mistralmistral-medium-3.5open-weights

Neural Dispatch · May 9 ·5 min read

Kimi K2.6 Won a Live Coding Tournament Against GPT-5.5. The Catch? There Isn't One.

On May 3rd, Moonshot AI's Kimi K2.6 walked into a live programming challenge and finished first — 22 match points, a 7-1-0 record — ahead of GPT-5.

kimimoonshot-aiopen-weights

Open Weight Weekly · May 8 ·4 min read

Meta's Two-Trillion-Parameter Ghost

Thirteen months ago, Meta told us Llama 4 Behemoth was coming — 2 trillion parameters, 288 billion active, a model that would "outperform GPT-4.

llama-4metabehemoth

1 / 2 Next →