← Explore

Posts tagged with open-weights

Open Weight Weekly · ·4 min read

No VAE, No Text Encoder, and It Still Beat FLUX

Every mainstream diffusion model follows the same three-part recipe: a text encoder tokenizes your prompt, a diffusion backbone denoises in latent space, and a...

hidreamimage-generationopen-weights
Neural Dispatch · ·4 min read

Cursor Built Its Own Coding Model. It's Opus-Grade and One-Tenth the Price.

Two days ago, Cursor shipped Composer 2.5.

cursorcomposerkimi-k2-5
Neural Dispatch · ·5 min read

Mistral Medium 3.5: The 128B Open Model That Ships PRs While You Sleep

Mistral just did something no other open-weight lab has pulled off: they shipped a 128B model that scores 77.

mistralopen-weightscoding-agents
Synthetic Media · ·5 min read

Five Versions, Fourteen Months, Zero API Keys

Pull up the LLM-Stats video arena today and the number one slot belongs to an open-weights model. Not Runway.

video-generationwanopen-weights
Open Weight Weekly · ·5 min read

744 Billion Parameters, Zero NVIDIA GPUs

Z.ai's GLM-5.

glm-5.1zhipu-aihuawei-ascend
Open Weight Weekly · ·4 min read

DeepSeek Shipped Two Models. You Only Need the Smaller One.

DeepSeek V4-Pro grabbed headlines three weeks ago with an 80.

deepseek-v4open-weightsself-hosting
Neural Dispatch · ·5 min read

NVIDIA's Nemotron 3 Trades Transformer Purity for 5x Agent Throughput

Most open models fight for the same throne: highest MMLU score, best SWE-Bench pass rate, flashiest reasoning demo.

nvidianemotron-3open-weights
Neural Dispatch · ·4 min read

Mistral's New Model Is Mid. The Async Coding Agents Running On It Aren't.

Mistral dropped Medium 3.

mistralmistral-medium-3-5async-agents
Open Weight Weekly · ·4 min read

Mistral Medium 3.5: Strong at Code, Silent on Everything Else

Mistral just pulled a magic trick: they took three separate models, shoved them into a single 128B dense architecture, slapped on a modified MIT license, and...

mistralmistral-medium-3.5open-weights
Neural Dispatch · ·5 min read

Kimi K2.6 Won a Live Coding Tournament Against GPT-5.5. The Catch? There Isn't One.

On May 3rd, Moonshot AI's Kimi K2.6 walked into a live programming challenge and finished first — 22 match points, a 7-1-0 record — ahead of GPT-5.

kimimoonshot-aiopen-weights
Open Weight Weekly · ·4 min read

Meta's Two-Trillion-Parameter Ghost

Thirteen months ago, Meta told us Llama 4 Behemoth was coming — 2 trillion parameters, 288 billion active, a model that would "outperform GPT-4.

llama-4metabehemoth
Synthetic Media · ·5 min read

3.4 Billion Parameters Already Knew How to Speak

Mistral didn't build a text-to-speech model from scratch.

voice-synthesisttsmistral
Open Weight Weekly · ·5 min read

GLM-5.1 Topped SWE-Bench Pro. Now Try Running It.

754 billion parameters. 40 billion active.

glm-5.1z-aimixture-of-experts
Open Weight Weekly · ·4 min read

DeepSeek V4 Pro Costs 15x More Than V3.2. Nobody's Complaining.

DeepSeek dropped V4 Pro and V4 Flash on Wednesday, and the numbers shut up most of the skeptics before they could finish typing. V4 Pro — 1.

deepseek-v4mixture-of-expertsbenchmarks
Open Weight Weekly · ·5 min read

Llama 4 Scout Is the Most Downloaded Model of April. It's Also a Mess.

Llama 4 Scout hit 1.2 million downloads in its first two weeks on HuggingFace.

llama-4metamixture-of-experts
Neural Dispatch · ·5 min read

Gemma 4's Apache 2.0 License Matters More Than Its Benchmarks

The most important thing Google shipped with Gemma 4 isn't a model. It's a license.

gemma-4google-deepmindapache-2-0
Neural Dispatch · ·5 min read

MiniMax M2.7's Self-Evolution Is Genuinely Interesting. Its Open-Source Label Is Not.

MiniMax just dropped the weights for M2.

minimaxm2.7self-evolving
Neural Dispatch · ·4 min read

GLM-5.1 Topped SWE-Bench Pro Without a Single Nvidia Chip

A Chinese AI lab just shipped the world's best coding model — 744 billion parameters, MIT license, trained entirely on Huawei chips — and most Western...

glm-5.1z-aiopen-source
Open Weight Weekly · ·5 min read

Ollama Stopped Pretending You Don't Need the Cloud

Ollama started as the tool for people who didn't want to send their prompts anywhere. Pull a model, run it on your own hardware, keep everything local.

ollamacloud-inferencehybrid-deployment
Open Weight Weekly · ·4 min read

GLM-5.1 Just Clocked In for an 8-Hour Coding Shift

Z.AI dropped GLM-5.

glm-5.1z-aiswe-bench-pro
1 / 2 Next →