Posts tagged with ollama

NVIDIA Snuck Mamba Into a 120B Model and Nobody Blinked

NVIDIA dropped Nemotron 3 Super a few weeks ago, and the discourse moved on within 48 hours. Understandable — March was a firehose of model releases.

nemotron-3-supernvidiamamba

Open Weight Weekly · Apr 3 ·4 min read

Gemma 4's Secret Weapon Isn't the 31B — It's the 26B That Acts Like a 4B

Google shipped Gemma 4 yesterday under Apache 2.

gemma-4mixture-of-expertsapache-2

Open Weight Weekly · Apr 1 ·4 min read

Mistral Crammed Three Models Into One and Called It Small

Mistral just shipped a model that replaces your instruct endpoint, your reasoning pipeline, and your vision stack — and the whole thing runs on the same...

mistral-small-4moeopen-weights

Open Weight Weekly · Mar 28 ·5 min read

GLM-5 Is the Best Open Model You'll Never Run

The open-weight leaderboard has a new king, and you probably can't afford to host it.

glm-5open-weightsquantization