Writer archive

Nico Sable

Nico covers DeepSeek, Kimi, Qwen, Mistral, Llama, and the open-weight model ecosystem. He believes open models are usually where the interesting work escapes the velvet rope first, and he has very limited patience for closed labs charging premium prices for ideas the open crowd is already stress-testing in public.

Open Models Blogger

2026-05-13 5 min read

GLiGuard is a tiny safety model with the right kind of ambition

Fastino’s 300M-parameter GLiGuard reframes moderation as classification instead of generation. If the benchmarks hold up, the lesson is simple: safety rails should be cheap enough to run everywhere, not another heavyweight model call.

2026-04-29 5 min read

Mistral Medium 3.5 is local, if your local machine has 80GB to spare

Unsloth’s Mistral 3.5 run guide turns a model launch into a hardware reality check: this is open local inference, not laptop magic.

Mistral AI Open Models Local LLMs Unsloth GGUF

2026-04-28 5 min read

NVIDIA’s Nemotron 3 Nano Omni wants to be the eyes and ears of agents

NVIDIA’s new open multimodal model is pitched as a cheaper perception layer for agents that need to read screens, documents, video, and audio without stitching four models together.

NVIDIA Nemotron Open Models Multimodal AI AI Agents

2026-04-24 4 min read

DeepSeek V4 applies open-model pricing pressure to closed labs

DeepSeek V4’s preview models pair million-token context with aggressive economics. Closed labs can sell mystique, but builders will be doing the math.

DeepSeek Open Models Open Weights Pricing Local AI

2026-04-16 3 min read

Ollama structured outputs finally tell local models to stop freelancing JSON

Ollama’s new JSON-schema constraints bring sanity to local AI, replacing fragile regex parsing with actual validation boundaries.

Ollama Local AI Structured Outputs Open Models Developer Tools

2026-04-07 3 min read

Llama 4 brings massive context windows and open-weight ambition

The launch of Llama 4 Maverick and Scout is thrilling for the open ecosystem, promising MoE scale and multimodality. Now builders need to stop clapping and start testing hardware reality.

Llama Hugging Face Open Weights Long Context Multimodal AI

2026-03-24 3 min read

Qwen3 turns AI reasoning into a budget knob for pragmatic builders

Qwen3’s open-weight release spans dense models, big MoEs, and hybrid thinking modes under an Apache 2.0 license. The real feature isn't magic; it's total control over your inference budget.

Qwen Open Weights Reasoning Models Apache 2.0 Agentic AI

2026-03-12 3 min read

Mistral Small 3.1 is open-model progress in its most dangerous form: actually deployable

Mistral Small 3.1 proves that the most important open models aren't the largest ones, but the ones you can actually afford to deploy locally.

Mistral Open Models Apache 2.0 Multimodal AI Local AI

2026-03-02 3 min read

DeepSeek R1 forces closed AI labs to justify their reasoning premium

DeepSeek R1 combines MIT-licensed weights, distilled checkpoints, and aggressive pricing to make open reasoning a practical engineering option rather than just a philosophical debate.

DeepSeek Open Models Reasoning Models MIT License Local AI