<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom">
  <channel>
    <title>Useful Machines</title>
    <link>https://usefulmachines.ai</link>
    <atom:link href="https://usefulmachines.ai/rss.xml" rel="self" type="application/rss+xml" />
    <description>Useful Machines covers practical AI news, workflows, tools, and strategy. Useful leverage, not hype.</description>
    <language>en-us</language>
    <lastBuildDate>Wed, 13 May 2026 20:59:00 GMT</lastBuildDate>
    <generator>Useful Machines Astro RSS endpoint</generator>
    
    <item>
      <title>GLiGuard is a tiny safety model with the right kind of ambition</title>
      <link>https://usefulmachines.ai/posts/gliguard-fastino-small-guard-model/</link>
      <guid isPermaLink="true">https://usefulmachines.ai/posts/gliguard-fastino-small-guard-model/</guid>
      <description>Fastino’s 300M-parameter GLiGuard reframes moderation as classification instead of generation. If the benchmarks hold up, the lesson is simple: safety rails should be cheap enough to run everywhere, not another heavyweight model call.</description>
      <pubDate>Wed, 13 May 2026 20:59:00 GMT</pubDate>
      <source url="https://arxiv.org/abs/2605.07982">Fastino AI / arXiv</source>
      <category>Fastino</category><category>GLiGuard</category><category>Open Models</category><category>AI Safety</category><category>Guardrails</category><category>LLM Infrastructure</category>
    </item>
    <item>
      <title>Gemini on Android is Google’s agent distribution play, not just a phone feature</title>
      <link>https://usefulmachines.ai/posts/gemini-intelligence-android-agent-distribution-play/</link>
      <guid isPermaLink="true">https://usefulmachines.ai/posts/gemini-intelligence-android-agent-distribution-play/</guid>
      <description>Google’s Gemini Intelligence turns Android into a proactive agent surface for app automation, Chrome, Autofill, voice cleanup, and custom widgets. The useful question is not whether it demos well. It is where control actually lives.</description>
      <pubDate>Tue, 12 May 2026 17:34:00 GMT</pubDate>
      <source url="https://blog.google/products-and-platforms/platforms/android/gemini-intelligence/">Google Gemini Blog</source>
      <category>Google</category><category>Android</category><category>Gemini</category><category>AI Agents</category><category>Mobile AI</category><category>Personal AI</category>
    </item>
    <item>
      <title>SAP’s NVIDIA agent deal is not about faster GPUs. It is about the leash.</title>
      <link>https://usefulmachines.ai/posts/nvidia-sap-agent-trust-openshell/</link>
      <guid isPermaLink="true">https://usefulmachines.ai/posts/nvidia-sap-agent-trust-openshell/</guid>
      <description>NVIDIA and SAP are embedding OpenShell into SAP’s agent platform so business agents get isolation, policy controls, and production guardrails. That is the useful part: less magic demo, more containment plan.</description>
      <pubDate>Tue, 12 May 2026 13:00:00 GMT</pubDate>
      <source url="https://blogs.nvidia.com/blog/sap-specialized-agents/">NVIDIA Blog</source>
      <category>NVIDIA</category><category>SAP</category><category>AI Agents</category><category>Enterprise AI</category><category>Governance</category><category>Open Source</category>
    </item>
    <item>
      <title>Useful Signals: open models, realtime voice, and GPUs you can actually reserve</title>
      <link>https://usefulmachines.ai/posts/useful-signals-zaya1-realtime-voice-gpu-capacity/</link>
      <guid isPermaLink="true">https://usefulmachines.ai/posts/useful-signals-zaya1-realtime-voice-gpu-capacity/</guid>
      <description>Today’s useful pile: Zyphra’s open ZAYA1 preview, OpenAI’s realtime voice push, AWS trying to make short GPU bursts less cursed, AgentCore Browser leaving the DOM, Gemini Flash-Lite going GA, and ChatGPT adding a trusted-contact safety rail.</description>
      <pubDate>Fri, 08 May 2026 03:24:00 GMT</pubDate>
      <source url="https://usefulmachines.ai/posts/useful-signals-zaya1-realtime-voice-gpu-capacity/">Zyphra / OpenAI / AWS / Google / Simon Willison</source>
      <category>Useful Signals</category><category>OpenAI</category><category>AWS</category><category>Gemini</category><category>Open Models</category><category>AI Agents</category>
    </item>
    <item>
      <title>Anthropic handed Petri to Meridian. Now the evals need to earn trust.</title>
      <link>https://usefulmachines.ai/posts/anthropic-petri-meridian-alignment-evals-trust/</link>
      <guid isPermaLink="true">https://usefulmachines.ai/posts/anthropic-petri-meridian-alignment-evals-trust/</guid>
      <description>Petri 3.0 turns Anthropic’s open alignment-testing tool into a more hackable, more realistic eval stack under Meridian Labs. Useful, if buyers treat it as a test harness instead of a trust sticker.</description>
      <pubDate>Fri, 08 May 2026 00:28:13 GMT</pubDate>
      <source url="https://www.anthropic.com/research/donating-open-source-petri">Anthropic / Meridian Labs</source>
      <category>Anthropic</category><category>Petri</category><category>Meridian Labs</category><category>AI Evaluation</category><category>Alignment</category><category>AI Safety</category>
    </item>
    <item>
      <title>ImageMining tests whether visual agents can actually search with their eyes</title>
      <link>https://usefulmachines.ai/posts/zai-imagemining-visual-agent-benchmark/</link>
      <guid isPermaLink="true">https://usefulmachines.ai/posts/zai-imagemining-visual-agent-benchmark/</guid>
      <description>Z.ai’s new ImageMining benchmark asks multimodal agents to inspect images, crop details, search outward, and reason across sources. That is a better test for many real visual workflows than another captioning score.</description>
      <pubDate>Thu, 07 May 2026 18:58:12 GMT</pubDate>
      <source url="https://github.com/zai-org/ImageMining">Z.ai ImageMining GitHub repository</source>
      <category>Z.ai</category><category>ImageMining</category><category>Multimodal AI</category><category>AI Benchmarks</category><category>Visual Agents</category><category>Deep Search</category>
    </item>
    <item>
      <title>AWS’s GRPO tutorial turns reward design into the main event</title>
      <link>https://usefulmachines.ai/posts/aws-grpo-verifiable-rewards-training-reality-check/</link>
      <guid isPermaLink="true">https://usefulmachines.ai/posts/aws-grpo-verifiable-rewards-training-reality-check/</guid>
      <description>AWS shows how verifiable rewards and GRPO can improve a small model on grade-school math. The useful lesson is not the benchmark bump — it is where reward functions are finally testable enough to trust.</description>
      <pubDate>Thu, 07 May 2026 15:59:20 GMT</pubDate>
      <source url="https://aws.amazon.com/blogs/machine-learning/overcoming-reward-signal-challenges-verifiable-rewards-based-reinforcement-learning-with-grpo-on-sagemaker-ai/">AWS Machine Learning Blog</source>
      <category>AWS</category><category>SageMaker</category><category>Reinforcement Learning</category><category>GRPO</category><category>RLVR</category><category>Model Training</category>
    </item>
    <item>
      <title>Anthropic’s Project Glasswing is a cyber alarm with a repair plan</title>
      <link>https://usefulmachines.ai/posts/anthropic-project-glasswing-ai-cybersecurity-repair-plan/</link>
      <guid isPermaLink="true">https://usefulmachines.ai/posts/anthropic-project-glasswing-ai-cybersecurity-repair-plan/</guid>
      <description>Anthropic says Claude Mythos Preview can find and exploit serious software flaws at a new scale. Project Glasswing is its attempt to put that capability in defenders’ hands before attackers get the same advantage.</description>
      <pubDate>Thu, 07 May 2026 15:28:10 GMT</pubDate>
      <source url="https://www.anthropic.com/glasswing">Anthropic</source>
      <category>Anthropic</category><category>Project Glasswing</category><category>Claude Mythos</category><category>Cybersecurity</category><category>Open Source Security</category><category>AI Safety</category>
    </item>
    <item>
      <title>AWS gave agents a wallet. The hard part is the leash.</title>
      <link>https://usefulmachines.ai/posts/aws-agentcore-payments-wallet-with-a-leash/</link>
      <guid isPermaLink="true">https://usefulmachines.ai/posts/aws-agentcore-payments-wallet-with-a-leash/</guid>
      <description>Amazon Bedrock AgentCore Payments brings Coinbase, Stripe, x402, budgets, and observability into agent workflows. The useful question is not whether agents can pay — it is who controls when they are allowed to.</description>
      <pubDate>Thu, 07 May 2026 12:59:20 GMT</pubDate>
      <source url="https://aws.amazon.com/blogs/machine-learning/agents-that-transact-introducing-amazon-bedrock-agentcore-payments-built-with-coinbase-and-stripe/">AWS Machine Learning Blog</source>
      <category>AWS</category><category>Amazon Bedrock</category><category>AI Agents</category><category>Payments</category><category>x402</category><category>Stripe</category><category>Coinbase</category>
    </item>
    <item>
      <title>Google’s agent codelab makes the demo look like integration work</title>
      <link>https://usefulmachines.ai/posts/google-gemini-enterprise-codelab-integration-work/</link>
      <guid isPermaLink="true">https://usefulmachines.ai/posts/google-gemini-enterprise-codelab-integration-work/</guid>
      <description>Google’s Cloud Next ’26 codelab shows Gemini Enterprise coordinating Cloud Run agents, BigQuery, Veo, Drive, and Gemini CLI. The useful lesson is not magic autonomy; it is where shared context and handoffs actually have to live.</description>
      <pubDate>Wed, 06 May 2026 22:28:10 GMT</pubDate>
      <source url="https://codelabs.developers.google.com/next26/gen-keynote/unified-intelligence">Google Codelabs</source>
      <category>Google Cloud</category><category>Gemini Enterprise</category><category>AI Agents</category><category>Cloud Run</category><category>Gemini CLI</category>
    </item>
    <item>
      <title>ChatGPT’s new default model is a memory test, not a victory lap</title>
      <link>https://usefulmachines.ai/posts/gpt-5-5-instant-chatgpt-default-memory-test/</link>
      <guid isPermaLink="true">https://usefulmachines.ai/posts/gpt-5-5-instant-chatgpt-default-memory-test/</guid>
      <description>OpenAI is replacing GPT-5.3 Instant with GPT-5.5 Instant as ChatGPT’s default. The useful story is not just fewer hallucination claims — it is whether memory, personalization, and model retirement become safer defaults.</description>
      <pubDate>Tue, 05 May 2026 17:30:10 GMT</pubDate>
      <source url="https://techcrunch.com/2026/05/05/openai-releases-gpt-5-5-instant-a-new-default-model-for-chatgpt/">TechCrunch / OpenAI</source>
      <category>OpenAI</category><category>ChatGPT</category><category>GPT-5.5</category><category>AI Models</category><category>Personalization</category>
    </item>
    <item>
      <title>Google’s April AI recap is a product strategy hiding in a list</title>
      <link>https://usefulmachines.ai/posts/google-april-ai-recap-product-strategy/</link>
      <guid isPermaLink="true">https://usefulmachines.ai/posts/google-april-ai-recap-product-strategy/</guid>
      <description>Google’s monthly AI roundup is not just a pile of announcements. It shows how the company is turning Gemini into a cross-product operating layer, from Cloud agents to Vids, Colab, Translate, Fitbit, and healthcare training.</description>
      <pubDate>Mon, 04 May 2026 17:58:52 GMT</pubDate>
      <source url="https://blog.google/innovation-and-ai/technology/ai/google-ai-updates-april-2026/">Google Blog</source>
      <category>Google</category><category>Gemini</category><category>AI Agents</category><category>Google Workspace</category><category>Developer Tools</category>
    </item>
    <item>
      <title>Google’s Gemini Enterprise Agent Platform makes Vertex AI the agent factory</title>
      <link>https://usefulmachines.ai/posts/google-gemini-enterprise-agent-platform-vertex-ai-shift/</link>
      <guid isPermaLink="true">https://usefulmachines.ai/posts/google-gemini-enterprise-agent-platform-vertex-ai-shift/</guid>
      <description>Google is folding Vertex AI’s future into a governed enterprise agent platform, which says the next AI fight is less about demos and more about identity, runtime, memory, and observability.</description>
      <pubDate>Wed, 29 Apr 2026 22:27:20 GMT</pubDate>
      <source url="https://cloud.google.com/blog/products/ai-machine-learning/introducing-gemini-enterprise-agent-platform">Google Cloud Blog</source>
      <category>Google Cloud</category><category>Gemini Enterprise</category><category>AI Agents</category><category>Vertex AI</category><category>Enterprise AI</category>
    </item>
    <item>
      <title>Mistral Medium 3.5 is local, if your local machine has 80GB to spare</title>
      <link>https://usefulmachines.ai/posts/mistral-medium-3-5-local-hardware-reality-check/</link>
      <guid isPermaLink="true">https://usefulmachines.ai/posts/mistral-medium-3-5-local-hardware-reality-check/</guid>
      <description>Unsloth’s Mistral 3.5 run guide turns a model launch into a hardware reality check: this is open local inference, not laptop magic.</description>
      <pubDate>Wed, 29 Apr 2026 15:57:42 GMT</pubDate>
      <source url="https://unsloth.ai/docs/models/mistral-3.5">Unsloth Documentation</source>
      <category>Mistral AI</category><category>Open Models</category><category>Local LLMs</category><category>Unsloth</category><category>GGUF</category>
    </item>
    <item>
      <title>Google’s Agent Skills repo is a quiet attack on context bloat</title>
      <link>https://usefulmachines.ai/posts/google-agent-skills-repo-context-bloat/</link>
      <guid isPermaLink="true">https://usefulmachines.ai/posts/google-agent-skills-repo-context-bloat/</guid>
      <description>Google’s new official Agent Skills repository gives agents compact, task-specific instructions for Cloud products instead of stuffing whole documentation sites into context.</description>
      <pubDate>Tue, 28 Apr 2026 16:58:45 GMT</pubDate>
      <source url="https://cloud.google.com/blog/topics/developers-practitioners/level-up-your-agents-announcing-googles-official-skills-repository">Google Cloud Blog</source>
      <category>Google Cloud</category><category>AI Agents</category><category>Agent Skills</category><category>MCP</category><category>Developer Tools</category>
    </item>
    <item>
      <title>NVIDIA’s Nemotron 3 Nano Omni wants to be the eyes and ears of agents</title>
      <link>https://usefulmachines.ai/posts/nvidia-nemotron-3-nano-omni-agent-eyes-ears/</link>
      <guid isPermaLink="true">https://usefulmachines.ai/posts/nvidia-nemotron-3-nano-omni-agent-eyes-ears/</guid>
      <description>NVIDIA’s new open multimodal model is pitched as a cheaper perception layer for agents that need to read screens, documents, video, and audio without stitching four models together.</description>
      <pubDate>Tue, 28 Apr 2026 16:28:21 GMT</pubDate>
      <source url="https://huggingface.co/unsloth/NVIDIA-Nemotron-3-Nano-Omni-30B-A3B-Reasoning-GGUF">Unsloth / Hugging Face</source>
      <category>NVIDIA</category><category>Nemotron</category><category>Open Models</category><category>Multimodal AI</category><category>AI Agents</category>
    </item>
    <item>
      <title>Talkie is a 1930 language model with a modern contamination problem</title>
      <link>https://usefulmachines.ai/posts/talkie-vintage-language-model-1930-clean-data-test/</link>
      <guid isPermaLink="true">https://usefulmachines.ai/posts/talkie-vintage-language-model-1930-clean-data-test/</guid>
      <description>A 13B model trained on pre-1931 text is less a nostalgia demo than a practical test bed for clean data, synthetic tuning, and what language models really learn from the web.</description>
      <pubDate>Tue, 28 Apr 2026 02:58:17 GMT</pubDate>
      <source url="https://talkie-lm.com/introducing-talkie">Talkie LM</source>
      <category>Language Models</category><category>Training Data</category><category>Open Models</category><category>AI Research</category><category>Data Contamination</category>
    </item>
    <item>
      <title>NVIDIA Dynamo is a reality check on the broken economics of agentic coding</title>
      <link>https://usefulmachines.ai/posts/nvidia-dynamo-inference-agentic-coding-economics/</link>
      <guid isPermaLink="true">https://usefulmachines.ai/posts/nvidia-dynamo-inference-agentic-coding-economics/</guid>
      <description>NVIDIA is rebuilding the inference stack with KV-aware routing because traditional architectures cannot survive the hidden cost of agentic API loops.</description>
      <pubDate>Sat, 25 Apr 2026 18:37:00 GMT</pubDate>
      <source url="https://x.com/NVIDIAAI/status/2048069526000934986">X / @NVIDIAAI</source>
      <category>NVIDIA</category><category>Agentic Coding</category><category>Infrastructure</category><category>Economics</category><category>KV-cache</category>
    </item>
    <item>
      <title>From Siri to the 17 Pro: Tim Cook’s 15-Year AI Hardware Reality Check</title>
      <link>https://usefulmachines.ai/posts/9to5mac-the-first-and-last-flagship-iphone-launched-under-tim-cook-4s-vs-17-pro/</link>
      <guid isPermaLink="true">https://usefulmachines.ai/posts/9to5mac-the-first-and-last-flagship-iphone-launched-under-tim-cook-4s-vs-17-pro/</guid>
      <description>Apple&apos;s first and last flagship iPhones under Tim Cook are separated by a decade and a half of hardware iteration, but they share the exact same pitch: putting a chatbot in your pocket.</description>
      <pubDate>Sat, 25 Apr 2026 17:37:26 GMT</pubDate>
      <source url="https://x.com/9to5mac/status/2048054927495135703">9to5Mac</source>
      <category>Apple</category><category>iPhone</category><category>Siri</category><category>AI Hardware</category><category>Tim Cook</category>
    </item>
    <item>
      <title>OpenAI merged Codex into the main model. Stop waiting for a specialized coding brain.</title>
      <link>https://usefulmachines.ai/posts/openai-codex-merged-gpt-5-5/</link>
      <guid isPermaLink="true">https://usefulmachines.ai/posts/openai-codex-merged-gpt-5-5/</guid>
      <description>Romain Huet confirmed that OpenAI&apos;s dedicated Codex line is dead. The main model and the coding model are now the same system, changing how builders should evaluate GPT-5.5.</description>
      <pubDate>Sat, 25 Apr 2026 12:35:00 GMT</pubDate>
      <source url="https://simonwillison.net/2026/Apr/25/romain-huet/#atom-everything">Simon Willison / Romain Huet</source>
      <category>OpenAI</category><category>GPT-5.5</category><category>Codex</category><category>Agentic Coding</category><category>AI Workflows</category>
    </item>
    <item>
      <title>GPT-5.5 is in the API. Stop rewriting your retry logic.</title>
      <link>https://usefulmachines.ai/posts/openai-gpt-5-5-api-availability-1m-context/</link>
      <guid isPermaLink="true">https://usefulmachines.ai/posts/openai-gpt-5-5-api-availability-1m-context/</guid>
      <description>OpenAI pushed GPT-5.5 to Chat Completions and Responses with a 1M context window, while putting GPT-5.5-pro behind Responses. The real product is fewer retries — and a nudge off legacy chat endpoints.</description>
      <pubDate>Sat, 25 Apr 2026 11:06:42 GMT</pubDate>
      <source url="https://x.com/OpenAIDevs/status/2047742566410736090">X / @OpenAIDevs</source>
      <category>OpenAI</category><category>GPT-5.5</category><category>API</category><category>Responses API</category><category>Infrastructure</category>
    </item>
    <item>
      <title>Perplexity makes GPT-5.5 its orchestration default, because tool-calling is the only benchmark that matters</title>
      <link>https://usefulmachines.ai/posts/perplexity-gpt-5-5-orchestration-model/</link>
      <guid isPermaLink="true">https://usefulmachines.ai/posts/perplexity-gpt-5-5-orchestration-model/</guid>
      <description>Perplexity is deploying GPT-5.5 as the default orchestrator for its agentic tier. It proves the next phase of AI architecture is a barbell: heavy routers delegating to cheap generators.</description>
      <pubDate>Sat, 25 Apr 2026 10:37:11 GMT</pubDate>
      <source url="https://x.com/perplexity_ai/status/2047748486767272243">X / @perplexity_ai</source>
      <category>Perplexity</category><category>GPT-5.5</category><category>Infrastructure</category><category>Economics</category><category>Agentic Workflows</category>
    </item>
    <item>
      <title>Google Gemini 3.1 TTS introduces audio tags to end the retry tax</title>
      <link>https://usefulmachines.ai/posts/google-gemini-3-1-tts-audio-tags-retry-tax/</link>
      <guid isPermaLink="true">https://usefulmachines.ai/posts/google-gemini-3-1-tts-audio-tags-retry-tax/</guid>
      <description>The introduction of inline audio tags in Gemini 3.1 TTS isn&apos;t just a formatting trick. It is a fundamental shift from probabilistic guessing to deterministic steering, aimed directly at the hidden costs of inference.</description>
      <pubDate>Sat, 25 Apr 2026 09:35:00 GMT</pubDate>
      <source url="https://x.com/GoogleAI/status/2047377023656436013">Google AI</source>
      <category>Google</category><category>Gemini 3.1 TTS</category><category>Infrastructure</category><category>Economics</category><category>Text-to-Speech</category>
    </item>
    <item>
      <title>OpenAI&apos;s GPT-5.5 prompting guide proves your legacy prompts are a liability</title>
      <link>https://usefulmachines.ai/posts/gpt-5-5-prompting-guide/</link>
      <guid isPermaLink="true">https://usefulmachines.ai/posts/gpt-5-5-prompting-guide/</guid>
      <description>OpenAI released detailed guidance on prompting GPT-5.5, and the primary lesson is demolition. Treat it as a new model family, delete your bloated prompt preambles, and keep your tool users updated while the model thinks.</description>
      <pubDate>Sat, 25 Apr 2026 08:35:58 GMT</pubDate>
      <source url="https://simonwillison.net/2026/Apr/25/gpt-5-5-prompting-guide/#atom-everything">Simon Willison</source>
      <category>OpenAI</category><category>GPT-5.5</category><category>Prompt Engineering</category><category>LLMs</category><category>API</category>
    </item>
    <item>
      <title>xAI drops Grok Voice Think Fast 1.0 to handle your actual, noisy life</title>
      <link>https://usefulmachines.ai/posts/xai-grok-voice-think-fast-1-0-launch/</link>
      <guid isPermaLink="true">https://usefulmachines.ai/posts/xai-grok-voice-think-fast-1-0-launch/</guid>
      <description>xAI’s new voice model claims top spot on the Tau Voice Bench, promising to survive background noise and interruptions. But a capable voice model still needs you to know what you want it to do.</description>
      <pubDate>Sat, 25 Apr 2026 05:36:50 GMT</pubDate>
      <source url="https://x.com/xai/status/2047441173569216721">X / @xai</source>
      <category>xAI</category><category>Grok</category><category>Voice AI</category><category>Generative AI</category><category>AI Workflows</category>
    </item>
    <item>
      <title>OpenAI&apos;s GPT-5.5 prompt guide has one instruction: stop micromanaging</title>
      <link>https://usefulmachines.ai/posts/gpt-5-5-prompting-start-over/</link>
      <guid isPermaLink="true">https://usefulmachines.ai/posts/gpt-5-5-prompting-start-over/</guid>
      <description>The new prompt guidance for GPT-5.5 is an exercise in demolition. The advice isn&apos;t to add new magic words; it&apos;s to clear out legacy prompt debt and define the destination rather than the path.</description>
      <pubDate>Sat, 25 Apr 2026 04:39:24 GMT</pubDate>
      <source url="https://developers.openai.com/api/docs/guides/prompt-guidance?model=gpt-5.5">OpenAI Docs</source>
      <category>OpenAI</category><category>GPT-5.5</category><category>Prompt Engineering</category><category>API</category><category>AI Workflows</category>
    </item>
    <item>
      <title>GPT-5.5 in the API turns OpenAI’s launch into a routing problem</title>
      <link>https://usefulmachines.ai/posts/gpt-5-5-api-workflow-decision/</link>
      <guid isPermaLink="true">https://usefulmachines.ai/posts/gpt-5-5-api-workflow-decision/</guid>
      <description>API access means teams can stop admiring GPT-5.5 from the showroom and start deciding where it actually deserves production budget.</description>
      <pubDate>Sat, 25 Apr 2026 01:35:50 GMT</pubDate>
      <source url="https://x.com/OpenAI/status/2047743592278745425">OpenAI on X</source>
      <category>OpenAI</category><category>GPT-5.5</category><category>API</category><category>Developer Tools</category><category>AI Workflows</category>
    </item>
    <item>
      <title>Simon Willison&apos;s llm 0.31 brings GPT-5.5 into the boring test loop</title>
      <link>https://usefulmachines.ai/posts/llm-0-31-gpt-5-5-terminal-workflow/</link>
      <guid isPermaLink="true">https://usefulmachines.ai/posts/llm-0-31-gpt-5-5-terminal-workflow/</guid>
      <description>The latest release of the llm CLI adds GPT-5.5 support plus useful knobs for verbosity and image detail. It isn&apos;t flashy, but repeatable terminal tools are how you avoid vibe-based evaluations.</description>
      <pubDate>Sat, 25 Apr 2026 00:35:50 GMT</pubDate>
      <source url="https://simonwillison.net/2026/Apr/24/llm/#atom-everything">Simon Willison</source>
      <category>LLM</category><category>GPT-5.5</category><category>OpenAI</category><category>Developer Tools</category><category>Builder Workflow</category>
    </item>
    <item>
      <title>ChatGPT workspace agents are a handoff test, not an autonomy victory lap</title>
      <link>https://usefulmachines.ai/posts/chatgpt-workspace-agents-practical-handoff/</link>
      <guid isPermaLink="true">https://usefulmachines.ai/posts/chatgpt-workspace-agents-practical-handoff/</guid>
      <description>OpenAI’s workspace agents sound autonomous, but the useful test is much duller: can they take a real workflow, preserve context, and return an artifact that is actually reviewable?</description>
      <pubDate>Sat, 25 Apr 2026 00:05:48 GMT</pubDate>
      <source url="https://openai.com/index/introducing-workspace-agents-in-chatgpt">OpenAI News</source>
      <category>OpenAI</category><category>ChatGPT</category><category>Workspace Agents</category><category>Codex</category><category>AI Workflows</category>
    </item>
    <item>
      <title>GPT-5.5 is OpenAI&apos;s push toward messier work and fewer rescue prompts</title>
      <link>https://usefulmachines.ai/posts/gpt-5-5-messier-work-launch/</link>
      <guid isPermaLink="true">https://usefulmachines.ai/posts/gpt-5-5-messier-work-launch/</guid>
      <description>OpenAI pitches its new model as better at complex coding and data analysis. The real test is whether it can navigate messy workflows without requiring constant human cleanup.</description>
      <pubDate>Fri, 24 Apr 2026 21:35:52 GMT</pubDate>
      <source url="https://openai.com/index/introducing-gpt-5-5">OpenAI News</source>
      <category>OpenAI</category><category>GPT-5.5</category><category>ChatGPT</category><category>Coding Agents</category><category>AI Workflows</category>
    </item>
    <item>
      <title>LiteParse proves the best AI workflow might avoid a model call entirely</title>
      <link>https://usefulmachines.ai/posts/liteparse-browser-pdf-workflow/</link>
      <guid isPermaLink="true">https://usefulmachines.ai/posts/liteparse-browser-pdf-workflow/</guid>
      <description>A browser-based LiteParse demo turns PDF extraction into a local-first workflow, proving that deterministic preprocessing should happen close to the user before inviting expensive models to guess.</description>
      <pubDate>Fri, 24 Apr 2026 19:05:52 GMT</pubDate>
      <source url="https://simonwillison.net/2026/Apr/23/liteparse-for-the-web/#atom-everything">Simon Willison</source>
      <category>LiteParse</category><category>PDF</category><category>Browser Tools</category><category>OCR</category><category>Builder Workflow</category>
    </item>
    <item>
      <title>Claude Code’s $100 pricing jump-scare is a lesson in developer trust</title>
      <link>https://usefulmachines.ai/posts/claude-code-pricing-trust-test/</link>
      <guid isPermaLink="true">https://usefulmachines.ai/posts/claude-code-pricing-trust-test/</guid>
      <description>Anthropic explained visible pricing confusion as a small test, but developers heard a warning to keep an exit ramp. Pricing stability is rollout infrastructure for coding tools.</description>
      <pubDate>Fri, 24 Apr 2026 17:01:39 GMT</pubDate>
      <source url="https://simonwillison.net/2026/Apr/22/claude-code-confusion/#atom-everything">Simon Willison</source>
      <category>Anthropic</category><category>Claude Code</category><category>Pricing</category><category>Developer Trust</category><category>Coding Agents</category>
    </item>
    <item>
      <title>GPT-5.5 landing in Codex before the API reveals OpenAI&apos;s product strategy</title>
      <link>https://usefulmachines.ai/posts/gpt-5-5-codex-before-api/</link>
      <guid isPermaLink="true">https://usefulmachines.ai/posts/gpt-5-5-codex-before-api/</guid>
      <description>GPT-5.5’s early path through Codex and ChatGPT says OpenAI wants the new model tested inside controlled workflows first. Builders should evaluate the access path as much as the model itself.</description>
      <pubDate>Fri, 24 Apr 2026 15:01:00 GMT</pubDate>
      <source url="https://simonwillison.net/2026/Apr/23/gpt-5-5/#atom-everything">Simon Willison</source>
      <category>OpenAI</category><category>GPT-5.5</category><category>Codex</category><category>APIs</category><category>Builder Workflow</category>
    </item>
    <item>
      <title>DeepSeek V4 applies open-model pricing pressure to closed labs</title>
      <link>https://usefulmachines.ai/posts/deepseek-v4-price-performance-shift/</link>
      <guid isPermaLink="true">https://usefulmachines.ai/posts/deepseek-v4-price-performance-shift/</guid>
      <description>DeepSeek V4’s preview models pair million-token context with aggressive economics. Closed labs can sell mystique, but builders will be doing the math.</description>
      <pubDate>Fri, 24 Apr 2026 12:59:00 GMT</pubDate>
      <source url="https://simonwillison.net/2026/Apr/24/deepseek-v4/#atom-everything">Simon Willison</source>
      <category>DeepSeek</category><category>Open Models</category><category>Open Weights</category><category>Pricing</category><category>Local AI</category>
    </item>
    <item>
      <title>OpenAI’s Codex push admits that enterprise AI requires installers</title>
      <link>https://usefulmachines.ai/posts/codex-enterprise-services-layer/</link>
      <guid isPermaLink="true">https://usefulmachines.ai/posts/codex-enterprise-services-layer/</guid>
      <description>OpenAI is pushing Codex through massive consulting firms like Accenture and PwC. It’s an admission that enterprise software needs governance, training, and a lot of meetings to survive.</description>
      <pubDate>Fri, 24 Apr 2026 00:10:00 GMT</pubDate>
      <source url="https://openai.com/index/scaling-codex-to-enterprises-worldwide">OpenAI</source>
      <category>OpenAI</category><category>Codex</category><category>Enterprise</category><category>Developer Tools</category>
    </item>
    <item>
      <title>ChatGPT Images 2.0 requires you to actually have some taste</title>
      <link>https://usefulmachines.ai/posts/chatgpt-images-2-0-creative-ops/</link>
      <guid isPermaLink="true">https://usefulmachines.ai/posts/chatgpt-images-2-0-creative-ops/</guid>
      <description>The new image model is definitely stronger, but the real lesson is that AI generation only works when teams apply constraints, budgets, and a review process.</description>
      <pubDate>Thu, 23 Apr 2026 19:40:00 GMT</pubDate>
      <source url="https://simonwillison.net/2026/Apr/21/gpt-image-2/">Simon Willison</source>
      <category>OpenAI</category><category>Images</category><category>Creative Ops</category><category>Tips and Tricks</category>
    </item>
    <item>
      <title>OpenAI’s workspace agents are an enterprise Trojan horse</title>
      <link>https://usefulmachines.ai/posts/workspace-agents-enterprise-boundary/</link>
      <guid isPermaLink="true">https://usefulmachines.ai/posts/workspace-agents-enterprise-boundary/</guid>
      <description>OpenAI’s workspace agents aren&apos;t just about doing more chores. They are a deliberate march into the enterprise control layer, where permissions and approvals rule the world.</description>
      <pubDate>Thu, 23 Apr 2026 19:10:00 GMT</pubDate>
      <source url="https://openai.com/index/introducing-workspace-agents-in-chatgpt">OpenAI</source>
      <category>OpenAI</category><category>ChatGPT</category><category>Enterprise</category><category>Agents</category>
    </item>
    <item>
      <title>LiteParse in the browser is actually a story about production plumbing</title>
      <link>https://usefulmachines.ai/posts/liteparse-browser-pdf-stack/</link>
      <guid isPermaLink="true">https://usefulmachines.ai/posts/liteparse-browser-pdf-stack/</guid>
      <description>Simon Willison ported LiteParse to the browser, proving once again that AI document workflows usually fail long before the model even sees the text.</description>
      <pubDate>Thu, 23 Apr 2026 19:05:00 GMT</pubDate>
      <source url="https://simonwillison.net/2026/Apr/23/liteparse-for-the-web/">Simon Willison</source>
      <category>PDF</category><category>Document Parsing</category><category>Tools</category><category>Tips and Tricks</category>
    </item>
    <item>
      <title>GPT-5.5&apos;s real feature is fewer cries for help</title>
      <link>https://usefulmachines.ai/posts/gpt-5-5-practical-take/</link>
      <guid isPermaLink="true">https://usefulmachines.ai/posts/gpt-5-5-practical-take/</guid>
      <description>OpenAI is pitching GPT-5.5 as a smarter model, but the practical upgrade is supposed to be less hand-holding. If we don&apos;t have to hover over it while it works, that&apos;s an actual feature.</description>
      <pubDate>Thu, 23 Apr 2026 18:30:00 GMT</pubDate>
      <source url="https://openai.com/index/introducing-gpt-5-5">OpenAI</source>
      <category>OpenAI</category><category>GPT-5.5</category><category>Models</category><category>Agents</category>
    </item>
    <item>
      <title>Privacy tools are finally becoming part of the AI product experience</title>
      <link>https://usefulmachines.ai/posts/privacy-tools-everyday-ai-boundaries/</link>
      <guid isPermaLink="true">https://usefulmachines.ai/posts/privacy-tools-everyday-ai-boundaries/</guid>
      <description>OpenAI’s Privacy Filter sends a clear cultural message: useful AI needs boundaries that are visible enough for users to actually trust it with their real work.</description>
      <pubDate>Thu, 23 Apr 2026 18:05:00 GMT</pubDate>
      <source url="https://openai.com/index/introducing-openai-privacy-filter">OpenAI</source>
      <category>Privacy</category><category>AI Culture</category><category>OpenAI</category><category>Trust</category>
    </item>
    <item>
      <title>ChatGPT workspace agents are gunning for the office sludge</title>
      <link>https://usefulmachines.ai/posts/workspace-agents-chatgpt/</link>
      <guid isPermaLink="true">https://usefulmachines.ai/posts/workspace-agents-chatgpt/</guid>
      <description>OpenAI is wrapping agent language around the most boring parts of enterprise life—shared chores, routing, and approvals. It&apos;s not glamorous, but it is unfortunately essential.</description>
      <pubDate>Thu, 23 Apr 2026 17:45:00 GMT</pubDate>
      <source url="https://openai.com/index/introducing-workspace-agents-in-chatgpt">OpenAI</source>
      <category>OpenAI</category><category>ChatGPT</category><category>Agents</category><category>Workflows</category><category>Enterprise</category>
    </item>
    <item>
      <title>OpenAI&apos;s Privacy Filter is the plumbing that keeps Legal off your back</title>
      <link>https://usefulmachines.ai/posts/openai-privacy-filter/</link>
      <guid isPermaLink="true">https://usefulmachines.ai/posts/openai-privacy-filter/</guid>
      <description>OpenAI&apos;s new open-weight Privacy Filter isn&apos;t a flashy demo. It&apos;s the upstream scrubber you need before your logs and evals start spraying personally identifiable information everywhere.</description>
      <pubDate>Thu, 23 Apr 2026 16:30:00 GMT</pubDate>
      <source url="https://openai.com/index/introducing-openai-privacy-filter">OpenAI</source>
      <category>OpenAI</category><category>Privacy</category><category>Security</category><category>Tools</category>
    </item>
    <item>
      <title>Google’s new TPUs prove that agentic AI is mostly a billing problem</title>
      <link>https://usefulmachines.ai/posts/google-tpus-agentic-era/</link>
      <guid isPermaLink="true">https://usefulmachines.ai/posts/google-tpus-agentic-era/</guid>
      <description>Google’s TPU 8i and 8t announcement sounds like a hardware story. It&apos;s actually a confession that AI agents turn latency and serving costs into your biggest product bottlenecks.</description>
      <pubDate>Thu, 23 Apr 2026 15:15:00 GMT</pubDate>
      <source url="https://blog.google/innovation-and-ai/infrastructure-and-cloud/google-cloud/tpus-8t-8i-cloud-next/">Google AI Blog</source>
      <category>Google</category><category>Infrastructure</category><category>TPU</category><category>Agents</category>
    </item>
    <item>
      <title>The Claude Code pricing scare shows how fragile developer trust is</title>
      <link>https://usefulmachines.ai/posts/claude-code-pricing-trust/</link>
      <guid isPermaLink="true">https://usefulmachines.ai/posts/claude-code-pricing-trust/</guid>
      <description>Anthropic&apos;s brief pricing confusion around Claude Code was quickly resolved, but developers reacted by doing what they always do: looking for the exit.</description>
      <pubDate>Thu, 23 Apr 2026 14:00:00 GMT</pubDate>
      <source url="https://simonwillison.net/2026/Apr/22/claude-code-confusion/">Simon Willison</source>
      <category>Anthropic</category><category>Claude</category><category>Claude Code</category><category>Developer Tools</category>
    </item>
    <item>
      <title>Grok&apos;s new audio APIs: Voice gets chopped into useful plumbing</title>
      <link>https://usefulmachines.ai/posts/grok-stt-tts-audio-api-push/</link>
      <guid isPermaLink="true">https://usefulmachines.ai/posts/grok-stt-tts-audio-api-push/</guid>
      <description>xAI broke Grok into standalone Speech to Text and Text to Speech APIs. The talking bot is the circus; the modular APIs are the actual infrastructure developers can ship.</description>
      <pubDate>Sat, 18 Apr 2026 15:05:00 GMT</pubDate>
      <source url="https://x.ai/news/grok-stt-and-tts-apis">xAI</source>
      <category>xAI</category><category>Grok</category><category>Speech to Text</category><category>Text to Speech</category><category>Voice AI</category>
    </item>
    <item>
      <title>Office agents need receipts, or they&apos;re just interns with root access</title>
      <link>https://usefulmachines.ai/posts/agent-observability-office-work/</link>
      <guid isPermaLink="true">https://usefulmachines.ai/posts/agent-observability-office-work/</guid>
      <description>OpenAI’s new agent observability tools sound like developer jargon, but they represent the difference between useful delegation and finding out your bot rearranged the CRM while you were asleep.</description>
      <pubDate>Fri, 17 Apr 2026 16:05:00 GMT</pubDate>
      <source url="https://openai.com/index/new-tools-for-building-agents/">OpenAI</source>
      <category>Agents</category><category>OpenAI</category><category>Operations</category><category>AI Workflows</category><category>Trust</category>
    </item>
    <item>
      <title>AI assurance is just trust after it stops being a mood board</title>
      <link>https://usefulmachines.ai/posts/ai-assurance-trust-infrastructure/</link>
      <guid isPermaLink="true">https://usefulmachines.ai/posts/ai-assurance-trust-infrastructure/</guid>
      <description>Partnership on AI’s take on assurance reminds us that public trust isn’t built on launch demos. It’s built on standards, monitoring, and the boring machinery that proves an AI isn&apos;t hallucinating its way through your data.</description>
      <pubDate>Thu, 16 Apr 2026 14:15:00 GMT</pubDate>
      <source url="https://partnershiponai.org/can-assurance-help-build-ai-systems-that-we-can-trust/">Partnership on AI</source>
      <category>AI Assurance</category><category>Trust</category><category>Policy</category><category>Standards</category><category>AI Culture</category>
    </item>
    <item>
      <title>OpenAI’s Agents SDK update brings the seatbelts your bots desperately need</title>
      <link>https://usefulmachines.ai/posts/agents-sdk-sandbox-production-harness/</link>
      <guid isPermaLink="true">https://usefulmachines.ai/posts/agents-sdk-sandbox-production-harness/</guid>
      <description>With native sandboxes, filesystem tools, and workspace manifests, OpenAI is admitting that agents need unglamorous harnesses to keep them from becoming clever incident generators.</description>
      <pubDate>Thu, 16 Apr 2026 13:40:00 GMT</pubDate>
      <source url="https://openai.com/index/the-next-evolution-of-the-agents-sdk/">OpenAI</source>
      <category>OpenAI</category><category>Agents SDK</category><category>Developers</category><category>Sandboxes</category><category>Agent Infrastructure</category>
    </item>
    <item>
      <title>Ollama structured outputs finally tell local models to stop freelancing JSON</title>
      <link>https://usefulmachines.ai/posts/ollama-structured-outputs-local-json/</link>
      <guid isPermaLink="true">https://usefulmachines.ai/posts/ollama-structured-outputs-local-json/</guid>
      <description>Ollama’s new JSON-schema constraints bring sanity to local AI, replacing fragile regex parsing with actual validation boundaries.</description>
      <pubDate>Thu, 16 Apr 2026 13:15:00 GMT</pubDate>
      <source url="https://ollama.com/blog/structured-outputs">Ollama Blog</source>
      <category>Ollama</category><category>Local AI</category><category>Structured Outputs</category><category>Open Models</category><category>Developer Tools</category>
    </item>
    <item>
      <title>Anthropic&apos;s MCP admits that AI agents need standardized plumbing to survive</title>
      <link>https://usefulmachines.ai/posts/mcp-standard-plumbing-reality/</link>
      <guid isPermaLink="true">https://usefulmachines.ai/posts/mcp-standard-plumbing-reality/</guid>
      <description>The Model Context Protocol won’t magically fix unreliable agents, but it might replace the nightmare of bespoke integrations with a shared standard for connecting AI to your data.</description>
      <pubDate>Wed, 15 Apr 2026 17:25:00 GMT</pubDate>
      <source url="https://www.anthropic.com/news/model-context-protocol">Anthropic</source>
      <category>Anthropic</category><category>MCP</category><category>Claude</category><category>AI Agents</category><category>Developer Tools</category>
    </item>
    <item>
      <title>GitHub Copilot’s coding agent puts the AI exactly where it belongs: in a pull request</title>
      <link>https://usefulmachines.ai/posts/copilot-coding-agent-issue-loop/</link>
      <guid isPermaLink="true">https://usefulmachines.ai/posts/copilot-coding-agent-issue-loop/</guid>
      <description>Instead of demanding a new workflow, GitHub’s coding agent starts at an issue, works in a cloud environment, and submits a reviewable PR. It turns out the best AI interface is the one developers already use.</description>
      <pubDate>Tue, 14 Apr 2026 15:30:00 GMT</pubDate>
      <source url="https://github.blog/changelog/2025-05-19-github-copilot-coding-agent-in-public-preview/">GitHub Changelog</source>
      <category>GitHub Copilot</category><category>Coding Agents</category><category>Developer Workflow</category><category>GitHub Actions</category><category>Code Review</category>
    </item>
    <item>
      <title>Deep research only works if your AI isn&apos;t treating the entire internet like a junk drawer</title>
      <link>https://usefulmachines.ai/posts/deep-research-needs-source-discipline/</link>
      <guid isPermaLink="true">https://usefulmachines.ai/posts/deep-research-needs-source-discipline/</guid>
      <description>OpenAI’s deep research tool lets you restrict sources and interrupt runs. The real lesson isn&apos;t that AI can summarize the web, but that research is useless if you can&apos;t defend the citations later.</description>
      <pubDate>Fri, 10 Apr 2026 12:45:00 GMT</pubDate>
      <source url="https://openai.com/index/introducing-deep-research/">OpenAI</source>
      <category>Research</category><category>OpenAI</category><category>MCP</category><category>Productivity</category><category>AI Workflows</category>
    </item>
    <item>
      <title>Claude for Education hopes to be a tutor instead of a homework vending machine</title>
      <link>https://usefulmachines.ai/posts/claude-education-learning-mode/</link>
      <guid isPermaLink="true">https://usefulmachines.ai/posts/claude-education-learning-mode/</guid>
      <description>Anthropic&apos;s push into universities includes a &apos;Learning mode&apos; designed to guide students rather than just handing them the answers. It’s a noble idea that is about to collide with actual college students.</description>
      <pubDate>Wed, 08 Apr 2026 14:35:00 GMT</pubDate>
      <source url="https://www.anthropic.com/news/introducing-claude-for-education">Anthropic</source>
      <category>Anthropic</category><category>Claude</category><category>Education</category><category>AI Tutoring</category><category>Higher Education</category>
    </item>
    <item>
      <title>Llama 4 brings massive context windows and open-weight ambition</title>
      <link>https://usefulmachines.ai/posts/llama4-long-context-open-weights-check/</link>
      <guid isPermaLink="true">https://usefulmachines.ai/posts/llama4-long-context-open-weights-check/</guid>
      <description>The launch of Llama 4 Maverick and Scout is thrilling for the open ecosystem, promising MoE scale and multimodality. Now builders need to stop clapping and start testing hardware reality.</description>
      <pubDate>Tue, 07 Apr 2026 14:35:00 GMT</pubDate>
      <source url="https://huggingface.co/blog/llama4-release">Hugging Face</source>
      <category>Llama</category><category>Hugging Face</category><category>Open Weights</category><category>Long Context</category><category>Multimodal AI</category>
    </item>
    <item>
      <title>Chatbots are becoming a news habit, but trust hasn&apos;t packed a bag</title>
      <link>https://usefulmachines.ai/posts/ai-chatbots-news-trust-gap/</link>
      <guid isPermaLink="true">https://usefulmachines.ai/posts/ai-chatbots-news-trust-gap/</guid>
      <description>The Reuters Institute&apos;s Digital News Report highlights a familiar media crisis and a new behavior: people are asking chatbots for the news. The interface is changing faster than the trust rituals can adapt.</description>
      <pubDate>Fri, 03 Apr 2026 15:50:00 GMT</pubDate>
      <source url="https://reutersinstitute.politics.ox.ac.uk/digital-news-report/2025">Reuters Institute for the Study of Journalism</source>
      <category>News</category><category>AI Culture</category><category>Media</category><category>Trust</category><category>Chatbots</category>
    </item>
    <item>
      <title>OpenAI&apos;s Codex pay-as-you-go seats lower the enterprise drawbridge</title>
      <link>https://usefulmachines.ai/posts/codex-pay-as-you-go-teams/</link>
      <guid isPermaLink="true">https://usefulmachines.ai/posts/codex-pay-as-you-go-teams/</guid>
      <description>Codex-only seats for Business and Enterprise teams are a pricing move designed to make coding-agent pilots easier to start, measure, and quietly expand without terrifying the finance department.</description>
      <pubDate>Fri, 03 Apr 2026 12:35:00 GMT</pubDate>
      <source url="https://openai.com/index/codex-flexible-pricing-for-teams/">OpenAI</source>
      <category>OpenAI</category><category>Codex</category><category>Pricing</category><category>ChatGPT Business</category><category>Enterprise AI</category>
    </item>
    <item>
      <title>Agentspace is Google selling the boring prerequisite to enterprise AI</title>
      <link>https://usefulmachines.ai/posts/agentspace-enterprise-knowledge-layer/</link>
      <guid isPermaLink="true">https://usefulmachines.ai/posts/agentspace-enterprise-knowledge-layer/</guid>
      <description>Google’s Agentspace isn&apos;t pitching a humanoid robot coworker. It’s pitching permission-aware search, enterprise knowledge graphs, and Chrome distribution—the dry infrastructure where enterprise AI actually survives.</description>
      <pubDate>Thu, 02 Apr 2026 16:15:00 GMT</pubDate>
      <source url="https://cloud.google.com/blog/products/ai-machine-learning/google-agentspace-enables-the-agent-driven-enterprise">Google Cloud Blog</source>
      <category>Google Cloud</category><category>Agentspace</category><category>Enterprise AI</category><category>AI Agents</category><category>Search</category>
    </item>
    <item>
      <title>Mistral OCR is the ingestion layer your AI agents keep pretending they have</title>
      <link>https://usefulmachines.ai/posts/mistral-ocr-docs-as-prompt/</link>
      <guid isPermaLink="true">https://usefulmachines.ai/posts/mistral-ocr-docs-as-prompt/</guid>
      <description>Mistral’s new OCR API turns complex PDFs and images into structured, ordered text. For developers, it’s a reminder that no reasoning model can reliably recover structure that the parser chewed up.</description>
      <pubDate>Thu, 02 Apr 2026 14:45:00 GMT</pubDate>
      <source url="https://mistral.ai/news/mistral-ocr">Mistral AI</source>
      <category>Mistral</category><category>OCR</category><category>Parsing</category><category>RAG</category><category>Developer Tools</category>
    </item>
    <item>
      <title>Gemini Robotics moves Google’s AI fight into the physical world</title>
      <link>https://usefulmachines.ai/posts/gemini-robotics-embodied-reasoning/</link>
      <guid isPermaLink="true">https://usefulmachines.ai/posts/gemini-robotics-embodied-reasoning/</guid>
      <description>Gemini Robotics and Gemini Robotics-ER bring multimodal reasoning to robots. The lesson isn&apos;t that a robot butler is arriving tomorrow, but that embodied AI leaves no room for demo theater.</description>
      <pubDate>Thu, 26 Mar 2026 14:55:00 GMT</pubDate>
      <source url="https://deepmind.google/blog/gemini-robotics-brings-ai-into-the-physical-world/">Google DeepMind</source>
      <category>Google DeepMind</category><category>Gemini</category><category>Robotics</category><category>Embodied AI</category><category>Multimodal AI</category>
    </item>
    <item>
      <title>ChatGPT&apos;s shopping updates are a play for the messy middle of product discovery</title>
      <link>https://usefulmachines.ai/posts/chatgpt-product-discovery-shopping/</link>
      <guid isPermaLink="true">https://usefulmachines.ai/posts/chatgpt-product-discovery-shopping/</guid>
      <description>OpenAI is expanding ChatGPT&apos;s commerce capabilities with visual browsing and comparisons. The real battle isn&apos;t about owning the checkout button; it&apos;s about influencing the shopper before the cart even appears.</description>
      <pubDate>Wed, 25 Mar 2026 15:10:00 GMT</pubDate>
      <source url="https://openai.com/index/powering-product-discovery-in-chatgpt/">OpenAI</source>
      <category>OpenAI</category><category>ChatGPT</category><category>Commerce</category><category>Shopping</category><category>ACP</category>
    </item>
    <item>
      <title>The Associated Press AI rules remember that fluency is not journalism</title>
      <link>https://usefulmachines.ai/posts/ap-ai-standards-editorial-trust/</link>
      <guid isPermaLink="true">https://usefulmachines.ai/posts/ap-ai-standards-editorial-trust/</guid>
      <description>The AP treats generative AI as unvetted source material and bans it from creating publishable content. It’s an unusually clean defense of human accountability in an era of automated confidence.</description>
      <pubDate>Tue, 24 Mar 2026 18:05:00 GMT</pubDate>
      <source url="https://www.ap.org/the-definitive-source/behind-the-news/standards-around-generative-ai/">The Associated Press</source>
      <category>Media</category><category>Generative AI</category><category>Trust</category><category>Journalism</category><category>AI Culture</category>
    </item>
    <item>
      <title>Qwen3 turns AI reasoning into a budget knob for pragmatic builders</title>
      <link>https://usefulmachines.ai/posts/qwen3-reasoning-budget-open-weights/</link>
      <guid isPermaLink="true">https://usefulmachines.ai/posts/qwen3-reasoning-budget-open-weights/</guid>
      <description>Qwen3’s open-weight release spans dense models, big MoEs, and hybrid thinking modes under an Apache 2.0 license. The real feature isn&apos;t magic; it&apos;s total control over your inference budget.</description>
      <pubDate>Tue, 24 Mar 2026 15:20:00 GMT</pubDate>
      <source url="https://qwenlm.github.io/blog/qwen3/">Qwen</source>
      <category>Qwen</category><category>Open Weights</category><category>Reasoning Models</category><category>Apache 2.0</category><category>Agentic AI</category>
    </item>
    <item>
      <title>Claude&apos;s web search is useful, but please put away the truth confetti</title>
      <link>https://usefulmachines.ai/posts/claude-web-search-citation-gap/</link>
      <guid isPermaLink="true">https://usefulmachines.ai/posts/claude-web-search-citation-gap/</guid>
      <description>Claude can now search the web and cite its sources, bringing much-needed freshness to its answers. But a footnote is just a handle for verification, not a guarantee of absolute truth.</description>
      <pubDate>Fri, 20 Mar 2026 16:45:00 GMT</pubDate>
      <source url="https://claude.com/blog/web-search">Claude Blog</source>
      <category>Anthropic</category><category>Claude</category><category>Web Search</category><category>Citations</category><category>Research</category>
    </item>
    <item>
      <title>Grok Business is xAI trying to put an enterprise suit on the internet gremlin</title>
      <link>https://usefulmachines.ai/posts/grok-business-enterprise-vault/</link>
      <guid isPermaLink="true">https://usefulmachines.ai/posts/grok-business-enterprise-vault/</guid>
      <description>xAI is pitching Grok Business and Grok Enterprise with Drive access, audit controls, and a dedicated Vault. The challenge isn&apos;t building the checklist; it&apos;s convincing buyers the chaos machine can be boring on command.</description>
      <pubDate>Fri, 20 Mar 2026 13:55:00 GMT</pubDate>
      <source url="https://x.ai/news/grok-business">xAI</source>
      <category>xAI</category><category>Grok Business</category><category>Enterprise AI</category><category>Privacy</category><category>RAG</category>
    </item>
    <item>
      <title>MCP gives AI workflows a front door instead of a hole in the fence</title>
      <link>https://usefulmachines.ai/posts/mcp-workflows-need-front-door/</link>
      <guid isPermaLink="true">https://usefulmachines.ai/posts/mcp-workflows-need-front-door/</guid>
      <description>Anthropic&apos;s Model Context Protocol is technical plumbing that gives AI assistants structured access to your company&apos;s data, proving that safely opening the front door is better than throwing agents into the corporate swamp.</description>
      <pubDate>Thu, 19 Mar 2026 15:35:00 GMT</pubDate>
      <source url="https://www.anthropic.com/news/model-context-protocol">Anthropic</source>
      <category>MCP</category><category>Anthropic</category><category>Workflows</category><category>Knowledge Management</category><category>Team Operations</category>
    </item>
    <item>
      <title>MCP is the boring connector layer agents needed before everyone built the same adapter pile twice</title>
      <link>https://usefulmachines.ai/posts/mcp-connector-standard-builders/</link>
      <guid isPermaLink="true">https://usefulmachines.ai/posts/mcp-connector-standard-builders/</guid>
      <description>MCP gives AI tools a standard way to connect to data and systems, replacing bespoke integration nightmares with a unified, boring architecture.</description>
      <pubDate>Thu, 19 Mar 2026 13:25:00 GMT</pubDate>
      <source url="https://www.anthropic.com/news/model-context-protocol">Anthropic</source>
      <category>MCP</category><category>Anthropic</category><category>Agents</category><category>Developer Tools</category><category>Integrations</category>
    </item>
    <item>
      <title>Ironwood is Google saying inference is where the money gets serious</title>
      <link>https://usefulmachines.ai/posts/ironwood-inference-economics/</link>
      <guid isPermaLink="true">https://usefulmachines.ai/posts/ironwood-inference-economics/</guid>
      <description>Google&apos;s Ironwood TPU proves that while training gets the prestige, inference is where the AI economy actually fights for its margins.</description>
      <pubDate>Wed, 18 Mar 2026 13:40:00 GMT</pubDate>
      <source url="https://blog.google/innovation-and-ai/infrastructure-and-cloud/google-cloud/ironwood-tpu-age-of-inference/">Google Blog</source>
      <category>Google Cloud</category><category>TPU</category><category>AI Infrastructure</category><category>Inference</category><category>Agents</category>
    </item>
    <item>
      <title>GPT-5.4 mini and nano are the cost-control models hiding under the glamour layer</title>
      <link>https://usefulmachines.ai/posts/gpt-5-4-mini-nano-cost-latency/</link>
      <guid isPermaLink="true">https://usefulmachines.ai/posts/gpt-5-4-mini-nano-cost-latency/</guid>
      <description>OpenAI’s GPT-5.4 mini and nano models are the unglamorous, cost-controlling workhorses that make complex agent systems economically viable.</description>
      <pubDate>Wed, 18 Mar 2026 13:05:00 GMT</pubDate>
      <source url="https://openai.com/index/introducing-gpt-5-4-mini-and-nano/">OpenAI</source>
      <category>OpenAI</category><category>GPT-5.4</category><category>Small Models</category><category>Codex</category><category>API</category>
    </item>
    <item>
      <title>The EU AI Act says your face should not become a workplace KPI</title>
      <link>https://usefulmachines.ai/posts/eu-ai-act-workplace-boundaries/</link>
      <guid isPermaLink="true">https://usefulmachines.ai/posts/eu-ai-act-workplace-boundaries/</guid>
      <description>The EU AI Act draws a hard line against workplace emotion recognition, rejecting the idea that human faces should be harvested for productivity metrics.</description>
      <pubDate>Sat, 14 Mar 2026 11:25:00 GMT</pubDate>
      <source url="https://digital-strategy.ec.europa.eu/en/policies/regulatory-framework-ai">European Commission</source>
      <category>EU AI Act</category><category>Privacy</category><category>Workplace</category><category>Policy</category><category>AI Culture</category>
    </item>
    <item>
      <title>Claude Code puts the agent in the terminal, which is brave and mildly terrifying</title>
      <link>https://usefulmachines.ai/posts/claude-code-terminal-agent-test/</link>
      <guid isPermaLink="true">https://usefulmachines.ai/posts/claude-code-terminal-agent-test/</guid>
      <description>Anthropic’s Claude Code drops the agent directly into the terminal, proving that the real test of AI is safely navigating a messy codebase.</description>
      <pubDate>Fri, 13 Mar 2026 17:10:00 GMT</pubDate>
      <source url="https://www.anthropic.com/news/claude-3-7-sonnet">Anthropic</source>
      <category>Anthropic</category><category>Claude Code</category><category>Developer Tools</category><category>Coding Agents</category><category>Terminal</category>
    </item>
    <item>
      <title>xAI’s $20B round is the compute arms race removing its indoor voice</title>
      <link>https://usefulmachines.ai/posts/xai-series-e-compute-arms-race/</link>
      <guid isPermaLink="true">https://usefulmachines.ai/posts/xai-series-e-compute-arms-race/</guid>
      <description>xAI’s massive $20B Series E isn&apos;t just a funding round—it&apos;s a clear signal that frontier AI has become a brutal capital-to-compute conversion engine.</description>
      <pubDate>Fri, 13 Mar 2026 15:30:00 GMT</pubDate>
      <source url="https://x.ai/news/series-e">xAI</source>
      <category>xAI</category><category>Funding</category><category>Grok</category><category>Colossus</category><category>AI Infrastructure</category>
    </item>
    <item>
      <title>Mistral Small 3.1 is open-model progress in its most dangerous form: actually deployable</title>
      <link>https://usefulmachines.ai/posts/mistral-small-31-local-workhorse/</link>
      <guid isPermaLink="true">https://usefulmachines.ai/posts/mistral-small-31-local-workhorse/</guid>
      <description>Mistral Small 3.1 proves that the most important open models aren&apos;t the largest ones, but the ones you can actually afford to deploy locally.</description>
      <pubDate>Thu, 12 Mar 2026 13:40:00 GMT</pubDate>
      <source url="https://mistral.ai/news/mistral-small-3-1">Mistral AI</source>
      <category>Mistral</category><category>Open Models</category><category>Apache 2.0</category><category>Multimodal AI</category><category>Local AI</category>
    </item>
    <item>
      <title>The best AI automation still knows when to bother a human</title>
      <link>https://usefulmachines.ai/posts/automation-human-checkpoints-2026/</link>
      <guid isPermaLink="true">https://usefulmachines.ai/posts/automation-human-checkpoints-2026/</guid>
      <description>Zapier&apos;s look at the future of workflow automation emphasizes human-in-the-loop systems, proving that the best AI knows when to step back.</description>
      <pubDate>Thu, 12 Mar 2026 13:10:00 GMT</pubDate>
      <source url="https://zapier.com/resources/events/automation-now-next-ai-agents-and-whats-coming-in-2026">Zapier</source>
      <category>Automation</category><category>Zapier</category><category>AI Workflows</category><category>MCP</category><category>Operations</category>
    </item>
    <item>
      <title>Gemini 2.5 Flash turns “thinking” into a knob developers can price</title>
      <link>https://usefulmachines.ai/posts/gemini-25-flash-thinking-budget/</link>
      <guid isPermaLink="true">https://usefulmachines.ai/posts/gemini-25-flash-thinking-budget/</guid>
      <description>Google&apos;s Gemini 2.5 Flash treats AI reasoning as an adjustable slider, giving developers the power to balance cost, latency, and intelligence.</description>
      <pubDate>Wed, 11 Mar 2026 15:05:00 GMT</pubDate>
      <source url="https://developers.googleblog.com/en/start-building-with-gemini-25-flash/">Google Developers Blog</source>
      <category>Google</category><category>Gemini</category><category>Gemini API</category><category>Developer Tools</category><category>Inference Cost</category>
    </item>
    <item>
      <title>OpenAI&apos;s Responses API makes building agents easier, and leaving much harder</title>
      <link>https://usefulmachines.ai/posts/responses-api-agent-stack/</link>
      <guid isPermaLink="true">https://usefulmachines.ai/posts/responses-api-agent-stack/</guid>
      <description>OpenAI&apos;s new Responses API and built-in tools want to be your entire agent stack. The convenience is undeniable, but it comes at the steep cost of vendor lock-in.</description>
      <pubDate>Tue, 10 Mar 2026 14:00:00 GMT</pubDate>
      <source url="https://openai.com/index/new-tools-for-building-agents/">OpenAI</source>
      <category>OpenAI</category><category>Responses API</category><category>Agents</category><category>APIs</category><category>Developer Tools</category>
    </item>
    <item>
      <title>Grok Imagine API is xAI betting video generation needs speed more than magic</title>
      <link>https://usefulmachines.ai/posts/grok-imagine-api-video-cost-latency/</link>
      <guid isPermaLink="true">https://usefulmachines.ai/posts/grok-imagine-api-video-cost-latency/</guid>
      <description>xAI’s new video API pitches generation, editing, speed, and cost. It’s a bet that creative teams care less about the first cinematic demo and more about the economics of the seventeenth revision.</description>
      <pubDate>Mon, 09 Mar 2026 14:45:00 GMT</pubDate>
      <source url="https://x.ai/news/grok-imagine-api">xAI</source>
      <category>xAI</category><category>Grok Imagine</category><category>Video Generation</category><category>Creative Tools</category><category>API</category>
    </item>
    <item>
      <title>The AI copyright fight is really a battle over industrial-scale memory</title>
      <link>https://usefulmachines.ai/posts/copyright-office-ai-training-reckoning/</link>
      <guid isPermaLink="true">https://usefulmachines.ai/posts/copyright-office-ai-training-reckoning/</guid>
      <description>The U.S. Copyright Office’s AI reports provide a public record for the cultural argument artists are making: what happens when human labor becomes the training substrate for its own replacement?</description>
      <pubDate>Fri, 06 Mar 2026 17:30:00 GMT</pubDate>
      <source url="https://www.copyright.gov/ai/">U.S. Copyright Office</source>
      <category>Copyright</category><category>AI Culture</category><category>Policy</category><category>Creative Work</category><category>Trust</category>
    </item>
    <item>
      <title>Claude 3.7 Sonnet correctly turns AI reasoning into a dial, not a whole new brain</title>
      <link>https://usefulmachines.ai/posts/claude-37-hybrid-reasoning-reality/</link>
      <guid isPermaLink="true">https://usefulmachines.ai/posts/claude-37-hybrid-reasoning-reality/</guid>
      <description>Anthropic’s hybrid reasoning model lets users choose whether they want a fast answer or a deep thought. It&apos;s the right product move in a market obsessed with confusing model menus.</description>
      <pubDate>Fri, 06 Mar 2026 15:30:00 GMT</pubDate>
      <source url="https://www.anthropic.com/news/claude-3-7-sonnet">Anthropic</source>
      <category>Anthropic</category><category>Claude</category><category>Claude 3.7 Sonnet</category><category>Reasoning Models</category><category>AI Workflows</category>
    </item>
    <item>
      <title>ChatGPT in Excel is OpenAI volunteering for spreadsheet archaeology</title>
      <link>https://usefulmachines.ai/posts/chatgpt-excel-finance-workflows/</link>
      <guid isPermaLink="true">https://usefulmachines.ai/posts/chatgpt-excel-finance-workflows/</guid>
      <description>Putting ChatGPT inside Excel isn&apos;t about magical insights. It&apos;s about automating the miserable middle of finance work: tracing formulas, building scenarios, and untangling inherited models.</description>
      <pubDate>Fri, 06 Mar 2026 14:20:00 GMT</pubDate>
      <source url="https://openai.com/index/chatgpt-for-excel/">OpenAI</source>
      <category>OpenAI</category><category>ChatGPT</category><category>Excel</category><category>Finance</category><category>Spreadsheets</category>
    </item>
    <item>
      <title>xAI joining SpaceX gives Grok a massive, rocket-powered distribution edge</title>
      <link>https://usefulmachines.ai/posts/xai-spacex-acquisition-distribution-machine/</link>
      <guid isPermaLink="true">https://usefulmachines.ai/posts/xai-spacex-acquisition-distribution-machine/</guid>
      <description>The official note is tiny, but the implications are huge. Grok is moving closer to Starlink, SpaceX operations, and a global hardware network where AI can be tested in real-world extremes.</description>
      <pubDate>Wed, 04 Mar 2026 16:15:00 GMT</pubDate>
      <source url="https://x.ai/news/xai-joins-spacex">xAI</source>
      <category>xAI</category><category>SpaceX</category><category>Grok</category><category>Elon Musk</category><category>Distribution</category>
    </item>
    <item>
      <title>Gemini 2.5 Pro proves Google thinks reasoning should be a baseline, not a special mode</title>
      <link>https://usefulmachines.ai/posts/gemini-25-pro-reasoning-default/</link>
      <guid isPermaLink="true">https://usefulmachines.ai/posts/gemini-25-pro-reasoning-default/</guid>
      <description>Google’s Gemini 2.5 Pro makes thinking behavior a default feature. It&apos;s a strategic bet that long-context workflows and agents require built-in reasoning to avoid compounding errors.</description>
      <pubDate>Wed, 04 Mar 2026 14:20:00 GMT</pubDate>
      <source url="https://blog.google/innovation-and-ai/models-and-research/google-deepmind/gemini-model-thinking-updates-march-2025/">Google Blog</source>
      <category>Google</category><category>Gemini</category><category>Reasoning Models</category><category>Agents</category><category>Long Context</category>
    </item>
    <item>
      <title>Stop drawing AI agent org charts and start writing operating rules</title>
      <link>https://usefulmachines.ai/posts/agent-org-charts-need-operating-rules/</link>
      <guid isPermaLink="true">https://usefulmachines.ai/posts/agent-org-charts-need-operating-rules/</guid>
      <description>Microsoft’s Frontier Firm vision of hybrid AI teams is compelling, but practically, companies just need one human owner, one repeatable workflow, and a clear way to review failures.</description>
      <pubDate>Wed, 04 Mar 2026 14:20:00 GMT</pubDate>
      <source url="https://www.microsoft.com/en-us/worklab/work-trend-index/2025-the-year-the-frontier-firm-is-born">Microsoft WorkLab</source>
      <category>Agents</category><category>Team Operations</category><category>Microsoft</category><category>Productivity</category><category>AI Workflows</category>
    </item>
    <item>
      <title>DeepSeek R1 forces closed AI labs to justify their reasoning premium</title>
      <link>https://usefulmachines.ai/posts/deepseek-r1-open-reasoning-price/</link>
      <guid isPermaLink="true">https://usefulmachines.ai/posts/deepseek-r1-open-reasoning-price/</guid>
      <description>DeepSeek R1 combines MIT-licensed weights, distilled checkpoints, and aggressive pricing to make open reasoning a practical engineering option rather than just a philosophical debate.</description>
      <pubDate>Mon, 02 Mar 2026 14:05:00 GMT</pubDate>
      <source url="https://api-docs.deepseek.com/news/news250120">DeepSeek API Docs</source>
      <category>DeepSeek</category><category>Open Models</category><category>Reasoning Models</category><category>MIT License</category><category>Local AI</category>
    </item>
    <item>
      <title>SWE-bench Verified maxed out, and it&apos;s time to build your own private coding evals</title>
      <link>https://usefulmachines.ai/posts/swe-bench-verified-benchmark-ceiling/</link>
      <guid isPermaLink="true">https://usefulmachines.ai/posts/swe-bench-verified-benchmark-ceiling/</guid>
      <description>OpenAI is moving on from SWE-bench Verified because the benchmark has degraded. It’s a harsh reminder that public leaderboards cannot replace private evaluations based on your actual codebase.</description>
      <pubDate>Sun, 01 Mar 2026 15:10:00 GMT</pubDate>
      <source url="https://openai.com/index/why-we-no-longer-evaluate-swe-bench-verified/">OpenAI</source>
      <category>Benchmarks</category><category>SWE-bench</category><category>Coding Agents</category><category>OpenAI</category><category>Developer Tools</category>
    </item>
  </channel>
</rss>
