Reviews · Featured review · MAY 5, 2026
Reviewed: GPT-5.5 Instant ships as ChatGPT's new default with a 52.5% hallucination-reduction claim
OpenAI's May 5 update to the default ChatGPT model promises sharper answers on medicine, law, and finance. The headline number is internal; the rollout is universal.
Latest from the desk
-
Multimodal · MAY 19, 2026
Google Gemini Omni: world-understanding multimodal at scale, any-input-to-any-output
Announced at Google I/O on May 19, Gemini Omni is positioned as a leap in world understanding, multimodality, and editing — generating any output from any input, starting with video.
-
Infrastructure · MAY 12, 2026
vLLM v0.20.2 ships Model Runner V2: up to 56% higher throughput on GB200
The May 2026 stable release of vLLM bundles a new GPU-native Triton kernel async-scheduling stack, FP8 inference, and continuous batching as the default.
-
Reviews · MAY 6, 2026
Claude Code goes agentic at Code w/ Claude: Managed Agents, higher rate limits, and self-hosted sandboxes
Anthropic used the May 6 opening of its developer conference to ship a coordinated coding-platform release — the most significant one since Claude Code's general availability last spring.
-
Benchmarks · MAY 5, 2026
Claude Opus 4.7 leads Vals AI's Finance Agent benchmark at 64.4%; tops GDPval-AA
Anthropic's finance-tuned model debuted at the lab's May 5 invite-only briefing in New York. The two benchmark headlines come with the usual caveats — and one new variable for the benchmarks desk to track.