For AI agents: a documentation index is available at the root level at /llms.txt and /llms-full.txt. Append /llms.txt to any URL for a page-level index, or .md for the markdown version of any page.
ModelsChatRankingsDocs
DocsAPI ReferenceClient SDKsAgent SDKCookbookChangelog
DocsAPI ReferenceClient SDKsAgent SDKCookbookChangelog
LogoLogo
ModelsChatRankingsDocs
On this page
  • May 21, 2026
  • Product changes
  • New models
  • May 20, 2026
  • Product changes
  • New models
  • May 19, 2026
  • Product changes
  • New models
  • May 18, 2026
  • Product changes
  • New models
  • May 17, 2026
  • Product changes
  • May 16, 2026
  • Product changes
  • May 15, 2026
  • Product changes
  • New models
  • May 14, 2026
  • Product changes
  • New models
  • May 13, 2026
  • Product changes
  • May 12, 2026
  • Product changes
  • New models

Changelog

A daily log of product changes, improvements, and new model releases on OpenRouter
May 21, 2026
May 21, 2026

May 20, 2026
May 20, 2026

May 19, 2026
May 19, 2026

May 18, 2026
May 18, 2026

May 17, 2026
May 17, 2026

May 16, 2026
May 16, 2026

May 15, 2026
May 15, 2026

May 14, 2026
May 14, 2026

May 13, 2026
May 13, 2026

May 12, 2026
May 12, 2026

Older posts

Next
Built with

Product changes

  • Export all chats from playground — Added an “Export All Chats” option to the chatroom sidebar menu to download all conversations as a single JSON file.
  • Filter menu badge counts — The activity filter menu now displays numeric badge counts for active selections per category and uses checkmarks instead of dots for selected items.
  • API keys on profile page — All API keys across workspaces are now visible on the user profile page with click-through navigation to each key’s detail page.
  • Improved coding agent attribution — Requests from Cursor, GitHub Copilot, Cline, RooCode, Kilo Code, Zed, and OpenCode are now properly identified and labeled in activity logs.
  • Fixed: chat export missing rooms — Exporting chats now includes all rooms, including those not recently visited.

New models

  • Qwen: Qwen3.7 Max

Product changes

  • Web fetch: Parallel engine — The web fetch server tool now supports Parallel as an engine option, bringing engine parity with web search.
  • Delete all chats in chatroom — Added a “Delete All Chats” option in the chatroom sidebar with a confirmation dialog for bulk-deleting all chat rooms at once.
  • Pareto Router default settings — The Pareto Router can now be configured with a default coding quality tier in workspace plugin settings, removing the need to pass min_coding_score on every API request.
  • Fixed: HTML-encoded characters in xAI tool-call arguments — xAI Grok models that intermittently emit &, <, or > inside tool-call argument JSON now have those entities decoded server-side so downstream clients receive clean arguments.
  • Fixed: compare page horizontal scroll at 4–5 models — The compare page now properly enables horizontal scrolling when comparing four or five models.

New models

  • xAI: Grok Build 0.1

Product changes

  • Download fusion responses as markdown — Fusion responses in the chatroom can now be downloaded as .md files with human-readable filenames based on the prompt title and timestamp.
  • Profile page improvements — User profiles now show per-model breakdowns for token and prompt counts, a discoverable profile picture change button, and a refreshed header layout.
  • Request ID filter in logs — The logs page now displays request IDs in generation details and supports filtering by request ID alongside generation ID and session ID.
  • Unified date range picker on activity page — The activity page now uses the same date range preset picker as the logs page for consistent date selection.
  • Collapsible citation icons in chatroom — When a chatroom response has more than five citation icons, only the first three are shown with an expander button to reveal the rest.
  • Model pinning for router sessions — Router models like openrouter/auto now pin to the same concrete model across requests sharing a session_id, improving prompt cache hit rates for multi-turn conversations.
  • Fixed: router fallback blocked by generation start events — Protocol-level generation start events no longer prevent provider fallback, so the router correctly retries with another provider when the first one errors before delivering content.

New models

  • Google: Gemini 3.5 Flash

Product changes

  • Move API keys between workspaces — You can now transfer API keys from one workspace to another, making it easy to reorganize keys as your team structure evolves.
  • Presets API for creating presets from inference requests — Added POST /api/v1/presets/:slug/chat/completions for creating or versioning a preset directly from a standard chat completions request body, now also available in the TypeScript and Python SDKs.
  • Date format preference — Added a date format selector in Settings with Default, Short, ISO, and Relative options that apply across the dashboard, logs, and API key pages.
  • Long-horizon agents cookbook — New cookbook guide covering how to build long-running agent tasks with the OpenRouter Agent SDK, including stop-condition ceilings, resumable state, streaming progress, and voice input.
  • Compare page empty state redesign — The compare page now shows interactive slot cards in the empty state, letting you select models directly without a separate search step.
  • Profile page layout — Profile pages now display name, avatar, email, role, and workspace info at the top of the page for a cleaner layout.
  • Fixed: Anthropic native web_fetch passthrough — Anthropic models with native web_fetch support now correctly receive the tool in upstream requests, restoring full web content fetching capability.
  • Fixed: compare page section headers not sticking on scroll — Section headers on the compare page now correctly dock under the model cards while scrolling.
  • Fixed: playground side panel losing generation state on close — Closing and reopening the playground side panel on model detail pages now preserves in-progress generation output.

New models

  • xAI: Grok Imagine Video
  • xAI: Grok Imagine Image Quality

Product changes

  • Fixed: rankings filter state not persisted in URL — Filters on the rankings page are now stored in the URL via query parameters, making filtered views shareable and bookmarkable.

Product changes

  • Bring-your-own-key management API — New REST endpoints for programmatically listing, retrieving, creating, updating, and deleting BYOK credentials across workspaces via management key authentication.
  • Observability destinations management API — New REST endpoints for listing, retrieving, creating, updating, and deleting observability destinations (Datadog, Langfuse, LangSmith, and more) via management key authentication.
  • Fixed: dashboard growth percentages exceeding display bounds — Very large growth percentages on the activity dashboard are now capped at >999% for readability.

Product changes

  • Cursor integration guide — Added a guide for configuring Cursor to route through OpenRouter via the Override OpenAI Base URL feature.
  • Fixed: /api/v1/models under-reporting context_length — Models whose cheapest endpoint has a lower context window than the model itself (e.g. moonshotai/kimi-k2.6 showing 32K instead of 256K) now correctly report the full model context length in the models API.
  • Fixed: web search unbounded context growth — The web search server tool now defaults max_total_results to 50 when not specified, preventing runaway token accumulation in agentic loops. Docs
  • Fixed: Mistral speech-to-text billing unit — Corrected the billing unit for Mistral speech-to-text models from per-second to per-minute to match the provider’s actual pricing. Docs

New models

  • Mistral: Voxtral Mini Transcribe

Product changes

  • Playground side panel on model pages — Model detail pages now include a “Playground” button in the header that opens a slide-out panel for testing the model inline, with support for all available modalities.
  • OpenCode integration guide — Added a guide for configuring OpenCode with OpenRouter to the coding agents documentation.
  • Fixed: BYOK model filter showing incomplete results — Bring-your-own-key provider pages now show all models a provider serves, not just models whose cheapest endpoint is from that provider.
  • Fixed: 500 errors for router model variant mismatches — Requests to router models with unavailable variants (e.g. openrouter/auto:thinking) now return a descriptive 404 instead of a generic 500 error.
  • Fixed: vertical scrollbar on models page modality tabs — Removed an unwanted vertical scrollbar on the models page modality filter tabs.

New models

  • xAI: Grok Voice TTS 1.0

Product changes

  • Fusion server tool now available via API and Chatroom — Fusion is now available as an API plugin, a server tool, and in the chatroom composer — it routes your prompt to multiple models in parallel and synthesizes their responses into a single, higher-quality answer.
  • Automatic prompt caching in the Responses API — The Responses API now accepts top-level cache_control for Anthropic Claude models, closing a gap where cached input tokens stayed at zero on /api/v1/responses. Docs
  • Logs page pagination toggle — The Generations tab on the logs page now lets you switch between “Load more” and traditional paged navigation, with your preference saved across sessions.
  • Replit community guide — Added a community guide for configuring your OpenRouter API key in Replit projects.
  • Fixed: Codex multi-turn reasoning context lost through OpenRouter — Codex CLI and Desktop conversations proxied through OpenRouter no longer lose reasoning context across turns, fixing issues where the model would repeat completed steps.
  • Fixed: chatroom artifacts ignoring prior prompts — Follow-up artifact generations now include the full prompt history, preventing the model from generating unrelated content when iterating.
  • Fixed: routing preferences ignored with fallbacks disabled — preferred_min_throughput and preferred_max_latency now take effect when allow_fallbacks is set to false. Docs
  • Fixed: mobile sidebar on iPad-size viewports — The sidebar drawer on tablet-sized screens no longer renders behind the backdrop overlay.
  • Fixed: Recraft vector model SVG output — Recraft vector model variants now return properly encoded image/svg+xml data URLs instead of incorrectly labeled raster output. Docs

Product changes

  • Private Models Beta — Published a new Private Models page covering the beta for routing to your own custom, fine-tuned, or dedicated model endpoints via the standard completions and responses endpoints.
  • Anthropic Claude Opus 4.7 Fast mode — speed: "fast" now routes Claude Opus 4.7 to Anthropic’s new fast-mode endpoint; the fast permaslug is also directly callable as anthropic/claude-opus-4.7-fast.
  • BYOK keys grouped by priority — The BYOK provider detail page now splits keys into a Prioritized section (tried in order) and a Fallback section (tried only after OpenRouter credits), each independently reorderable via drag-and-drop. Docs
  • Adaptive Exa highlights for web search — The default web-search server tool and web plugin now request adaptive highlights from Exa (roughly 2–4K characters per result) instead of a fixed 15K-character cap; set search_context_size to low, medium, or high to opt back into a larger fixed budget.
  • Fixed: playground preserves uploaded images after video generation — Submitting a video generation in the playground no longer clears the images you previously attached to the prompt.
  • Fixed: model compare page on mobile — The model compare page now scrolls horizontally with a sticky header row on narrow screens, and the “Add model” trigger no longer surfaces Quick suggestions.

New models

  • Anthropic: Claude Opus 4.7 (Fast)
  • Perceptron: Perceptron Mk1