Gateway routing, agents ship, Mistral TTS drops

Tool of the Week

AI Gateway routing rules block and redirect models

Apply firewall-style model rewrites and denials at the gateway level—no code changes needed when models fail or get retired.

Eliminates deploy cycles for model switching. When a model goes down or you need to enforce team policies, a single CLI command applies instantly across all requests using those credentials.

Replaces in-app fallback logic and manual code deployments for model substitution. Requires Vercel AI Gateway and CLI access; works immediately for Rewrite (swap one model for another) and Deny (block requests with 403). Ready now in beta—low friction if already on Vercel stack.

“Routing rules are firewall-style rules that control which models your team can use, applied at the gateway level instead of in your application code”
“When a model goes down or gets retired, you usually have to ship a code change to move off it. With routing rules, you push one rule and every request reroutes instantly”
“Rules apply to every request made with your team's AI Gateway credentials”
“Routing rules are in beta”

ai-gatewayroutingmodel-fallbackverceldeployments

Dev Signal

Get issues like this in your inbox — free, every weekday.

Quick Signals

Nano Banana 2 Lite replaces first-gen image model

Swappable image model generates 1K images in 4 seconds at $0.034/1K; pair with Gemini Omni Flash ($0.10/sec video) to chain image-to-video workflows.

Reduces latency for interactive prototyping and cuts per-image costs by optimizing for speed-first pipelines. Omni Flash adds video generation and natural-language editing to the same API surface, enabling end-to-end multimedia workflows without external tools.

Nano Banana 2 Lite is a drop-in replacement for gemini-2.5-flash-image; use it now for drafting and ideation. Omni Flash (gemini-omni-flash-preview) is production-ready for video generation but limited to 10-second outputs and lacks audio/scene extension on API. Start with image generation first, then test Omni Flash if your workflow needs video.

“Delivers text-to-image outputs in 4 seconds”
“Cost-efficient choice for developers focused on drafting, ideating, managing operational budgets or low-bandwidth usage”
“priced competitively at $0.10 per second of video output”
“it's our recommended replacement for developers currently using our first version of Nano Banana (gemini-2.5-flash-image), you can swap it out now for immediate benefits”
“Omni offers 10-second video generations currently, with longer durations coming soon”

image-generationvideo-apigeminilatency-optimizationmulti-modal

Ornith-1.0 open-source coding agents ship four sizes

MIT-licensed agentic models (9B–397B) trained with RL to optimize both solution rollouts and search scaffolds, available in dense and MoE variants with 256K context and OpenAI-compatible serving.

Data Point

Agents fail to teach users what they want

Interactive recommender agents achieve only 56% accuracy because they don't expand user knowledge during conversation—the bottleneck is preference formation, not item search.

If you're building agentic recommendation or preference-elicitation systems, this paper quantifies a hard constraint: clarifying questions alone don't work when users lack domain knowledge. Your agent needs explicit teaching mechanisms (examples, explanations) to move the needle on task specification.

This doesn't replace existing systems yet—it's a diagnostic. CoShop benchmark reveals that five-turn interactions with frontier models don't actually educate users about their own preferences. If you're shipping an agent that relies on user clarity, this is a warning: invest in knowledge-building dialog actions before optimizing search.

“no agent exceeds 56% accuracy on CoShop despite five turns of interaction”
“Failures stem not from agents' ability to find items, but from how little the interaction expands what users know about what they want”
“Users often lack the domain knowledge to have completely specified preferences”

agentsrecommendation-systemsuser-preferencesbenchmarkdialog-systems

Enjoying Dev Signal? Get every issue in your inbox.

Free forever · 3 issues a week · One-click unsubscribe

Refer a friend →

Earn rewards for every developer you bring in.

Go premium →

Sponsor-free feed · full archive search · $149 lifetime.

Gateway routing, agents ship, Mistral TTS drops

AI Gateway routing rules block and redirect models

Quick Signals

Nano Banana 2 Lite replaces first-gen image model

Ornith-1.0 open-source coding agents ship four sizes

Agents fail to teach users what they want

Vercel Private Blob exits beta, adds OIDC auth

Mistral releases Voxtral TTS with 4B parameters

Mistral releases connectors API for enterprise tool integration