GLM 5.2 Fast + Gemini's Computer Use: Week's Dev Wins

Tool of the Week

GLM 5.2 Fast ships on Wafer via AI Gateway

Wafer-backed GLM 5.2 Fast delivers 2x higher throughput than competing serverless providers, with 170+ tok/s on small context and 200+ tok/s on large context.

Decode speed directly affects streaming latency in production; 2x throughput means faster token generation for sustained workloads without provider switching. AI Gateway unifies billing, retry logic, and usage tracking across models.

Drop-in replacement via model ID `zai/glm-5.2-fast` in Vercel AI SDK. Requires AI Gateway account; zero platform fee on inference. Worth testing now if you run streaming text generation or have context-heavy workloads.

“Wafer delivers a 2x higher throughput than other providers serving GLM-5.2 on serverless”
“Small context: 170+ tok/s”
“Large context: 200+ tok/s”
“set `model` to `zai/glm-5.2-fast`”

llm-inferenceglm-5.2serverlessthroughputai-gateway

Dev Signal

Get issues like this in your inbox — free, every weekday.

Quick Signals

Claude Tag launches Slack integration for team workflows

Join Claude in Slack channels as a multiplayer teammate with persistent context, tool access, and async task execution—replaces the previous Claude Slack app.

Shifts Claude from single-chat tool to shared team member that retains channel context and auto-learns from shared work, reducing repetitive context-setting and enabling parallel task delegation. Internal metrics show 65% of Anthropic's product team code now created via Claude Tag.

Replaces existing Claude in Slack app (30-day migration window). Requires admin setup: define channel-scoped tool/data access, set spend limits, configure permissions isolation. Ready now for Enterprise and Team customers in beta. Worth adopting immediately if you run multi-person code/data workflows in Slack; opting in triggers introductory launch credits.

“Today, 65% of our product team's code is created by our internal version of Claude Tag”
“@Claude is multiplayer. Within a given Slack channel, there's one Claude that interacts with everyone”
“@Claude learns over time. As Claude follows along with its channel, it builds more context about the work”
“If 'ambient' behavior is enabled, Claude will proactively keep you updated about whatever it thinks you might need to know”
“Claude Tag replaces the existing Claude in Slack app. To migrate, administrators can opt in within 30 days”
“Claude Tag works with Opus 4.8”

slack-integrationteam-aiclaudeasync-workflowstool-access

Gemini 3.5 Flash adds native computer use capability

Data Point

Open far-field ASR benchmark measures real acoustic gaps

FFASR Leaderboard quantifies WER degradation across 14 simulated rooms with sim-to-real validation, replacing proprietary evaluation pipelines with reproducible far-field metrics.

Models scoring well on clean-speech benchmarks often degrade substantially in deployment. This leaderboard exposes acoustic robustness gaps and speed-accuracy tradeoffs that matter for voice agents, robotics, and in-car systems before production.

Ready now. Submit your ASR model to Hugging Face leaderboard at https://huggingface.co/spaces/treble-technologies/ffasr. Requires inference on NVIDIA L4 GPU under standardized conditions. Replaces ad-hoc far-field testing with reproducible ranking. Moving-source evaluation still in beta; multi-talker scenarios coming later.

“far-field WER at low SNR is consistently several times higher than near-field WER on the same speech content”
“The gap between benchmark performance and real-world deployment is one of the more persistent frustrations in ASR development”
“hybrid wave-based simulation, sim-to-real validation, moving-source splits in beta, held-out audio, and standardized evaluation hardware across all submissions”
“Fourteen fully furnished rooms are included in the benchmark, ranging from 20 to 470 m³”
“the Pareto front view in the Analysis tab makes that tradeoff explicit”

asrfar-field-audiobenchmarkacoustic-robustnessvoice-agents

Enjoying Dev Signal? Get every issue in your inbox.

Free forever · 3 issues a week · One-click unsubscribe

Refer a friend →

Earn rewards for every developer you bring in.

Go premium →

Sponsor-free feed · full archive search · $149 lifetime.

GLM 5.2 Fast + Gemini's Computer Use: Week's Dev Wins

GLM 5.2 Fast ships on Wafer via AI Gateway

Quick Signals

Claude Tag launches Slack integration for team workflows

Gemini 3.5 Flash adds native computer use capability

Open far-field ASR benchmark measures real acoustic gaps

LangSmith Engine clusters agent failures automatically

Cloudflare Email Service enters public beta

Gemini 3.5 Flash adds native computer use capability