self-hosted-ai air-gapped-networks open-source-models compliance inference-infrastructure

GitLab 19.0 expands self-hosted open source model support

Air-gapped deployments can now run Mistral, GLM, Kimi, and MiniMax models on local inference hardware via vLLM, keeping code in-network while maintaining agentic capability.

Summary

Teams under data residency or compliance constraints no longer sacrifice AI capability—you can match model size to task complexity (routine work vs. complex reasoning) without sending code to third-party APIs. Eliminates the single-model bottleneck that previously forced isolated environments to choose between overkill or underpowered.

Why it matters

Implementation verdict

Replaces cloud-dependent Duo Agent setups for regulated environments. Requires: vLLM serving platform, on-premises GPU hardware (or GPU VMs in VPC), and GitLab Duo Agent Platform Self-Hosted add-on. Ready now if you have infrastructure; hybrid deployments (mixing self-hosted + GitLab-managed models per feature) are supported. Contact sales to validate hardware requirements per model.

Sources

1.air-gapped environments have historically been the last to realize AI productivity gains
2.The newly supported models include: Mistral Devstral 2 123B, GLM-5.1, Kimi-K2.6, MiniMax-M2.7
3.GitLab Duo Agent Platform Self-Hosted supports mixing self-hosted models with GitLab-managed models per feature
4.The primary pattern is on-premises hardware running vLLM, GitLab's recommended serving platform for open source models

Dev Signal

Get briefs like this in your inbox — free, every weekday.

100+ sources compressed into one 4-minute read. Ranked, cited, implementation-ready.

Read the full issue →All briefs