← Back to LLM Wiki
LLM Wiki · Open Weights · Self-Hostable

Qwen

Alibaba's open-weights family — Qwen3, Qwen3-Coder — paired with the Qwen-Agent framework. The reference open model for coding agents.
Qwen is the open-weights family that consistently ships strongest on tool use and coding. Qwen3-Coder rivals closed frontier models on code benchmarks, and the Qwen-Agent framework bundles tool calling, MCP support, and browser / code-interpreter primitives. Apache 2.0 licensing removes the scale restrictions that ship with Llama.
Alibaba Open Weights Agent Framework Coder Apache 2.0

Quick Facts

Vendor
Alibaba Cloud (Hangzhou)
Released
Qwen (2023); Qwen3 (2025)
Current line
Qwen3 (0.6B–235B) · Qwen3-Coder · Qwen-Agent
License
Apache 2.0 (most variants)
Hosting
Self-hosted (vLLM, Ollama); hosted via DashScope, Together, Fireworks
Context window
32K–1M tokens depending on variant
Modalities
Text, vision (Qwen-VL), audio (Qwen-Audio)
Architecture
Dense and MoE variants

Summary

Qwen is Alibaba's open-weights LLM family, first released in 2023. Qwen3 (2025) shipped a tiered lineup from 0.6B to 235B parameters, with mixture-of-experts at the top and dense models filling out the smaller tiers. Qwen3-Coder is the specialized coding variant and, on most benchmarks, the strongest open model for agentic code tasks.

The Qwen-Agent framework is as important as the weights. It wraps tool calling, MCP server integration, a Python code interpreter, a browsing agent, and retrieval into a cohesive agent primitive — the same design patterns Claude Agent SDK and Stripe's Minions use, but on open infrastructure. For teams building managed AI services that can't rely on closed APIs, Qwen + Qwen-Agent is the closest thing to a turnkey stack.

Model Lineup

Where Qwen Fits

Qwen is the default open model for coding agents and for workloads where tool-use reliability matters more than raw benchmark scores. Apache 2.0 licensing sidesteps the enterprise-scale restrictions in Llama's license. Within the Claw ecosystem, Qwen3-Coder powers OpenClaw and NanoClaw — the agent runtimes that handle interior code loops on customer hardware. Qwen-Agent is the framework the ecosystem is named after.

Tradeoffs

Deployment Notes

Qwen3-Coder is the engine behind OpenClaw and NanoClaw. It runs on customer Mac Minis via vLLM or llama.cpp, with Qwen-Agent wrapping the agent loop and MCP bridges connecting to Claude Code Slack, FrawdBot, and the observability stack. For teams that want a pure-open stack — no closed API calls — Qwen is typically the highest-capability choice. See The Agent Infrastructure Stack for the full topology.

References

  1. Qwen — Alibaba
  2. QwenLM on GitHub
  3. Qwen-Agent framework
  4. The Agent Infrastructure Stack — Organized AI