Overview
Qwen AI Chat is useful when you want a practical read on the current Qwen release before switching a workflow. This page helps you separate Plus, Flash, core benchmarks, and the first tasks worth testing.
Hosted release
The April 2, 2026 launch centers on Qwen3.6-Plus, the hosted lane with 1M context and the broadest benchmark story.
The main April 2, 2026 launch is Qwen3.6-Plus. That is the hosted model with the 1M default context window and the broad benchmark story across code, files, tools, and visual inputs.
Open lane
The April 15, 2026 follow up adds Qwen3.6-35B-A3B, an open model that often appears in examples as qwen3.6-flash.
April 15, 2026 adds Qwen3.6-35B-A3B. It is the open model follow up and the lane that often appears in examples under the qwen3.6-flash API name.
Release lanes
This page separates the hosted lane from the open lane and the API name.
Plus is the hosted flagship and the cleanest place to start a serious evaluation. qwen3.6-flash is the API facing name that points to the 35B-A3B lane, which is better framed as a lighter or more open deployment option.
Context
It explains what long context changes for files, research notes, and planning.
It helps when you need one run to carry long files, research notes, product specs, screenshots, and follow up questions together. The value is less context switching and fewer manual summaries between steps.
Benchmarks
It highlights the rows that matter most for repo repair and terminal work.
Terminal-Bench 2.0 at 61.6, SWE-bench Pro at 56.6, and QwenClawBench at 57.2 make Plus worth testing on repo and terminal tasks.
Files
It covers document parsing, screenshot reading, and diagram reasoning.
OmniDocBench1.5 at 91.2 and AI2D_TEST at 94.4 make the file and diagram lane relevant for daily work.
Demos
It includes the best official examples for interface work and rapid prototyping.
Official web demos and QwenWebBench show how fast the model gets to a reviewable interface.
Migration
It calls out OpenAI compatible and Anthropic compatible interfaces for easier migration.
Qwen supports OpenAI compatible chat completions and responses APIs plus an Anthropic compatible interface, so teams can test without rebuilding their stack.
Why It Matters
The release now has enough public data to support a serious team test. Code, tools, files, diagrams, and screen tasks all appear in the official material.
Terminal-Bench 2.0 at 61.6, SWE-bench Pro at 56.6, and QwenClawBench at 57.2 make Plus worth testing on repo and terminal tasks.
TAU3-Bench at 70.7, DeepPlanning at 41.5, MCPMark at 48.2, and WideSearch at 74.3 show that multi-step tasks are part of the release.
OmniDocBench1.5 at 91.2 and AI2D_TEST at 94.4 make the file and diagram lane relevant for daily work.
Qwen supports OpenAI compatible chat completions and responses APIs plus an Anthropic compatible interface, so teams can test without rebuilding their stack.
Coverage
A useful SEO page should answer the questions behind the search, not just repeat the model name.
What this page separates
What to test first
Workflow fit
Official Data
These figures are the fastest way to see where Plus leads, where it stays close, and what deserves a real prompt test.
Three workflows reveal most of the answer quickly. Test one repo task, one long file task, and one screenshot or PDF task before you pick a default lane.


Terminal-Bench 2.0 at 61.6, SWE-bench Pro at 56.6, and QwenClawBench at 57.2 make Plus worth testing on repo and terminal tasks.
TAU3-Bench at 70.7, DeepPlanning at 41.5, MCPMark at 48.2, and WideSearch at 74.3 show that multi-step tasks are part of the release.
OmniDocBench1.5 at 91.2 and AI2D_TEST at 94.4 make the file and diagram lane relevant for daily work.
Official Figures
These figures are the fastest way to see where Plus leads, where it stays close, and what deserves a real prompt test.
| Benchmark | Score Official figure Plus | Workflow read What to test | Why it matters Rollout context |
|---|---|---|---|
Default context window Hosted flagship | 1M | Carry long files together | Plus is presented as a hosted model with a 1M context window by default, which matters for long files and mixed evidence. |
Terminal-Bench 2.0 Repo repair | 61.6 | Review diffs and fixes | The official coding table reports 61.6, ahead of Claude Opus 4.5 at 59.3 and ahead of the Qwen3.5 baseline at 52.5. |
QwenClawBench Terminal agents | 57.2 | Judge terminal execution | On the internal Claw benchmark, Plus reaches 57.2, ahead of Claude Opus 4.5 at 52.3. |
TAU3-Bench Planning and tool use | 70.7 | Test multi-step plans | The general agent table reports 70.7 for long horizon planning and tool use. |
MCPMark MCP workflows | 48.2 | Check MCP-heavy flows | For MCP heavy tool workflows, the official table reports 48.2, which leads the comparison set. |
OmniDocBench1.5 Complex documents | 91.2 | Read PDFs and policy files | For complex document understanding, Plus reaches 91.2 and leads every model shown in the official comparison. |
AI2D_TEST Diagrams and charts | 94.4 | Test diagrams and screenshots | The vision table reports 94.4 on diagram and chart reasoning, ahead of GPT-5.2 at 92.2. |
VideoMME (w. sub.) Video reasoning | 87.8 | Check video analysis | The official multimodal evaluation reports 87.8 with subtitles, which keeps Qwen competitive on video reasoning. |
ScreenSpot Pro Screen grounding | 68.2 | Use UI and screen tasks | For screen grounding, Plus reaches 68.2, ahead of the Qwen3.5 baseline at 65.6 and well ahead of Claude 4.5 Opus at 45.7. |
OSWorld-Verified Real computer tasks | 62.5 | Verify computer-use realism | On OSWorld-Verified, Plus reports 62.5, slightly behind Claude 4.5 Opus at 66.3 but still ahead of GPT-5.2. |
For code and repo work, Terminal-Bench 2.0 and QwenClawBench are the fastest read. For files and diagrams, OmniDocBench1.5 and AI2D_TEST say more than a general chatbot score.
Overview
Qwen AI Chat is useful when you want a practical read on the current Qwen release before switching a workflow. This page helps you separate Plus, Flash, core benchmarks, and the first tasks worth testing.

Official Demo
Use a diff review, small fix, or repo planning task first. That is where Plus has the clearest benchmark support.
Official Demo
Official web demos and QwenWebBench show how fast the model gets to a reviewable interface.
Official Demo
Use a long PDF, policy file, or diagram set to test whether 1M context and file reasoning help in practice.

Official Demo
Visual workflow demos make it easier to judge live screen understanding, richer media input, and tool assisted reasoning.
FAQ
These answers stay focused on rollout decisions, migration work, and what the official data really says.
Qwen AI Chat is the workspace and entry point on this site. Inside it, you can test the current Qwen lane you care about, compare prompts, and judge whether Plus or Flash fits the task.
The main April 2, 2026 launch is Qwen3.6-Plus. That is the hosted model with the 1M default context window and the broad benchmark story across code, files, tools, and visual inputs.
April 15, 2026 adds Qwen3.6-35B-A3B. It is the open model follow up and the lane that often appears in examples under the qwen3.6-flash API name.
Plus is the hosted flagship and the cleanest place to start a serious evaluation. qwen3.6-flash is the API facing name that points to the 35B-A3B lane, which is better framed as a lighter or more open deployment option.
FAQ
These answers stay focused on rollout decisions, migration work, and what the official data really says.
It is an open mixture of experts model with 35B total parameters and 3B active parameters. In practice, that means you get a more deployable lane without confusing it with the hosted Plus release.
It helps when you need one run to carry long files, research notes, product specs, screenshots, and follow up questions together. The value is less context switching and fewer manual summaries between steps.
Not exactly. Plus is positioned as the default 1M context hosted lane, while the official 35B-A3B model card says the open model supports 262,144 tokens natively and can be extended up to 1,010,000 tokens.
For code and repo work, Terminal-Bench 2.0 and QwenClawBench are the fastest read. For files and diagrams, OmniDocBench1.5 and AI2D_TEST say more than a general chatbot score.
FAQ
These answers stay focused on rollout decisions, migration work, and what the official data really says.
Turn it on when the task needs more than one reasoning step, tool use, or a long chain of follow up questions. It is usually less important for short lookups and more useful for planning, file analysis, and screen based work.
Often yes. The official docs say Qwen supports chat completions and responses APIs that follow the OpenAI style, and it also offers an Anthropic compatible interface for teams that want lower migration effort.
Run one repo or code review task, one long file task, and one screenshot or PDF task. If the model stays grounded across all three, the benchmark story is probably turning into real workflow value.
Coverage
A useful SEO page should answer the questions behind the search, not just repeat the model name.
This page separates the hosted lane from the open lane and the API name.
See release lanesIt explains what long context changes for files, research notes, and planning.
See official dataIt highlights the rows that matter most for repo repair and terminal work.
See repo workflowIt covers document parsing, screenshot reading, and diagram reasoning.
See document workflowIt includes the best official examples for interface work and rapid prototyping.
See web demoIt calls out OpenAI compatible and Anthropic compatible interfaces for easier migration.
See migration pathIt points readers back to current official access rules when trial or quota terms matter.
Open pricingTry Qwen AI Chat
Use this guide for context, then run one code task, one long file task, and one screenshot or document task before you choose a default lane.