Qwen3.6. This is it.
A developer watched the 35B parameter model code, test, and fix bugs in real-time using MCP screenshots.
A developer's viral demonstration shows Alibaba's Qwen3.6-35B-A3B model autonomously coding, testing, and debugging a complete tower defense game. The user gave a single instruction, and the AI agent proceeded to write the code, utilize the Model Context Protocol (MCP) to take screenshots of the installed application for verification, and independently test features like upgrades. Crucially, the model demonstrated meta-cognitive reasoning by identifying its own bugs—noting when the game canvas failed to render and spotting an error in wave completion logic—and then implementing fixes. This was run using the llama.cpp server with a massive 120,384 token context window and specific quantization (Q6_K_XL), highlighting the technical infrastructure enabling such complex, long-horizon tasks.
The event signals a significant shift from AI as a coding assistant to a proactive, executing agent. The model's ability to follow a high-level goal, break it into sub-tasks (coding, screenshot verification, testing, debugging), and course-correct based on observed outcomes mirrors a software engineer's workflow. The developer's shock, noting "I can't imagine what the Qwen Coder that's following will be able to do," underscores the perceived leap in capability. This demonstration was performed with a 35-billion-parameter model, suggesting that highly capable agentic AI is becoming more accessible and efficient, moving beyond pure chat interfaces into actionable, tool-using systems.
- The Qwen3.6-35B-A3B model built a functional tower defense game from a single prompt, showcasing agentic workflow execution.
- It used MCP screenshots for visual verification and autonomously identified and fixed bugs in its own code, including canvas rendering and game logic.
- The demo ran on llama.cpp with a 120K context window, indicating powerful, locally executable AI coding agents are now a reality.
Why It Matters
This moves AI from a coding copilot to an autonomous engineer, potentially automating significant portions of software development and testing cycles.