🖼️ Native vision-language design with early multimodal fusion trained on massive text-image-video data
🌍 Expanded multilingual coverage to 201 languages & dialects
🤖 Demonstrates visual agent workflows across mobile & desktop interfaces beyond passive chat
📊 First open-weight release: Qwen3.5-397B-A17B ultra-sparse MoE model
🔬 Large-scale reinforcement learning across multi-agent environments for real-world adaptability