Alibaba’s Qwen team has announced the launch of QwQ-32B, an AI model with 32 billion parameters. This new model showcases performance that rivals the significantly larger DeepSeek-R1, which has 671 billion parameters (with 37 billion activated). The introduction of QwQ-32B emphasizes the potential of scaling Reinforcement Learning (RL) within robust foundation models. The innovative model […]