Samsung is addressing the limitations of existing benchmarks for evaluating the real-world productivity of enterprise AI models with the introduction of a new system called TRUEBench. Developed by Samsung Research, TRUEBench aims to bridge the gap between theoretical AI performance and its practical utility in workplace settings. As organizations increasingly adopt large language models (LLMs) […]
Introducing Deep Cogito v2: The Open-Source AI Revolutionizing Reasoning Skills
Deep Cogito has unveiled Cogito v2, a series of open-source AI models designed to enhance their own reasoning capabilities. This latest family of models includes four hybrid reasoning versions, with two mid-range models featuring 70B and 109B parameters, and two larger models with 405B and 671B parameters. The standout in this lineup is the 671B […]
Tencent Enhances Creative AI Model Testing with Innovative New Benchmark
Tencent has launched a new benchmark called ArtifactsBench to improve the evaluation of creative AI models. Traditionally, AI models have been assessed based on their ability to generate functionally correct code, often missing the crucial aspects of visual appeal and user experience. For instance, AI might produce a website or a chart that technically works, […]
Anthropic Claude 4: Ushering in a New Era of Intelligent Agents and AI Coding
Anthropic has introduced its latest models in the Claude 4 family, which promise significant advancements for developers and AI assistant builders. The highlights are Claude Opus 4, hailed as the most powerful coding model to date, and Claude Sonnet 4, which aims to be an agile all-rounder. Anthropic positions Claude Opus 4 as an innovative […]
Exploring GPT-4o: Delivering Human-like AI Interaction Across Text, Audio, and Vision
Ryan Daws is a senior editor at TechForge Media, with a background spanning over a decade in tech journalism. He has identified the latest technological trends, dissected complex topics, and written compelling narratives around the most cutting-edge developments. His articles and interviews with leading industry figures have gained him recognition as a key influencer by […]
Unveiling the Reasons: Why Most AI Benchmarks Offer Limited Information
On Tuesday, startup Anthropic released a family of generative AI models that it claims achieve best-in-class performance. Just a few days later, rival Inflection AI unveiled a model that it asserts comes close to matching some of the most capable models out there, including OpenAI’s GPT-4, in quality. Anthropic and Inflection are by no means […]






