
Tech • IA • Crypto
OpenAI has unveiled GPT-5.6 in a limited partner preview, withholding a public release amid heightened scrutiny. The model introduces a three-tier system—Sol, Terra, and Luna—tailored for different performance and cost profiles. Sol Ultra reportedly uses hidden sub-agents to parallelize reasoning, hinting at more autonomous workflows. Pricing near $5 input / $30 output per million tokens and efficiency claims of 3× fewer tokens raise questions about true capability gains.
Z.AI’s GLM 5.2 is emerging as a credible challenger to leading U.S. models, particularly in cybersecurity tasks. Researchers report strong performance in vulnerability detection, placing it among the top 10 most-used models shortly after release. Its rapid adoption underscores narrowing capability gaps between Chinese and American systems. The launch has drawn attention from policymakers concerned about strategic competition.
Unlike closed systems from OpenAI or Anthropic, GLM 5.2 is released as an open-weight model. This allows unrestricted downloading, modification, and deployment, appealing to enterprises seeking autonomy. However, experts warn it could enable malicious actors to operate outside regulatory oversight. The debate highlights growing tension between openness and safety in advanced AI distribution.
Anthropic has launched Claude Design 2.0, a major overhaul focused on usability and integration. The platform introduces a prompt-first interface with templates for prototypes, slides, and documents. A new direct canvas editing feature allows manual adjustments without prompting, blending AI generation with traditional design control. The update positions Claude as a broader creative and development environment.
Claude Design 2.0 consolidates usage limits תחת shared plans like Pro and Max, replacing fragmented quotas. Activity now draws from a unified pool across Claude Code, Claude Co-Work, and design tools. This shift addresses developer frustration with restrictive caps and enables more fluid workflows. It also signals Anthropic’s push toward a tightly integrated ecosystem.
Leading researchers estimate a roughly 60% probability that recursive AI self-improvement could emerge before 2028. The concept involves systems like Claude 10 helping design successors such as Claude 11, creating compounding progress loops. Early forms already exist in AI-assisted coding, where feedback cycles close in seconds. If realized, the bottleneck shifts from human labor to compute, infrastructure, and governance.
Traditional AI benchmarks are facing growing skepticism as imperfect measures of real-world performance. Platforms like LM Arena, which rely on user preference comparisons, are gaining influence. Models such as GLM 5.2 may rank highly in these environments despite mixed benchmark results. This shift reflects a broader move toward experiential evaluation over standardized testing.
AI-generated search summaries, known as AI Overviews, are set to expand into new regions including France. These systems prioritize synthesized answers over traditional link-based results. The rollout is expected to significantly alter user behavior and web traffic patterns. It also raises concerns about information sourcing, visibility, and publisher economics.