
Tech • IA • Crypto
OpenAI launched GPT-5.5 on April 23, positioning it as a shift toward sustained autonomous task execution. The model emphasizes real-world usefulness over raw benchmark hype, focusing on long-running workflows. It matches prior latency despite increased scale, signaling major infrastructure gains. The release intensifies competition with Claude Opus 4.7 and Gemini 3.1 Pro.
GPT-5.5 leads across multiple evaluations, including Terminal Bench 2.0 (82.7%) and GDP Val (84.9%). It also edges rivals in OSWorld Verified (78.7%), narrowly beating Claude Opus 4.7. Gains are especially strong in math and complex reasoning tasks. The results reinforce OpenAI’s focus on applied performance rather than narrow test optimization.
OpenAI Codex now operates as an autonomous coding colleague with real-time desktop control. It can see screens, click interfaces, write code, and manage workflows without constant prompts. The system shifts from reactive assistance to proactive task ownership. This marks a significant step toward fully agentic software development environments.
The latest Codex deploys multiple agents simultaneously, splitting tasks like debugging, testing, and deployment. These agents operate continuously, accelerating development cycles dramatically. Integration spans over 90 tools including Slack, Notion, Jira, and Google Drive. Persistent memory allows Codex to adapt to user habits and workflows over time.
Claude introduces modular “skills” as reusable automation units stored in Markdown. These keyword-triggered workflows function like pre-trained employees, executing consistent tasks without retraining. A shared ecosystem, including directories like skills.sh, enables discovery of vetted community tools. This lowers the barrier to building complex AI-driven pipelines.
Effective use of Claude code depends on mastering skills, commands, hooks, and file organization. Skills run automatically, while slash commands allow controlled execution of multi-step processes. Hooks add deeper automation by triggering scripts under defined conditions. File structure acts as Claude’s contextual memory, shaping how it retrieves and uses information.
Popular “second brain” setups using Obsidian fall short of true RAG (Retrieval-Augmented Generation) systems. The platform relies on keyword matching rather than semantic vector search. Without embeddings or reranking, retrieval quality is limited and prone to irrelevant context. This gap undermines claims of advanced AI knowledge systems built on simple note graphs.
Palantir sparked controversy with a 22-point manifesto warning of “technofascism” and urging alignment with U.S. interests. CEO Alex Karp framed technology as a tool to defend institutions and national security. The document drew sharp criticism across France and the UK, with calls to reconsider government contracts. The episode highlights rising tensions over AI, power, and geopolitics.