ENFR
8news

Tech • IA • Crypto

BriefingToday's VideosVideo briefingsTopicsToday's Top 50Daily Summaries

GPT-5.5 Launch, OpenAI Codex Agents, Palantir Manifesto Fallout

AISaturday, April 25, 2026· 6 videos

Briefing

GPT-5.5 debuts with autonomous gains

OpenAI launched GPT-5.5 on April 23, positioning it as a shift toward sustained autonomous task execution. The model emphasizes real-world usefulness over raw benchmark hype, focusing on long-running workflows. It matches prior latency despite increased scale, signaling major infrastructure gains. The release intensifies competition with Claude Opus 4.7 and Gemini 3.1 Pro.

GPT-5.5 posts benchmark dominance

GPT-5.5 leads across multiple evaluations, including Terminal Bench 2.0 (82.7%) and GDP Val (84.9%). It also edges rivals in OSWorld Verified (78.7%), narrowly beating Claude Opus 4.7. Gains are especially strong in math and complex reasoning tasks. The results reinforce OpenAI’s focus on applied performance rather than narrow test optimization.

Codex evolves into autonomous agent system

OpenAI Codex now operates as an autonomous coding colleague with real-time desktop control. It can see screens, click interfaces, write code, and manage workflows without constant prompts. The system shifts from reactive assistance to proactive task ownership. This marks a significant step toward fully agentic software development environments.

Codex runs parallel multi-agent workflows

The latest Codex deploys multiple agents simultaneously, splitting tasks like debugging, testing, and deployment. These agents operate continuously, accelerating development cycles dramatically. Integration spans over 90 tools including Slack, Notion, Jira, and Google Drive. Persistent memory allows Codex to adapt to user habits and workflows over time.

Claude skills enable reusable automation

Claude introduces modular “skills” as reusable automation units stored in Markdown. These keyword-triggered workflows function like pre-trained employees, executing consistent tasks without retraining. A shared ecosystem, including directories like skills.sh, enables discovery of vetted community tools. This lowers the barrier to building complex AI-driven pipelines.

Claude workflows hinge on structure

Effective use of Claude code depends on mastering skills, commands, hooks, and file organization. Skills run automatically, while slash commands allow controlled execution of multi-step processes. Hooks add deeper automation by triggering scripts under defined conditions. File structure acts as Claude’s contextual memory, shaping how it retrieves and uses information.

Obsidian hype exposes RAG limitations

Popular “second brain” setups using Obsidian fall short of true RAG (Retrieval-Augmented Generation) systems. The platform relies on keyword matching rather than semantic vector search. Without embeddings or reranking, retrieval quality is limited and prone to irrelevant context. This gap undermines claims of advanced AI knowledge systems built on simple note graphs.

Palantir manifesto triggers political backlash

Palantir sparked controversy with a 22-point manifesto warning of “technofascism” and urging alignment with U.S. interests. CEO Alex Karp framed technology as a tool to defend institutions and national security. The document drew sharp criticism across France and the UK, with calls to reconsider government contracts. The episode highlights rising tensions over AI, power, and geopolitics.

Videos covered

Previous briefings · AI