
Tech • IA • Crypto
OpenAI has launched real-time voice AI models alongside a new supercomputing network system, highlighting rapid product advances amid a still-uncertain impact on jobs.
OpenAI introduced three developer-facing systems: GPT Realtime 2, GPT Realtime Translate, and GPT Realtime Whisper. The goal is to make voice assistants behave less like scripted menus and more like responsive agents capable of understanding intent and completing tasks during live conversations. These models are designed to handle interruptions, corrections, and complex multi-step requests in real time.
GPT Realtime 2 integrates near GPT-5–level reasoning into spoken interactions. It can call multiple tools simultaneously, track long conversational context, and respond while processing background actions. The context window has expanded to 128,000 tokens, enabling longer, more coherent exchanges across use cases such as customer support, tutoring, and medical workflows.
Benchmarks show major improvements, with accuracy rising to 96.6% on Big Bench Audio, up from 81.4% in prior versions. Developers can tune reasoning intensity across five levels, balancing speed and depth. The system also adapts tone—such as calm or upbeat—and handles specialized terminology more effectively.
GPT Realtime Translate supports over 70 input languages and produces output in 13 languages, enabling near-instant multilingual conversations. Early testing includes Deutsche Telekom using it for customer support. Meanwhile, GPT Realtime Whisper provides live transcription, powering captions, summaries, and automated workflows during meetings and events.
OpenAI outlined three emerging patterns: “voice-to-action,” where spoken commands trigger real-world tasks; “systems-to-voice,” where software communicates live updates; and “voice-to-voice,” enabling seamless multilingual communication. Pricing has been disclosed, with translation at $0.034 per minute and transcription at $0.017 per minute, signaling readiness for commercial deployment.
Behind these models is a new networking protocol, Multi-Path Reliable Connection (MRC), designed for massive AI supercomputers. Developed with partners including Microsoft, Nvidia, AMD, and Intel, MRC optimizes how thousands of GPUs exchange data, reducing bottlenecks and preventing costly slowdowns during training.
MRC distributes data across multiple paths instead of relying on single routes, improving throughput and reliability. It can reroute around failures in microseconds and maintain operations even during hardware disruptions. The system enables clusters of up to 131,000 GPUs with fewer switches, cutting infrastructure costs while improving speed.
Despite rapid technological progress, evidence of widespread job displacement is mixed. Surveys from the National Bureau of Economic Research show nearly 90% of executives reporting no workforce impact so far, and labor data shows limited macroeconomic change through early 2026. However, projections vary widely, with some leaders warning of significant reductions in entry-level roles.
Research indicates a 13% decline in employment for early-career workers in AI-exposed roles, while experienced workers remain stable. At the same time, companies have cited AI in layoffs, though some executives acknowledge this may mask broader economic pressures rather than direct automation effects.
OpenAI’s latest releases highlight both the rapid evolution of real-time AI capabilities and the growing importance of underlying infrastructure, while the broader economic impact on jobs remains uncertain and uneven.