ENFR

Tech • IA • Crypto

Today Topics Videos Crypto Archives Favorites

Google’s New Omni and Spark Just Changed AI Forever

10/10

AIAI RevolutionMay 21, 2026 at 12:31 AM18:12

Audio player

0:00 / 0:00

TL;DR

Google I/O 2026 showcased rapid AI scaling, new Gemini models, and a shift toward autonomous agents embedded across products and infrastructure.

KEY POINTS

Explosive AI usage growth

Google reported processing over 3.2 quadrillion tokens per month, up from 480 trillion a year earlier and 9.7 trillion two years ago. The Gemini app surpassed 900 million monthly users, more than doubling year over year, while AI-powered search features reached billions of users. This signals a transition from experimental AI to global, everyday infrastructure.

Gemini 3.5 Flash challenges top models

The newly introduced Gemini 3.5 Flash outperformed prior flagship models on multiple benchmarks, including 76.2% on Terminal Bench 2.1 and 1,656 ELO on GDP Val AA. It competes with systems like GPT-5.5 and Claude Opus 4.7, while delivering speeds near 280 tokens per second, roughly four times faster than rivals. Google positioned it as both high-performance and cost-efficient.

Major cost reductions for enterprises

Google stated that Flash delivers similar capabilities at less than half the price of competing frontier models. Large-scale users shifting workloads could save over $1 billion annually, highlighting a growing emphasis on efficiency as AI adoption scales.

Introduction of Gemini Omni world model

Gemini Omni represents a step toward artificial general intelligence, combining text, audio, image, and video understanding in a single system. Unlike traditional generators, it models physical consistency, enabling realistic outputs such as accurate protein folding animations and synchronized audiovisual scenes.

Advanced video editing and generation

Omni enables iterative, conversation-driven editing where scenes retain continuity, physics, and character consistency. Demonstrations included transforming objects, modifying environments, and generating structured multimedia sequences with coherent audio and visuals.

AI safety and watermarking expansion

All generated content includes SynthID watermarking, now applied to over 100 billion images and videos and 60,000 years of audio. Adoption by companies like OpenAI, Nvidia, and ElevenLabs signals movement toward an industry-wide transparency standard.

Next-generation TPU infrastructure

Google unveiled eighth-generation TPUs, including TPU8T for training and TPU8 for inference. Training can now scale across over one million TPUs, reducing model development timelines from months to weeks. Efficiency improved to 2x performance per watt, alongside major latency gains.

Massive capital investment

Annual capital expenditure is projected at $180–190 billion, up from $31 billion in 2022, underscoring the scale of infrastructure required to sustain AI growth.

Rise of autonomous agent platforms

The Antigravity 2.0 platform expands into a full ecosystem for building and orchestrating AI agents. Combined with Gemini 3.5, agents can execute complex workflows, automate development tasks, and operate across environments with minimal setup via APIs and SDKs.

Developer ecosystem upgrades

Google AI Studio now supports full-stack app development, Kotlin integration, and direct deployment. Tools like Android migration agents can convert apps across platforms in hours, while WebMCP aims to standardize how web agents interact with online tools.

Gemini Spark and personal AI agents

Gemini Spark introduces persistent, cloud-based agents that operate continuously, managing tasks like scheduling, research, and communication. It integrates with Google services and third-party tools, reflecting a broader shift toward always-on digital assistants.

AI integrated across consumer products

New features include Docs Live for voice-driven document creation, Ask YouTube for context-aware video navigation, Daily Brief for personalized summaries, and enhanced Google Maps interactions. Search is evolving into a dynamic, task-oriented interface with interactive outputs.

New creative and hardware initiatives

Tools like Google Pix enable object-level image editing, while AI-powered glasses—developed with Warby Parker and Gentle Monster—bring real-time assistance, translation, and media capture into wearable devices.

CONCLUSION

Google’s announcements highlight a decisive shift toward fast, cost-efficient models and autonomous agents embedded across products, signaling an industry-wide move from passive AI tools to systems that actively execute tasks.

Full transcript

More from AI