Skip to content Skip to sidebar Skip to footer

OpenAI

Introducing DiffusionGemma

Why diffusion for text? While the AI research community has explored diffusion-based text generation for years, applying it to large models has remained a challenge. DiffusionGemma changes this by shifting how models use hardware. The trade-off with traditional models Most language models act like a typewriter, generating one token at a time from left to…

Read More

New AI Tools for the Future of Science

For centuries, the scientific method has been the greatest engine of human progress. At Google, our mission is deeply rooted in building tools to accelerate it. We believe that a new era of discovery won’t come from narrow, specialized models, but general agents that empower researchers across every scientific field. That’s why we are introducing…

Read More

Introducing Gemini Omni

Last year, Nano Banana brought Gemini's intelligence to image generation and editing. Since then, it’s helped millions of people restore old photos, design from sketches and visualize ideas in ways that weren’t possible before. From the start we built Gemini to be natively multimodal from the ground up, and now we’re taking the next step.…

Read More

We’re launching the Google DeepMind Accelerator program in Asia Pacific to tackle environmental risks.

The Asia-Pacific region is a global engine for economic growth, but it's also highly vulnerable to climate change. While green technologies are gaining momentum, a recent report shows they aren’t scaling fast enough to keep up with the region’s rising environmental risks. To help innovators tackle these environmental challenges, we’re launching an inaugural Google DeepMind…

Read More

Gemini 3.1 Flash TTS: New text-to-speech AI model

Today, we’re introducing Gemini 3.1 Flash TTS, the latest text-to-speech model that delivers improved controllability, expressivity and quality — empowering developers, enterprises and everyday users to build the next generation of AI-speech applications. Starting today, 3.1 Flash TTS is rolling out: Improved speech quality and controllability We’ve improved the overall speech quality of Gemini 3.1…

Read More

New video generation model updates

Today, Veo is getting more expressive, with improvements that help you create more fun, creative, high-quality videos based on ingredient images, built directly for the mobile format. We’re excited to bring new creative possibilities for everyone from casual storytellers to professional filmmakers. We’re releasing: Improvements to Veo 3.1 Ingredients to Video, our capability that lets…

Read More

FACTS Benchmark Suite: a new way to systematically evaluate LLMs factuality

Large language models (LLMs) are increasingly becoming a primary source for information delivery across diverse use cases, so it’s important that their responses are factually accurate. In order to continue improving their performance on this industry-wide challenge, we have to better understand the types of use cases where models struggle to provide an accurate response…

Read More