Audio streaming, image, and video model cards arranged on a developer migration calendar
Audio streaming, image, and video model cards arranged on a developer migration calendar
+ Google News

Gemini API adds TTS streaming as media model shutdown dates arrive

Google's Gemini API changelog added streaming speech generation for a preview TTS model and set near-term shutdown dates for older Imagen and Veo model IDs.

13 minutes ago

Google’s Gemini API release notes added streaming support for speech generation on June 17, 2026, and set fresh shutdown dates for older image and video generation models on June 15. The changelog is a small post, but it is the kind of update that can break production workflows if teams treat model IDs as permanent.

The new capability is for gemini-3.1-flash-tts-preview: Google says streaming through streamGenerateContent and stream: true in the Interactions API is now supported. The migration work is on the media side. Google says Imagen 4 and Gemini 3 Image model IDs will shut down on August 17, 2026, and several Veo 2.0 and Veo 3.0 model IDs will shut down on June 30, 2026.

The dates are the story

This is not a flashy model launch. It is a calendar item for builders. If your product calls the older Veo IDs, June 30 is the deadline. If it calls the listed Imagen 4 or Gemini 3 Image IDs, August 17 is the deadline.

ChangeDate in Google’s changelogAffected surface
TTS streaming addedJune 17, 2026gemini-3.1-flash-tts-preview
Veo model shutdown announcedJune 15, 2026veo-2.0-generate-001, veo-3.0-generate-001, veo-3.0-fast-generate-001
Imagen/Gemini image shutdown announcedJune 15, 2026imagen-4.0-generate-001, imagen-4.0-ultra-generate-001, imagen-4.0-fast-generate-001

Google’s suggested path for the Veo shutdown is to move to Veo 3.1 preview IDs or the 3.1 GA models available through the Gemini Enterprise Agent Platform. For image generation, Google points developers to the Gemini deprecations page and newer stable or preview endpoints.

Streaming speech changes product feel

TTS streaming matters because speech interfaces are latency-sensitive. A non-streaming voice response can feel slow even when the final audio is high quality. Streaming lets an application begin playback while generation continues.

The changelog does not turn this into a broad voice platform announcement. It names one preview model and two API surfaces. That is enough to matter for developers building assistants, language-learning tools, customer support, or agent interfaces where spoken responses need to arrive quickly.

The preview label is also important. Teams should test quality, latency, fallback behavior, and API shape before treating it as a stable dependency.

Media models are moving faster than integrations

The deprecation pattern is now normal across AI APIs: a model launches, improves, moves from preview to GA or gets replaced, and old model IDs receive shutdown dates. The risk is that applications bind too tightly to one ID and do not keep a migration lane open.

For builders, the basic checklist is short:

Gemini media migration check

  1. Inventory model IDs. Search production code, prompt configs, worker jobs, and vendor settings for the affected Imagen and Veo IDs.
  2. Map replacements. Test the newer image endpoints and the Veo 3.1 preview or GA path that fits your account and product.
  3. Compare outputs. Re-run representative prompts and human-review outputs before switching traffic.
  4. Add monitoring. Track generation failures, latency, content-policy changes, and cost after the migration.

The point is not only to avoid a shutdown. New media models can change style, motion, prompt sensitivity, safety filters, and output timing. A migration that compiles can still change the product.

What to watch next

The next checkpoint is whether Google moves gemini-3.1-flash-tts-preview toward general availability and whether Veo 3.1 becomes the default developer path outside the enterprise platform. The second checkpoint is how much notice future media-model deprecations give builders. Shorter windows make model abstraction and regression testing more important.

For readers tracking the model and company layers behind Gemini, see our AI model leaderboard and Google company tracker.

Sources

The AI Feed Desk

The AI Feed Desk

Editorial desk

The AI Feed Desk tracks AI provider updates, model releases, agent tooling, and enterprise adoption, turning fast-moving announcements into source-linked context for builders and operators.

Noticed a typo, incorrect information, or translation error?

Tell us so we can fix it.

Help Improve This Article

Related Articles

Gemini 3.5 Flash beats last year's Pro on the work builders ship

Google's Gemini 3.5 Flash beats last year's 3.1 Pro on coding and agentic benchmarks at ~40% lower cost — with reasoning and 1M-context limits worth testing.

The AI Feed Desk

By The AI Feed Desk

Google rolls out Gemini Omni Flash for video generation

Gemini Omni Flash turns mixed inputs into video and is rolling into Gemini, Flow, YouTube Shorts, and YouTube Create before the API arrives.

The AI Feed Desk

By The AI Feed Desk

Google releases DiffusionGemma for faster local text generation

Google's DiffusionGemma is an experimental open text-diffusion model that generates blocks of text in parallel for lower-latency local workflows.

The AI Feed Desk

By The AI Feed Desk

Google brings Gemini models to Apple developers

Google says Apple developers can call Gemini models through Apple's Foundation Models framework and use Gemini inside Xcode.

The AI Feed Desk

By The AI Feed Desk

Google releases Gemma 4 12B for local multimodal agents

Google's Gemma 4 12B is a 12B-parameter open model for local multimodal work, with 16GB memory guidance, native audio inputs, and a 256K-token context window.

The AI Feed Desk

By The AI Feed Desk