谷歌Veo 3视频生成实测:8秒玩转创意新境界。
在2025年Google I/O开发者大会上,谷歌发布了一系列令人瞩目的图像和视频生成工具。今天,我要和大家分享一下最近超级火的Veo 3视频生成体验。后续我还会尝试Imagen 4和Flow平台,到时候再和大家分享。先简单介绍一下Veo 3。我使用的平台是Gemini,但是目前是只支持文生图,链接:https://gemini.google.com/以及Flow也可以,可以支持图生图、首尾帧,链

在2025年Google I/O开发者大会上,谷歌发布了一系列令人瞩目的图像和视频生成工具。今天,我要和大家分享一下最近超级火的Veo 3视频生成体验。后续我还会尝试Imagen 4和Flow平台,到时候再和大家分享。先简单介绍一下Veo 3。

- Veo 3是谷歌最新的视频生成模型,官方表示其具备更强的物理理解能力,生成的动画更加流畅、逼真。
- 原生Veo 3已经直接支持音频生成,包括环境声、音效,甚至角色对白,能够让AI生成视频更具沉浸感和真实感。
- 该模型对AI Pro和AI Ultra订阅用户开放。
我使用的平台是Gemini,但是目前是只支持文生图,链接:https://gemini.google.com/
以及Flow也可以,可以支持图生图、首尾帧,链接:https://labs.google/fx/tools/flow

在Gemini平台使用:

在Flow平台使用文生视频:
点击选择【文生视频】,然后输入提示词。
文生视频:通过描述性的文本提示词生成视频。
图生视频:支持首帧、尾帧或首尾帧参考,生成动态内容(250525目前已经支持外部图片上传)。
元素组合生成视频:可提取多张图片的内容和风格,结合提示词生成视频。

注意这个设置:

在Flow中还可以在最后导出的时候,导出为GIF格式或720P,若要导出1080P需要点击超分处理后导出。Flow还有延长视频在线剪辑等玩法。
下面我先讲讲我的大体思路。因为只有8秒,而文生视频的未知性更强,没有办法在最初就通过图片控制它的整体风格和主体,所以这带有一定抽卡偶然性质,很容易就崩掉。我的想法是:
- 1.
在提示词中尽可能给更多内容和限制。提示词中包括但不限于视觉风格、故事概述,再尝试加入目前它可以实现的最先进的配音和字幕的提示描述。
- 2.
8秒很短,但也可以做一些改变的内容,因为只是文生视频也不好续,我希望这8秒内能够快速传达某种感受,在提示词中尝试将8秒拆成4段,每两秒之间有一个场景变幻、情绪递进或者转折。
需要注意的是,这些提示并不是完全都可以实现的,这只是理想化的情况,实际实现能到70%-80%就已经算不错了。

Prompt: A breathtaking, painterly 2D animated continuous visual narrative, rendered with the lush, vibrant, and slightly surreal, almost dreamlike, infused with the intricate, delicate detail of traditional Japanese woodblock prints (Ukiyo-e), follows a young, adventurous, and kind-hearted girl (perhaps with bright, curious eyes and wearing simple, practical, beautifully patterned traditional Japanese farm attire) as she befriends a colossal, gentle, ancient Forest Spirit. The Spirit is a magnificent, awe-inspiring creature, its form a harmonious blend of animal and plant – perhaps with moss-covered, antler-like branches, fur like shimmering leaves that change color with its mood, and eyes like deep, tranquil forest pools. They meet in a sun-dappled, sacred grove deep within an ancient, primeval forest, where impossibly tall, gnarled trees form a living cathedral and tiny, glowing, friendly forest sprites (Kodama-like) peek from behind mossy rocks and giant, fantastical mushrooms. The girl, initially awestruck, offers the massive Spirit a small, carefully cultivated offering – perhaps a perfectly ripe persimmon or a handful of wild berries – her gesture one of pure, innocent respect and affection. The Forest Spirit responds with a slow, gentle inclination of its massive head, its leafy fur rustling like a thousand whispers, and perhaps causes a shower of magical, luminous flower petals to drift down from the canopy, or a tiny, new sapling to sprout at the girl's feet. The animation captures the incredible, detailed textures of the forest, the Spirit's majestic yet gentle presence, and the profound, unspoken emotional connection forming between the child and this ancient guardian of nature. The color palette is a rich symphony of deep forest greens, earthy browns, vibrant floral hues, and the soft, magical glow of the sprites and the Spirit's own subtle luminescence. This continuous, sweeping visual journey is a celebration of the profound, often mystical, bond between humanity and nature, the innocence and courage of childhood, and the power of kindness and respect to bridge even the most fantastical of divides, an affectionate, visually intoxicating ode to ecological harmony and interspecies understanding. The only implied sounds are the gentle rustling of leaves, the distant calls of unseen forest birds, the girl's soft, respectful breathing, the Spirit's deep, resonant, almost inaudible hum, and a soaring, emotionally resonant, orchestral score.
你会会用AI生成你的视频吗?
火山引擎开发者社区是火山引擎打造的AI技术生态平台,聚焦Agent与大模型开发,提供豆包系列模型(图像/视频/视觉)、智能分析与会话工具,并配套评测集、动手实验室及行业案例库。社区通过技术沙龙、挑战赛等活动促进开发者成长,新用户可领50万Tokens权益,助力构建智能应用。
更多推荐
所有评论(0)