PRICE
Per Time
INPUT
image
OUTPUT
video
Input
Output
{}Pricing Details
| Resolution | Duration | Price |
|---|---|---|
540p | 1 | $0.024 |
| 2 | $0.032 | |
| 3 | $0.04 | |
| 4 | $0.048 | |
| 5 | $0.056 | |
| 6 | $0.064 | |
| 7 | $0.072 | |
| 8 | $0.08 | |
720p | 1 | $0.032 |
| 2 | $0.04 | |
| 3 | $0.08 | |
| 4 | $0.12 | |
| 5 | $0.16 | |
| 6 | $0.2 | |
| 7 | $0.24 | |
| 8 | $0.28 | |
1080p | 1 | $0.14 |
| 2 | $0.18 | |
| 3 | $0.22 | |
| 4 | $0.26 | |
| 5 | $0.3 | |
| 6 | $0.34 | |
| 7 | $0.38 | |
| 8 | $0.42 |
Examples
If you have been looking for a way to turn static images into cinematic sequences, the Vidu API is probably already on your radar. At GPTProto, we have integrated these powerful video generation models to give you a single access point for high-end visual storytelling. Whether you are building a content automation tool or a creative suite, Vidu offers a specialized range of models that balance speed, cost, and visual fidelity. You can browse Vidu and other models in our library to find the perfect fit for your specific use case.
Vidu stands out in the crowded ai space because it does not just animate images; it understands physical motion. When you send a request to the Vidu API, the system analyzes the visual depth and movement vectors required to produce a natural result. Unlike older models that often suffer from warping or logic errors, Vidu maintains structural integrity across the entire duration of the clip. This reliability makes Vidu a top-tier choice for professional workflows where quality consistency is non-negotiable.
The versatility of the Vidu suite is another major factor. You have access to everything from the classic Viduq1 for stable camera operations to the newer Vidu 2.0 which focuses on rapid generation. This flexibility allows developers to build ai applications that serve different user tiers. You can monitor your Vidu API usage in real time through our dashboard, ensuring you always know exactly how many resources your project is consuming.
Vidu represents a significant step forward in temporal consistency for ai video. The ability to define both a start and an end frame gives developers granular control that prompt-only models simply cannot match.
When you start using the Vidu API through GPTProto, you will notice several model variants. The Viduq3-pro model is the flagship of the lineup. It is designed for creators who need the highest possible fidelity and, crucially, supports audio-video synchronization. This means you can generate video that matches a soundscape or includes dialogue, which is a massive leap for automated content creation.
On the other hand, Viduq3-turbo is built for speed. If your application needs to provide a preview or a quick draft to a user, Viduq3-turbo delivers the results faster than the pro version while maintaining much of the same structural quality. Both of these v3 models support durations up to 16 seconds and resolutions up to 1080p. If you are ready to start building, you can read the full Vidu API documentation to see the specific parameter requirements for each model type.
One of the most impressive features of Vidu is its ability to take two separate images and generate the motion between them. This is not just a simple fade; the Vidu ai engine calculates how an object would realistically move from point A to point B. To get the best results, you should ensure that the aspect ratios of your start and end frames are similar, typically between 0.8 and 1.25.
For developers looking for technical specifics, the start-to-end frame video generation documentation provides the exact requirements for image sizing and base64 encoding. By providing both frames, you eliminate the guesswork often found in text-to-video ai tools. Vidu ensures the character or object in the first frame successfully arrives at the state depicted in the second frame. This makes Vidu ideal for storyboarding or architectural visualizations.
If your priority is sheer throughput, Vidu 2.0 is the model you want. It is optimized for fast generation speeds, making it perfect for applications where users expect near-instant feedback. While it might not have all the bells and whistles of the Viduq3-pro engine, it provides a very reliable baseline for high-volume ai video tasks. You can try Vidu and other intelligent AI agents on our skills page to see the difference in generation speed for yourself.
| Model Name | Best Use Case | Max Duration | Max Resolution |
|---|---|---|---|
| Viduq3-pro | High-quality cinematic clips | 16s | 1080p |
| Viduq3-turbo | Fast, high-quality drafts | 16s | 1080p |
| Vidu 2.0 | Rapid prototyping | 8s | 1080p |
| Viduq2-pro-fast | Budget-conscious projects | 8s | 1080p |
We understand that cost management is vital for any developer using an api. That is why we provide flexible pay-as-you-go pricing for the Vidu suite. There are no monthly credits that expire at the end of the month. Instead, you only pay for the successful calls you make, whether you are using Vidu 2.0 for speed or Viduq3-pro for maximum quality.
While Vidu is great at following visual cues, the prompt parameter allows you to add specific artistic directions. If you want a specific lighting style or a particular mood, adding a detailed prompt alongside your images will guide the Vidu ai to match your vision. Additionally, using the seed parameter is essential if you want to reproduce a specific motion style across multiple generations. By keeping the Vidu seed constant, you can experiment with different prompts while keeping the underlying movement logic similar.
You can also find more tips on how to optimize your prompts by checking out the GPTProto tech blog, where we frequently post tutorials on getting the most out of video generation models. Using Vidu effectively requires a balance of clear visual input and descriptive text, and our guides are here to help you master that balance.
For large-scale projects that are not time-sensitive, the Vidu off_peak mode is a fantastic way to save on costs. When you enable this mode, tasks are processed during lower-traffic periods and can take up to 48 hours to complete, but they consume significantly fewer points. This is a great feature of the Vidu API for batch processing video content for archives or long-term marketing campaigns. To stay updated on task status without constant polling, always configure a callback_url. The Vidu system will push the final video link to your server the moment it is ready, streamlining your entire ai workflow.

Discover how businesses are using Vidu to transform their creative workflows.
Challenge: A retailer needed to turn thousands of static product photos into engaging 5-second video clips for social media. Solution: They used the Vidu API to automate start-to-end frame transitions, taking a front-on photo as the start and a side-profile as the end. Result: Vidu generated smooth, rotating views of products, increasing ad engagement by 45% while reducing video production costs significantly.
Challenge: A production studio needed to visualize scene transitions quickly during the writing phase without hiring an animation team. Solution: The writers used Viduq3-pro to animate their keyframe sketches. Result: The Vidu models provided a cinematic sense of timing and motion, allowing the team to iterate on scene logic in hours rather than weeks.
Challenge: A news agency wanted to create short, visual summaries of news stories using only available press photos. Solution: By integrating Vidu 2.0 into their CMS, editors could select two photos and generate a 'motion news' clip instantly. Result: The agency saw a 60% increase in social shares for their news briefs, powered by the reliable and fast Vidu generation engine.
Follow these simple steps to set up your account, get credits, and start sending API requests to viduq2 turbo via GPT Proto.

Sign up

Top up

Generate your API key

Make your first API call

Stop relying on text prompts alone. The vidu q1 reference-to-video feature gives you absolute control over character consistency. Read the full review.

Discover how Vidu Q3 is revolutionizing the AI video industry by offering superior character consistency, native audio-visual synchronization, and professional-grade 16-second clips for creators worldwide.

Create cinematic AI videos with Vidu Q2's natural expressions and smooth camera work. See how it compares to Sora 2 and turn images into video instantly.

Explore runway gen-4 performance, from character consistency to physics bugs. See how it compares to Sora and master your prompting strategy today.
User Reviews & Technical Feedback