More transparency and control over Gemini API costs


Usage Tiers: Less friction and more transparency as you scale

We’ve completely revamped our Usage Tiers to get you higher capacity faster. While we rely on these tiers to manage aggregate load and help to ensure equitable API access, your progression through them is now automated and transparent. Here is what’s changing:

  • Lower spend qualifications: To make it easier for users with a strong payment history to get higher quotas, we are also reducing the spend qualifications for higher tiers.
  • Automatic and faster upgrades: The system now automatically upgrades you to the next tier as your usage grows and your payment history matures. You get access to higher rate limits and increased monthly quota as soon as the criteria is met.
  • Billing account tier cap: Each Usage Tier will now have a maximum monthly spend limit ($) enforced across your entire billing account (similar to other platforms in the industry). This system-defined cap automatically increases as you graduate to higher tiers, and operates independently of the custom Project Spend Caps you set yourself.

You can see the usage tier limits along with the new criteria in our docs and discover how different tiers impact your rate limit metrics directly within Google AI Studio.

Improved billing flow with enhanced observability and control

Over the past few months, we’ve launched a suite of updates in Google AI Studio to improve our billing experience, observability and cost management, with the goal to give developers an easier and more transparent experience with our paid services. Here’s what’s new:

  • New billing setup directly in Google AI Studio: You can now configure your billing profile and link it to your projects right from the settings, ensuring you can scale your application more seamlessly as your needs grow. No more jumping between 3 different windows and tabs.
  • New rate limit dashboard: The dashboard gives you a clear view of your progress towards rate limits for every project imported into Google AI Studio. You can monitor usage against three key metrics: Requests Per Minute (RPM), Tokens Per Minute (TPM) and Requests Per Day (RPD), view and filter graphs for these metrics to identify traffic spikes and explore rate limits across different models.
  • New cost dashboard: To help you manage your budget, we also launched a Daily Cost Breakdown Graph within the Billing Dashboard. This tool provides a transparent view of your spend, allowing you to track costs per project over different time frames — from the last 7 days to the entire month, and filter by model.
  • New usage dashboard: An expanded, comprehensive view of your system’s performance. Beyond standard request counts, you can now dive into error metrics, token usage and specific generation stats. We’ve also added dedicated graphs for Imagen and Veo requests per day, in addition to tools like Grounding with Google Search and Maps.

We hope these updates help you build more confidently with the Gemini API, and we will continue to make improvements to provide a more reliable and transparent service.



Source link

Share:

Leave a Reply

3 latest news
News Archives
On Key

Related Posts

5 years of impact in Europe

Over the last five years, we’ve invested more than $150 million to help people learn digital skills. We’ve worked with 70 organizations across 41 European

Introducing AI Works for Europe

Maria Teresa Pellegrino knows the value of innovation. The 61-year-old from Andria, Italy, has worked in olive oil production for most of her life. As

Solverwp- WordPress Theme and Plugin