OpenAI’s fastest model, GPT-4o mini is now available on Azure AI | Microsoft Azure Blog (2024)

GPT-4o mini, announced by OpenAI today, is available simultaneously on Azure AI, supporting text processing capabilities with excellent speed and with image, audio, and video coming later.

We are also announcing safety features by default for GPT-4o mini, expanded data residency and service availability, plus performance upgrades to Microsoft Azure OpenAI Service.

GPT-4o mini allows customers to deliver stunning applications at a lower cost with blazing speed. GPT-4o mini is significantly smarter than GPT-3.5 Turbo—scoring 82% on Measuring Massive Multitask Language Understanding (MMLU) compared to 70%—and is more than 60% cheaper.1 The model delivers an expanded 128K context window and integrates the improved multilingual capabilities of GPT-4o, bringing greater quality to languages from around the world.

GPT-4o mini, announced by OpenAI today, is available simultaneously on Azure AI, supporting text processing capabilities with excellent speed and with image, audio, and video coming later. Try it at no cost in the Azure OpenAI Studio Playground.

We’re most excited about the new customer experiences that can be enhanced with GPT-4o mini, particularly streaming scenarios such as assistants, code interpreter, and retrieval which will benefit from this model’s capabilities. For instance, we observed the incredible speed while testing GPT-4o mini on GitHub Copilot, an AI pair programmer that assists you by delivering code completion suggestions in the tiny pauses between keystrokes, rapidly updating recommendations with each new character typed.

We are also announcing updates to Azure OpenAI Service, including extending safety by default for GPT-4o mini, expanded data residency, and worldwide pay-as-you-go availability, plus performance upgrades.

Azure AI brings safety by default to GPT-4o mini

Safety continues to be paramount to the productive use and trust that we and our customers expect.

We’re pleased to confirm that our Azure AI Content Safety features—including prompt shields and protected material detection— are now ‘on by default’ for you to use with GPT-4o mini on Azure OpenAI Service.

We have invested in improving the throughput and speed of the Azure AI Content Safety capabilities—including the introduction of an asynchronous filter—so you can maximize the advancements in model speed while not compromising safety. Azure AI Content Safety is already supporting developers across industries to safeguard their generative AI applications, including game development (Unity), tax filing (), and education (South Australia Department for Education).

In addition, our Customer Copyright Commitment will apply to GPT-4o mini, giving peace of mind that Microsoft will defend customers against third-party intellectual property claims for output content.

Azure AI now offers data residency for all 27 regions

From day one, Azure OpenAI Service has been covered by Azure’s data residency commitments.

Azure AI gives customers both flexibility and control over where their data is stored and where their data is processed, offering a complete data residency solution that helps customers meet their unique compliance requirements. We also provide choice over the hosting structure that meets business, application, and compliance requirements. Regional pay-as-you-go and Provisioned Throughput Units (PTUs) offer control over both data processing and data storage.

We’re excited to share that Azure OpenAI Service is now available in 27 regions including Spain, which launched earlier this month as our ninth region in Europe.

Azure AI announces global pay-as-you-go with the highest throughput limits for GPT-4o mini

GPT-4o mini is now available using our global pay-as-you-go deployment at 15 cents per million input tokens and 60 cents per million output tokens, which is significantly cheaper than previous frontier models.

We are pleased to announce that the global pay-as-you-go deployment option is generally available this month, allowing customers to pay for the resources they consume, making it flexible for variable workloads, while traffic is routed globally to provide higher throughput, and still offering control over where data resides at rest.

Additionally, we recognize that one of the challenges customers face with new models is not being able to upgrade between model versions in the same region as their existing deployments. Now, with global pay-as-you-go deployments, customers will be able to upgrade from existing models to the latest models.

Global pay-as-you-go offers customers the highest possible scale, offering 15M tokens per minute (TPM) throughput for GPT-4o mini and 30M TPM throughput for GPT-4o. Azure OpenAI Service offers GPT-4o mini with 99.99% availability and the same industry leading speed as our partner OpenAI.

Azure AI offers leading performance and flexibility for GPT-4o mini

Azure AI is continuing to invest in driving efficiencies for AI workloads across Azure OpenAI Service.

GPT-4o mini comes to Azure AI with availability on our Batch service this month. Batch delivers high throughput jobs with a 24-hour turnaround at a 50% discount rate by using off-peak capacity. This is only possible because Microsoft runs on Azure AI, which allows us to make off-peak capacity available to customers.

We are also releasing fine-tuning for GPT-4o mini this month which allows customers to further customize the model for your specific use case and scenario to deliver exceptional value and quality at unprecedented speeds. Following our update last month to switch to token based billing for training, we’ve reduced the hosting charges by up to 43%. Paired with our low price for inferencing, this makes Azure OpenAI Service fine-tuned deployments the most cost-effective offering for customers with production workloads.

With more than 53,000 customers turning to Azure AI to deliver breakthrough experiences at impressive scale, we’re excited to see the innovation from companies like Vodafone (customer agent solution), the University of Sydney (AI assistants), and GigXR (AI virtual patients). More than 50% of the Fortune 500 are building their applications with Azure OpenAI Service.

We can’t wait to see what our customers do with GPT-4o mini on Azure AI!

1GPT-4o mini: advancing cost-efficient intelligence | OpenAI

OpenAI’s fastest model, GPT-4o mini is now available on Azure AI | Microsoft Azure Blog (2024)

FAQs

Is GPT-4o available in Azure? ›

GPT-4o mini model available for deployment

GPT-4o mini is the latest Azure OpenAI model first announced on July 18, 2024: "GPT-4o mini allows customers to deliver stunning applications at a lower cost with blazing speed.

Is GPT-4o available now? ›

GPT-4o mini is now available using our global pay-as-you-go deployment at 15 cents per million input tokens and 60 cents per million output tokens, which is significantly cheaper than previous frontier models.

Is OpenAI available on Azure? ›

Azure OpenAI Service offers pricing based on both Pay-As-You-Go and Provisioned Throughput Units (PTUs). Pay-As-You-Go allows you to pay for the resources you consume, making it flexible for variable workloads.

What Azure OpenAI base model can you deploy to access the capabilities of ChatGPT? ›

GPT-3.5. GPT-3.5 models can understand and generate natural language or code. The most capable and cost effective model in the GPT-3.5 family is GPT-3.5 Turbo, which has been optimized for chat and works well for traditional completions tasks as well.

How can I access GPT 40? ›

After you have made a successful payment of $5 or more (usage tier 1), you'll be able to access the GPT-4, GPT-4 Turbo, GPT-4o models via the OpenAI API. Learn more about adding credit to your OpenAI account.

When GPT-4 API will be available? ›

After your first payment, they will make GPT-4 available to you. On July 6, 2023, we gave access to the GPT-4 API (8k) to all API users who have made a successful payment of $1 or more. If you have a newer account, simply purchase $0.50 of credits.

Is GPT-4o better than GPT-4? ›

Performance and efficiency

GPT-4o is also designed to be quicker and more computationally efficient than GPT-4 across the board, not just for multimodal queries. According to OpenAI, GPT-4o is twice as fast as the most recent version of GPT-4.

Is ChatGPT 4o available for free? ›

Yes ! It's free but it's not working all the tools for everyone in free version.

What will GPT-4 cost? ›

gpt-4 models cost $30.00 per 1M input tokens and $60.00 per 1M output tokens. gpt-4-1106-vision-preview (with GPT_VISION) costs $10.00 per 1M input tokens and $30.00 per 1M output tokens. text-embedding-ada-002 (with GPT_MATCH) costs $0.10 for 1M input tokens.

Is Azure AI better than ChatGPT? ›

Which platform is better for natural language processing, Microsoft Azure OpenAI or ChatGPT? Microsoft Azure is great at natural language processing because it has robust tools and services for language understanding, like Azure Cognitive Services and Language Understanding Intelligence Service (LUIS).

What is the difference between OpenAI and Azure OpenAI? ›

OpenAI uses the model keyword argument to specify what model to use. Azure OpenAI has the concept of unique model deployments.

What does GPT-4o do? ›

GPT-4o (“o” for “omni”) is a step towards much more natural human-computer interaction—it accepts as input any combination of text, audio, image, and video and generates any combination of text, audio, and image outputs.

How to use chat gpt-4 for free? ›

Now let's go ahead and learn how to use Bing to access ChatGPT 4 freely.
  1. Head over to bing.com/new (visit) and click on “Chat now”.
  2. Now, switch to the “Creative” mode and ask your questions. ...
  3. You can also install the Bing app (Android / iOS — Free) on your smartphone and enable the “GPT-4” toggle.
Jun 13, 2024

How to get ChatGPT 4? ›

If you are a ChatGPT Plus subscriber, you will have access to GPT-4 via the ChatGPT website. Under the model selector in top left corner of ChatGPT, select "GPT-4" to try it out.

What language is GPT-4 support? ›

Talking about the language, the GPT-4o has improved language capabilities to deliver better quality responses at a faster pace. ChatGPT also now supports more than 50 languages, including Hindi, Bengali, Tamil, Bengali among other Indian languages.

Is GPT-4 Microsoft? ›

Announcing the General Availability of GPT-4 Turbo with Vision on Azure OpenAI Service | Azure updates | Microsoft Azure.

Where can you use GPT-4? ›

How to Use ChatGPT-4 For Free?
  • Microsoft Copilot. The easiest and fastest way to use GPT-4 without paying a subscription is through Microsoft Copilot. ...
  • Perplexity.ai. Perplexity is an AI-based search engine that leverages GPT-4 for a more comprehensive and smarter search experience. ...
  • Poe.com. ...
  • Merlin Chrome Extension.
Feb 16, 2024

Is ChatGPT built on Azure? ›

ChatGPT is now available in Azure OpenAI Service | Microsoft Azure Blog.

Does GPT Plus give access to GPT-4? ›

The core service you pay for with ChatGPT Plus is access to GPT-4. You're not buying an unlimited number of prompts, though. Subscribers are currently limited to inputting 40 prompts to GPT-4 every three hours. After hitting the prompt limit, users can always switch to the GPT-3.5 version.

References

Top Articles
Latest Posts
Recommended Articles
Article information

Author: Terrell Hackett

Last Updated:

Views: 5409

Rating: 4.1 / 5 (52 voted)

Reviews: 91% of readers found this page helpful

Author information

Name: Terrell Hackett

Birthday: 1992-03-17

Address: Suite 453 459 Gibson Squares, East Adriane, AK 71925-5692

Phone: +21811810803470

Job: Chief Representative

Hobby: Board games, Rock climbing, Ghost hunting, Origami, Kabaddi, Mushroom hunting, Gaming

Introduction: My name is Terrell Hackett, I am a gleaming, brainy, courageous, helpful, healthy, cooperative, graceful person who loves writing and wants to share my knowledge and understanding with you.