Introducing the Azure AI Inference SDK: Access More AI Models with the Azure AI Model Catalog

Luis Quintanilla · Aug 13, 2024

AI models are constantly evolving and improving, but keeping up with the latest developments can be challenging.

That’s why we’re introducing the Azure AI Inference SDK for .NET.

This SDK lets you easily access and use a wide range of AI models from the Azure AI Model Catalog for inference tasks like chat, so you can seamlessly integrate AI into your applications that meet your needs.

What is the Azure AI Model Catalog?

Image displaying models in the Azure AI Model Catalog

The Model Catalog in Azure AI Studio makes it easy to browse through various AI models and deploy them.

Models from the catalog can be deployed to Managed Compute or as a Serverless API.

Some key features include:

Model Availability: The model catalog features a diverse collection of models from providers such as Microsoft, Azure OpenAI, Mistral, Meta, and Cohere. This ensures you can find the right model to satisfy your requirements.
Easy to deploy: Serverless API deployments remove the complexity about hosting and provisioning the hardware to run cutting edge models. When deploying models with serverless API, you don’t need quota to host them and you are billed per token.
Responsible AI Built-In: Safety is a priority. Language models from the catalog come with default configurations of Azure AI Content Safety moderation filters which detect harmful content.

For more details, see the Azure AI Model Catalog documentation.

Get Started

Deploy a model like Phi-3. For more details, see the Azure AI Model Catalog deployment documentation.
Create a C# console application and install the Azure.AI.Inference SDK from NuGet.
Add the following code to your application to start making requests to your model service. Make sure to replace your key and endpoint with those provided with your deployment.

Code:

var key = "YOUR-MODEL-API-KEY";
var endpoint = "YOUR-MODEL-ENDPOINT";

var chatClient = new ChatCompletionsClient(
    new Uri(endpoint), 
    new Azure.AzureKeyCredential(key));

var chatHistory = new List<ChatRequestMessage>()
{
    new ChatRequestSystemMessage("You are a helpful assistant that knows about AI.")
};

Console.WriteLine($"System: {
    chatHistory
    .Where(x => x.GetType() == typeof(ChatRequestSystemMessage))
    .Select(x => ((ChatRequestSystemMessage)x).Content)
    .First()}");

while(true)
{
    Console.Write("You: ");
    var userMessage = Console.ReadLine();

    // Exit loop
    if (userMessage.StartsWith("/q"))
    {
        break;
    }

    chatHistory.Add(new ChatRequestUserMessage(userMessage));

    ChatCompletions? response = await chatClient.CompleteAsync(chatHistory);
    ChatResponseMessage? assistantMessage = response.Choices.First().Message;
    chatHistory.Add(new ChatRequestAssistantMessage(assistantMessage));

    Console.WriteLine($"Assistant: {assistantMessage.Content}");
}

Sample console output of a chat with a model from Azure AI model catalog

For more details, see the Azure AI Model Inference API documentation.

Join the AI Community Standup

Interested in learning more about the Azure AI Inference SDK for .NET?

Don’t miss the AI Community Standup session on August 14th, at 10 AM PST the team will dive into all the details.

Conclusion

We’re excited to see what you build! Try out the Azure AI Inference SDK and give us feedback.

The post Introducing the Azure AI Inference SDK: Access More AI Models with the Azure AI Model Catalog appeared first on .NET Blog.

Continue reading...

Introducing the Azure AI Inference SDK: Access More AI Models with the Azure AI Model Catalog

Luis Quintanilla

Guest

What is the Azure AI Model Catalog?​

Get Started​

Join the AI Community Standup​

Conclusion​

What is the Azure AI Model Catalog?

Get Started

Join the AI Community Standup

Conclusion