Testing LLMS

Question

Testing LLMS

Stephen 85

I am trying to test different LLMs to see which one best addresses our use case.

I was allocated quota on a Standard_NC24ads_A100_v4 with 24 cores and 220 GB RAM.

I was able to run my first LLM test on that. However, when I go to test other LLMs, they all seem to be only available for different SKUs even though the Standard_NC24ads_A100_v4 should be able to handle it.

What is the best way to test different LLMs? Do I have to go through the quota process for different machine SKUs essentially for each model I want to test?

Thanks,

Stephen Pillow

Stephen 85 Reputation points

2025-04-07T15:48:47.7166667+00:00

Thanks for the help. Was hoping it would be easier. Just get a big machine and run the different models. But, I will go through the quota process.
Manas Mohanty 5,620 Reputation points Microsoft External Staff Moderator

2025-04-07T15:51:39.5733333+00:00

Hi Stephen

Could you accept the below answer if the pointers helped.

Thank you.

Accepted answer

0 additional answers

Your answer

Stephen 85 Reputation points

2025-04-07T15:48:47.7166667+00:00

Thanks for the help. Was hoping it would be easier. Just get a big machine and run the different models. But, I will go through the quota process.
Manas Mohanty 5,620 Reputation points Microsoft External Staff Moderator

2025-04-07T15:51:39.5733333+00:00

Hi Stephen

Could you accept the below answer if the pointers helped.

Thank you.

Answer 1

Hi Stephen

All the models might not need high end GPU.

Some of them are available with Serverless APIs, internal computes at Azure (OpenAI models) and some can run on even memory optimized computes.

You can filter out model with Serverless APIs with "Deployment Option= Serverless API" filter.

Yes, but you might to need to raise GPU for certain LLM models as the deployment requires.

Reference- https://learn.microsoft.com/en-us/azure/machine-learning/concept-model-catalog?view=azureml-api-2

Hope it addresses your query.

Please accept this answer and say yes if it helped.

Thank you.

Share via

Testing LLMS

0 additional answers

Your answer