Add client for Azure-hosted Llama models #871

drdavella · 2024-10-08T20:48:06Z

We would like to support codemods that use Llama models hosted in Azure.

We have proposed a spec update here that introduces two new environment variables: pixee/codemodder-specs#39

CODEMODDER_AZURE_LLAMA_API_KEY=<KEY>
CODEMODDER_AZURE_LLAMA_ENDPOINT=<ENDPOINT>

If only one or the other variable is present, we should raise an exception.

If both of these variables are present, we need to create a new llama client using the Azure AI Inference package.

Use of the client looks something like this:

# pip install azure-ai-inference
import os
from azure.ai.inference import ChatCompletionsClient
from azure.core.credentials import AzureKeyCredential

api_key = os.getenv("<API key>", '')
if not api_key:
  raise Exception("A key should be provided to invoke the endpoint")

client = ChatCompletionsClient(
    endpoint='<endpoint URL>',
    credential=AzureKeyCredential(api_key)
)

The behavior of this client is completely orthogonal to the OpenAI/Azure OpenAI client. One or the other or both can be configured at the same time. The client should live on the CodemodExecutionContext. Implementing this may lead to some breaking API changes.

The text was updated successfully, but these errors were encountered:

drdavella assigned drdavella and clavedeluna and unassigned drdavella Oct 8, 2024

clavedeluna mentioned this issue Oct 9, 2024

Add Azure Llama client support #872

Merged

clavedeluna closed this as completed in #872 Oct 10, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add client for Azure-hosted Llama models #871

Add client for Azure-hosted Llama models #871

drdavella commented Oct 8, 2024

Add client for Azure-hosted Llama models #871

Add client for Azure-hosted Llama models #871

Comments

drdavella commented Oct 8, 2024