Open_Source_LLM Tool - Restore Dynamic List + Fixes (#1076)

# Description Changes: 1. Restore Open_Source_LLM Tool Dynamic list functionality 2. Fixes: WS: 2806189, 2806307 # All Promptflow Contribution checklist: - [x] **The pull request does not introduce [breaking changes].** - [x] **CHANGELOG is updated for new features, bug fixes or other significant changes.** - [x] **I have read the [contribution guidelines](../CONTRIBUTING.md).** - [x] **Create an issue and link to the pull request to get dedicated review from promptflow team. Learn more: [suggested workflow](../CONTRIBUTING.md#suggested-workflow).** ## General Guidelines and Best Practices - [x] Title of the pull request is clear and informative. - [x] There are a small number of commits, each of which have an informative message. This means that previously merged commits do not appear in the history of the PR. For more information on cleaning up the commits in your PR, [see this page](https://github.com/Azure/azure-powershell/blob/master/documentation/development-docs/cleaning-up-commits.md). ### Testing Guidelines - [x] Pull request includes test coverage for the included changes. --------- Co-authored-by: Gerard <[email protected]>
microsoft · Dec 6, 2023 · 9199acf · 9199acf
1 parent 7823185
commit 9199acf
Show file tree

Hide file tree

Showing 8 changed files with 1,199 additions and 292 deletions.
diff --git a/.cspell.json b/.cspell.json
@@ -153,7 +153,8 @@
     "Abhishek",
     "restx",
     "httpx",
-    "tiiuae"
+    "tiiuae",
+    "metagenai"
   ],
   "flagWords": [
     "Prompt Flow"

diff --git a/docs/media/reference/tools-reference/open_source_llm_on_vscode_promptflow.png b/docs/media/reference/tools-reference/open_source_llm_on_vscode_promptflow.png
diff --git a/docs/reference/tools-reference/open_source_llm_tool.md b/docs/reference/tools-reference/open_source_llm_tool.md
@@ -2,48 +2,65 @@
 
 ## Introduction
 
-The prompt flow Open Source LLM tool enables you to utilize a variety of Open Source and Foundational Models, such as [Falcon](https://ml.azure.com/models/tiiuae-falcon-7b/version/4/catalog/registry/azureml) or [Llama 2](https://ml.azure.com/models/Llama-2-7b-chat/version/14/catalog/registry/azureml-meta) for natural language processing, in PromptFlow.
+The Open Source LLM tool enables the utilization of a variety of Open Source and Foundational Models, such as [Falcon](https://ml.azure.com/models/tiiuae-falcon-7b/version/4/catalog/registry/azureml) and [Llama 2](https://ml.azure.com/models/Llama-2-7b-chat/version/14/catalog/registry/azureml-meta), for natural language processing in Azure ML Prompt Flow.
 
 Here's how it looks in action on the Visual Studio Code prompt flow extension. In this example, the tool is being used to call a LlaMa-2 chat endpoint and asking "What is CI?".
 
-![Screenshot of the Open Source Llm On vsCode PromptFlow extension](../../media/reference/tools-reference/open_source_llm_on_vscode_promptflow.png)
+![Screenshot of the Open Source LLM On VScode Prompt Flow extension](../../media/reference/tools-reference/open_source_llm_on_vscode_promptflow.png)
 
-This prompt flow supports two different LLM API types:
+This prompt flow tool supports two different LLM API types:
 
 - **Chat**: Shown in the example above. The chat API type facilitates interactive conversations with text-based inputs and responses.
 - **Completion**: The Completion API type is used to generate single response text completions based on provided prompt input.
 
 ## Quick Overview: How do I use Open Source LLM Tool?
 
-1. Choose a Model from the AzureML Model Catalog and deploy.
-2. Setup and select the connections to the model deployment.
-3. Configure the tool with the model settings.
+1. Choose a Model from the AzureML Model Catalog and get it deployed.
+2. Connect to the model deployment.
+3. Configure the open source llm tool settings.
 4. Prepare the Prompt with [guidance](./prompt-tool.md#how-to-write-prompt).
 5. Run the flow.
 
 ## Prerequisites: Model Deployment
 
 1. Pick the model which matched your scenario from the [Azure Machine Learning model catalog](https://ml.azure.com/model/catalog).
 2. Use the "Deploy" button to deploy the model to a AzureML Online Inference endpoint.
+2.1. Use one of the Pay as you go deployment options.
 
 More detailed instructions can be found here [Deploying foundation models to endpoints for inferencing.](https://learn.microsoft.com/en-us/azure/machine-learning/how-to-use-foundation-models?view=azureml-api-2#deploying-foundation-models-to-endpoints-for-inferencing)
 
-## Prerequisites: Prompt flow Connections
+## Prerequisites: Connect to the Model
 
-In order for prompt flow to use your deployed model, you will need to setup a Connection. Explicitly, the Open Source LLM tool uses the CustomConnection.
+In order for prompt flow to use your deployed model, you will need to connect to it. There are several ways to connect.
 
-1. Instructions to create a Custom Connection [can be found here.](https://microsoft.github.io/promptflow/how-to-guides/manage-connections.html#create-a-connection)
+### 1. Endpoint Connections
 
-    The keys to set are:
+Once associated to a AzureML or Azure AI Studio workspace, the Open Source LLM tool can use the endpoints on that workspace.
 
-    1. **endpoint_url**
-        - This value can be found at the previously created Inferencing endpoint.
-    2. **endpoint_api_key**
-        - Ensure to set this as a secret value.
-        - This value can be found at the previously created Inferencing endpoint.
-    3. **model_family**
-        - Supported values: LLAMA, DOLLY, GPT2, or FALCON
-        - This value is dependent on the type of deployment you are targetting.
+1. **Using AzureML or Azure AI Studio workspaces**: If you are using prompt flow in one of the web page based browsers workspaces, the online endpoints available on that workspace will automatically who up.
+
+2. **Using VScode or Code First**: If you are using prompt flow in VScode or one of the Code First offerings, you will need to connect to the workspace. The Open Source LLM tool uses the azure.identity DefaultAzureCredential client for authorization. One way is through [setting environment credential values](https://learn.microsoft.com/en-us/python/api/azure-identity/azure.identity.environmentcredential?view=azure-python).
+
+### 2. Custom Connections
+
+The Open Source LLM tool uses the CustomConnection. Prompt flow supports two types of connections:
+
+1. **Workspace Connections** - These are connections which are stored as secrets on an Azure Machine Learning workspace. While these can be used, in many places, the are commonly created and maintained in the Studio UI.
+
+2. **Local Connections** - These are connections which are stored locally on your machine. These connections are not available in the Studio UX's, but can be used with the VScode extension.
+
+Instructions on how to create a workspace or local Custom Connection [can be found here.](../../how-to-guides/manage-connections.md#create-a-connection)
+
+The required keys to set are:
+
+1. **endpoint_url**
+    - This value can be found at the previously created Inferencing endpoint.
+2. **endpoint_api_key**
+    - Ensure to set this as a secret value.
+    - This value can be found at the previously created Inferencing endpoint.
+3. **model_family**
+    - Supported values: LLAMA, DOLLY, GPT2, or FALCON
+    - This value is dependent on the type of deployment you are targeting.
 
 ## Running the Tool: Inputs
 
@@ -53,7 +70,6 @@ The Open Source LLM tool has a number of parameters, some of which are required.
 |------|------|-------------|----------|
 | api | string | This is the API mode and will depend on the model used and the scenario selected. *Supported values: (Completion \| Chat)* | Yes |
 | endpoint_name | string | Name of an Online Inferencing Endpoint with a supported model deployed on it. Takes priority over connection. | No |
-| connection | CustomConnection | This is the name of the connection which points to the Online Inferencing endpoint. | No |
 | temperature | float | The randomness of the generated text. Default is 1. | No |
 | max_new_tokens | integer | The maximum number of tokens to generate in the completion. Default is 500. | No |
 | top_p | float | The probability of using the top choice from the generated tokens. Default is 1. | No |