(chore) Update Quick Start for System Recommendations (#722)

* Update Quick Start guide to include system requirements * Add clarification notes to quick start guide
defenseunicorns · Jul 8, 2024 · 80adda2 · 80adda2
1 parent 4e8fb18
commit 80adda2
Showing 1 changed file with 29 additions and 3 deletions.
diff --git a/website/content/en/docs/local deploy guide/quick_start.md b/website/content/en/docs/local deploy guide/quick_start.md
@@ -8,6 +8,18 @@ weight: 2
 
 The fastest and easiest way to get started with a deployment of LeapfrogAI is by using [UDS](https://github.com/defenseunicorns/uds-core). These quick start instructions show how to deploy LeapfrogAI in either a CPU or GPU-enabled environment.
 
+## System Requirements
+
+Please review the following table to ensure your system meets the minimum requirements. LFAI can be run with or without GPU-access, but GPU-enabled systems are recommended due to the performance gains. The following assumes a single personal device:
+
+|     | Minimum           | Recommended (Performance) |
+|-----|-------------------|---------------------------|
+| RAM | 32 GB             | 128 GB                    |
+| CPU | 8 Cores @ 3.0 GHz | 32 Cores @ 3.0 GHz        |
+| GPU | N/A               | 2x NVIDIA RTX 4090 GPUs   |
+
+Additionally, please check the list of tested [operating systems](https://docs.leapfrog.ai/docs/local-deploy-guide/requirements/#operating-systems) for compatibility.
+
 ## Prerequisites
 
 - [Docker](https://docs.docker.com/engine/install/)
@@ -21,6 +33,18 @@ GPU considerations (NVIDIA GPUs only):
 - NVIDIA GPU drivers compatible with CUDA (>=12.2).
 - NVIDIA Container Toolkit is available via internet access, pre-installed, or on a mirrored package repository in the air gap.
 
+## Default Models
+LeapfrogAI deploys with certain default models. The following models were selected to balance portability and performance for a base deployment:
+
+| Backend          | CPU/GPU Support | Default Model                                                                |
+|------------------|-----------------|------------------------------------------------------------------------------|
+| llama-cpp-python | CPU             | [SynthIA-7B-v2.0-GGUF](https://huggingface.co/TheBloke/SynthIA-7B-v2.0-GGUF) |
+| vllm             | GPU             | [Synthia-7B-v2.0-GPTQ](https://huggingface.co/TheBloke/SynthIA-7B-v2.0-GPTQ) |
+| text-embeddings  | CPU/GPU         | [Instructor-XL](https://huggingface.co/hkunlp/instructor-xl)                 |
+| whisper          | CPU/GPU         | [OpenAI whisper-base](https://huggingface.co/openai/whisper-base)            |
+
+**NOTE:** If a user's system specifications are beyond the minimum requirements, advanced users are able to swap out the default model choices with larger or fine-tuned models.
+
 ## Disclaimers
 
 GPU workloads **_WILL NOT_** run if GPU resources are unavailable to the pod(s). You must provide sufficient NVIDIA GPU scheduling or else the pod(s) will go into a crash loop.
@@ -67,18 +91,20 @@ In order to test the GPU deployment locally on K3d, use the following command wh
 ```
 
 ## Checking Deployment
-
-Inspect the cluster using:
+Once the cluster and LFAI have deployed, the cluster and pods can be inspected using uds:
 
 ```bash
 uds zarf tools monitor
 ```
 
+The following URLs should now also be available to view LFAI resources:
+
+**DISCLAIMER**: These URls will only be available *after* both K3D-core and LFAI have been deployed. They will also only be available on the host system that deployed the cluster.
+
 | Tool       | URL                                   |
 | ---------- | ------------------------------------- |
 | UI         | <https://ai.uds.dev>                  |
 | API        | <https://leapfrogai-api.uds.dev/docs> |
-| RAG Server | <https://leapfrogai-rag.uds.dev/docs> |
 
 ## Accessing the UI