Skip to content

Latest commit

 

History

History
18 lines (12 loc) · 967 Bytes

ReadMe.md

File metadata and controls

18 lines (12 loc) · 967 Bytes

An LLM Web Application

A simple flask application that runs a local copy of the AMD OLMo language model.

Instructions

Warnings

  • The model is roughly 5GB and has to be downloaded the first time the application is run. Please account for your device storage and download speed as appropriate.
  • A test run I did had the model load in 16 seconds and give a response in 1 minute 17 seconds to get a response on my laptop as I used my CPU only (an Intel Core i5, 8GB laptop). Model response times would vary based on your device specifications e.g RAM size, CPU speed etc.

Sample

A running sample is given below

Screenshot of the application homepage

The response is

Screenshot of the response to the question "What is the Nicene creed?"