Skip to content

Latest commit

 

History

History
 
 

text-generation-webui

Folders and files

NameName
Last commit message
Last commit date

parent directory

..
 
 
 
 
 
 

text-generation-webui by oobabooga on akash network

A gradio web UI for running Large Language Models like LLaMA, llama.cpp, GPT-J, Pythia, OPT, and GALACTICA.

Its goal is to become the AUTOMATIC1111/stable-diffusion-webui of text generation.

The Tags of the Docker Image is the same as the release on the official repository. I.e. "v1.3.1" or "v1.4"

Image1 Image2
Image3 Image4

Features

  • 3 interface modes: default, notebook, and chat
  • Multiple model backends: tranformers, llama.cpp, AutoGPTQ, GPTQ-for-LLaMa, ExLlama, RWKV, FlexGen
  • Dropdown menu for quickly switching between different models
  • LoRA: load and unload LoRAs on the fly, load multiple LoRAs at the same time, train a new LoRA
  • Precise instruction templates for chat mode, including Alpaca, Vicuna, Open Assistant, Dolly, Koala, ChatGLM, MOSS, RWKV-Raven, Galactica, StableLM, WizardLM, Baize, Ziya, Chinese-Vicuna, MPT, INCITE, Wizard Mega, KoAlpaca, Vigogne, Bactrian, h2o, and OpenBuddy
  • Multimodal pipelines, including LLaVA and MiniGPT-4
  • 8-bit and 4-bit inference through bitsandbytes
  • CPU mode for transformers models
  • DeepSpeed ZeRO-3 inference
  • Extensions
  • Custom chat characters
  • Very efficient text streaming
  • Markdown output with LaTeX rendering, to use for instance with GALACTICA
  • Nice HTML output for GPT-4chan
  • API, including endpoints for websocket streaming (see the examples)

To learn how to use the various features, check out the Documentation: https://github.com/oobabooga/text-generation-webui/tree/main/docs

Acknowledgements