LLAMARA - Large Language Assistant for Model-Augmented Retrieval and Analysis

LLAMARA is an LLM-based assistant for information retrieval from a provided knowledge base. It aims at supporting researchers working with scientific papers, whitepapers and documentation, as well as possibly serving research findings in an accessible way to the public.

NOTE: This repository contains the LLAMARA backend only.

Features

Retrieval-Augmented-Generation (RAG) chat functionality with provided knowledge
Support uploading the following file types as knowledge source:
- PDF
- Microsoft Word DOCX
- Markdown
- TXT
- and everything else supported by Apache Tika
Provide reference to the source of knowledge in the generated response with the ability to download the file-based source
Interchangeable LLM (chat model) for each request
Extensive management of knowledge (including the ability to set label and tags)
Multi-User support with Single-Sign-On (SSO) support through external OIDC providers (we recommend Keycloak):
- Admins can manage all knowledge added to LLAMARA.
- Users can add individual knowledge and share it with other users of LLAMARA through fine-grained permissions.
- Anonymous access can be enabled to allow anyone to make us of the publicly shared knowledge.
Multiple sessions per user with server-side chat history
Integration with the following LLM (chat model) providers:
Integration with the following embedding model providers:
Integration with the following embedding stores:
- Qdrant
Build with Quarkus, the Supersonic Subatomic Java Framework
Uses LangChain4j, the versatile LLM integration library
Relies on battle-tested open-source software such as PostgreSQL, MinIO & Redis

Configuration

LLAMARA comes with a default configuration that can be overridden by providing an application.yaml file in the config directory. Refer to config/README.md for more information.

Running LLAMARA

The easiest way to run LLAMARA is to use the provided Docker container with Docker Compose.

Please refer to llamara-docker for more information.

Dependencies

For development, you only need to set up the AI Model Provider dependency, as the other dependencies are provided by Quarkus Dev Services. See DEVELOPMENT.md for more information.

Authentication

This application requires an OIDC authentication provider to be set up. The OIDC provider requires the auth-server-url and client-id to be set in the application.yaml file and the QUARKUS_OIDC_CREDENTIALS_SECRET environment variables. For Keycloak, you need to add the microprofile-jwt and profile scopes for the Quarkus client, see Keycloak Server Documentation.

AI Model Provider

With the default configuration, LLAMARA relies on GPT-4o mini and text-embeddding-3-large from OpenAI. You therefore need to provide an OpenAI API key through the OPENAI_API_KEY environment variable, e.g. through an .env file.

Databases & Object Storage

PostgreSQL

The application requires a PostgreSQL database on localhost:5432 (default). If needed, specify a password through the QUARKUS_DATASOURCE_PASSWORD environment variable.

The application requires its tables to be available in the configured JDBC database.

MinIO

The application requires a MinIO object storage on localhost:9000 (default). You need to set up access and secret key through the MinIO web interface and provide them through the QUARKUS_MINIO_ACCESS_KEY and QUARKUS_MINIO_SECRET_KEY environment variables.

Redis

This application requires a Redis server on localhost:6379 (default). It uses database 1 (default) for chat memory and database 2 (default) for chat history. If needed, specify passwords through the QUARKUS_REDIS_CHAT_MEMORY_PASSWORD and QUARKUS_REDIS_CHAT_HISTORY_PASSWORD environment variables.

Qdrant

This application requires a Qdrant Vector Database on localhost:6334 (gRPC) (default). If needed, specify an API key through the QDRANT_API_KEY environment variable.

LLAMARA will create the required collection according to the configured collection name and vector size, and enable payload index for the knowledge_id payload key.

How to do that manually

Before using Qdrant, you need to create the required collection:

Visit http://localhost:6333/dashboard#/tutorial/quickstart
Create a collection ${COLLECTION_NAME} with the vector size matching the used embedding model
Enable payload index for the knowledge_id payload key by executing the following in http://localhost:6333/dashboard#/console:
```
PUT /collections/${COLLECTION_NAME}/index
  {
    "field_name": "knowledge_id",
    "field_schema": "uuid"
  }
```

${COLLECTION_NAME} is the configured collection name.

Common embedding models and their vector size and recommended distance calculation are:

Provider	Embedding Model	Vector Size	Distance Calculation	Refs
OpenAI	`text-embedding-3-small`	1536	Dot Product	Docs
OpenAI	`text-embedding-3-large`	3072	Dot Product	Docs
Ollama	`nomic-embed-text`	768	Dot Product	Ollama

Endpoints

REST API

LLAMARA backend provides a REST API on the /rest path to be consumed by a user interface. You can explore it through Swagger UI on the /q/swagger-ui endpoint, or use the OpenAPI YAML or JSON API scheme definitions available from CI artifacts.

Info Endpoint

The /q/info endpoint provides information about LLAMARA backend, including Git, Java, OS and build details.

Health Endpoint

LLAMARA backend exposes four REST endpoints according to the Eclipse Microprofile specification:

/q/health/live: The application is up and running.
/q/health/ready: The application is ready to serve requests.
/q/health/started: The application is started.
/q/health: Accumulating all health check procedures in the application.

Known Issues

Filtering embeddings by permissions in the retrieval step only works if knowledge has only a single permission set.

Name		Name	Last commit message	Last commit date
Latest commit History 51 Commits
.github		.github
.mvn/wrapper		.mvn/wrapper
config		config
src		src
.dockerignore		.dockerignore
.gitignore		.gitignore
BUILD.md		BUILD.md
DEVELOPMENT.md		DEVELOPMENT.md
LICENSE		LICENSE
README.md		README.md
mvnw		mvnw
mvnw.cmd		mvnw.cmd
pom.xml		pom.xml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LLAMARA - Large Language Assistant for Model-Augmented Retrieval and Analysis

Features

Configuration

Running LLAMARA

Dependencies

Authentication

AI Model Provider

Databases & Object Storage

PostgreSQL

MinIO

Redis

Qdrant

Endpoints

REST API

Info Endpoint

Health Endpoint

Known Issues

About

Releases 4

Packages

Languages

License

llamara-ai/llamara-backend

Folders and files

Latest commit

History

Repository files navigation

LLAMARA - Large Language Assistant for Model-Augmented Retrieval and Analysis

Features

Configuration

Running LLAMARA

Dependencies

Authentication

AI Model Provider

Databases & Object Storage

PostgreSQL

MinIO

Redis

Qdrant

Endpoints

REST API

Info Endpoint

Health Endpoint

Known Issues

About

Resources

License

Stars

Watchers

Forks

Releases 4

Packages 0

Languages

Packages