RAG Demo

This project implements a minimal Retrieval-Augmented Generation (RAG) pipeline using Mistral AI, PostgreSQL with pgvector, and Flask.

Architecture

The system follows a simple RAG pipeline:

Markdown documents are loaded and chunked.
Chunks are embedded using Mistral embeddings.
Embeddings are stored in PostgreSQL with pgvector.
At query time:
- The query is embedded.
- Top-k similar chunks are retrieved.
- The LLM generates an answer using the retrieved context.

Environment Variables

Create a .env file in the project root:

API_KEY=your_api_key
DOCS_PATH=/path/to/your/markdown/files
POSTGRES_DB=your_db
POSTGRES_HOST=localhost
POSTGRES_PORT=5432
POSTGRES_USER=user
POSTGRES_PASSWORD=password

Virtual Environment Setup

Set up the venv and load the necessary requirements to run the project:

python -m venv venv
source venv/bin/activate
pip install -r requirements.txt

Database Initialization

Run the following SQL commands in order after connecting to your PostgreSQL database.

1. Enable the pgvector Extension

This must be done once per database before using vector types.

CREATE EXTENSION IF NOT EXISTS vector;

2. Create the documents table

The embedding dimension must match your embedding model output. In this project, embeddings have 1024 dimensions.

CREATE TABLE IF NOT EXISTS documents (
    id BIGSERIAL PRIMARY KEY,
    content TEXT NOT NULL,
    embedding VECTOR(1024) NOT NULL
);

3. Verify Setup

Confirm that the table and vector dimension are correct.

\d documents;

SELECT vector_dims(embedding) FROM documents LIMIT 1;

Index Documents

Before running queries, you must index your documents:

python scripts/index_documents.py

Run the API Server

From the project root to start the API Server:

python run.py

A/B Testing Evaluation

This project includes an A/B evaluation interface to compare different RAG configurations (e.g. chunk size, temperature, top-k retrieval).

The system:

Generates answers for multiple configurations
Creates all unique pairwise comparisons per question
Stores comparisons in PostgreSQL
Allows human evaluators to vote
Stores votes for later Bradley–Terry / Gaussian Process analysis

Database Setup for A/B Testing

Run the following commands after connecting to PostgreSQL.

CREATE TABLE IF NOT EXISTS ab_pairs (
    id SERIAL PRIMARY KEY,
    question TEXT NOT NULL,
    config_a TEXT NOT NULL,
    config_b TEXT NOT NULL,
    answered BOOLEAN DEFAULT FALSE
);

CREATE TABLE IF NOT EXISTS ab_results (
    id SERIAL PRIMARY KEY,
    question TEXT NOT NULL,
    config_a TEXT NOT NULL,
    config_b TEXT NOT NULL,
    winner TEXT NOT NULL,
    created_at TIMESTAMP DEFAULT NOW()
);

Generate Evaluation Data

You must first generate responses for all configurations. This can be done with generate_data.ipynb

Example expected format (data.json):

[
  {
    "config": "config_1",
    "results": [
      {
        "question": "How can I search for license risks?",
        "answer": "..."
      }
    ]
  }
]

Place this file in the project root.

Start the A/B Testing Server

Run:

python3 -m app.api.testing_server

Then open:

http://127.0.0.1:5000/

How the A/B System works

All unique config pairs are generated per question
Each comparison is shown exactly once
Votes are stored in ab_results
Pairs are marked as answered in ab_pairs
When all comparisons are complete, the interface stops

For 5 configs and 5 questions:

10 comparisons per question
50 total comparisons

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

RAG Demo

Architecture

Environment Variables

Virtual Environment Setup

Database Initialization

1. Enable the pgvector Extension

2. Create the documents table

3. Verify Setup

Index Documents

Run the API Server

A/B Testing Evaluation

Database Setup for A/B Testing

Generate Evaluation Data

Start the A/B Testing Server

How the A/B System works

FilesExpand file tree

README.md

Latest commit

History

README.md

File metadata and controls

RAG Demo

Architecture

Environment Variables

Virtual Environment Setup

Database Initialization

1. Enable the pgvector Extension

2. Create the documents table

3. Verify Setup

Index Documents

Run the API Server

A/B Testing Evaluation

Database Setup for A/B Testing

Generate Evaluation Data

Start the A/B Testing Server

How the A/B System works