Local RAG MVP

This repository is a local-first, production-shaped RAG backend for enterprise use. It mirrors the larger AWS design with Docker services:

FastAPI API for ingestion, retrieval, answer generation, feedback, and debug traces
PostgreSQL with pgvector for vector retrieval and PostgreSQL full-text search
MinIO for raw source artifacts and future extracted assets
Ollama for optional local embeddings and answer generation
Deterministic hash embeddings as the default so the system works before any model downloads

Start the stack

docker compose up --build

API:

http://localhost:8000

Frontend:

http://localhost:5173

UI snapshots

These snapshots mirror the main frontend workflows: project setup and ingestion, document review, and grounded Q&A.

Project setup and ingestion

Create or switch projects, ingest the mounted sample folder, and upload files with ACL groups.

Document library

See indexed sources, chunk counts, and open the raw citation file for any revision.

Ask and answer

Ask a question against the selected project, tune Top K, and inspect the returned citations and diagnostics.

Swagger docs:

http://localhost:8000/docs

MinIO console:

http://localhost:9001
user: minioadmin
password: minioadmin

Ingest sample documents

Invoke-RestMethod -Method Post `
  -Uri http://localhost:8000/ingest/local `
  -ContentType application/json `
  -Body '{"acl_groups":[]}'

The API ingests files mounted from sample_docs/ into /data/input.

The frontend can also ingest the same sample folder into the selected project, upload additional files from your browser, list indexed documents, open raw citation files, and ask questions against the selected project.

Projects

List projects:

Invoke-RestMethod http://localhost:8000/projects

Create a project:

Invoke-RestMethod -Method Post `
  -Uri http://localhost:8000/projects `
  -ContentType application/json `
  -Body '{"name":"Networking"}'

Ingest the mounted sample folder into a specific project:

Invoke-RestMethod -Method Post `
  -Uri http://localhost:8000/projects/<project-id>/ingest/local `
  -ContentType application/json `
  -Body '{"acl_groups":[]}'

Upload files into a project from the frontend at http://localhost:5173.

Search

Invoke-RestMethod -Method Post `
  -Uri http://localhost:8000/search `
  -ContentType application/json `
  -Body '{"query":"How do I troubleshoot VPN error 809?","top_k":3,"groups":[]}'

Ask for an answer

Invoke-RestMethod -Method Post `
  -Uri http://localhost:8000/answer `
  -ContentType application/json `
  -Body '{"query":"How do I troubleshoot VPN error 809?","top_k":3,"groups":[]}'

If Ollama does not have the configured chat model yet, the API returns an extractive fallback from the highest-ranked chunk.

Optional: use Ollama models

Pull models into the Ollama container:

docker exec -it rag-poc-ollama ollama pull qwen2.5:0.5b
docker exec -it rag-poc-ollama ollama pull nomic-embed-text

To use Ollama embeddings, edit .env:

EMBEDDING_PROVIDER=ollama
EMBEDDING_DIM=768

Then recreate the database volume because pgvector columns are dimensioned:

docker compose down -v
docker compose up --build

MVP behavior

Ingestion does the following:

Reads supported files from the local source directory.
Stores raw bytes in MinIO.
Extracts text from Markdown, text, HTML, PDF, and DOCX.
Computes source, normalized text, and chunking fingerprints.
Creates a new document revision only when content or chunking changed.
Chunks with document title and section path context.
Embeds each chunk.
Atomically publishes the new revision and supersedes the old revision.

Retrieval does the following:

Embeds the query.
Runs pgvector cosine search.
Runs PostgreSQL full-text search.
Merges both result sets with reciprocal rank fusion.
Applies ACL group filtering before returning chunks.
Stores a retrieval trace for debugging.

Important local limitations

The default hash embedding provider is for learning and plumbing tests, not final retrieval quality.
Reranking is not implemented yet.
Image extraction and image captioning are not implemented yet.
Connectors for Document360 and SharePoint are not implemented yet.
Auth is represented by request-supplied groups for now. A real deployment should validate tokens and derive groups server-side.

Next milestones

Add a real embedding model by default.
Add a reranker stage.
Add async ingestion workers with Redis.
Add extracted image/table assets.
Add SharePoint and Document360 connectors.
Add an eval harness for retrieval quality, stale content prevention, and ACL safety.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
api		api
docs/readme-screenshots		docs/readme-screenshots
frontend		frontend
sample_docs		sample_docs
.env		.env
.env.example		.env.example
README.md		README.md
docker-compose.yml		docker-compose.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Local RAG MVP

Start the stack

UI snapshots

Project setup and ingestion

Document library

Ask and answer

Ingest sample documents

Projects

Search

Ask for an answer

Optional: use Ollama models

MVP behavior

Important local limitations

Next milestones

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Local RAG MVP

Start the stack

UI snapshots

Project setup and ingestion

Document library

Ask and answer

Ingest sample documents

Projects

Search

Ask for an answer

Optional: use Ollama models

MVP behavior

Important local limitations

Next milestones

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages