Prometheus is a frontier-grade, multi-agent Retrieval-Augmented Generation (RAG) architecture built to autonomously execute, evaluate, and synthesize deep academic research.
Designed to overcome the limitations of standard monolithic RAG systemsβwhich often struggle with complex reasoning and hallucinate when faced with contradictory evidenceβPrometheus leverages parallel subagents, hybrid retrieval, and an LLM-powered Contradiction Engine mapped via a Neo4j Citation Graph.
A Lead Agent breaks down complex queries into mutually exclusive sub-tasks, deploying asynchronous Parallel Subagents to investigate each angle independently.
A dedicated Evidence Scorer evaluates every claim made by the subagents. If confidence falls below the threshold, it autonomously refines search keywords and triggers a retrieval retry loop.
Combines dense vector similarity (ChromaDB + sentence-transformers) with sparse keyword matching (BM25Okapi) and Reciprocal Rank Fusion (RRF) for superior document recall.
Maps claims and paper citations to a Neo4j Graph Database. An LLM Contradiction Engine cross-examines nodes to identify, flag, and resolve conflicting scientific evidence before final synthesis.
Includes an LLM-as-a-Judge benchmarking suite (run_eval.py) that mathematically evaluates the performance of the multi-agent architecture against a standard single-agent RAG baseline.
A fully responsive Streamlit dashboard visualizes the agentic workflow, confidence scores, retrieval evidence, contradiction analysis, and streams the final synthesized report in real time.
User Query
β
βΌ
Lead Agent (Task Decomposition)
β
βββ Research Angle 1
βββ Research Angle 2
βββ Research Angle 3
βββ Research Angle 4+
β
βΌ
Parallel Subagents
β
βΌ
Hybrid Retrieval Engine
(Dense + Sparse + RRF)
β
βΌ
Evidence Scorer (CRAG)
β
ββββββββ΄βββββββ
β β
High Confidence Low Confidence
β β
β Retrieval Retry Loop
β β
ββββββββ¬βββββββ
βΌ
Citation Graph
(Neo4j)
β
βΌ
Contradiction Engine
β
βΌ
Report Synthesis Agent
β
βΌ
Final Research Report
The Lead Agent analyzes the user query and creates a structured JSON research plan containing multiple independent investigation angles.
Multiple subagents simultaneously retrieve, analyze, and summarize evidence from the local corpus.
The Evidence Scorer evaluates evidence quality, relevance, and citation support.
If evidence confidence is insufficient, the system automatically reformulates queries and performs additional retrieval.
Claims and source papers are mapped into a Neo4j knowledge graph.
The Contradiction Engine identifies conflicting claims, evaluates severity levels, and generates resolution summaries.
The synthesis module compiles validated findings into a structured, citation-backed research report.
- Python 3.10+
- Docker
- NVIDIA API Key
- Neo4j Database
git clone https://github.com/NukaNarendra/Prometheus.git
cd prometheuspip install -r requirements.txtCreate a .env file:
NVIDIA_API_KEY=nvapi-your-key-here
NEO4J_URI=bolt://localhost:7687
NEO4J_USERNAME=neo4j
NEO4J_PASSWORD=passworddocker run \
--name neo4j-prometheus \
-p 7474:7474 \
-p 7687:7687 \
-e NEO4J_AUTH=neo4j/password \
-d neo4j:latestpython scripts/seed_corpus.pystreamlit run app/streamlit_app.pyPrometheus includes a comprehensive benchmarking suite that compares:
- Multi-Agent Prometheus
- Standard Single-Agent RAG
Evaluation categories:
- Evidence Coverage
- Citation Accuracy
- Contradiction Detection
- Hallucination Reduction
- Report Completeness
- Overall Research Quality
Run evaluation:
python eval/run_eval.pyResults are stored in:
eval/results/evaluation_metrics.json
Example output:
{
"multi_agent_score": 9.2,
"single_agent_score": 7.1,
"coverage_gain": "29.5%",
"hallucination_reduction": "41.3%"
}prometheus/
β
βββ app/
β βββ streamlit_app.py
β
βββ data/
β βββ corpus/
β βββ memory/
β βββ chroma_db/
β
βββ eval/
β βββ run_eval.py
β
βββ scripts/
β βββ seed_corpus.py
β
βββ src/
β β
β βββ agents/
β β βββ lead_agent.py
β β βββ subagent.py
β β βββ orchestrator.py
β β
β βββ retrieval/
β β βββ hybrid_search.py
β β βββ vector_store.py
β β βββ keyword_store.py
β β
β βββ correction/
β β βββ evidence_scorer.py
β β
β βββ synthesis/
β β βββ contradiction_engine.py
β β βββ neo4j_graph.py
β β βββ report_builder.py
β β
β βββ connectors/
β β βββ data_normalizer.py
β β
β βββ config.py
β
βββ requirements.txt
βββ .env
βββ .gitignore
βββ README.md
| Layer | Technology |
|---|---|
| LLM | NVIDIA Nemotron |
| Framework | LangChain |
| Vector DB | ChromaDB |
| Embeddings | Sentence Transformers |
| Keyword Search | BM25Okapi |
| Graph Database | Neo4j |
| UI | Streamlit |
| Evaluation | LLM-as-a-Judge |
| Retrieval Fusion | Reciprocal Rank Fusion |
- Multi-modal research (PDFs, images, figures)
- Agent memory persistence
- Cross-paper reasoning chains
- Automated hypothesis generation
- Scientific knowledge graph expansion
- Federated document retrieval
- Research paper drafting assistant
- Autonomous literature review generation
Prometheus is inspired by cutting-edge developments in:
- Multi-Agent Systems
- Retrieval-Augmented Generation (RAG)
- Corrective RAG (CRAG)
- Self-RAG
- Knowledge Graph Reasoning
- Agentic AI Research Workflows
Built using:
- LangChain
- NVIDIA AI Endpoints
- Neo4j
- ChromaDB
- Sentence Transformers
- Streamlit
MIT License
Copyright (c) 2026
Permission is hereby granted, free of charge, to any person obtaining a copy of this software and associated documentation files to deal in the Software without restriction.