Specializing LLMs to Low-Documented Domains with RAG

This repo shares:

the code behind the project (Coming Soon)
a simplified versione of the prompts used that implement the RAG pipeline
the benchmark used to evaluate the pipeline

📂 Repository Contents

Code/ – The main code (Coming Soon)
benchmark.json – The benchmark dataset in JSON format.
benchmark_reader.py – Python code for reading and validating the benchmark. used in the study.
APPENDIX D -- prompts.py – Reference implementation of the prompt templates

Unity XRI v2 Q&A Benchmark

This benchmark structure is designed to be extensible — you can add Q&A datasets for any XR platform and toolkit. However, this repository currently includes only one dataset: Unity as the platform and XRI version 2 as the toolkit.

It includes a Python utility script for easily loading, validating, and querying the dataset.

📘 Benchmark Structure

The benchmark is organized as a hierarchy:

benchmark_info – General metadata.
platforms[] – E.g., Unity, Web(Mock).
- toolkits[] – E.g., XRIv2, MRTK3(Mock), A-Frame(Mock).
  - dataset – List of Q&A pairs, with optional metadata.

Example

{
  "benchmark_info": {
    "name": "XRI-benchmark",
    "description": "Text-based, Q&A Benchmark for Virtual Reality applications...",
    "version": "0.1",
    "date": "2024-09-15",
    "author": "CG3HCI (https://cg3hci.dmi.unica.it/lab/)",
    "email": "jacopo.mereu@unica.it"
  },
  "platforms": [
    {
      "name": "Unity",
      "toolkits": [
        {
          "name": "XRIv2",
          "dataset": [
            {
              "question": "What is ... ?",
              "answer": "... is a ...",
              "metadata1": "A value",
              ...
              "metadataN": "Another value"
            }
          ]
        }
      ]
    }
  ]
}

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
Code/Python		Code/Python
APPENDIX D -- prompts.py		APPENDIX D -- prompts.py
LICENSE		LICENSE
README.md		README.md
benchmark.json		benchmark.json
benchmark_utils.py		benchmark_utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Specializing LLMs to Low-Documented Domains with RAG

📂 Repository Contents

Unity XRI v2 Q&A Benchmark

📘 Benchmark Structure

Example

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Specializing LLMs to Low-Documented Domains with RAG

📂 Repository Contents

Unity XRI v2 Q&A Benchmark

📘 Benchmark Structure

Example

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages