Skip to content

Commit cfd7467

Browse files
authored
Merge pull request #1391 from asimurka/disable_rag_annotations
LCORE-1498: Disable annotation in root config
2 parents 068411f + 53a93f1 commit cfd7467

File tree

2 files changed

+11
-0
lines changed

2 files changed

+11
-0
lines changed

docs/rag_guide.md

Lines changed: 7 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -20,6 +20,7 @@ This document explains how to configure and customize your RAG pipeline using th
2020
* [Add an Inference Model (LLM)](#add-an-inference-model-llm)
2121
* [Complete Configuration Reference](#complete-configuration-reference)
2222
* [System Prompt Guidance for RAG (as a tool)](#system-prompt-guidance-for-rag-as-a-tool)
23+
* [Llama Stack RAG annotations](#llama-stack-rag-annotations)
2324
* [References](#references)
2425

2526

@@ -493,6 +494,12 @@ When using RAG, the `knowledge_search` tool must be explicitly referenced in you
493494
```text
494495
You are a helpful assistant with access to a 'knowledge_search' tool. When users ask questions, ALWAYS use the knowledge_search tool first to find accurate information from the documentation before answering.
495496
```
497+
498+
---
499+
# Llama Stack RAG annotations
500+
501+
The top-level `vector_stores` block in Llama Stack configuration may include `annotation_prompt_params` to control whether Llama Stack injects extra RAG annotation instructions into the model prompt (for example, citation-style markers). The [`run.yaml`](../run.yaml) in this repository sets `enable_annotations: false` under that block to avoid unwanted annotations. For a configuration that enables annotations and customizes the instruction template, see [`examples/run.yaml`](../examples/run.yaml).
502+
496503
---
497504

498505
# References

run.yaml

Lines changed: 4 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -142,6 +142,10 @@ registered_resources:
142142
# Without it, llama-stack's rag-runtime silently fails all file_search operations
143143
# with no error logged.
144144
vector_stores:
145+
# LCORE-1498: Disables Llama Stack RAG annotation generation
146+
# causing unwanted citation/file markers in model output.
147+
annotation_prompt_params:
148+
enable_annotations: false
145149
default_provider_id: faiss
146150
default_embedding_model: # Define the default embedding model for RAG
147151
provider_id: sentence-transformers

0 commit comments

Comments
 (0)