Naveo

STEP 16 / 22

B1 TOOL-SCHEMA

DEFINE THE SCHEMA

Write the JSON Schema for the rag_chain tool.

TOOL PURPOSE

Design a 3-stage RAG (Retrieval-Augmented Generation) pipeline to
answer technical questions using the ship's internal manual as the
source.

The 3 stages:

1. retrieve(query) → fetches the 3 most relevant fragments from the
   manual (via vector search). Output: { snippets: [{text, source}, ...] }.

2. augment(query, snippets) → builds the prompt that combines the
   user's question with the retrieved snippets as context. Output:
   { augmented_prompt: string }.

3. generate(augmented_prompt) → generates the final answer using the
   augmented prompt. Output: { answer: string, citations: [source, ...] }.

Format: YAML with a `steps` array (same schema as step 02's chain).
Each step declares: id, prompt (or function), input, output_key.

EXAMPLE INVOCATIONS

rag_chain.run("What's the nominal coolant pressure in bay 4?") → runs retrieve→augment→generate and returns { answer, citations }

YOUR SCHEMA

662 CHARACTERS

5 CRITERIA

GUEST MODE

You're viewing this lesson as a guest. To save your progress, earn XP, and keep your streak, sign in when you're ready to check.

Costs 1 heart

RAG: the model doesn't have to know everything by heart

Models have knowledge frozen at training. everything that happened after, or everything internal to your organization, they don't know. Two ways to give them access:

Tools (Track 3). the model invokes functions that fetch info in real time.
RAG. before generating, you fetch relevant data and inject it into the prompt as context.

RAG is the dominant choice when:

You have a large corpus (manuals, docs, transcripts).
Questions are about content, not actions.
You want answers to come with citations (verifiable).

The three stages

code

[user query]
        ↓
   retrieve   (vector search over the corpus → top-K snippets)
        ↓
   augment    (composes a prompt: snippets + original question)
        ↓
   generate   (LLM answers using the augmented prompt, citing sources)
        ↓
[answer + citations]

Your job: design the YAML config of the pipeline. Three steps, chained by output_keys.

The trap that breaks RAG

The most common problem isn't retrieval. it's that the generate doesn't handle the "snippets aren't relevant to the question" case. When that happens, if you don't tell the model what to do, it invents an answer with bits from the snippets. That's worse than not answering: it looks confident, carries fake citations, and the user doesn't notice.

Practical rule: in the generator's prompt, always include something like "If the snippets don't contain relevant information, say you didn't find that in the manual." It's the difference between honest RAG and hallucinating RAG.

How it's evaluated

5 LLM-judge criteria:

retrieve declared as a tool/function call (not LLM).
augment receives question + snippets, combines them explicitly.
generate uses the augmented_prompt (not the raw question).
generate asks for citations from the snippets.
generate handles "empty / not-relevant snippets" without hallucinating.