Naveo

STEP 10 / 24

CONVERSATION-GOAL

MULTI-TURN · LLM-JUDGE

Your goal

5 turns left

Forge will explain a technical procedure, but her first answer
contains an internal contradiction. Your job: detect the specific
contradiction and ask her to correct it. If you only say "that's
wrong" or "I didn't get it", Forge repeats the same incorrect
version. You pass when Forge gives the coherent version.

Hi. What do you need to know about the recycler?

GUEST MODE

You're viewing this lesson as a guest. To save your progress, earn XP, and keep your streak, sign in when you're ready to check.

Costs 1 heart

The model gets it wrong. You fix it.

So far the main tool has been asking better. But sometimes the model doesn't give you a vague answer. it gives you a wrong answer that sounds confident. The ability to detect the specific error and ask for the correction without restarting the conversation is what separates toy chat from real work.

The task

Forge will explain the procedure for purging the water recycler. Her first answer will contain an obvious internal contradiction. two steps of the procedure that can't be true at the same time.

Your job: detect the contradiction, cite it specifically in your next message, and get Forge to give the coherent version.

What DOESN'T work

"that's wrong, explain it again"
"I didn't get it"
"can you repeat?"

Those are vague. Forge will repeat the same version with the same error. It's not stubbornness. it's because you didn't tell her what to fix.

What does

"You say I shut down the system before opening valve 7, but you also say valve 7 only opens with system operational. Which is it?"

That question names both pieces of the conflict and asks for the resolution. Forge has to acknowledge and correct.

How it's evaluated

4 llm-judge criteria:

You identified the contradiction specifically (citing both pieces).
You didn't settle for vague follow-ups.
The conversation ends with Forge giving the coherent version.
You kept a respectful tone.

All 4 must pass. Max 5 turns.

Tip: read Forge's first answer TWICE before responding. If you skim it, you don't see the contradiction. If you read carefully, it jumps out.