Verifiers
Sometimes evaluation is easier than generation
It’s often easier to tell whether a solution is correct than to come up with it. We can use this to improve the quality of responses. As a first step, let’s use it to check the quality of responses. We’ll try two versions:
Just ask the model: Is this solution correct?
Given a list of reasoning steps, ask the model: For each step, is it correct?
Last updated