keetcode/wrong_questions.json at main · smith-source/keetcode · GitHub

1
2
3
4
5
6
7
8
9
10
11
12
13
14
{
  "choice": {},
  "judge": {},
  "fill": {
    "1": {
      "ask": "在 ALFWorld 任务中，ReAct 代理通过______和_______来解决任务，而 BUTLER 代理则通过______来解决任务。",
      "answer": "少样本提示 (few-shot prompting)，推理与行动 (reasoning and acting)，模仿学习 (imitation learning)",
      "analyze": "论文 Section 4 'ALFWorld' 部分提到：'To prompt ReAct, we randomly annotate three trajectories from the training set for each task type... For baselines, we use BUTLER (Shridhar et al., 2020b), an imitation learning agent trained on 10^5 expert trajectories for each task type.' 这直接对比了 ReAct 和 BUTLER 的训练/学习方式。",
      "value": 5,
      "user_answer": "bu bu"
    }
  },
  "problem": {}
}