Large Language Models as Optimizers

Source: arXiv Authors: Chengrun Yang, Xuezhi Wang, Yifeng Lu, Hanxiao Liu, Quoc V. Le, Denny Zhou, Xinyun Chen Date: 2023-09-07 Primary category: cs.LG All categories: cs.LG, cs.AI, cs.CL

Abstract / key passage

This paper introduces Optimization by PROmpting (OPRO), where a language model acts as a black-box optimizer that proposes new candidate solutions from a prompt containing previously tried solutions and their scores. In the prompt-optimization setting, the candidates are task instructions whose downstream task performance is evaluated and fed back into the next optimization step.

Harness takeaway

OPRO is the clean canonical reference for the “LLM as optimizer” branch. For harnesses it is useful whenever there is a scoreable artifact and little structured critique, but it also highlights how much optimization can proceed with almost nothing except candidate history and scalar value feedback.

Agent Harness Wiki

Browse

Large Language Models as Optimizers

Abstract / key passage

Harness takeaway

Graph View

Table of Contents

Backlinks