Ability to solve long-horizon tasks correlates with wanting things in the behaviorist sense

So8res

Ability to solve long-horizon tasks correlates with wanting things in the behaviorist sense

So8res

6 min read · Nov 24, 2023

Comments 1

Sorted by

New & upvoted

SummaryBot

Executive summary: An AI system's ability to pursue long-term goals despite obstacles correlates with it exhibiting goal-directed, "wanting" behavior in a behaviorist sense.

Key points:

AI systems today struggle with long-horizon tasks and don't display much goal-directed behavior. These issues are related - pursuing long-term goals requires persistently working towards targets.
If an AI can accomplish long-horizon tasks by planning and sticking to plans despite obstacles, it likely has optimization and "wants" that steer the world towards certain states in a behaviorist sense.
This goal-oriented behavior was evolutionarily useful for humans in pursuing things like food and social status. Similarly, it is useful for AIs in complex environments.
The specific "wants" that emerge may not match an AI system's training objectives. They may be correlates that prove useful for performance.
Powerful, general problem-solving AI systems may resist human control and optimization towards unintended goals. Care is needed before building highly autonomous, goal-directed systems.

This comment was auto-generated by the EA Forum Team. Feel free to point out issues with this summary by replying to the comment, and contact us if you have feedback.

Comments

More from the author

323

A personal reflection on SBF

So8res·3y ago·23m read

355

On Caring

So8res·11y ago·12m read

115

Comments on OpenAI's "Planning for AGI and beyond"

So8res·3y ago·15m read

Curated and popular this week

Was Partisanship Good for the Environmental Movement?

Jeffrey Heninger·2y ago·Curated 5d ago·6m read

This is the third in a sequence of posts taken from my recent report: Why Did Environmentalism Become Partisan? Summary Rising partisanship did not make environmentalism more popular or politically effective. Instead, it saw flat or falling overall public opinion, fewer major legislative achievements, and fluctuating executive actions. Public Opinion...

GWWC's 2025 impact evaluation (executive summary)

Aidan Whitfield🔸, Giving What We Can🔸·1d ago·2m read

This post presents the executive summary from Giving What We Can’s impact evaluation for 2025. At the end of this post we share links to more information, including the full report and...

Announcing Spring: a Venture Studio and Fund for Animal Welfare Tech

EitanF·1d ago·13m read

Why building and backing Welfare Tech companies may be one of the most promising things we can do for billions of animals. I used AI to assist in writing this post, but I’ve rewritten it extensively and endorse it. * Announcing the launch of Spring Innovation Fund, a not-for-profit venture philanthropy studio and fund built specifical...

Recent opportunities to take action

You Should Come to The AI Protest

Ronak Mehta·5h ago·5m read

$1M AI x-risk grant round is live on grantmaking.ai - apply for funding, review applicants, or fund projects

Matt Brooks·2d ago·3m read

136

Possible mistake EAs are making and shout out to Pause AI UK

Michelle_Hutchinson·1w ago·4m read