Sentient Welfare Across Three Futures

MichaelDickens

Sentient Welfare Across Three Futures

MichaelDickens

2 min readMay 25

Comments 2

Sorted by

New & upvoted

James Faville

1mo

Some objections:

If timelines are long...
...we can prioritize work that takes a long time to complete

In short timelines, we might have a small amount of calendar time with which to work on more difficult tasks, but a large amount of thinking/research time courtesy of automated AI researchers / research assistants. This might be important to the value of things like foundational research under short timelines.

If we're on track to solving AI alignment...
...the shape of the future will be determined by an aligned ASI

Solving the alignment problem does not necessarily mean "the shape of the future will be determined by an aligned ASI". For example, an aligned earth-originating ASI might run into a more powerful alien AI; and the shape of the future would be determined by the alien AI. In that case, we care about the long-term effects of earth-originating AI only insofar as they influence the decisions of the alien one. More prosaically, if we solve intent-alignment but someone uses this to launch a coup the resulting ASI might not be aligned to the interests of all humanity / good values.

If we're not on track to solving AI alignment...
...none of those other types of work listed above will pay off

This isn't always true. For example, decision theory research could inform interventions on unaligned ASIs that make the future go better for sentient beings in expectation, without us having solved the full alignment problem.

Benton 🔸

1mo

Another thing I think we should focus on if we are on track to solve alignment is concentration of power. Aligned ASI would make this problem more important.

Comments

More from the author

A frontier AI company should shut down

MichaelDickens·1mo ago·3m read

Worlds where we solve AI alignment on purpose don't look like the world we live in

MichaelDickens·4mo ago·6m read

The Future Will Be Weirder Than That

MichaelDickens·3mo ago·8m read

Curated and popular this week

Counting animals: Stable population size is not equivalent to priority level

abrahamrowe, mal_graham🔸·1w ago·Curated 6d ago·16m read

AI Use Note: Main body text entirely human written. Claude (Opus 4.8) helped develop models of animal life histories in the appendix. Cross-posted from Good Structures. Executive Summary * Animal advocates sometimes make claims like “there are X of this animal...

How (not) to fundraise from Anthropic staff

Jack Lewars·6d ago·7m read

Adapted from my Substack, Funding Anthropalypse. Short version: if you want a share of the coming Anthropic and OpenAI windfall - the $37bn+ that could be in play next year - the way in is to become 'legibly excellent', so the evaluators and donors that frontier lab staff already trust point them to yo...

If you're agentic, work in biosecurity

sharmaayushmaan🔸·4d ago·7m read

Disclaimer: Although I work on the Groups Team at CEA, I’m writing this in a personal capacity, and this post does not constitute an endorsement by CEA. Agency - the realisation that you really can just do things. TL;DR Biosecurity needs people (of any background) who are agentic and have a high execution velocity and track record....

Recent opportunities to take action

Marginal Victories: career advising and opportunities for U.S. democracy preservation & political work

Annika Burman 🔸·2d ago·2m read

Starting an EA group @ SUNY Binghamton

micahzarin·1d ago·1m read

I'm stepping down as Hive's Executive Director, and we're hiring my successor

SofiaBalderson, Hive·2d ago·3m read

Which future are you betting on?

Some plans make strong assumptions without making them explicit. When you pursue a strategy, you're making an implicit bet on which future you'll find yourself in. You're assuming that you live in the world where that strategy makes most sense.

It's worth taking the time to probe our beliefs:

What do we expect the future to look like, and what strategies make sense given those expectations?

What are we currently working on? In which futures does that work pay off?

At the community level, we shouldn't bet everything on one future. (For individuals, it's often better to specialize.^[1]) Some people should pursue long-timelines work; others should prioritize optimistic short-timelines work; still others should focus on pessimistic short timelines. It's worth considering what this balance ought to look like, and how we might get closer to the right balance.

A natural next question: What plausible futures are we neglecting? That's a question I want to spend more time thinking about.

Individuals benefit from developing expertise over time. In most fields, it takes more than 80,000 person-hours for diminishing marginal utility of effort to kick in. The gains of increasing expertise outweigh the diminishing utility of marginal work. ↩︎