EA Forum Bot Site
EA Forum

Hide table of contents

Comment Permalink

Ryan GreenblattDec 23 20231

I think you misunderstood the points I was making. Sorry for writing an insufficently clear comment.

Actually that's not exactly how I'd frame it, since what matters more is how much we can reduce the risk of catastrophe by delaying, not just the total risk of a catastrophe.

Agreed that's why I wrote "0.1% to 0.01% reduction in p(doom) per year". I wasn't talking about the absolute level of doom here. I edited my comment to say "0.1% to 0.01% reduction in p(doom) per year of delay" which is hopefully more clear. The expected absolute level of doom is probably notably higher than 0.1% to 0.01%.

Human disempowerment just means that the species homo sapiens is disempowered, and I don't see why we should draw the relevant moral boundary around our species.

I don't. That's why I said "Similarly, I would potentially be happier to turn over the universe to aliens instead of AIs."

Also, note that I think AI take over is unlikely to lead to extinction.

ETA: I'm pretty low confidence about a bunch of these tricky moral questions.

I would be reasonably happy (e.g. 50-90% of the value relative to human control) to turn the universe over to aliens. The main reduction in value is due to complicated questions about the likely distribution of values of aliens. (E.g., how likely is that aliens are very sadistic or lack empathy. This is probably still not the exact right question.) I'd also be pretty happy with (e.g.) uplifted dogs (dogs which are made to be as intelligent as humans while keeping the core of "dog" whatever that means) so long as the uplifting process was reasonable.

I think the exact same questions apply to AIs, I just have empirical beliefs that AIs which end up taking over are likely to do predictably worse things with the cosmic endowment (e.g. 10-30% of the value). This doesn't have to be true, I can imagine learning facts about AIs which would make me feel a lot better about AI takeover. Note that conditioning on the AI taking over is important here. I expect to feel systematically better about smart AIs with long horizon goals which are either not quite smart enough to take over or don't take over (for various complicated reasons).

More generally, I think I basically endorse the views here (which discusses the questions of when you should cede power etc.).

Note that in my ideal future it seems really unlikely that we end up spending a non-trivial fraction of future resources running literal humans instead of finding out better stuff to spend computational resources on (e.g. like beings with experiences that a wildly better than our experiences or beings which are vastly cheaper to run).

(That said, we can and should let all humans live for as long as they want and dedicate some fraction of resources to basic continuity of human civilization insofar as people want this. 1/10^12 of the resources would easily suffice from my perspective, but I'm sympathic to making this more like 1/10^3 or 1/10^6.)

Perhaps as a brief example to help illustrate my point, it seems very plausible to me that I would identify more strongly with a smart behavioral LLM clone of me trained on my personal data compared to how much I'd identify with the human species.

I think "identify" is the wrong word from my perspective. The key question is "what would the smart behavioral clone do with the vast amount of future resources". That said, I'm somewhat sympathetic to the claim that this behavioral clone would do basically reasonable things with future resources. I also feel reasonably optimistic about pure imitation LLM alignment for somewhat similar reasons.

On top of all of this, I think I disagree with your argument about discount rates, since I think you're ignoring the case for high discount rates based on epistemic uncertainty, rather than pure time preferences.

Am I ignoring this case? I just think we should treat "what do I terminally value"^[1] and "what is the best route to achieving that" as most separate questions. So, we should talk about whether "high discount rates due to epistemic uncertainty" is a good reasoning heuristic for achieving my terminal values separately from what my terminal values are.

Separately, I think a high per year discount rate due to epistemic uncertainty seems pretty clearly wrong. I'm pretty confiden that I can influence, to at least a small degree (e.g. I can affect the probability by >10^-10, probably much greater), whether or not the moral equivalent of 10^30 people are tortured in 10^6 years. It seems like a very bad idea from my perspective to put literally zero weight on this due to 1% annual discount rates.

For less specific things like "does a civilization descended from and basically endorsed by humans exist in 10^6 years", I think I have considerable influence. E.g., I can affect the probability by >10^-6 (in expectation). (This influence is distinct from the question of how valuable this is to influence, but we were talking about epistemic uncertainty here.)

My guess is that we end up with basically a moderate fixed discount over very long run future influence due to uncertainty over how the future will go, but this is more like 10% or 1% than 10^-30. And, because the long run future still dominates in my views, this just multiplies though all calculations and ends up not mattering much for decision making. (I think acausal trade considerations implicitly mean that I would be willing to tradeoff long run considerations in favor of things which look good as weighted by current power structures (e.g. helping homeless children in the US) if I had a 1,000x-10,000x opportunity to do this. E.g., if I could stop 10,000 US children from being homeless with a day of work and couldn't do direct trade, I would still do this.

^{^}
More precisely, what would my CEV (Coherant Extrapolated Volition) want and how do I handle uncertainty about what my CEV would want?

See in context

[ Question ]

What is the current most representative EA AI x-risk argument?

by Matthew_Barnett

Dec 15 20233 min read10 answers 25