Context: A slew of slightly disorganized thoughts I’ve had in mind for some time, but I’m not sure how obvious the ideas are to others. Relatively unpolished but posting anyway.

Years ago, I did a research project on value drift in EA. One of the things I asked participants is what would make them leave the movement. A couple of them mentioned the possibility of the EA movement’s values drifting.

At the time, this seemed kind of weird to me. Where could EA values drift to anyway? The only two values I could think of that define EA are effectiveness and altruism, and it seemed weird that we wouldn’t notice if one or both fell through.

As I’ve watched the movement change and grow over the past six years, I’ve realized it’s a little more complicated than that. There are lots of different ways people’s values affect how they pursue Effective Altruism. And I can’t help but notice that the values of the core of the EA movement are shifting. I’m hardly the first person to point out that things are changing, but most of what I’ve seen focuses on the increased funding. I see a few more differences:

  • Longtermism — and perhaps implicitly, x-risk avoidance — as a dominant value, if not the only value. (This one’s been elaborated on at length already.)
  • Less risk aversion. EA’s drastically increased involvement in policy and politics seems like the most obvious example, but the poor optics of highly-paid junior roles at “altruistic” nonprofits is an underrated risk, in my opinion, too.
  • Similarly, less proportion of funding towards evidence-backed ideas, and more funding for long-shot projects, of which much or most of the expected value comes from a small chance of big success.
  • Less impartiality. This is the one that concerns me the most. As more funding goes to community-building, a lot of money seems to be going to luxuries EAs don’t need, which feels very far from EA’s origins.

I’m not saying these changes are all bad, but they certainly do not reflect the values of the movement as a whole, and I want things to stay that way. But I worry that as the movement moves more and more firmly toward these values, others might be ostracized for staying where the movement was ten years, five years, or even one year ago. 

My main point is: I think EAs think that leaving EA is value drift, and value drift is bad, and therefore leaving EA is a bad thing. But with the changes we’re seeing, people might be leaving EA as a means of staying within their values. And that’s okay.

By no means do I think that old EA values are superior to new ones. I’m grateful for the people challenging the status quo in EA and leading to progress. But I also want to make sure we still value the community members who don’t change their minds because of what’s trending in EA, and who still hold the values that we thought were good values 10 years ago and are likely still good values today.

Comments16
Sorted by Click to highlight new comments since:

Am in agreement with most of your post, except for one thing: calling these changes to our values.

The following is the beginnings of a chain of thinking that isn't fully fleshed out, but I hope is useful. All word choices are probably suboptimal. I don't hold the implications of these views very strongly or at all, I'm mostly trying to puzzle things out and provide arguments I think are somewhat strong or compelling.

All the things you mention don't seem like values to me - they seem more like strategies or approaches to doing good (which 

"Core" values are things like truth-seekingness, epistemic humility or maximizing impact or something, whereas for example "cause neutrality" and by extension "longtermism" are downstream of that. 

 But we also have "secondary values" (terrible wording) which are influenced by our core values and our worldview and specific (cognitive) beliefs about how the world works (this influence each other but are somewhat independent).

I can see a version of EA where the core values -> longtermism chain becomes replaced with just longtermism as a default (just like in current EA we take the core values -> helping people in developing countries chain is something of a default - I don't think it's very often that people come into EA strongly opposing this value set - this isn't a bad thing - these are the low hanging fruit). 

Why are core & secondary values important to distinguish?

  1. People who are on board with the changes do not see the shared values as conflicting with the core values they see it as a natural progression of core values. Just like we thought that "everyone matters" leads to "donate to help improve the lives of poor people in developing countries" so too is the connection between "everyone matters" to "future people should be our priority". 
    1. Implication: people reading this post may say "this isn't value drift"
  2. I think are core values are really important and the real glue of our community, a glue that will withstand the test of time and ideally let us adapt, change and grow as we get new information and we update our beliefs. 
    1. Maybe this is to idealistic, and in practice simply saying "but we share the same core values" even if true, is simply not enough. 
    2. In practice, the level of secondary values can be more useful: maybe technical AI safety researchers and farmed animal welfare advocates just don't have that much in common or the inferential distance is a bit too much in terms of their models of the world, impact, risk aversion etc. etc. 

Maybe related is that even for ideal expected utility maximizers, values and subjective probabilities are impossible to disentangle by observing behavior. So it's not always easy to tell what changes are value drift vs epistemic updates.

While I understand the point you're making, the comment you linked is (to my non-STEM mind) pretty hard to parse. Would you be able to give a less technical, more ELI5 explanation?

Sure, here's the ELI12:

Suppose that there are two billionaires, April and Autumn. Originally they were funding AMF because they thought working on AI alignment would be 0.01% likely to work and solving alignment would be as good as saving 10 billion lives, which is an expected value of 1 million lives, lower than you could get by funding AMF.

After being in the EA community a while they switched to funding alignment research for different reasons.

  • April updated upwards on tractability. She thinks research on AI alignment is 10% likely to work, and solving alignment is as good as saving 10 billion lives.
  • Autumn now buys longtermist moral arguments. Autumn thinks research on AI alignment is 0.01% likely to work, and solving alignment is as good as saving 10 trillion lives.

Both of them assign the same expected utility to alignment-- 1 billion lives. As such they will make the same decisions. So even though April made an epistemic update and Autumn  a moral update, we cannot distinguish them from behavior alone.

This extends to a general principle: actions are driven by a combination of your values and subjective probabilities, and any given action is consistent with many different combinations of utility function and probability distribution.

As a second example, suppose Bart is an investor who makes risk-averse decisions (say, invests in bonds rather than stocks). He might do this for two reasons:

  1. He would get a lot of disutility from losing money (maybe it's his retirement fund)
  2. He irrationally believes the probability of losing money is higher than it actually is (maybe he is biased because he grew up during a financial crash).

These different combinations of probability and utility inform the same risk-averse behavior. In fact, probability and utility are so interchangeable that professional traders-- just about the most calibrated, rational people with regard to probability of losing money, and who are only risk-averse for reason (1) -- often model financial products as if losing money is more likely than it actually is, because it makes the math easier.

Thanks this is helpful, and potentially a useful top-level post

Very valid! I guess I'm thinking of this as "approaches EA values" [verb] rather than "values" [noun]. I think most if not all of the most abstract values EA holds are still in place, but the distinction between core and secondary values is important.

This was mainly a linguistic comment because I find that sometimes people disagree with a post if the terminology used is wrong, so I wanted to get ahead of that. I think I probably could have been more clear that I think you've identified something important and true here, and I am somewhat concerned about how memes spread and wouldn't want people who haven't updated along those lines to feel less like they are part of the EA community.  

I agree with this goal hierarchy framework - it's super super useful to appreciate that many of one's personal goals are just extrapolations and mental shortcuts of more distilled upstream goals 

While I agree with Vaidehi's comments on whether "value drift" is the right descriptor, I think it's true that proportion of in-practice-priorities has probably shifted.

As someone who endorses the overall shift towards longtermist priorities, I still do agree with this post. I think it's important people be thinking for themselves and not getting tugged along with social consensus.

I think the challenge is that the recent changes can be described in a number of different ways:

  • Object level changes to fields, disciplines or industries that we focus on which is priorities shift
  • Changes in (attitudes and behaviors) regarding spending which maybe could be described as lifestyle shift (and relatedly, increasing importance ascribed to EA time, which could be a bit of a values shift)
  • A more ambitious and less risk averse attitude, which maybe is a culture shift

I'm not quite sure how I'd summarise these changes with 1 phrase or word, but these things in combination does create a certain... "aesthetic" that feels coherent - I could create a "2022 EA starter pack" meme that would probably capture the above pretty accurately. 

Similarly, less funding towards evidence-backed ideas, and more funding for long shot projects, of which much or most of the expected value comes from a small chance of big success.


Suggestion to change to: "Less proportion of funding towards" since the total amount of funding to e.g. GiveWell backed charities has increased but overall there is stil more funding (at least as of now, though that may change in the near future) towards that. 

Good catch, thank you!

Personally, I dislike the title framing on at least two accounts:

  1. As Vaidehi mentioned, I don’t think that “values” have shifted, but rather implementation/interpretation of fairly steady values (e.g., truth-seeking/open-mindedness, impact-maximization) have shifted or at least expanded.
  2. “Drift” has a connotation (if not outright denotation) of “drifting [away] [[from the ideal/target]]” whereas I think that it’s mainly just fair to say “interpretations are shifting."

One person's Value Drift is another person's Bayesian Updating

Relatively unpolished but posting anyway.
 

Finally : +1 for posting anyways, I appreciate it. I find alternative framings of ideas I've heard before and things I don't fully agree with really useful (more so than ideas I agree with actually) for teasing out what I actually think and clarifying my thoughts on complicated topics. 

I also want to make sure we still value the community members who don’t change their minds because of what’s trending in EA, and who still hold the values that we thought were good values 10 years ago and are likely still good values today.

I strongly agree with this!

It's ok to leave EA, but I really hope these community members feel welcome to stay here. Just donating to GiveWell charities is still effective altruism. "Big tent" effective altruism is very important (particularly right now).

Curated and popular this week
Ben_West🔸
 ·  · 1m read
 · 
> Summary: We propose measuring AI performance in terms of the length of tasks AI agents can complete. We show that this metric has been consistently exponentially increasing over the past 6 years, with a doubling time of around 7 months. Extrapolating this trend predicts that, in under a decade, we will see AI agents that can independently complete a large fraction of software tasks that currently take humans days or weeks. > > The length of tasks (measured by how long they take human professionals) that generalist frontier model agents can complete autonomously with 50% reliability has been doubling approximately every 7 months for the last 6 years. The shaded region represents 95% CI calculated by hierarchical bootstrap over task families, tasks, and task attempts. > > Full paper | Github repo Blogpost; tweet thread. 
Joris 🔸
 ·  · 5m read
 · 
Last week, I participated in Animal Advocacy Careers’ Impactful Policy Careers programme. Below I’m sharing some reflections on what was a really interesting week in Brussels! Please note I spent just one week there, so take it all with a grain of (CAP-subsidized) salt. Posts like this and this one are probably much more informative (and assume less context). I mainly wrote this to reflect on my time in Brussels (and I capped it at 2 hours, so it’s not a super polished draft). I’ll focus mostly on EU careers generally, less on (EU) animal welfare-related careers. Before I jump in, just a quick note about how I think AAC did something really cool here: they identified a relatively underexplored area where it’s relatively easy for animal advocates to find impactful roles, and then designed a programme to help these people better understand that area, meet stakeholders, and learn how to find roles. I also think the participants developed meaningful bonds, which could prove valuable over time. Thank you to the AAC team for hosting this! On EU careers generally * The EU has a surprisingly big influence over its citizens and the wider world for how neglected it came across to me. There’s many areas where countries have basically given a bunch (if not all) of their decision making power to the EU. And despite that, the EU policy making / politics bubble comes across as relatively neglected, with relatively little media coverage and a relatively small bureaucracy. * There’s quite a lot of pathways into the Brussels bubble, but all have different ToCs, demand different skill sets, and prefer different backgrounds. Dissecting these is hard, and time-intensive * For context, I have always been interested in “a career in policy/politics” – I now realize that’s kind of ridiculously broad. I’m happy to have gained some clarity on the differences between roles in Parliament, work at the Commission, the Council, lobbying, consultancy work, and think tanks. * The absorbe
Max Taylor
 ·  · 9m read
 · 
Many thanks to Constance Li, Rachel Mason, Ronen Bar, Sam Tucker-Davis, and Yip Fai Tse for providing valuable feedback. This post does not necessarily reflect the views of my employer. Artificial General Intelligence (basically, ‘AI that is as good as, or better than, humans at most intellectual tasks’) seems increasingly likely to be developed in the next 5-10 years. As others have written, this has major implications for EA priorities, including animal advocacy, but it’s hard to know how this should shape our strategy. This post sets out a few starting points and I’m really interested in hearing others’ ideas, even if they’re very uncertain and half-baked. Is AGI coming in the next 5-10 years? This is very well covered elsewhere but basically it looks increasingly likely, e.g.: * The Metaculus and Manifold forecasting platforms predict we’ll see AGI in 2030 and 2031, respectively. * The heads of Anthropic and OpenAI think we’ll see it by 2027 and 2035, respectively. * A 2024 survey of AI researchers put a 50% chance of AGI by 2047, but this is 13 years earlier than predicted in the 2023 version of the survey. * These predictions seem feasible given the explosive rate of change we’ve been seeing in computing power available to models, algorithmic efficiencies, and actual model performance (e.g., look at how far Large Language Models and AI image generators have come just in the last three years). * Based on this, organisations (both new ones, like Forethought, and existing ones, like 80,000 Hours) are taking the prospect of near-term AGI increasingly seriously. What could AGI mean for animals? AGI’s implications for animals depend heavily on who controls the AGI models. For example: * AGI might be controlled by a handful of AI companies and/or governments, either in alliance or in competition. * For example, maybe two government-owned companies separately develop AGI then restrict others from developing it. * These actors’ use of AGI might be dr
Recent opportunities in Building effective altruism
46
Ivan Burduk
· · 2m read