Tyler Johnston

AI Safety Advocacy
967 karmaJoined Jul 2022Working (0-5 years)Tulsa, OK, USA

Bio

Participation
4

Book a 1:1 with me: https://cal.com/tylerjohnston/book

Share anonymous feedback with me: https://www.admonymous.co/tylerjohnston

Comments
59

I broadly want to +1 this. A lot of the evidence you are asking for probably just doesn’t exist, and in light of that, most people should have a lot of uncertainty about the true effects of any overton-window-pushing behavior.

That being said, I think there’s some non-anecdotal social science research that might make us more likely to support it. In the case of policy work:

  • Anchoring effects, one of the classic Kahneman/Tversky biases, have been studied quite a bit, and at least one article calls it “the best-replicated finding in social psychology.” To the extent there’s controversy about it, it’s often related to “incidental” or “subliminal” anchoring which isn’t relevant here. The market also seems to favor a lot of anchoring strategies (like how basically everything on Amazon in “on sale” from an inflated MSRP), which should be a point of evidence that this genuinely just works.
  • In cases where there is widespread “preference falsification,” overton-shifting behavior might increase people’s willingness to publicly adopt views that were previously outside of it. Cass Sunstein has a good argument that being a “norm entrepreneur,” that is, proposing something that is controversial, might create chain-reaction social cascades. A lot of the evidence for this is historical, but there are also polling techniques that can reveal preference falsification, and a lot of experimental research that shows a (sometimes comically strong) bias toward social conformity, so I suspect something like this is true. Could there be preference falsification among lawmakers surrounding AI issues? Seems possible.

Also, in the case of public advocacy, there's some empirical research (summarized here) that suggests a "radical flank effect" whereby overton-window shifting activism increases popular support for moderate demands. There's also some evidence pointing the other direction. Still, I think the evidence supporting is stronger right now.

P.S. Matt Yglesias (as usual) has a good piece that touches on your point. His takeaway is something like: don’t engage in sloppy Overton-window-pushing for its own sake — especially not in place of rigorously argued, robustly good ideas.

For what it's worth, I have no affiliation with CE, yet I disagree with some of the empirical claims you make — I've never gotten the sense that CE has a bad reputation among animal advocacy researchers, nor is it clear to me that the charities you mentioned were bad ideas prior to launching.

Then again, I might just not be in the know. But that's why I really wish this post was pointing at specific reasoning for these claims rather than just saying it's what other people think. If it's true that other people think it, I'd love to know why they think it! If there are factual errors in CE's research, it seems really important to flag them publicly. You even mention that the status quo for giving in the animal space (CE excepted) is "very bad already," which is huge if true given the amount of money at stake, and definitely worth sharing examples of what exactly has gone wrong.

I've been thinking about Emre's comment since I read it — and given this event on the Forum, I eventually decided to go and read Marcus Rediker's biography of Lay. I recommend it for anyone interested in learning more about him as a historical figure.

To share some thoughts on the questions you posed, my feeling is that his extreme protests weren't based on any strategic thinking about social change, and I definitely don't think he'd be an incrementalist if he were alive today. Rather, I think his actions were driven by his extremely firm, passionately felt, and often spiritually-derived moral convictions — the same ones that convinced him to live in a cave and practice radical self-sufficiency. Actually, it seems like he had what we might describe as an excessive degree of "soldier mindset." From the Rediker text:

He was loving to his friends, but he could be a holy terror to those who did not agree with him. He was aggressive and disruptive. He was stubborn, never inclined to admit a mistake. His direct antinomian connection to God made him self-righteous and at times intolerant. The more resistance he encountered, or, as he understood it, the more God tested his faith, the more certain he was that he was right. He had reasons both sacred and self-serving for being the way he was. He was sure that these traits were essential to defeat the profound evil of slavery.

I don't know if the EA community would be wrong to exclude him today. He turned out to be ahead of his time in so many ways, and probably did meaningfully influence the eventual abolition of slavery, but this is so much easier to celebrate ex post. What does it actually feel like from the inside, to have extreme personal convictions that society doesn't share, and how do you know (1) that history will prove you right; and (2) that you are actually making a difference? I really worry that what it feels like to be Benjamin Lay, from the inside, isn't so dissimilar from what it feels like to be a Westboro Baptist Church member today. 

I do think the role of radicalism in driving social change is underrated in this community, and I think it played a big role in not only the slavery abolition movement but also the women's suffrage movement, the civil rights movement, the gay rights movement, etc. It's worth looking into the radical flank effect or Cass Sunstein's writing on social change if you are curious about this. Maybe one thing I'd like to believe is that the world is antifragile and can tolerate radicals ranging the moral spectrum, and those who are on the right side of history will eventually win out, making radicalism a sort of asymmetric weapon that's stronger when you are ahead of your time on the moral arc of history. But that's a very convenient theory and I think it's hard to know with any confidence, and the success of so many fascist and hateful ideologies in relatively recent history probably suggests otherwise. 

In any case, I really admire Lay for his conviction and his empathy and his total dedication to living a principled life. But I also really admire communities like this one for their commitment to open debate and the scout mindset and earnest attempts to hear each other out and question our own assumptions. So I expect, and hope, that the EA community would ban Benjamin Lay from our events. But I also hope we wouldn't laugh at him like so many Quakers did. I hope we would look at him, scowling at us through the glass, and ask ourselves with total sincerity, "What if he has a point?"

I'm not affiliated with 80k, but I would be surprised if the average reader who encounters their work comes away from it with higher regard for AI labs than they came in with — and certainly not that there is something like a brand partnership going on. Most of the content I've seen from them has (in my reading) dealt pretty frankly with the massive negative externalities that AI labs could be generating. In fact, my reading of their article "Should you work at a leading AI lab?" is that they don't broadly recommend it at all. Here's their 1-sentence summary verbatim:

 Recommendation: it's complicated

We think there are people in our audience for whom this is their highest impact option — but some of these roles might also be very harmful for some people. This means it's important to take real care figuring out whether you're in a harmful role, and, if not, whether the role is a good fit for you.

 Hopefully this is helpful. It also sounds like these questions could be rhetorical / you have suspicions about their recommendation, so it could be worth writing up the affirmative case against working at labs if you have ideas about that. I know there was a post last week about this, so that thread could be a good place for this.

I'll admit this only came to mind because of the dubious anthropomorphizing in the piece ("mental imagery" and "dreams"), but I've really enjoyed Stephen Wolfram's writings on AI, including Generative AI Space and the Mental Imagery of Alien Minds. I'm guessing your students would enjoy it. 

His write-up on ChatGPT is also a very good intro to LLMs and neural networks, touching on some of what's going on behind the scenes while remaining approachable for non-technical readers.

Yeah, agreed that it's an odd suggestion. The idea of putting it on a business card feels so counterintuitive to me that I wonder how literally it's meant to be taken, or if the sentence is really just a rhetorical device the authors are using to encourage the reader.

The mention of "Pareto Productivity Pro" rang a bell, so I double-checked my copy of How to Launch a High-Impact Nonprofit — and sure enough, towards the end of the chapter on productivity, the book actually encourages the reader to add that title to their Linkedin verbatim. Not explicitly as a certification, nor with CE as the certifier, but just in general. I still agree that it could be misleading, but I imagine it was done in fairly good faith given the book suggests it.

However, I do think this sort of resume padding is basically the norm rather than the exception. Somewhat related anecode from outside EA: Harvard College has given out a named award for many decades to the "top 5% of students of the year by GPA." Lots of people — including myself — put this award in their resume hoping it will help them stand out among other graduates.

The catch is that grade inflation has gotten so bad that something like 30-40% of students will get a 4.0 in any given year, and they all get the award on account of having tied for it (despite it now not signifying anything like "top 5%.") But the university still describes it as such, and therefore students still describe it that way on resumes and social media (you can actually search "john harvard scholar" in quotes on LinkedIn and see the flexing yourself). Which just illustrates how even large, reputable institutions support this practice through fluffy, misleading awards and certifications.

This post actually spurred me to go and remove the award from my LinkedIn, but I still think it's very easy and normal to accidentally do things that make yourself look better in a resume — especially when there is a "technically true" justificaiton for it (like "the school told me I'm in the top 5%" or "the book told me I could add this to my resume!"), whether or not this is really all that informative for future employers. Also, in the back of my mind, I wonder whether choosing to not do this sort of resume padding creates bad selection effects that lead to people with more integrity being hired less, meaning even high-integrity people should be partaking in resume padding so long as everyone else is (Moloch everywhere!). Maybe the best answer is just making sure hiring comittees have good bullshit detectors and lean more on work trials/demonstrated aptitude over fancy certifications/job titles.

I think the lesson we can draw from climate and animal rights that you mention - the radical flank effect - shows that extreme actions concerning an issue in general might make incremental change more palatable to the public. But I don’t think it shows that extreme action attacking incremental change makes that particular incremental change more likely.

If I had to guess, the analogue to this in the animal activist world would be groups like PETA raising awareness about the “scam” that is cage-free. I don’t think there’s any reason to think this has increased the likelihood of cage-free reforms taking place — in fact, my experience from advocating for cage-free tells me that it just worsened social myths that the reform was meaningless despite evidence showing it reduced total hours spent suffering by nearly 50%.

So, I would like to see an activist ecosystem where there are different groups with different tactics - and some who maybe never offer carrots. But directing the stick to incremental improvements seems to have gone badly in past movements, and I wouldn’t want to see the same mistake made here.

I think just letting the public now about AI lab leaders’ p(dooms)s makes sense - in fact, I think most AI researchers are on board with that too (they wouldn’t say these things on podcasts or live on stage if not).

It seems to me this campaign isn’t just meant to raise awareness of X-risk though — it’s meant to punish a particular AI lab for releasing what they see as an inadequate safety policy, and to generate public/legislative opposition to that policy.

I think the public should know about X-risk, but I worry using soundbites of it to generate reputatonial harms and counter labs’ safety agendas might make it less likely they speak about it in the future. It’s kind of like a repeated game: if the behavior you want in the coming years is safety-oriented, you should cooperate when your opponent exhibits that behavior. Only when they don’t should you defect.

Load more