Linch

I've been trying to keep the "meta" and the main posts mostly separate so hopefully the discussions for the metas and the main posts aren't as close together.

Linch's Quick takes

Linch3d2

I've now written it here, thanks for all the feedback! :) https://linch.substack.com/p/simplest-case-ai-catastrophe

Linch's Quick takes

Linch14d11

The bets I've seen you post seem rather disadvantageous to the other side, and I believed so at the time. Which is fine/good business from your perspective given that you managed to find takers. But it means I'm more pessimistic on finding good deals by both of our lights.

Linch's Quick takes

Linch18d2

Hmm right now this seems wrong to me, and also not worth going into in an introductory post. Do you have a sense that your view is commonplace? (eg from talking to many people not involved in AI)

Linch's Quick takes

Linch19d22

Here's my current four-point argument for AI risk/danger from misaligned AIs.

We are on the path of creating intelligences capable of being better than humans at almost all economically and militarily relevant tasks.
There are strong selection pressures and trends to make these intelligences into goal-seeking minds acting in the real world, rather than disembodied high-IQ pattern-matchers.
Unlike traditional software, we have little ability to know or control what these goal-seeking minds will do, only directional input.
Minds much better than humans at seeking their goals, with goals different enough from our own, may end us all, either as a preventative measure or side effect.

Request for feedback: I'm curious whether there are points that people think I'm critically missing, and/or ways that these arguments would not be convincing to "normal people." Original goal.

Linch's Quick takes

Linch20d21

What are people's favorite arguments/articles/essays trying to lay out the simplest possible case for AI risk/danger?

Every single argument for AI danger/risk/safety I’ve seen seems to overcomplicate things. Either they have too many extraneous details, or they appeal to overly complex analogies, or they seem to spend much of their time responding to insider debates.

I might want to try my hand at writing the simplest possible argument that is still rigorous and clear, without being trapped by common pitfalls. To do that, I want to quickly survey the field so I can learn from the best existing work as well as avoid the mistakes they make.

Linch's Quick takes

Linch1mo3

I often see people advocate others sacrifice their souls. People often justify lying, political violence, coverups of “your side’s” crimes and misdeeds, or professional misconduct of government officials and journalists, because their cause is sufficiently True and Just. I’m overall skeptical of this entire class of arguments.

This is not because I intrinsically value “clean hands” or seeming good over actual good outcomes. Nor is it because I have a sort of magical thinking common in movies, where things miraculously work out well if you just ignore tradeoffs.

Rather, it’s because I think the empirical consequences of deception, violence, criminal activity, and other norm violations are often (not always) quite bad, and people aren’t smart or wise enough to tell the exceptions apart from the general case, especially when they’re ideologically and emotionally compromised, as is often the case.

Instead, I think it often helps to be interpersonally nice, conduct yourself with honor, and overall be true to your internal and/or society-wide notions of ethics and integrity.

I’m especially skeptical of galaxy-brained positions where to be a hard-nosed consequentialist or whatever, you are supposed to do a specific and concrete Hard Thing (usually involving harming innocents) to achieve some large, underspecified, and far-off positive outcome.

I think it's like those thought experiments about torturing a terrorist (or a terrorist's child) to find the location of the a ticking nuclear bomb under Manhattan where somehow you know the torture would do it.

I mean, sure, if presented that way I'd think it's a good idea but has anybody here checked the literature on the reliability of evidence extracted under torture? Is that really the most effective interrogation technique?

So many people seem eager to rush to sell their souls, without first checking to see if the Devil’s willing to fulfill his end of the bargain.

(x-posted from Substack)

Unknown Knowns: Five Ideas You Can't Unsee

Linch1mo3

Thanks! I agree the math isn't exactly right. The point about x^2 on the rationals is especially sharp.

The problem with calling it "the paradox of the heap" is to make it sound like an actual paradox, instead of a trivially easy connection re:tipping points. I wish I had a better terminology/phrase for the connection I want to make.

Unknown Knowns: Five Ideas You Can't Unsee

Linch1mo4

Thanks, the feeling is mutual.

Unknown Knowns: Five Ideas You Can't Unsee

Linch1mo10

Happy holidays to you too.

I think your comment largely addresses a version of the post that doesn't exist.

In brief:

I don't think I claimed novelty; the post is explicitly about existing concepts that seem obvious once you have them. I even used specific commonly known terms for them.

Theory of mind, mentalization, cognitive empathy, and perspective taking are, of course, not actually "rare" but are what almost all people are doing almost all the time. The interesting question is what kinds of failures you think are common. The more opinionated you are about this, and the more you diverge from consensus opinions of experts such as psychologists and researchers in social work, the more likely you are to be wrong.

The post gave specific examples of people with the capacity for ToM nonetheless failing to consistently apply it to political outgroups, foreign adversaries, story characters etc. Also the specific wording I wrote was:

The core idea is very simple: treat other agents as real. It sounds banal, until you realize how rare it can be, and how frequently people mess up."

You harp on the word "rare" but miss the surrounding context. You consistently make technically true but irrelevant points.

so if the point is to understand world hunger or global poverty, it would be a better idea to just read an introductory text on international development than to think further about how the concept of net present value might or might not shed new light on global poverty.

Are you seriously implying that it takes less effort to read an entire textbook on developmental economics than it is to write a paragraph on a related question? Besides, that wasn't the point of the post anyway, which was more like "here's a specific conceptual error people make, NPV dissolves it."

I don't think anybody disagrees that ideas matter. I would say everyone agrees with that.

This blog post initially grew out of a conversation with a popular blogger about whether ideas actually matter. It's also commonly believed in Silicon Valley that ideas are almost irrelevant compared to execution.

I personally don't find any value in Grice's maxims.

Clearly.

Linch

Posts 85

Comments2910

Posts
85

Comments
2910