I like this comment; the events analogy, in particular, shifted some of my beliefs and gave me a useful frame I had overlooked before. Apologies if you've already covered this in other comments.
My impression is that, if asked, many people would say that EA is significantly worse than other spaces. There's some amount of trying to figure out which other spaces are reasonable to compare to, which allows people to agree with some versions and disagree with others, but on a vibes level, I feel like people think EA is significantly worse. [1] Idk if this is useful, but unlike (most?) other commenters, I do think Nathan is responding to a real sentiment.
Re "why do we even care about a baseline?" Ime, when harassment gets brought up (e.g. on the forum), people say that there were basic/common/cheap/expected mitigations that would have prevented the incident. If it turns out that most other orgs/communities, in fact, don't have these measures in place or are similarly bad at mitigating problems, then imo it's harder for proponents of change to argue that "EA" "should"[2] solve the problem. [3]
I liked the events analogy - one reason why I don't care that much about baselines for events (particiularly ones I have little connection to) is that most events are just ... pretty bad - or at least bad at achieving goals that are analogous to mine, and if I expected a similar level of success to them, I'd spend my time doing other stuff, but I do care a lot about the performance of events that seem particularly great and where my events stack up compared to those (maybe a baseline for "competent according to me" events) - idk if there are transparent groups out there that are doing a really great job in this area and would meet the standards of forum commenters but if there are I would be curious to hear about them.
As an aside, talking about this topic does seem pretty cursed, and I don't think there's much upside socially to being on the "EA is no worse than others" side or even the "idk man, it's pretty confusing how bad EA is on this dimension" side.
To be clear, I mean "should" in more of a deontological sense than a consequentialist one. I do care about outcomes in the world, but I just don't think it's possible to have an online discussion about the consequentialist case that couldn't ~only engage with one side.
It might even point to the problem being unusually difficult to solve (otherwise, why haven't the other well-intentioned communities solved it?).
Hmm, I think if smart EA/Rat types get "corrupted" in general, they'll present as thoughtful people with reasons that are hard to dismiss quickly when questioned by EAs. I get the vague sense that your evidence bar for "corruption" is going to be too high to be useful in most worlds where there's a lot of corruption.
(that's not to say that EAs/Rats/etc. who join labs/start wildly profitable companies speeding up AI progress have been "corrupted" - I just think if they were, it would present pretty similarly to how it has done and it's hard to get lots of easy to share evidence)
Interesting, is sports betting plausibly as bad as tobacco/alcohol in low-income countries?
Like, I think sports betting is plausibly one of the "worst businesses" for the US, comparable to alcohol/tobacco - but my impression is that the EAs that care about tobacco/alcohol don't care very much about interventions in high-income countries relative to low-income countries.
I think this post significantly overstates its conclusion and is plausibly poorly calibrated on the relative value of forecasting.
My main "directional" issues with the post as it's currently written:
I agree with some of the post's vibes and think it's pointing at real cultural traits of rationalist communities. Though tbh, I think OP is too bearish on the usefulness of betting/making falsifiable predictions for people in EA-spaces. I suspect that OP seen lots of people getting very distracted by futarchy/manifold etc. (and I do think this is a risk), but culturally I think EA should be pretty into "betting/making falsifiable predictions" and that cluster of epistemic traits AND I think forecasting infrastructure has a meaningful effect on this. E.g In two office spaces (out of three that I've spent substantial time in), I think Manifold/prediction markets have very clearly made the communities more forecasting-y, and this has had tangible effects on people's research/choice of projects - this is probably the most explicit example, though most changes are harder to hyperlink.
Given that you are criticising the epistemics of EAs taking AGI very seriously, I think it's reasonable to hold this post to a higher epistemic standard than a typical EA forum post. Apologies if this comes across as combative - I spent some time trying to tone it down with Claude and struggled to get something that wasn't just hedged/weak sauce. I am excited about more discussion of the capabilities of AI systems on the EA forum and would like more people to write up their takes on the current situation.
......
I think you are applying more rigour to the bullish case than the bearish one. For example, you say:
[Mythos not providing a substantive improvement in cybersec capabilities] is further highlighted by the fact that an independent analysis was able to find many of the same vulnerabilities using much smaller open-source models.
I think this is misleading for a few reasons:
On the claim that Anthropic talks about risks from their own models primarily to create hype: I find this hard to square with the evidence. Talking about how your B2B product might be extremely dangerous, or publishing lengthy documents critically assessing your own product and admitting to errors that would be difficult to identify independently (e.g. accidentally training against the CoT), is not a common marketing tactic. It feels like your model implies that companies should only release materials optimised for short-term interests, which doesn't predict the real differences in how AI companies approach releases.
Benchmarks are interpreted uncritically
The benchmark contamination arguments are worth engaging with in principle, but I'm not sure they're doing much work in practice - I don't think many people in EA are actually updating heavily on raw benchmark scores right now. METR, arguably EA's favourite benchmarking org, has been pretty vocal about their own benchmarks being saturated, so I think the community is reasonably aware of these limitations already.
Negative results are ignored
I'm genuinely uncertain what you want Anthropic and other AI companies to do here. Do you think "genuine intelligence" is easy to measure and well-defined? The more concrete concepts being used as proxies - coding ability, economic value generated, uplift - seem defensible on their own terms rather than as misleading substitutes for something more fundamental.
On "fundamental limits of LLMs" more broadly: these arguments have been made confidently by prominent researchers since the advent of LLMs and have not had a great track record. That doesn't make them wrong, but it's worth noting.
.....
I think this post would be much stronger if it applied its standards more symmetrically. It would also help to have a more concrete conclusion. The current takeaway is essentially "further research is needed", which is a claim you can make about most areas of research (so much so that it's been banned from multiple journals), but I don't have a great sense of what research would actually convince you that the "AI hype" is reasonable.
Having an AI that doesn't willingly participate in coups doesn't imply that you need to specify all of the AI's values in advance, or that it will be incorrigible in a broad (and x-risk increasing sense).
I think that the people preventing AI-assisted coups are imagining pretty corrigible AIs (in the sense that Claude right now is very corrigible); they just won't want to do coups (in a similar sense to Claude not wanting to help with bioweapons research), and this just seems pretty workable.
A separate cluster of threat models that is worth disentangling is creating more surface area for anti-human-user coordination within the economy, particularly if it's much easier for smart, misaligned AI systems to coordinate with relatively stupid, corrigible AI systems (e.g., Opus 4.7). The arguments for AI <> AI coordination advantage (over AI <> human) are quite intuitive to me, but I don't think you actually need an asymmetry here to put society in a more vulnerable state than the current one. I don't have a great sense of how this washes out, but it feels like a crux for evaluating the net benefit of coordination tech.
Similar to how traditional -> digital banking probably creates more surface area for exploitation by computer hackers, it's probably very good to have primitive computers touching nukes rather than more modern ones.
I thought that part of the core thesis was that as we go through the intelligence explosion, coordination tech becomes increasingly valuable (maybe critical). Are you saying that it's plausible that we'll get "good enough" coordination tech out of agents that are much less powerful that than the frontier during the IE? E.g. coordination tech generally uses Opus 4.7, even in the Opus 6-8 era, where coordination tech seems most (?) valuable, but we also have much more legitimate concerns about scheming capabilities?
Ah yeah, that all makes sense.
I'm sorry that you're much happier for distancing yourself from EA - but I'm glad that you're much happier now!
> If I didn't have a pretty thorough view formed over years, base rate might also be the first thing I want to understand. I guess I'm just personally well past that
I relate to feeling this way about other topics. Normally, I just roll my eyes and don't really engage with online discussions, but for fwiw I appreciate you giving your thoughts here - it was, at the very least, helpful for me.