Hide table of contents

Comment Permalink

My post is a philosophical critique of EA's epistemic culture. The core argument is that our community has a built-in preference for the easily measurable, and I'm exploring the downsides of that bias.

On your points about proof (1 & 3):

The demand for pre-existing, legible proof for a class of interventions highlights the exact paradox I'm concerned about. If we require a high standard of quantitative evidence before trying something new, we may never run the experiments needed to generate that evidence. This creates a catch-22 that can lock us out of potentially transformative work.

On your point about existing orgs (2):

You're right, these organizations do this work. My argument isn't that systemic change is absent from EA, but that it's on the periphery. It's not central to the movement's core narrative or funding strategy in the way direct interventions are. The question is why this type of work isn't more foundational to our approach across all cause areas.

On your final question (the "root cause"):

Good catch on my "root cause" phrasing—it was imprecise. A better term is "foundational systems" or "underlying structures." My hypothesis isn't about a single cause, but that improving these foundational systems acts as an "impact multiplier" that makes all direct interventions more effective. The core problem remains that the impact of strengthening these systems is diffuse and hard to measure.

See in context

Is Our Focus on Measurement Limiting Our Impact?

by tootlife

Jun 284 min read 5

18

PhilosophyOpinion

Frontpage

Is Our Focus on Measurement Limiting Our Impact?

The Allure of the Measurable: Are We Trading Impact for Certainty?

The Central Analogy: The Bednet and The Health System

When Numbers Deceive

Addressing Common Objections

A Path Forward: Potential Solutions

Conclusion

5 comments

The Allure of the Measurable: Are We Trading Impact for Certainty?

At its heart, Effective Altruism is a commitment to using evidence and reason to do the most good. The engine of this effectiveness has always been measurement. By quantifying our impact—collecting data, analyzing outcomes, and comparing interventions—we have managed to stay rational, avoid ineffective dead ends, and make a tangible difference in the world. Measurement is the flag on the EA ship, the very principle that distinguishes our approach.

However, a tool that is instrumental to our success can also become a critical weakness.

In this post, I will argue that the EA community's deep-seated reliance on measurement, counting, and monetization may inadvertently cause us to overlook superior opportunities. We risk dismissing or ignoring the most promising routes to impact simply because their effectiveness cannot be easily quantified.

The Central Analogy: The Bednet and The Health System

Let's consider a classic example. For years, one of the most celebrated conclusions in EA has been the cost-effectiveness of distributing insecticide-treated bednets to prevent malaria. Let's examine why.

Intervention A: The Bednet. The appeal of the bednet is its legibility. The math is beautifully straightforward. We can calculate the cost per net, measure the baseline rate of malaria, distribute the nets, and measure the new, lower rate of infection. The causal link is direct, the feedback loop is short, and we can arrive at a clear, satisfying number: X dollars spent to avert one case of malaria.
Intervention B: The Health System. Now, consider an alternative: a long-term project to reform the underlying healthcare system. This could involve improving public health education, draining swamps where mosquitos breed, or increasing the number of local clinics. The goal is to address the root cause. A successful project wouldn't just reduce malaria; it would likely reduce the incidence of the next disease as well. The potential upside is enormous.

But how do we measure this? The causal chains are a tangled web. The timeframe is years. The success is not just a lower malaria rate, but a more resilient and healthy society. There is no simple, clean number to put in a spreadsheet.

Faced with these two options, our current tools and culture will almost always favor the bednet. It is clear, proven, and measurable. But is it truly the most effective thing we can do? Or is it just the most effective thing we can count?

When Numbers Deceive

Our trust in measurement appears to be the most objective way to identify the best option. But this reliance can be deceptive. As I've explored in a previous post, a purely numerical calculation like Expected Value (EV) can mask the true risk of failure, making a 99.9% failure rate seem like a rational choice.

This problem extends beyond complex calculations. Our intense focus on what can be measured may cause us to systemically undervalue interventions that are simply hard to quantify, making them seem less effective even when their potential impact is far greater.

Addressing Common Objections

Objection 1: "If we move away from measurement, we lose our rigor and just waste money." This isn't a call to abandon rigor, but to expand it. True rationality requires us to see the world clearly, and if our measurement tools are acting as blinders, we must supplement them. We need to develop more sophisticated frameworks that incorporate qualitative evidence and expert judgment without being "disconnected from the world."
Objection 2: "It's better to fund a proven intervention that saves 100 lives for sure than an unproven one that might save 100,000." This highlights a potential flaw in our evaluation process. If our system consistently overlooks opportunities with a vastly higher potential impact because they carry uncertainty, then the system itself may not be fully rational. A truly effective system must be capable of identifying and pursuing these high-reward opportunities, even if it means tolerating risk.
Objection 3: "Systemic change is too complex, too political, and too slow for philanthropy to solve." If the goal of Effective Altruism is to solve the world's most pressing problems, we cannot shy away from them simply because they are difficult. To ignore the root causes of suffering because they are complex is a choice. Our ambition for impact must be as large as the problems we hope to solve.

A Path Forward: Potential Solutions

A Portfolio Approach: We should consider formally splitting funding into distinct portfolios. For example, a fund might allocate 70% of its resources to proven, highly measurable interventions, while dedicating 30% to a "high-risk, high-reward" fund for systemic change. This would allow us to continue supporting reliable interventions while also creating space for potentially transformative work.
Develop Better Evaluation Tools: We need to invest in creating new analytical tools that go beyond simple numbers and help us see the bigger picture. This means getting better at root cause analysis. When we see the malaria problem, we should model the effects of fixing the healthcare system. We can use qualitative data, historical case studies, and expert forecasting to evaluate the potential impact of these larger, systemic interventions.

Conclusion

The tools of quantification have served our community immeasurably well. But we must ensure our tools do not become our masters. We must be willing to look up from the spreadsheet and into the messy, complex reality of the world, ready to tackle the greatest challenges, not just the most measurable ones.

My question to the community is this: How do we hold onto the rigor that defines us, without letting our tools define the limits of our moral ambition?

18 Reactions

More posts like this

Comments5

Sorted by

New & upvoted

Click to highlight new comments since: Today at 10:05 AM

Eli RoseJun 2813

I think there's really something to this, as a critique of both EA intellectual culture & practice. Deep in our culture is a type of conservatism and a sense that if something is worth doing, one ought to be able to legibly "win a debate" against all critiques of it. I worry this chokes off innovative approaches, and misses out on the virtues of hits-based giving.

However, there are really a wide variety of activities that EAs get up to, and I think this post could be improved by deeper engagement with the many EA activities that don't fit the bednet mold.

My job is helping the world navigate the development of transformative AI without blowing up, getting taken over by AIs or small groups of humans, or generally going off the rails. The weird nature of this challenge and the lack of a long history of clearly analogous work to study means we fundamentally can't be too measurement-based in the way you describe (though we are certainly vulnerable to other types of pathologies). Many EAs work in this area, and my employer Open Philanthropy gives a lot to it.

An example from a very different part of EA might be Legal Impact for Chickens, currently featured on the CEA website. Though I have no special insight into this work at all, I suspect it also faces fundamental barriers to measurement, since the outcomes of legal action are much more concentrated in a few data points than the outcomes of bednet distribution.

CamilleJun 293

I'd be very grateful to have :

1-A precise example of systemic change given

2-Explanations for why e.g. ARMoR, Concentric Policies or Kooperation Global (or any other policy-focused org or conjunction thereof) don't count as systemic change approaches (despite being definitely EA aligned)

3-An example of a tried and tested "solve the root cause" intervention, something I can look at and think "Oh, I want that but for GHD!".

Another question : How did we come to the conclusion that a root cause existed for all/a large chunk of GHD issues ? This sounds like an extremely complex hypothesis to me. What evidence have we observed that is more probable to observe in case of a root cause rather than not ?

tootlifeJul 11

On your points about proof (1 & 3):

On your point about existing orgs (2):

On your final question (the "root cause"):

JasonJun 292

A Portfolio Approach: We should consider formally splitting funding into distinct portfolios. For example, a fund might allocate 70% of its resources to proven, highly measurable interventions, while dedicating 30% to a "high-risk, high-reward" fund for systemic change. This would allow us to continue supporting reliable interventions while also creating space for potentially transformative work.

EAs may only control a small fraction of resources in most cause areas (depending on exactly how one defines the cause area). If the portfolio approach is correct, I submit that the hypothetical fund should care about improving the total allocation of resources between the two approaches, not making its own allocation match what would be ideal for the charitable sector as a whole to do. Unless the charitable sector already has the balance between portfolios in a cause area approximately correct, it seems that a fund whose objective was to improve the overall sector balance between the two approaches would be close to all-in on one or the other.^[1]

^{^}
There are reasons this might not be the case -- for instance, you might think other funders were not going a very good job funding either proven, highly measurable interventions or high-risk / high-reward systemic interventions.

SummaryBotJun 301

Executive summary: This exploratory post argues that Effective Altruism’s heavy reliance on measurable outcomes may cause it to overlook high-impact opportunities—such as systemic reforms—simply because they are harder to quantify, and it calls for broader evaluative tools and risk-tolerant funding models to address this blind spot.

Key points:

Core critique: EA's emphasis on measurement and legibility risks biasing us toward interventions like bednets that are easier to quantify, while undervaluing complex, potentially more impactful systemic changes.
Illustrative analogy: The author contrasts easily measured interventions (e.g. bednets) with harder-to-evaluate systemic reforms (e.g. healthcare system strengthening), suggesting we may favor the former not because they’re more effective, but because they’re more countable.
Limitations of expected value (EV): Numerical models like EV can obscure high failure probabilities and reinforce our tendency to prefer safe, measurable options over riskier ones with large upside.
Rebuttal of common objections: The post defends the idea of incorporating qualitative evidence and expert judgment as a form of expanded rigor—not a retreat from it—and challenges the notion that systemic interventions are too slow, political, or uncertain to pursue.
Proposed path forward: The author recommends a dual approach: (a) dedicating a share of funding to high-risk, hard-to-measure systemic interventions, and (b) improving tools for evaluating qualitative, long-term, and root-cause-based strategies.
Underlying question: How can EA retain its analytical discipline while broadening its conception of impact to include the less quantifiable but potentially more transformative?

This comment was auto-generated by the EA Forum Team. Feel free to point out issues with this summary by replying to the comment, and contact us if you have feedback.