GiveWell's updated estimate of deworming and decay

GiveWell

GiveWell's updated estimate of deworming and decay

GiveWell

14 min readApr 3, 2023

Comments 6

Sorted by

New & upvoted

JoelMcGuire

Hi Alex, I’m heartened to see GiveWell engage with and update based on our previous work!

[Edited to expand on takeaway]

My overall impression is:

This update clearly improves GiveWell’s deworming analysis.
Each % point change in deworming cost-effectiveness could affect where hundreds of thousands of dollars are allocated. So getting it right seems important.
More work building an empirical prior seems likely to change the estimated decay of income effects and thus deworming's cost-effectiveness, although it’s unclear what direction.
Further progress appears easy to make.
This work doesn't update HLI's view of deworming much because:
- We primarily focus on subjective wellbeing as an outcome, which deworming doesn't appear to affect in the long run.
- The long-term income effects of deworming remain uncertain.
- In either case, analysing deworming's long-term effects still relies on a judgement-driven analysis of a single (well-done) but noisy study.

[Note: I threw this comment together rather quickly, but I wanted to get something out there quickly that gave my approximate views.]

1. There are several things I like about this update:

In several ways, it clarifies GiveWell’s analysis of deworming.
- It succinctly explains where many of the numbers come from.
- It clarifies the importance of explicit subjective assumptions (they seem pretty important).
It lays out the evidence it uses to build its prior in a manner that’s pretty easy to follow.
- Helpfully, it lists the sources and reasons for the studies not included.

2. There are a few things that I think could be a bit clearer:

The decay rate from the raw (unadjusted) data is 13% yearly.
- Assuming the same starting value as GiveWell, using this decay rate would lead to a total present value of 0.06 log-income units, compared to 0.09 for their “3% decay” model and 0.11 for their “no-decay” model.
- Different decay rates imply very different discounts relative to the "no-decay" baseline / prior, 13% decay→ 49% discount. 3% → 19% discount.
- They arrive at a decay rate of 3% instead of 13% because they subjectively discount the effect size from earlier time points more, which reduces the decay rate to 3%. While some of their justifications seem quite plausible^[1] -- after some light spreadsheet crawling, I'm still confused about what's happening underneath the hood here.
The 10% decrease in effectiveness comes because they assign 50% of the weight to their prior that there's no decay and 50% to their estimate of a 3% decay rate. So whether the overall adjustment is 0% or 50% depends primarily on two factors:
- How much to subjectively (unclear if this has an empirical element) discount each follow-up.
- How much weight should be assigned to the prior for deworming's time-trajectory, which they inform with a literature review.
All this being said, I think this update is a big improvement to the clarity of GiveWell's deworming analysis.

My next two comments are related to some limitations of this update that Alex acknowledges:

It’s possible we’ve missed some relevant studies altogether.
We have not tried to formally combine these to get point estimates over time or attempted to weight studies based on relevance, study quality, etc.
We are combining studies that may have little ability to inform what we’d expect from deworming (twin studies, childcare programs, etc.).
It could be possible to re-assess other studies measuring long-term benefits of early childhood health interventions. When we set our prior, we excluded studies that did not report separate effects on income at different time periods. We guess that for several of these studies, it would be possible to re-analyze the primary data and create estimates of the effect on income at different time periods.

3. After briefly looking over the literature review GiveWell uses to build a prior on the long-term effects of deworming, it seems like further research would lead to different results.

GiveWell takes a “vote counting” approach where the studies are weighted equally^[2]. But I would be very surprised if further research assigned equal weight to these studies because they appear to vary considerably in relevance, sample size, and quality.
- Deworming analogies include preschool, schooling, low birth weight, early childhood stimulation, pollution, twin height differences, and nutritional school lunches. It’s unclear how relevant these are to deworming because the mechanisms for deworming to benefit income seem poorly understood.
- Sample sizes aren’t noted. This could matter as one of the “pro-growth trajectory” studies, Gertler et al. (2021) have a follow-up sample size of around ~50. That seems unusually small, so it’s unclear how much weight that should receive relative to others. However, it is one of the only studies in an LMIC!
- There are also two observational studies, which typically receive less weight than quasi-experimental trials or RCTs (Case et al. 2005, Currie and Hyson 1999).

4. Progress towards building a firmer prior seems straightforward. Is GiveWell planning on refining its prior for deworming's trajectory? Or incentivizing more research on this topic, e.g., via a prize or a bounty? Here are some reasons why I think further progress may not be difficult:

The literature review seems like it could be somewhat easily expanded:
- It seems plausible that you could use Liu and Liu (2019), another causal study of deworming’s long-term effects on income, to see if the long-term effects change depending on age. They were helpful when we asked them for assistance.
- Somewhat at random, I looked at Duflo et al. (2021), which was passed over for inclusion in the review and found that it contained multiple follow-ups and found weak evidence for incomes increasing over time due to additional education.
The existing literature review on priors could be upgraded to a meta-analysis with time (data extraction is more tedious than technically challenging). A resulting meta-analysis where each study is weighted by precision and potentially a subjective assessment of relevance would be more clarifying than the present “vote counting” method.
It’s unclear if all the conclusions were warranted. GiveWell reads Lang and Nystedt (2018) as finding “Increases for males; mixed for females” and notes some quotes from the original study:
- “From ages 30–34 and onwards, the height premium increases over the life cycle for men, starting at approximately 5%, reaching 10% at ages 45-64 and approximately 11-12% at ages 65-79 (i.e., in retirement)." [...] "Almost the opposite trend is found for women. Being one decimeter taller is associated with over 11% higher earnings for women aged 25–29. As the women age, the height premium decreases and levels off at approximately 6–7%." [...] "The path of the height premium profile over the female adult life cycle is quite unstable, and no obvious trend can be seen (see Fig. 2)." (17-18)
- But when I look up that same table (shown below), I see decay for women and growth for men.

^{^}
Higher ln earnings effects from KLPS-2 to KLPS-3 are driven by lower control group earnings in KLPS-2 ($330 vs. $1165).[8] In KLPS-3, researchers started measuring farming profits in addition to other forms of earnings,[9]so part of the apparent increase in control group earnings from KLPS-2 to KLPS-3 is likely driven by a change in measurement, not real standards of living or catch-up growth.”
^{^}
“We found 10 longitudinal studies with at least two adult follow-ups from a number of countries examining the impact of a range of childhood interventions or conditions (see this table), in addition to the deworming study (Hamory et al. 2021). Of those 10 studies, 3 found decreasing effects on income, 3 found increasing effects, and 4 found mixed effects (either similar effects across time periods, different patterns across males and females, or increases and then decreases over the life cycle). Based on this, we think it makes sense to continue to assume as a prior that income effects would be constant over time. I have low confidence in these estimates, though, and it’s possible further work could lead to a different conclusion.”

GiveWell

Hi, Joel,

Alex here, responding to your comment. Thank you for taking the time to give us this feedback!

In response to some of your specific points:

You're right that we should have characterized the results from Lång and Nystedt (2018) as mixed rather than positive. Thanks for pointing out that mistake. We will update the spreadsheet so that study is correctly color-coded, and update the relevant part of the post. With this adjustment, among the studies we looked at, 3 suggest decreasing effects over time, 2 suggest increasing effects over time, and 5 show mixed effects. This still doesn't seem like it adds up to strong evidence for either increasing or decreasing effects, so my prior of a flat effect over time remains the same.
We excluded Duflo et al. 2021 because it didn't appear to include much about life cycle impacts on income from the intervention. It does report some increases in income for women in the treatment group between 2019 and 2020. However, I'd be reluctant to interpret that as evidence for increases over adulthood, because it represents only one year and because it compares pre-COVID results with results during COVID, which means other factors are probably at play.
That said, I agree that a more in-depth analysis might lead to a different prior for how we should expect early-life health interventions to affect income over the life cycle. We didn't prioritize an in-depth analysis for this adjustment, but we would be open to more work to create a better-informed prior of deworming's income effects over time. This would require deeper engagement with the studies we looked at to better understand their methodologies, relevance to deworming, and other factors. At the moment, it's not a high-priority project for GiveWell staff, but we're considering an external partnership to explore this further. We imagine that having a better grasp on how income effects change over time could inform our analysis not just of deworming but also of other programs we support, including vitamin A supplementation and seasonal malaria chemoprevention.

We'll continue to share here if more work on this leads us to further updates.

Best,
Alex

Kaleem

Hi Alex, thanks for this really detailed post, and for the work you put into the analysis! Its a really nice example of how internal critique in the EA community has lead to a tangible update.

My question: (How) Should the average reader/non-expert update on this -10% re-weighting? Like, if ~-10% is the decided as the official relighting, will this have a non-negligible effect on how we should view the cost-effectiveness of deworming programs etc?

Guy Raveh

And furthermore, will it change how funds from the 'all grants' fund are spent?

GiveWell

Hi, Kaleem and Guy!

This is Miranda Kaplan, communications associate at GiveWell. I'll answer both questions here, since they're closely related.

This adjustment updated GiveWell's overall impression of deworming by around 10%. But the bottom-line takeaway on deworming—which is that it's one of the most cost-effective programs we know of in some locations, but we have a higher degree of uncertainty about it than we do our top charities—hasn't changed much, and we think that should probably continue to be the takeaway for followers of our work.

You can see the effect of our adjustment across all locations and all deworming programs we've supported in our cost-effectiveness analysis change tracker. Before this adjustment, there was already wide variation in our cost-effectiveness estimates for these programs—as high as 38.3x cash for Deworm the World's program in Kenya, and as low as -1x cash for SCI Foundation's program on Unguja, Zanzibar.

We can't say yet what the impact of the decay adjustment will be on GiveWell's overall grantmaking in the deworming space, either using All Grants Fund donations or using other sources. Our approach to grantmaking hasn't changed—we will continue to assess funding gaps for deworming on a case-by-case basis, and consider filling those gaps that clear our cost-effectiveness bar. In a few cases, locations that previously looked cost-effective enough to meet our bar for funding (currently 10x cash) now don't meet that standard. For example, as a result of this adjustment, the estimated cost-effectiveness of Deworm the World's program in Lagos state, Nigeria, dropped to 8.9x cash from 9.9x cash. But for most locations, this change didn't cause a decisive shift in cost-effectiveness that would affect a funding decision.

I hope that's helpful!

Best,

Miranda

Guy Raveh

Hi Miranda, thanks for the very clear answer!

I don't necessarily agree with the method of allocation, but from a broad perspective I'm happy to see that a small change in estimates translates to a small, but still meaningful, adjustment in allocation.

Comments

More from the author

How We’re Searching for the Best Ways to Help in 2026

GiveWell·1mo ago·15m read

Scrutinizing One of Our Longest-Funded Programs

GiveWell·3mo ago·2m read

GiveWell’s 2025 Grantmaking: Record Grants, Expanded Reach, Crisis Response

GiveWell·4mo ago·3m read

Curated and popular this week

Counting animals: Stable population size is not equivalent to priority level

abrahamrowe, mal_graham🔸·5d ago·Curated 1d ago·16m read

AI Use Note: Main body text entirely human written. Claude (Opus 4.8) helped develop models of animal life histories in the appendix. Cross-posted from Good Structures. Executive Summary * Animal advocates sometimes make claims like “there are X of this animal...

150

Let's taboo the V-word

lincolnq·5d ago·8m read

“How long have you been v*g*n?” This is one of the most common icebreakers at animal protection events. It’s a baseline assumption, and it mostly holds true: if you’re out advocating for animals not to be tortured or abused, realistically these days you are v**n, or close. And it makes for good conversation. It seems fairly safe to assume when you meet strangers. But this assumption is hurting the movement in a way which we don’t always notice: someone new comes into the sp...

Spiro: an update 2.5 years on and a fundraising ask for expansion

Habiba Banu·2d ago·6m read

Summary Back in November 2023 I posted here to launch Spiro and raise our first $198k. Two and a half years later this is an update and a fundraiser for the next step. The short version: we've now reached over-5,900 people with TB preventive medicine, including over 3,000 children under five years old. Our early results have held up well an...

Recent opportunities to take action

EA Organisation Updates thread: July 2026

Dane Valerie·4d ago·1m read

announcing High Impact Aliens

tzukitchan·1d ago·1m read

A proposal for food retail and services: the internal animal welfare feebate

Stijn Bruers 🔸·9h ago·6m read

JoelMcGuire

Hi Alex, I’m heartened to see GiveWell engage with and update based on our previous work!

[Edited to expand on takeaway]

My overall impression is:

This update clearly improves GiveWell’s deworming analysis.
Each % point change in deworming cost-effectiveness could affect where hundreds of thousands of dollars are allocated. So getting it right seems important.
More work building an empirical prior seems likely to change the estimated decay of income effects and thus deworming's cost-effectiveness, although it’s unclear what direction.
Further progress appears easy to make.
This work doesn't update HLI's view of deworming much because:
- We primarily focus on subjective wellbeing as an outcome, which deworming doesn't appear to affect in the long run.
- The long-term income effects of deworming remain uncertain.
- In either case, analysing deworming's long-term effects still relies on a judgement-driven analysis of a single (well-done) but noisy study.

[Note: I threw this comment together rather quickly, but I wanted to get something out there quickly that gave my approximate views.]

1. There are several things I like about this update:

In several ways, it clarifies GiveWell’s analysis of deworming.
- It succinctly explains where many of the numbers come from.
- It clarifies the importance of explicit subjective assumptions (they seem pretty important).
It lays out the evidence it uses to build its prior in a manner that’s pretty easy to follow.
- Helpfully, it lists the sources and reasons for the studies not included.

2. There are a few things that I think could be a bit clearer:

The decay rate from the raw (unadjusted) data is 13% yearly.
- Assuming the same starting value as GiveWell, using this decay rate would lead to a total present value of 0.06 log-income units, compared to 0.09 for their “3% decay” model and 0.11 for their “no-decay” model.
- Different decay rates imply very different discounts relative to the "no-decay" baseline / prior, 13% decay→ 49% discount. 3% → 19% discount.
- They arrive at a decay rate of 3% instead of 13% because they subjectively discount the effect size from earlier time points more, which reduces the decay rate to 3%. While some of their justifications seem quite plausible^[1] -- after some light spreadsheet crawling, I'm still confused about what's happening underneath the hood here.
The 10% decrease in effectiveness comes because they assign 50% of the weight to their prior that there's no decay and 50% to their estimate of a 3% decay rate. So whether the overall adjustment is 0% or 50% depends primarily on two factors:
- How much to subjectively (unclear if this has an empirical element) discount each follow-up.
- How much weight should be assigned to the prior for deworming's time-trajectory, which they inform with a literature review.
All this being said, I think this update is a big improvement to the clarity of GiveWell's deworming analysis.

My next two comments are related to some limitations of this update that Alex acknowledges:

It’s possible we’ve missed some relevant studies altogether.
We have not tried to formally combine these to get point estimates over time or attempted to weight studies based on relevance, study quality, etc.
We are combining studies that may have little ability to inform what we’d expect from deworming (twin studies, childcare programs, etc.).
It could be possible to re-assess other studies measuring long-term benefits of early childhood health interventions. When we set our prior, we excluded studies that did not report separate effects on income at different time periods. We guess that for several of these studies, it would be possible to re-analyze the primary data and create estimates of the effect on income at different time periods.

3. After briefly looking over the literature review GiveWell uses to build a prior on the long-term effects of deworming, it seems like further research would lead to different results.

GiveWell takes a “vote counting” approach where the studies are weighted equally^[2]. But I would be very surprised if further research assigned equal weight to these studies because they appear to vary considerably in relevance, sample size, and quality.
- Deworming analogies include preschool, schooling, low birth weight, early childhood stimulation, pollution, twin height differences, and nutritional school lunches. It’s unclear how relevant these are to deworming because the mechanisms for deworming to benefit income seem poorly understood.
- Sample sizes aren’t noted. This could matter as one of the “pro-growth trajectory” studies, Gertler et al. (2021) have a follow-up sample size of around ~50. That seems unusually small, so it’s unclear how much weight that should receive relative to others. However, it is one of the only studies in an LMIC!
- There are also two observational studies, which typically receive less weight than quasi-experimental trials or RCTs (Case et al. 2005, Currie and Hyson 1999).

The literature review seems like it could be somewhat easily expanded:
- It seems plausible that you could use Liu and Liu (2019), another causal study of deworming’s long-term effects on income, to see if the long-term effects change depending on age. They were helpful when we asked them for assistance.
- Somewhat at random, I looked at Duflo et al. (2021), which was passed over for inclusion in the review and found that it contained multiple follow-ups and found weak evidence for incomes increasing over time due to additional education.
The existing literature review on priors could be upgraded to a meta-analysis with time (data extraction is more tedious than technically challenging). A resulting meta-analysis where each study is weighted by precision and potentially a subjective assessment of relevance would be more clarifying than the present “vote counting” method.
It’s unclear if all the conclusions were warranted. GiveWell reads Lang and Nystedt (2018) as finding “Increases for males; mixed for females” and notes some quotes from the original study:
- “From ages 30–34 and onwards, the height premium increases over the life cycle for men, starting at approximately 5%, reaching 10% at ages 45-64 and approximately 11-12% at ages 65-79 (i.e., in retirement)." [...] "Almost the opposite trend is found for women. Being one decimeter taller is associated with over 11% higher earnings for women aged 25–29. As the women age, the height premium decreases and levels off at approximately 6–7%." [...] "The path of the height premium profile over the female adult life cycle is quite unstable, and no obvious trend can be seen (see Fig. 2)." (17-18)
- But when I look up that same table (shown below), I see decay for women and growth for men.

^{^}
Higher ln earnings effects from KLPS-2 to KLPS-3 are driven by lower control group earnings in KLPS-2 ($330 vs. $1165).[8] In KLPS-3, researchers started measuring farming profits in addition to other forms of earnings,[9]so part of the apparent increase in control group earnings from KLPS-2 to KLPS-3 is likely driven by a change in measurement, not real standards of living or catch-up growth.”
^{^}
“We found 10 longitudinal studies with at least two adult follow-ups from a number of countries examining the impact of a range of childhood interventions or conditions (see this table), in addition to the deworming study (Hamory et al. 2021). Of those 10 studies, 3 found decreasing effects on income, 3 found increasing effects, and 4 found mixed effects (either similar effects across time periods, different patterns across males and females, or increases and then decreases over the life cycle). Based on this, we think it makes sense to continue to assume as a prior that income effects would be constant over time. I have low confidence in these estimates, though, and it’s possible further work could lead to a different conclusion.”

“Wage earnings and self-employment profits were collected in KLPS-2, KLPS-3, and KLPS-4; agricultural profits were collected in KLPS-3 and KLPS-4. Annual per capita household earnings are calculated as the sum of wage employment earnings, self-employment profits, and agricultural profits across all household members, divided by the number of household members. Household earnings are only available in KLPS-4.” Hamory et al. 2021, Table 1. ↩︎
We describe the rationale for this here and here. ↩︎
This is based on evidence from health and other possible mechanisms that might contribute to deworming’s long term effects. Our calculations are in this spreadsheet. 1% is the weighted average of effects from different mechanisms (these cells) with the weights on these different mechanisms (these cells). ↩︎
The treatment effect of deworming on ln(income) in the Miguel and Kremer 2004 study population is 0.109, based on our pooling of results across rounds. We describe the rationale for this parameter in the documents linked from this cell in our cost-effectiveness analysis. ↩︎
We describe our informal Bayesian approach here and here. The rationale for our 13% replicability adjustment for deworming is in the documents linked from this cell. ↩︎
Hamory et al. 2021, Appendix, Fig. S3. ↩︎
“It is worth noting that one quarter of both the treatment and control groups are still in school by the time of the survey (Table II), and labor market outcomes are less meaningful for this group.” Baird et al. 2016, IV.C. “Impact on Labor Hours and Occupation,” paragraph 1. ↩︎
Hamory et al. 2021, Appendix, Fig. S3. “Deworming Treatment Effects by Survey Round, B. Annual Individual Earnings.” ↩︎
“Annual individual earnings are calculated as the sum of wage employment across all jobs; nonagricultural self-employment profit across all business; and individual farming profit, defined as net profit generated from noncrop and crop farming activities for which the respondent provided all reported household labor hours and was the main decision maker within the last 12 mo. Wage earnings and self-employment profits were collected in KLPS-2, KLPS-3, and KLPS-4; agricultural profits were collected in KLPS-3 and KLPS-4.” Hamory et al. 2021, Table 2. ↩︎
Hamory et al. 2021, Appendix, Fig. S3. “Deworming Treatment Effects by Survey Round, B. Annual Individual Earnings.” ↩︎
Hamory et al. 2021, Appendix, Fig. S3. “Deworming Treatment Effects by Survey Round, A. Annual Per-Capita Consumption.” ↩︎
“The measurement of economic outcomes was also improved: KLPS round 4 (KLPS-4) incorporates a detailed consumption expenditure questionnaire (modeled on the World Bank Living Standards Measurement Survey; see ref. 32) for all respondents, and round 3 collected this for a representative subsample.” Hamory et al. 2021, Introduction, paragraph 5. ↩︎
See this blog post for further discussion of GiveWell's approach to using broadly Bayesian frameworks in our analyses. ↩︎
1.4% equals 0.109 treatment effect * 13% replicability adjustment. ↩︎
See discussion above, under "Reasons to put less weight on effects varying over time." ↩︎

Title	Link
Baird et al. 2016	https://doi.org/10.1093%2Fqje%2Fqjw022
GiveWell, "2023 cost-effectiveness analysis - version 2"	https://docs.google.com/spreadsheets/d/10JFJaWnFAEKmsv5XjXqGqEoMUx0eM7x3WYwu_vC7FRw/edit#gid=472531943
GiveWell, "Context on deworming replicability adjustment (2020)"	https://docs.google.com/document/d/1-F5sZBq6FD6E73SWkKFhwMR9gCdKUCTfp9dOe0I-1vw/edit
GiveWell, "Deworming decay adjustment: deworming effect calculation (2023)"	https://docs.google.com/spreadsheets/d/1iUcIjfudwQlPOftbG_e5axbiAfRr15ie7rvtTiiDFvU/edit#gid=1321957472
GiveWell, "Deworming decay adjustment: KLPS 4 Deworming Effect Size Parameter Update (2023)"	https://docs.google.com/spreadsheets/d/1bbZWTjklQ5hc2i4zynCR6gq2TqR0x3HCgC-ssK6i0FI/edit#gid=1667455426
GiveWell, "Deworming decay adjustment: replicability adjustment (2023)"	https://docs.google.com/spreadsheets/d/1u6kDrFbns-2_M46G_POro09RcKQZ0TSy8qq-IXK6Z1o/edit#gid=2002315610
GiveWell, "Deworming decay adjustment: replicability adjustment (informal Bayesian analysis, 2023)"	https://docs.google.com/spreadsheets/d/1kOh43pku33n7AQyAv6X43Z9bTUThcE43ZZcOeD-YUoo/edit#gid=251688210
GiveWell, "Deworming Effect Size Parameter Update - KLPS 4 Results 11.07.19"	https://docs.google.com/spreadsheets/d/1MNEPqRhIndfpeJT3LxCK1Bvrn-N81n0PlvsXRjc3fb4/edit#gid=0
GiveWell, "Deworming replicability adjustment (2020)"	https://docs.google.com/document/d/1PZfYXegWco0qrmQnQBjclZeq4uUdWfpE5xdBYx0cAEU/edit
GiveWell, "Deworming replicability adjustment 2019"	https://docs.google.com/spreadsheets/d/1ZvX6XI5AKxTYQJbyxlEkmuf6LyYlt18kyfeqRGs-aaM/edit#gid=0
GiveWell, "Long-term effects literature review"	https://docs.google.com/spreadsheets/d/1n1fRU77jvxFlIkiHF4zoQn3I6n3KcpjGfLih_W_JniM/edit#gid=0
GiveWell, "UC Berkeley — KLPS-4 Survey"	https://www.givewell.org/research/incubation-grants/uc-berkeley/april-2017-grant
GiveWell, "Why we can’t take expected value estimates literally (even when they’re unbiased)," 2011	https://blog.givewell.org/2011/08/18/why-we-cant-take-expected-value-estimates-literally-even-when-theyre-unbiased/
Hamory et al. 2021	https://doi.org/10.1073/pnas.2023185118
McGuire, Dupret, and Plant, "Deworming and decay: replicating GiveWell’s cost-effectiveness analysis," 2022	https://web.archive.org/web/20230221162055/https://forum.effectivealtruism.org/posts/MKiqGvijAXfcBHCYJ/deworming-and-decay-replicating-givewell-s-cost

GiveWell's updated estimate of deworming and decay

GiveWell's updated estimate of deworming and decay

In a nutshell

What we did previously

Incorporating the possibility that there is decay over time

Weight on decay

Prior for decay

Replicability adjustment for each survey

Bottom line adjustment factor

Sources

Notes