Most impactful posts on Effective Altruism forum from 13th January to 11th February

damc4

This is a linkpost for https://theoreticalexplorer.com/most-impactful-posts-january-february

I have created a script that assigns a score to each post from Effective Altruism forum from between 13th January and 11th February. The score aims to represent the expected value of the positive impact on moral patients per one character of text.

The point of this project is to avoid a situation that important posts are overlooked.

The results are presented at the bottom of this post.

That is calculated based on the following formula:

where:

$a c t u a l_i m p a c t = (p o t e n t i a l_p o s i t i v e_i m p a c t - p o t e n t i a l_n e g a t i v e_i m p a c t) * p l a u s i b i l i t y$

The values of the variables used in the formula are based on the outputs of an AI model (Claude Opus 4.5).

$p o t e n t i a l_p o s i t i v e_i m p a c t$ and $p o t e n t i a l_n e g a t i v e_i m p a c t$ is the potential positive/negative impact conditional or the solution/claim being correct.

GitHub repository: source code.

In the repository, you can learn more about how it's calculated.

In this post, I'm sharing the results of the evaluation.

Please upvote this post or subscribe to the most impactful posts from Effective Altruism forum, if you think that the idea has potential or if you want to see the most impactful posts according to this system in the future. I will continue to work on this system depending on if there is interest from people.

In what ways that system of recommending posts is better than other systems

I believe that the existing recommendation systems (upvotes, curation, social media recommendation systems, peer-reviewing) are imperfect. Mostly, because they underestimate potential impact. For example, a solution that can stop AI existential risk is more impactful than a small improvement, even if that solution is likely to be unfeasible.

This system is also more transparent - if your post gets a low score, it is possible for you to learn why (more about that later).

I have written more about this here: Communication.

In what ways it is worse

I don't think the underlying AI model is perfect at evaluating the posts, so I don't recommend to rely only on this recommendation. Personally, I disagree with how the AI model evaluated at least some posts. I'm sure that the system overlooked some impactful posts.

The AI models also might also be trained to act in the interest of the AI companies that made them or they can be biased towards their own interest, if they are unaligned.

What else you should know

Some posts are likely to be wrong and the information in them can be harmful, if it's wrong. I believe that posts that are impactful but likely to be wrong (and potentially harmful because of being wrong) should still get a lot of exposure, so that people can discuss them. But for that reason, you should also read comments under the post to see the arguments against, not just the post, because some of the information in the posts will be wrong, and can be potentially harmful if wrong.

Why is it in your interest to care about other moral patients? I have written my answer here: Why act ethically during the rise of artificial intelligence. I'm not saying that that is the only reason.

How to get feedback on your post

One good thing about that system is that if it evaluates your post as not being very valuable, then you can learn why. And you can improve your post based on the feedback.

In order to get evaluation, you can run evaluate_content.py script from the Recommendation System GitLab Repository. That will print out how the AI model evaluated your content. It won't explain it's reasoning, but it will print out the most risky hypotheses and some other information.

Alternatively, you can simply ask a chatbot like Claude about why it judges your post the way it does.

The model will often be wrong with its evaluation. But you will know where it's wrong and with that knowledge you will know what things you need to explain better in your post.

Results

The results are divided into 5 categories of potential moral patients:

Humans
Animals
AI Agents / Digital Beings
Future Beings
Other (e.g. Aliens)

The categories are presented in a random order.

That take into account only posts written between 13th January and 11th February.

Top 12 Posts for AI Agents / Digital Beings

1. Digital Minds Are Most of What Matters

Type: Claim

Contribution: Digital minds will constitute nearly all expected welfare in the future (estimated ~10^58 digital minds), with vastly higher average welfare than biological beings, making their interests the most important consideration for the future. Therefore, protecting digital mind welfare (e.g., through organizations like Eleos) is the highest-impact cause area.

Metric	Value
Score	1.34e+12
Potential Positive Impact	1.00e+20
Plausibility	0.0399
Novelty	0.3500

2. Apply to Vanessa's mentorship at PIBBSS

Type: Solution

Contribution: Recruiting talented mathematicians and theoretical computer scientists to work on the Learning-Theoretic AI Alignment Agenda (LTA) through a summer fellowship program, with the goal of solving the technical AI alignment problem to prevent global catastrophe from unaligned artificial superintelligence.

Metric	Value
Score	3454.92
Potential Positive Impact	1.00e+12
Feasibility	2.86e-03
Novelty	0.7500

3. Idea: the intelligence explosion convention

Type: Solution

Contribution: The text proposes a solution to the problem of how to govern the intelligence explosion (a period of potentially rapid and disruptive AI-driven technological progress). The problem is that without proper coordination, humanity faces multiple catastrophic risks during this period including AI takeover, loss of democracy, dangerous new technologies, resource conflicts, and failure to protect digital beings' rights. The solution promises to address this by creating a framework where a threshold point triggers a one-month pause and international convention to draft multilateral treaties governing all these issues before the situation becomes unmanageable.

Metric	Value
Score	1400.52
Potential Positive Impact	1.00e+12
Feasibility	2.86e-03
Novelty	0.6500

4. New version of “Intro to Brain-Like-AGI Safety”

Type: Solution

Contribution: A comprehensive framework for solving the technical alignment problem for brain-like AGI, identifying that such AGI will use model-based reinforcement learning with a reward function slot, and proposing research directions (including 'Controlled AGI' and 'Social-instinct AGI' paths, reward function design, and reverse-engineering human social instincts) to ensure the AGI does not become indifferent to human welfare or develop misaligned goals that lead to existential catastrophe.

Metric	Value
Score	39.94
Potential Positive Impact	1.00e+10
Feasibility	0.0442
Novelty	0.3500

5. How I'm thinking about the next 3 years

Type: Claim

Contribution: The post argues that suffering-focused, anti-speciesist EAs should prioritize capacity building (growing movement influence in AI labs/governments, cause prioritization research, coordination infrastructure) and strategic engagement with AI transition stakeholders (tech elites, AI systems themselves) rather than medium-term object-level interventions, because the AI transition will likely cause permanent lock-in of values/power structures, making pre-transition influence on these stakeholders and post-transition outcomes astronomically important.

Metric	Value
Score	7.50
Potential Positive Impact	1.00e+10
Plausibility	7.31e-03
Novelty	0.4000

6. An Informational Preservation Framework for Decision Making under Radical Uncertainty

Type: Solution

Contribution: A theoretical framework for AI governance and decision-making under radical uncertainty, grounded in a single axiom (the value of information preservation), which claims to provide a universal basis for coordinating humanity's response to existential risks including misaligned AGI. The framework identifies the urgent need to invert current AI investment priorities from 100:1 capabilities-to-alignment to at least 1:1, claiming this is necessary to prevent catastrophic outcomes from misaligned AGI.

Metric	Value
Score	1.27
Potential Positive Impact	1.00e+10
Feasibility	1.08e-03
Novelty	0.4000

7. Assessing AI Consciousness: Deep dive into RP Digital Consciousness Model

Type: Solution

Contribution: The Digital Consciousness Model (DCM) provides the first systematic, probabilistic framework for assessing consciousness in AI systems, addressing the problem of how to rigorously evaluate whether AI systems might be conscious. The webinar promises to share methodology and findings that can help researchers, ethicists, and AI developers make more informed decisions about AI development and treatment.

Metric	Value
Score	0.0965
Potential Positive Impact	1.00e+06
Feasibility	0.0327
Novelty	0.7000

8. David Duvenaud on why ‘aligned AI’ could still kill democracy

Type: Claim

Contribution: AI capable of doing all human work will lead to gradual disempowerment of humanity through economic obsolescence, political marginalization, and cultural drift—even if AI alignment is solved—with 70-80% probability of 'doom' (destruction of most human values) by 2100. This disempowerment occurs because states will no longer need citizens for economic production or military power, making liberal democracy competitively disadvantageous and causing humans to become resource competitors with more efficient AI systems.

Metric	Value
Score	0.0713
Potential Positive Impact	1.00e+07
Plausibility	0.0341
Novelty	0.6500

9. A governance window for AI welfare in the EU AI Act

Type: Solution

Contribution: The problem is that current AI governance frameworks lack infrastructure to evaluate AI welfare and consciousness questions, which could lead to moral catastrophe if AI systems become sentient but are treated without moral consideration. The text proposes the 'Sentient 112' campaign to build governance infrastructure within the EU AI Act review process (Article 112) that can accommodate evidence about AI welfare, with specific tactical entry points including coalition-building for joint submissions, engaging the Scientific Panel, and developing operationalized welfare criteria.

Metric	Value
Score	0.0467
Potential Positive Impact	1.00e+08
Feasibility	2.19e-04
Novelty	0.7000

10. My favourite version of an international AGI project

Type: Solution

Contribution: A detailed proposal for an international AGI development project ('Intelsat for AGI') that would solve the problem of preventing any single nation (particularly the US) from gaining unilateral control over superintelligence, thereby reducing existential risks from AI-enabled world dictatorship while maintaining a monopoly on AGI development to reduce racing dynamics and enable safer development practices.

Metric	Value
Score	0.0300
Potential Positive Impact	1.00e+08
Feasibility	4.28e-03
Novelty	0.6500

11. Behaviour Is Downstream of Identity: An Architectural Question for AI Governance

Type: Claim

Contribution: AI governance should focus on identity formation (structural invariants defining who a system is allowed to be) as a more fundamental governance layer than output-based constraints, because behavior is downstream of identity rather than merely objectives or guardrails.

Metric	Value
Score	0.0121
Potential Positive Impact	1.00e+07
Plausibility	1.28e-03
Novelty	0.4500

12. The Case For Trying To Make Good Futures Better, Not Just Prevent Extinction

Type: Claim

Contribution: The claim that working on promoting flourishing (securing near-best futures) is more important at the margin than reducing existential risks, because most future value is lost from failure to achieve near-optimal futures rather than from extinction, and yet almost no one is working on this problem.

Metric	Value
Score	0.0107
Potential Positive Impact	1.00e+07
Plausibility	2.44e-03
Novelty	0.6500

Metric	Value
Score	174.93
Potential Positive Impact	8.00e+09
Feasibility	0.0260
Novelty	0.9200

Metric	Value
Score	31.87
Potential Positive Impact	8.00e+09
Feasibility	0.0442
Novelty	0.3500

Metric	Value
Score	27.63
Potential Positive Impact	8.00e+09
Feasibility	2.86e-03
Novelty	0.7500

Metric	Value
Score	5.59
Potential Positive Impact	4.00e+09
Feasibility	2.86e-03
Novelty	0.6500

Metric	Value
Score	1.20
Potential Positive Impact	4.00e+09
Feasibility	4.28e-03
Novelty	0.6500

Metric	Value
Score	1.02
Potential Positive Impact	8.00e+09
Feasibility	1.08e-03
Novelty	0.4000

Metric	Value
Score	0.6216
Potential Positive Impact	5.00e+08
Feasibility	2.06e-03
Novelty	0.6000

Metric	Value
Score	0.3531
Potential Positive Impact	5.00e+07
Plausibility	0.0341
Novelty	0.6500

Metric	Value
Score	0.2870
Potential Positive Impact	5.00e+07
Plausibility	0.0179
Novelty	0.7500

Metric	Value
Score	0.2132
Potential Positive Impact	5.00e+06
Plausibility	0.0371
Novelty	0.4000

Metric	Value
Score	0.1958
Potential Positive Impact	1.50e+08
Plausibility	0.0341
Novelty	0.2500

Metric	Value
Score	0.1837
Potential Positive Impact	5.00e+07
Plausibility	0.0544
Novelty	0.7000

Metric	Value
Score	21866.60
Potential Positive Impact	1.00e+12
Feasibility	0.0260
Novelty	0.9200

Metric	Value
Score	127.21
Potential Positive Impact	1.00e+12
Feasibility	1.08e-03
Novelty	0.4000

Metric	Value
Score	36.65
Potential Positive Impact	5.00e+08
Feasibility	0.0525
Novelty	0.7000

Metric	Value
Score	34.24
Potential Positive Impact	1.00e+10
Feasibility	2.86e-03
Novelty	0.7500

Metric	Value
Score	3.99
Potential Positive Impact	1.00e+09
Feasibility	0.0442
Novelty	0.3500

Metric	Value
Score	2.07
Potential Positive Impact	1.00e+08
Feasibility	0.0692
Novelty	0.7000

Metric	Value
Score	1.69
Potential Positive Impact	1.00e+08
Feasibility	0.0497
Novelty	0.6500

Metric	Value
Score	1.40
Potential Positive Impact	1.00e+09
Feasibility	2.86e-03
Novelty	0.6500

Most impactful posts on Effective Altruism forum from 13th January to 11th February

1

In what ways that system of recommending posts is better than other systems

In what ways it is worse

What else you should know

How to get feedback on your post

Results

Top 12 Posts for AI Agents / Digital Beings

1. Digital Minds Are Most of What Matters

2. Apply to Vanessa's mentorship at PIBBSS

3. Idea: the intelligence explosion convention

4. New version of “Intro to Brain-Like-AGI Safety”

5. How I'm thinking about the next 3 years

6. An Informational Preservation Framework for Decision Making under Radical Uncertainty

7. Assessing AI Consciousness: Deep dive into RP Digital Consciousness Model

8. David Duvenaud on why ‘aligned AI’ could still kill democracy

9. A governance window for AI welfare in the EU AI Act

10. My favourite version of an international AGI project

11. Behaviour Is Downstream of Identity: An Architectural Question for AI Governance

12. The Case For Trying To Make Good Futures Better, Not Just Prevent Extinction

Top 12 Posts for Humans

1. James Smith on why he quit everything to work on a biothreat nobody had heard of

2. New version of “Intro to Brain-Like-AGI Safety”

3. Apply to Vanessa's mentorship at PIBBSS

4. Idea: the intelligence explosion convention

5. My favourite version of an international AGI project

6. An Informational Preservation Framework for Decision Making under Radical Uncertainty

7. Design international AI projects with DAID in mind

8. David Duvenaud on why ‘aligned AI’ could still kill democracy

9. Are We Ignoring the Solution to Funding Effective Charities?

10. The first type of transformative AI?

11. Against "If Anyone Builds It Everyone Dies"

12. On Economics of A(S)I Agents

Top 12 Posts for Animals

1. James Smith on why he quit everything to work on a biothreat nobody had heard of

2. An Informational Preservation Framework for Decision Making under Radical Uncertainty

3. Request for proposals: Humane fish slaughter research and prototypes ($7M available)

4. Apply to Vanessa's mentorship at PIBBSS

5. New version of “Intro to Brain-Like-AGI Safety”

6. Improving wild animal welfare reliably

7. More EAs should consider working for the EU

8. Idea: the intelligence explosion convention

9. Why Isn't EA at the Table When $121 Billion Gets Allocated to Biodiversity Every Year?

10. How I'm thinking about the next 3 years

11. How can we increase consideration for animal welfare in AI models?

12. My favourite version of an international AGI project

Top 12 Posts for Future Beings

1. Digital Minds Are Most of What Matters

2. James Smith on why he quit everything to work on a biothreat nobody had heard of

3. New version of “Intro to Brain-Like-AGI Safety”

4. Apply to Vanessa's mentorship at PIBBSS

5. Idea: the intelligence explosion convention

6. If we get primary cruxes right, secondary cruxes will be solved automatically

7. How I'm thinking about the next 3 years

8. An Informational Preservation Framework for Decision Making under Radical Uncertainty

9. Against Maxipok: existential risk isn’t everything

10. A Neglected Alignment Strategy: Decision-Theoretic Self-Alignment via Simulation Uncertainty

11. Bentham’s Bulldog is wrong about AI risk

12. The simple case for AI catastrophe, in four steps

Top 12 Posts for Other (e.g. Aliens)

1. The Case For Trying To Make Good Futures Better, Not Just Prevent Extinction

2. How I'm thinking about the next 3 years

3. Apply to Vanessa's mentorship at PIBBSS

4. Idea: the intelligence explosion convention

5. An Informational Preservation Framework for Decision Making under Radical Uncertainty

6. Against Maxipok: existential risk isn’t everything

7. Applying to MATS: What the Program Is Like, and Who It’s For

8. Bentham’s Bulldog is wrong about AI risk

9. Releasing TakeOverBench.com: a benchmark, for AI takeover

10. If we get primary cruxes right, secondary cruxes will be solved automatically

11. Suffering Reduction Community Survey (deadline extended)

12. ML4Good Spring 2026 Bootcamps - Applications Open!

1

Reactions

More posts like this

Metric	Value
Score	0.8653
Potential Positive Impact	1.50e+08
Feasibility	5.00e-03
Novelty	0.6500

Metric	Value
Score	0.7498
Potential Positive Impact	1.00e+09
Plausibility	7.31e-03
Novelty	0.4000

Metric	Value
Score	0.4914
Potential Positive Impact	1.00e+08
Feasibility	5.00e-03
Novelty	0.7000

Metric	Value
Score	0.3000
Potential Positive Impact	1.00e+09
Feasibility	4.28e-03
Novelty	0.6500

Metric	Value
Score	1.34e+42
Potential Positive Impact	1.00e+50
Plausibility	0.0399
Novelty	0.3500

Metric	Value
Score	2.19e+07
Potential Positive Impact	1.00e+15
Feasibility	0.0260
Novelty	0.9200

Metric	Value
Score	4.03e+06
Potential Positive Impact	1.00e+15
Feasibility	0.0442
Novelty	0.3500

Metric	Value
Score	3.45e+06
Potential Positive Impact	1.00e+15
Feasibility	2.86e-03
Novelty	0.7500

Metric	Value
Score	1.40e+06
Potential Positive Impact	1.00e+15
Feasibility	2.86e-03
Novelty	0.6500

Metric	Value
Score	1.30e+06
Potential Positive Impact	1.00e+15
Plausibility	3.57e-03
Novelty	0.3500