Lloyd Rhodes-Brandon 🔸

Co-Founder and Head of Governance at the Nottingham AI Safety Initiative | Masters in policy research @ University of Nottingham

44 karmaJoined Dec 2019Pursuing a graduate degree (e.g. Master's)Working (0-5 years)Nottingham, UK

Message

Interests:

AI safetyAI governance

Bio

Participation
2

I'm a cofounder and the head of AI governance at the Nottingham AI Safety Initiative (NAISI), a student society with faculty advisors and affiliation with UoN's student union. In this role I've taught courses on frontier AI governance, run multiple events on AI safety, and convinced several people to pursue careers in AI safety. Alongside NAISI I'm doing a part-time MA in policy research at Nottingham, focused on catastrophic AI risk.

My current main research is the infrastructure required for unmonitorable multi-agent collusion to become a reality, and what we can do about it. It grew out of my LessWrong work on encrypted agent communication.

I also run BlueDot-informed AI governance courses, help people move into AI safety careers from non-technical backgrounds, and write on the Forum and LessWrong.

How others can help me

Adversarial / expert review of my Private Agent Networks work, on both the technical and governance side
Introductions to people working on multi-agent safety, agent collusion, or AI control
Advice on org-building

How I can help others

Setting up and running effective university AI safety groups (other student unions have approached us for advice!)
Facilitating or running BlueDot-informed AI governance courses
Getting non-technical people usefully into AI safety
Connecting people into the UK student AI safety scene

Posts
6

Sorted by New

More AI safety research and project ideas I haven't seen elsewhere

Lloyd Rhodes-Brandon 🔸

· 10d ago · 8m read

Controversy surrounding Moltbook obscures its very real, novel, unexpressed and rapidly emerging safety risks

Lloyd Rhodes-Brandon 🔸

· 4mo ago · 5m read

Some AI safety project & research ideas/questions for short and long timelines

Lloyd Rhodes-Brandon 🔸

· 11mo ago · 6m read

Democratising AI Alignment: Challenges and Proposals

Lloyd Rhodes-Brandon 🔸

· 1y ago · 5m read

Sentience-Based Alignment Strategies: Should we try to give AI genuine empathy/compassion?

Lloyd Rhodes-Brandon 🔸

· 1y ago · 4m read

Comments
6

Will Sentience Make AI’s Morality Better?

Lloyd Rhodes-Brandon 🔸1y1

Thought-provoking read, thanks for having shared it with me on my post!

I particularly appreciated your ideas around the tension between grounding AI morality in human-like experience—despite AI's lack of it—while recognising that both continuous change and susceptibility to bias complicate moral reasoning, raising questions about whether brief, strategically timed or partial experiences could instil robust moral understanding without distorting judgment in the way emotionally charged human experience often does.

I also found your reflections on engineering AI with predominantly positive qualia, the challenge to valence as the default moral reward signal, the idea that AI sentience might influence human ethical behaviour in return, and the call for a deeper moral foundation than utility maximisation to all be quite novel and helpful ways of seeing things.

Your reference to Agarwal and Edelman aligns well with David Pearce's idea of 'a motivational system based on heritable gradients of bliss', worth checking out if you're not familiar - I think it's a promising model for designing a sentient ASI.

I'm of course not aware of why you chose not to distinguish between consciousness and sentience, but I do find the distinction 80,000 Hours (and no doubt many other sources) makes between them to be useful.

Do Self-Perceived Superintelligent LLMs Exhibit Misalignment?

Lloyd Rhodes-Brandon 🔸1y2

This is fascinating! I really want to see more research like this.

Debate: Morality is Objective

Lloyd Rhodes-Brandon 🔸1y1

70% disagree

The most objective thing about morality (especially utilitarianism) is that some experiential states are objectively 'better' than others by virtue of their valence and that therefore moral projects, however valid they themselves are, at least take root in something real.

Democratising AI Alignment: Challenges and Proposals

Lloyd Rhodes-Brandon 🔸1y1

I appreciate this! I don't feel though that the article addresses the possibility of democratising alignment (or, as Toner says, 'steerability').

Lloyd Rhodes-Brandon 🔸

Bio

Participation2

How others can help me

How I can help others

Posts 6

Comments6

Participation
2

Posts
6

Comments
6