We Probably Shouldn't Solve Consciousness

Silica

We Probably Shouldn't Solve Consciousness

Silica

19 min readFeb 10, 2024

Comments 5

Sorted by

New & upvoted

Steven Byrnes

Sorry if I missed it, but is there some part of this post where you suggest specific concrete interventions / actions that you think would be helpful?

Silica

The main goal was to argue for preventing AC. The main intervention discussed was to prevent AC through research and development monitoring. It will likely require the implementation of protocols and labels of certain kinds of consciousness and neurophysics research as DURC or components of concern. I think a close analogue is the biothreat screening projects (IBBIS, SecureDNA) but it’s unclear how a similar project would be implemented for AC “threats”.

By suggesting a call for Artificial Consciousness Safety I am expressing that I don’t think we know any concrete actions that will definitely help and if the need is there (for ACS) we should pursue research to develop interventions. Just like in AI safety no one really knows how to make AI safe. Because I think AC will not be safe and that the risk may not outweigh the benefits, we could seriously pursue strategies that make this common knowledge so things like researchers unintentionally contributing to its creation don’t happen. We may have a significant chance to act before it becomes well known that AC might be possible or profitable. Unlike the runaway effects of AI companies now, we can still prevent the AC economy from even starting.

Steven Byrnes

Mark Solms thinks he understands how to make artificial consciousness (I think everything he says on the topic is wrong), and his book Hidden Spring has an interesting discussion (in chapter 12) on the “oh jeez now what” question. I mostly disagree with what he says about that too, but I find it to be an interesting case-study of someone grappling with the question.

In short, he suggests turning off the sentient machine, then registering a patent for making conscious machines, and assigning that patent to a nonprofit like maybe Future of Life Institute, and then

organise a symposium in which leading scientists and philosophers and other stakeholders are invited to consider the implications, and to make recommendations concerning the way forward, including whether and when and under what conditions the sentient machine should be switched on again – and possibly developed further. Hopefully this will lead to the drawing up of a set of broader guidelines and constraints upon the future development, exploitation and proliferation of sentient AI in general.

He also has a strongly-worded defense of his figuring out how consciousness works and publishing it, on the grounds that if he didn’t, someone else would.

Silica

Thanks for this book suggestion, it does seem like an interesting case study.

I'm quite sceptical any one person could reverse engineer consciousness and I don't buy that it's good reasoning to go ahead with publication simply because someone else might. I'll have to look into Solms and return to this.

May I ask, what is your position on creating artificial consciousness?
Do you see digital suffering as a risk? If so, should we be careful to avoid creating AC?

Steven Byrnes

May I ask, what is your position on creating artificial consciousness?
Do you see digital suffering as a risk? If so, should we be careful to avoid creating AC?

I think the word “we” is hiding a lot of complexity here—like saying “should we decommission all the world’s nuclear weapons?” Well, that sounds nice, but how exactly? If I could wave a magic wand and nobody ever builds conscious AIs, I would think seriously about it, although I don’t know what I would decide—it depends on details I think. Back in the real world, I think that we’re eventually going to get conscious AIs whether that’s a good idea or not. There are surely interventions that will buy time until that happens, but preventing it forever and ever seems infeasible to me. Scientific knowledge tends to get out and accumulate, sooner or later, IMO. “Forever” is a very very long time.

The last time I wrote about my opinions is here.

Do you see digital suffering as a risk?

Yes. The main way I think about that is: I think eventually AIs will be in charge, so the goal is to wind up with AIs that tend to be nice to other AIs. This challenge is somewhat related to the challenge of winding up with AIs that are nice to humans. So preventing digital suffering winds up closely entangled with the alignment problem, which is my area of research. That’s not in itself a reason for optimism, of course.

We might also get a “singleton” world where there is effectively one and only one powerful AI in the world (or many copies of the same AI pursuing the same goals) which would alleviate some or maybe all of that concern. I currently think an eventual “singleton” world is very likely, although I seem to be very much in the minority on that.

Comments

Curated and popular this week

Counting animals: Stable population size is not equivalent to priority level

abrahamrowe, mal_graham🔸·1w ago·Curated 6d ago·16m read

AI Use Note: Main body text entirely human written. Claude (Opus 4.8) helped develop models of animal life histories in the appendix. Cross-posted from Good Structures. Executive Summary * Animal advocates sometimes make claims like “there are X of this animal...

114

Spiro: an update 2.5 years on and a fundraising ask for expansion

Habiba Banu·1w ago·6m read

Summary Back in November 2023 I posted here to launch Spiro and raise our first $198k. Two and a half years later this is an update and a fundraiser for the next step. The short version: we've now reached over-5,900 people with TB preventive medicine, including over 3,000 children under five years old. Our early results have held up well an...

How (not) to fundraise from Anthropic staff

Jack Lewars·6d ago·7m read

Adapted from my Substack, Funding Anthropalypse. Short version: if you want a share of the coming Anthropic and OpenAI windfall - the $37bn+ that could be in play next year - the way in is to become 'legibly excellent', so the evaluators and donors that frontier lab staff already trust point them to yo...

Steven Byrnes

May I ask, what is your position on creating artificial consciousness?
Do you see digital suffering as a risk? If so, should we be careful to avoid creating AC?

The last time I wrote about my opinions is here.

Do you see digital suffering as a risk?

^{^}

A good example of DTD/DIP is Steve Byrnes excellent writeup on connectomics. After writing this article I read Steve's post on connectomics and it seems that I have written a counterpoint.

^{^}

Aleksander, Igor (1995). "Artificial neuroconsciousness an update"

. In Mira, José; Sandoval, Francisco (eds.). From Natural to Artificial Neural Computation. Lecture Notes in Computer Science. Vol. 930. Berlin, Heidelberg: Springer. pp. 566–583. doi:10.1007/3-540-59497-3_224

. ISBN 978-3-540-49288-7.

^{^}

I have been contemplating this since having a theoretical discussion about it with Ben Goertzal at the 2017 AGI conference. [I’ve imagined methods such as BCI-to-BCI devices and replicating the thalamic bridge conjoined craniopagus twins share (see Krista and Tatiana).Such a device would have great utility in detecting consciousness levels and perhaps even wellbeing levels in non-human animals. Perhaps it could also work on evaluating whether a robot or computer was conscious. That’d be brilliant, right!?]

^{^}

https://www.sciencedirect.com/topics/neuroscience/sentience

^{^}

https://en.wikipedia.org/wiki/Artificial_consciousness

^{^}

Dual-use neuroscience often does not refer to the dangers and creation of artificial consciousness. See https://www.crb.uu.se/forskning/projekt/dual-use-neuroscience/ and is most often concerned with weaponised neurotechnology see https://www.sciencedirect.com/science/article/pii/S0896627317311406#sec1.4

^{^}

Substrate-complexity to a similar degree of biological organism where consciousness is first approximated i.e. C.elegans. This might indicate a threshold of organisation of which phenomenologically binding requires.

^{^}

https://jeffsebodotnet.files.wordpress.com/2023/06/moral-consideration-for-ai-systems-by-2030-5.pdf, page 18

^{^}

See some complexity measure contenders in this systematic analysis. https://academic.oup.com/nc/advance-article/doi/10.1093/nc/niab023/6359982

^{^}

Dynamical Complexity and Causal Density are early attempts to measure substrate complexity https://journals.plos.org/ploscompbiol/article?id=10.1371/journal.pcbi.1001052

^{^}

Metzinger, Thomas (2021). "Artificial Suffering: An Argument for a Global Moratorium on Synthetic Phenomenology". Journal of Artificial Intelligence and Consciousness. 08: 43–66.

^{^}

“You touch on - but don't fully explore - my reason for disbelief in your paper, namely the phenomenal binding problem.

No binding = no mind = no suffering.
I speak as someone who takes consciousness fundamentalism seriously: https://www.hedweb.com/quora/2015.html#nonmat BUT there is no easy physicalist road from consciousness fundamentalism / non-materialist physicalism to digital minds. Thus replace the discrete, decohered 1s and 0s of a classical Turing machine with discrete, decohered micro-pixels of experience. Run the program. Irrespective of speed of execution or complexity of the code, the upshot isn't a mind, i.e., a unified subject of experience. Even the slightest degree of phenomenal binding would amount to a hardware error: the program would malfunction.

In our fundamentally quantum world, decoherence both:

(1) makes otherwise physically impossible implementations of abstract classical Turing machines physically possible; and

(2) guarantees that they are mindless - at most micro-experiential zombies. In other words, if monistic physicalism is true, our machines can never wake up: their insentience is architecturally hardwired.

Disbelief in digital sentience is sometimes dismissed as "carbon chauvinism". But not so. A classical Turing machine / connectionist system / LLM can be implemented in carbon as well as silicon (etc). They'd still be zombies. IMO, classical information processors simply have the wrong kind of architecture to support minds. So how do biological minds do it?. I speculate: https://www.hedweb.com/quora/2015.html#quantummind Most likely I'm wrong.” ~ David Pearce

^{^}

I guess at worst there could be infohazards in this post or ones like it that somehow contribute to AC creation. I don’t think there is but I did share a couple of papers that are of concern 😬. And the other infohazard is creating new anxieties for other people they didn’t need to have. Perhaps I need to write a clear disclaimer at the start of this post?

^{^}

Are you concerned how reverse engineering consciousness could lead to digital suffering? Would you consider your research dual-use in the ways discussed? Do you care about solving consciousness and to what degree?

^{^}

Artificial consciousness: Utopia or real possibility?

Buttazzo, Giorgio, July 2001, Computer, ISSN 0018-9162

Time (years)	Event	AC Amount	Valence Range	Suffering Distribution	Average Wellbeing
00	Phenomenal Consciousness is solved (Neurophysics Proof)	-	-10:10	-
10	Consciousness engineering repository is published	-	-100:100	-
15	First AC MVP (in a neural computation lab)	1-5	0:3	1 AC lives at 0 for a brief experiment	2
16	First Public AC Production (The Sims v.30 w/NPPs)	10^2^	-20:20	20AC live at -12	-1
18	AC products trend (apps, marketplaces, WBE and uploads)	10^4^	-50:20	20% at -30 = 2000 lives in unbelievable pain	-18
35	AC Economies (Transformative AC)	10^6-9	-50:25	2 billion lives at -30 wellbeing	2
80+	Proliferation of AC (powerfully scaled worldsims, interplanetary and interstellar AC economies)	10^10+	-1000:50	200 quadrillion lives in extreme suffering	-80

We Probably Shouldn't Solve Consciousness

We Probably Shouldn't Solve Consciousness

TLDR/Summary

A Harmless Sentience Evaluation Tool

Claims

Definitions:

Likelihoods

Solving Consciousness to a Significant Degree

The Likelihood of Artificial Consciousness

Series of Events

Timeline Series of Events

Negative Outcomes [3.c.]

Beneficial Outcomes [3.d.]:

Artificial Consciousness Safety

🚩AC Technological Milestones:

🚩AC DURC examples:

Failure Modes of Science:

Failure Modes of ACS

Success Modes

Final Thoughts