I don't have an opinion on whether Holly is correct that no one should work for the labs. But even for those who disagree, there are some implied hypotheses here that are worth pondering:
- People systematically underestimate how much money, power, and prestige will influence their beliefs and their judgment.
- People systematically overestimate how much influence they have on others and underestimate how much influence others have on them. Editorializing on my own, I suspect that almost everyone thinks of themselves as a net influencer, but net amount of [influence on others - influence by others] in a system seemingly has to be zero.
If people decide to work in a frontier lab anyway, to what extent can they mitigate the risk of being "frogboiled" by
- having a plan to evaluate -- as objectively as possible -- whether they are being influenced in the ways Holly describes. (What would this look like?);
- living well beneath the AI-lab salary and chipmunking away most of the excess, reducing the risk that they will feel psychological pressure to continue with a lab to maintain their standard of living;
- going out of their way to ensure enough of their social lives / support is independent of the lab, so that their desire to maintain that support will not lead them to stay with the lab if that no longer seems wise;
- publicly commit to yellow lines under which they would seriously consider reversing course, and red lines under which they would pre-commit to doing so;
- or something else?
(I'm open to the response that there are no meaningful detection and/or mitigation techniques.)
I downvoted this (but have upvoted some of your comments).
I think this advice is at minimum overstated, and likely wrong and harmful (at least if taken literally). And it's presented with rhetorical force, so that it seems to mostly be trying to push people's views towards a position that is (IMO) harmful, rather than mostly providing them with information to help them come to their own conclusions.
TBC: