This is still in brainstorming stage; I think there's probably a convincing line of argument for "AI alignment difficulty is high at least on priors" that includes the following points:
- Many humans don't seem particularly aligned to "human values" (not just thinking of dark triad traits, but also things like self-deception, cowardice, etc.)
- There's a loose analogy where AI is "more technological progress," and "technological progress" so far hasn't always been aligned to human flourishing (it has solved or improved a lot of long-term problems of civilization, like infant mortality, but has also created some new ones, like political polarization, obesity, unhappiness from constant bombardement with images of people who are richer and more successful than you, etc.). So, based on this analogy, why think things will somehow fall into place with AI training so that the new forces that be will for once become aligned?
- AI will accelerate everything, and if you accelerate something that isn't set up in a secure way, it goes off the rails ("small issues will be magnified").
[Takeaways from Covid forecasting on Metaculus]
I’m probably going to win the first round of the Li Wenliang forecasting tournament on Metaculus, or maybe get second. (My screen name shows up in second on the leaderboard, but it’s a glitch that’s not resolved yet because one of the resolutions depends on a strongly delayed source.) (Update: I won it!)
With around 52 questions, this was the largest forecasting tournament on the virus. It ran from late February until early June.
I learned a lot during the tournament. Next to claiming credit, I want to share some observations and takeaways from this forecasting experience, inspired by Linch Zhang’s forecasting AMA:
Some things I was particularly wrong about:
Some things I was particularly right about:
(I have stopped following the developments closely by now.)
+1 to the congratulations from JP! I may have mentioned this before, but I considered your forecasts and comments for covidy questions to be the highest-quality on Metaculus, especially back when we were both very active.
You may not have considered it worth your time in the end, but I still think it's good for EAs to do things that on the face of it seem fairly hard, and develop better self models and models of the world as a result.
I know it might not be what you're looking for, but congratulations!
This was a great writeup, thanks for taking the time to make it. Congrats on the contest, too! I'm sorry to hear your experience was stressful. Do you intend to go back to Metaculus in a more relaxed way? I know some users restrict themselves to a subset of topics, for example.
Can you provide some links on the latest IFR estimates? A quick Google search leads me to the same 0.5% ballpark.
I'm not following the developments anymore. I could imagine that the IFR is now lower than it used to be in April because treatment protocols have improved.