Your definition seems to constrain 'epistemic process' to mere analytic tasks. It seems to me that it's a big leap from there to effective decision-making. For instance, I can imagine how LLMs could effective produce resolvable, non-conditional questions, and then answer them with relatively high accuracy. Yet there are three other tasks I'm more skeptical about: 1) generating conditional forecasting questions that encapsulate decision options; 2) making accurate probability judgements of those questions; and thus 3) the uptake of such forecasts into a 'live' decision process. This all seems more likely to work better for environments that seem to have discrete and replicable processes, some of which you mention, like insurance calculations. But these tasks seem potentially unsolvable by LLM for more complex decision environments that require more ethical, political, and creative solutions. By 'creative' I mean solutions (e.g. conditional forecasting question) that simply cannot be assembled from training data because the task is unique. What comprises 'unique' is perhaps an interesting discussion? Nevertheless, this post helped me work through some of these questions -- thanks for sharing! Curious if you have any reactions.
Good questions. A few thoughts:
An anecdote: the US government is trying to convince a foreign government to sign an agreement with the United States but is repeatedly stymied by presidents from both parties for two decades. Let's assume a forecast at that moment suggests a 10% change the law will be passed within a year. A creative new ambassador designs a creative new strategy that hasn't been attempted before. Though the agreement would require executive signature, she's decides instead to meet with every single member of parliament and tell them the United States would owe them if they came out publicly in favor of the deal. Fast forward a year, and the agreement is signed.
Another anecdote: the invention of the Apple computer.
Presumably you could use LLM+scaffold to generate a range of options and compare conditional forecasts of likelihood of success. But will it beat a human? I'm skeptical that an LLM is ever going to be able to "think" through the layers of contextual knowledge about a particular challenge (say nothing of prioritizing the correct challenge in the first place) to be able to generate winning solutions.
Metric: give forecasters a slate of decision options -- some calculated by LLM, some by humans -- and see who wins.
Another thought on metrics: calculate a "similarity score" between a decision option and previous at solving similar challenges. Almost like a metric that calculates "neglectedness" and "tractability"?