Some research ideas in forecasting

Jaime Sevilla

Some research ideas in forecasting

Jaime Sevilla

8 min readNov 15, 2022

Comments 5

Sorted by

New & upvoted

Javier Prieto🔸

Your likelihood_pool method is returning Brier scores >1. How is that possible? Also, unless you extremize, it should yield the same aggregates (and scores) as regular geometric mean of odds, no?

Jaime Sevilla

I am so dumb I was mistakenly using odds instead of probs to compute the brier score :facepalm:

And yes, you are right, we should extremize before aggregating. Otherwise, the method is equivalent to geo mean of odds.

It's still not very good though

dschwarz

[I wrote this comment on LW, copying to this post. Shouldn't that happen automatically?]

Nice post! I'll throw another signal boost for the Metaculus hackathon that OP links, since this is the first time Metaculus is sharing their whole 1M db of individual forecasts (not just the db of questions & resolutions which is already available). You have to apply to get access though. I'll link it again even though OP already did: https://metaculus.medium.com/announcing-metaculuss-million-predictions-hackathon-91c2dfa3f39

There are nice cash prizes too.

As the OP writes, I think most the ideas here would be valid entries in the hackathon, though the emphasis is on forecast aggregation & methods for scoring individuals. I'm particularly interested in decay of predictions idea. I don't think we know how well predictions age, and what the right strategy for updating your predictions should be for long-running questions.

Jonas Moss

Thanks for writing this.

I wrote about "decay of predictions" here. I would classify the problem as hard.
Do you have a feeling for how suitable the projects are for academic projects? Such as bachelor theses or master theses, perhaps? It would be great to show a list of projects to students!

Jaime Sevilla

Thanks Jonas!

I'd forgotten about that great article! Linked.
I feel some of these would be good bachelor / MSc theses yeah!

Comments

Method

Weighted

Brier

-log

Questions

Neyman aggregate (p=0.36)

Yes

0.106

0.340

899

Extremized mean of logodds (d=1.55)

Yes

0.111

0.350

899

Neyman aggregate (p=0.5)

Yes

0.111

0.351

899

Extremized mean of probabilities (d=1.60)

Yes

0.112

0.355

899

Metaculus prediction

Yes

0.111

0.361

774

Mean of logodds

Yes

0.116

0.370

899

Neyman aggregate (p=0.36)

0.120

0.377

899

Median

Yes

0.121

0.381

899

Extremized mean of logodds (d=1.50)

0.126

0.391

899

Mean of probabilities

Yes

0.122

0.392

899

Neyman aggregate (o=1.00)

0.126

0.393

899

Extremized mean of probabilities (d=1.60)

0.127

0.399

899

Mean of logodds

0.130

0.410

899

Median

0.134

0.418

899

Mean of probabilities

0.138

0.439

899

Baseline (p = 0.36)

N/A

0.230

0.652

899