You want to maximize the "good" you do in the world. What definition of "good" do you use for this? What definition of moral correctness do you use, and wouldn't it get sufficiently Goodharted if you try, with numbers and probabilities and whatnot, to maximize it?
(I'd caution against truly maximising.)
Ben Todd's 80K article What is social impact? A definition is a pretty decent start:
And their more rigorous definition:
The rest of the article elaborates on what they mean by all the terms in their rigorous definition.
80K also note that this doesn't just reduce to utilitarianism:
I try to maximise happiness (in the broadest meaning of the word), and to minimise suffering (again, in the broadest meaning of the word). Goodharting would be to say that by far the best outcome for my values would be to turn everything in the universe into hedonium (a homogeneous substance with limited consciousness, which is in a constant state of supreme bliss). That doesn't sound like a great outcome to me, so yes, it can be goodharted. It shows that my actually values are more complex than just caring about happiness and suffering. But it is usually a good-enough proxy for what I want.