Comments
The Extreme Inefficiency of RL for Frontier Models — EA Forum