M

mlsbt

153 karmaJoined

Bio

*

Posts
1

Sorted by New

Comments
33

I have run non-EA projects that have been criticized internally and externally. Why do you think it's off? Criticism is just feedback + things that don't matter, when you believe in what you're doing. The EA world is rational enough to adjust its opinions properly in the fullness of time.

If I have an active project I want it to be as good as possible. Certainly there's been mean-spirited, low-quality criticism on the EA Forum before, but not a high proportion. If relatively valid criticism bothers the founder that much, their project is just probably not going to make it. Or they don't really believe in their project (maybe for good reason, as pointed out by the critique).

"public critique clearly created barriers to starting new projects" In what sense? People read criticism of other projects and decide that starting their own isn't worth it? People with new active projects discouraged by critique?

Looking forward to seeing the ARC performance of future multimodal models. I'm also going to try to think of a text-based ARC analog, that is perhaps more general. There are only so many unique simple 2D-grid transformation rules so it can be brute forced to some extent.

I used GPT-4o which is multimodal (and in fact was even trained on these images in particular as I took the examples from the ARC website, not the Github). I did test more grid inputs and it wasn't perfect at 'visualizing' them.

I might be misunderstanding you here. You can easily get ChatGPT to convert the image to a grid representation/visualization, e.g. in Python, not just a list of square-color labels. It can formally draw out the grid any way you want and work with that, but still doesn’t make progress.

Also, to answer your initial question about ARC’s usefulness, the idea is just that these are simple problems where relevant solution strategies don’t exist on the internet. A non-visual ARC analog might be, as Chollet mentioned, Caesar ciphers with non-standard offsets.

Language models have no problem interpreting the image correctly. You can ask them for a description of the input grid and they’ll get it right, they just don’t get the pattern.

This isn’t true if Greg values animal welfare donations above most non-AI things by a sufficient amount. He could have tried to shop around for more favorable conditions with someone else in EA circles but it seems pretty likely that he’d end up going with this one. There’s no market for these bets.

By the most recent World Bank and FAO data, as well as the 2017 FAO data you link to, Greece isn't close to being the largest producer of fish in the EU nor the 15th largest producer in the world. Correct me if I'm wrong, I think the correct claim is that Greece farms the greatest number of fish in the EU. Fish production statistics are generally by total weight rather than fish number, and I see how the latter is more relevant to welfare concerns. However I think your phrasing is a bit misleading, as Greece has a very unique fish industry for the EU. It farms a huge amount of low-weight fish and has a relatively small wild-catch industry. For most (all?) other European countries, total national fish catch (by weight and number) is still dominated by fishing fleet capture rather than aquaculture. I'd be curious to know how your model weights welfare impacts on humane slaughter method adoption vs improving living conditions on farms. If the latter is a bigger deal, I see how Greece can be a high-leverage country to start with, especially considering the growing proportion of aquaculture in fish production worldwide.

Great post! Quick note: clicking on the carets takes me to that same section rather than the longer intervention descriptions under 'List of prioritized interventions'.

Load more