Board Game Arena: A Framework and Benchmark for Assessing Large Language Models via Strategic Play
Lucia Cipolina-Kun, Marianna Nezhurina, Jenia Jitsev
https://arxiv.org/abs/2508.03368
This https://arxiv.org/abs/2503.02817 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csSE_…
Me gustó mucho la herramienta de cifrado de ficheros age https://github.com/FiloSottile/age y me puse a hacer un pequeño script para poder cifrar directorios de forma recursiva, desencriptarlos, tener definidas varias bóvedas en distintos ficheros de variables para poder descifrar unos u otros...…
Just read this post by @… on an optimistic AGI future, and while it had some interesting and worthwhile ideas, it's also in my opinion dangerously misguided, and plays into the current AGI hype in a harmful way.
https://social.coop/@eloquence/114940607434005478
My criticisms include:
- Current LLM technology has many layers, but the biggest most capable models are all tied to corporate datacenters and require inordinate amounts of every and water use to run. Trying to use these tools to bring about a post-scarcity economy will burn up the planet. We urgently need more-capable but also vastly more efficient AI technologies if we want to use AI for a post-scarcity economy, and we are *not* nearly on the verge of this despite what the big companies pushing LLMs want us to think.
- I can see that permacommons.org claims a small level of expenses on AI equates to low climate impact. However, given current deep subsidies on place by the big companies to attract users, that isn't a great assumption. The fact that their FAQ dodges the question about which AI systems they use isn't a great look.
- These systems are not free in the same way that Wikipedia or open-source software is. To run your own model you need a data harvesting & cleaning operation that costs millions of dollars minimum, and then you need millions of dollars worth of storage & compute to train & host the models. Right now, big corporations are trying to compete for market share by heavily subsidizing these things, but it you go along with that, you become dependent on them, and you'll be screwed when they jack up the price to a profitable level later. I'd love to see open dataset initiatives SBD the like, and there are some of these things, but not enough yet, and many of the initiatives focus on one problem while ignoring others (fine for research but not the basis for a society yet).
- Between the environmental impacts, the horrible labor conditions and undercompensation of data workers who filter the big datasets, and the impacts of both AI scrapers and AI commons pollution, the developers of the most popular & effective LLMs have a lot of answer for. This project only really mentions environmental impacts, which makes me think that they're not serious about ethics, which in turn makes me distrustful of the whole enterprise.
- Their language also ends up encouraging AI use broadly while totally ignoring several entire classes of harm, so they're effectively contributing to AI hype, especially with such casual talk of AGI and robotics as if embodied AGI were just around the corner. To be clear about this point: we are several breakthroughs away from AGI under the most optimistic assumptions, and giving the impression that those will happen soon plays directly into the hands of the Sam Altmans of the world who are trying to make money off the impression of impending huge advances in AI capabilities. Adding to the AI hype is irresponsible.
- I've got a more philosophical criticism that I'll post about separately.
I do think that the idea of using AI & other software tools, possibly along with robotics and funded by many local cooperatives, in order to make businesses obsolete before they can do the same to all workers, is a good one. Get your local library to buy a knitting machine alongside their 3D printer.
Lately I've felt too busy criticizing AI to really sit down and think about what I do want the future to look like, even though I'm a big proponent of positive visions for the future as a force multiplier for criticism, and this article is inspiring to me in that regard, even if the specific project doesn't seem like a good one.
"Applications of the Critical Incident Technique in Library and Information Science Research: A Literature Review" https://doi.org/10.1515/libri-2024-0065
Just updated the trigger crossbar to the latest versions of all of my various RTL and firmware libraries.
Still have to convert the Ethernet subsystem on the FPGA over from the legacy EthernetBus interface over to my new AXI4-Stream based IP.
And replace the 7 series transceivers wizard on the 10Gbase-R uplink with a raw GTYXE2_CHANNEL (I'm already using raw channel primitives on the other ports)
Just finished "Witchy" by Ariel Slamet Ries. It was a random pick from my local library, but now I see it's a webcomic and I'm looking forward to reading beyond the end of volume 1. It's got cool worldbuilding and a main character who grapples with state oppression, both of which I appreciated.
Lots of interesting use of movement blur in various panels too.
#AmReading
"Vancouver has 22 civic libraries. Victoria has 12. The streets of these two cities, however, have well over 1,000 unincorporated “little libraries”. You’ve seen them, of course. Those quaint little bird-house structures, placed along sidewalks, usually with a glassed-in front to keep the contents safe from the elements. Inside these little boxes there is always the oddest selection of books, and almost always something that will pique your interest. "
This https://arxiv.org/abs/2412.06430 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csSE_…