2025-10-07 09:59:32
Multi-Hop Question Answering: When Can Humans Help, and Where do They Struggle?
Jinyan Su, Claire Cardie, Jennifer Healey
https://arxiv.org/abs/2510.04493 https://
Multi-Hop Question Answering: When Can Humans Help, and Where do They Struggle?
Jinyan Su, Claire Cardie, Jennifer Healey
https://arxiv.org/abs/2510.04493 https://
Microsoft AI CEO Mustafa Suleyman says Microsoft plans to focus on superintelligence that prioritizes human control; he will lead a new superintelligence team (Reed Albergotti/Semafor)
https://www.semafor.com/article/11/05/2025/mi…
Collaboration and Conflict between Humans and Language Models through the Lens of Game Theory
Mukul Singh, Arjun Radhakrishna, Sumit Gulwani
https://arxiv.org/abs/2509.04847 htt…
I see that with a lot of criticism of generative "AI"—people state that obviously it's completely unreliable and untrustworthy for _their domain of expertise_ but they'll somehow gladly use it for other stuff.
I believe this cognitive dissonance has to do with how the chatbots pretend to be humans and trick us to assume agency when there is none.
Anyway, as I said otherwise it's great, you should read it: https://theoatmeal.com/comics/ai_art
Re the “wise sage” option:
You do not raise money to build multiple new nuclear power plants to power your product if people think your pitch is “it sucks just like humans except faster and more unpredictably, and it’s worse at arithmetic”
Real humans don’t stream Drake songs 23 hours a day, rapper suing Spotify says - Ars Technica
https://arstechnica.com/tech-policy/2025/11/real-humans-dont-stream-drake-songs-23-hours-a-day-rapper-suing-spotify-says/
Pointing-Guided Target Estimation via Transformer-Based Attention
Luca M\"uller, Hassan Ali, Philipp Allgeuer, Luk\'a\v{s} Gajdo\v{s}ech, Stefan Wermter
https://arxiv.org/abs/2509.05031
Sturgeon’s law remains my biggest reassurance against LLM world takeover. As far as I can tell, that law is as solid as gravity. Humans on average not giving a shit IS the training material poison pill.
yes, I have just looked at random GitHub code again, why do you ask
Why is it that every time I bring up a thing gen AI does poorly, someone will point out that humans also do this thing poorly. Yes, I also can't draw hands well (unlike trained artists btw) but where are my gazillion dollars?
How does random humans being bad at summarising texts negate that AI summaries suck?
#AI
"I want all social discourse to fit within my personal Overton Window." -- Basically all humans
"We stand by the accuracy of what we published,"
Reuters' statement read.
"We have carefully reviewed the published footage, and we have found no reason to believe Reuters longstanding commitment to accurate, unbiased journalism has been compromised."
The four-minute clip, released Sept. 3, captured Putin telling Xi that biotechnology could one day extend human life indefinitely.
"With continuous advances in biotechnology, human organs wi…
crabs have evolved five times, humans (at least the ones alive today) only once. how dare we describe ourselves as the peak of evolution?
OH on slack:
SREcon 2024: hey, just a crazy thing, but we thought about giving the root password to a parrot. What do you think?
SREcon 2025: here's our comprehensive report from parrot's rampage, after equipping it with root password and a spending account
SREcon 2026: codebases should be parrots, not pets
SREcon 2027, presented mostly by parrots: how to work effectively with grumpy humans
#sre #devoops #aipocalypse @sre@tagpush.app
The Raising Humans Kind Podcast
Great Australian Pods Podcast Directory: #GreatAusPods
From Actions to Kinesics: Extracting Human Psychological States through Bodily Movements
Cheyu Lin, Katherine A. Flanigan
https://arxiv.org/abs/2510.04844 https://
LLMs as Policy-Agnostic Teammates: A Case Study in Human Proxy Design for Heterogeneous Agent Teams
Aju Ani Justus, Chris Baber
https://arxiv.org/abs/2510.06151 https://
Picture the human body. Zoom in on a single cell. It lives for a while, then splits or dies, as part of a community of cells that make up a particular tissue. This community lives together for many many cell-lifetimes, each performing their own favorite function and reproducing as much as necessary to maintain their community, consuming the essential resources they need and contributing back what they can so that the whole body can live for decades. Each community of cells is interdependent on the whole body, but also stable and sustainable over long periods of time.
Now imagine a cancer cell. It has lost its ability to harmonize with the whole and prioritize balance, instead consuming and reproducing as quickly as it can. As neighboring tissues start to die from its excess, it metastasizes, always spreading to new territory to fuel its unbalanced appetite. The inevitable result is death of the whole body, although through birth, that body can create a new fresh branch of tissues that may continue their stable existence free of cancer. Alternatively, radiation or chemotherapy might be able to kill off the cancer, at great cost to the other tissues, but permitting long-term survival.
To the cancer cell, the idea of decades-long survival of a tissue community is unbelievable. When your natural state is unbounded consumption, growth, and competition, the idea of interdependent cooperation (with tissues all around the body you're not even touching, no less) seems impossible, and the idea that a tissue might survive in a stable form for decades is ludicrous.
"Perhaps if conditions were bleak enough to perfectly balance incessant unrestrained growth against the depredations of a hostile environment it might be possible? I guess the past must have been horribly brutal, so that despite each tissue trying to grow as much as possible they each barely survived? Yes, a stable and sustainable population is probably only possible under conditions of perfectly extreme hardship, and in our current era of unfettered growth, we should rejoice that we live in much easier times!"
You can probably already see where I'm going with this metaphor, but did you know that there are human communities, alive today, that have been living sustainably for *tens, if not hundreds of thousands of years*?
#anarchy #colonialism #civilization
P.S. if you're someone who likes to think about past populations and historical population growth, I cannot recommend the (short, free) game Opera Omnia by Stephen Lavelle enough: https://www.increpare.com/2009/02/opera-omnia/
AP2O: Correcting LLM-Generated Code Errors Type by Type Like Humans via Adaptive Progressive Preference Optimization
Jianqing Zhang, Wei Xia, Hande Dong, Qiang Lin, Jian Cao
https://arxiv.org/abs/2510.02393
When AI Gets Persuaded, Humans Follow: Inducing the Conformity Effect in Persuasive Dialogue
Rikuo Sasaki, Michimasa Inaba
https://arxiv.org/abs/2510.04229 https://
In an alternate universe, there are marsupial humans.
A profile of Mercor, a Scale AI rival valued at $2B in February, which hires domain experts to train models; it had a $100M run rate in March and $6M H1 profit (Richard Nieva/Forbes)
https://www.forbes.com/sites/richardnieva/2025/09/03/ai…
LLMs are a fundamentally useless technology because their applications (supposedly) boil down to humans not having to think for themselves or do their own writing / drawing / filming.
But if you can do it on your own - why would you need a robot to do it? It’s, at best, a novelty.
That’s why this shit only resonates with executives and capital owners. “Get things done with fewer people and expenses” is at least an actual pitch. “Get things done faster for yourself” isn’t.
The individual angle really works for things you already were trying to avoid doing because you’re either disinterested or don’t have enough time to do things right.
“Avoid your work” as a value proposition doesn’t work when you’re dealing with intellectual labor rather than commodities. Not large scale, not long term.
Sorry for the rant, I saw some Notion ads on the subway and got irritated 😅
#AI #llm #LLMs
Here's my latest Human Meme podcast! We cry because we can!
#human
Curiosity-Driven Co-Development of Action and Language in Robots Through Self-Exploration
Theodore Jerome Tinker, Kenji Doya, Jun Tani
https://arxiv.org/abs/2510.05013 https://
Happy to see resolution for this case of anti-health, anti-vax, quackery and grifting.
I believe most farmers are ethical and understand that disease within their flocks and herds can and do cause disease in other domestic and wild populations and those diseases are often what causes pandemics in humans.
Avian flu is a danger. Millions of birds have died because of it, and have been culled on other farms to try to control it.
Ostriches are no different.
https://www.cbc.ca/news/canada/british-columbia/livestory/bc-ostrich-farm-decision-scoc-9.6968394
Advanced 2.5 Million-Year-Old Tools May Rewrite Human History https://www.404media.co/advanced-2-5-million-year-old-tools-may-rewrite-human-history/
«Lessons in humility & simplicity for 'data science': Garmin's health status»
I blogged about how another #wearable manufacturer went down the road of leaving up data interpretation to humans instead of automating it – and how that relates to "AI" or "automated decision making"
(Responses to this toot become blog comments too)
#quantifiedself #personalscience
🦾 Humans sense a collaborating robot as part of their 'extended' body
#robots
Training a Perceptual Model for Evaluating Auditory Similarity in Music Adversarial Attack
Yuxuan Liu, Rui Sang, Peihong Zhang, Zhixin Li, Shengchen Li
https://arxiv.org/abs/2509.04985
Leech (2022) by Hiron Ennes is a post-post-apoc novel told from the POV of a hive-mind parasite that's eliminated any human who has medical knowledge, except for the humans it inhabits. This is rationalized as a survival tactic.
The story begins with a mystery: the parasite has no memory of the death of its host who was doctor to a tyrant in the bleak north, so it sends another of itself to investigate. It discovers the dead host has a different parasite. A dog eats that. Some peop…
A Case for Declarative LLM-friendly Interfaces for Improved Efficiency of Computer-Use Agents
Yuan Wang, Mingyu Li, Haibo Chen
https://arxiv.org/abs/2510.04607 https://
Replaced article(s) found for cs.NE. https://arxiv.org/list/cs.NE/new
[1/1]:
- The dynamic interplay between in-context and in-weight learning in humans and neural networks
Jacob Russin, Ellie Pavlick, Michael J. Frank
Sources: Google Gemini co-lead Noam Shazeer clashed with colleagues on internal forums over topics like gender and Gaza; moderators deleted some of his comments (The Information)
https://www.theinformation.com/articles/googl…
Biosphere Substrate and its parameters range
Yegor A. Morozov, Mikhail Bukhtoyarov, Mahdi Yoozbashizadeh
https://arxiv.org/abs/2509.04846 https://arxiv.org…
ProToM: Promoting Prosocial Behaviour via Theory of Mind-Informed Feedback
Matteo Bortoletto, Yichao Zhou, Lance Ying, Tianmin Shu, Andreas Bulling
https://arxiv.org/abs/2509.05091
Emergent Social Dynamics of LLM Agents in the El Farol Bar Problem
Ryosuke Takata, Atsushi Masumori, Takashi Ikegammi
https://arxiv.org/abs/2509.04537 https://
An Arbitration Control for an Ensemble of Diversified DQN variants in Continual Reinforcement Learning
Wonseo Jang, Dongjae Kim
https://arxiv.org/abs/2509.04815 https://
"UNESCO Adds an Area the Size of Bolivia to Biosphere Reserve System to Protect 5% of the World’s Land"
#UNESCO #Environment #Conservation
A Hybrid CAPTCHA Combining Generative AI with Keystroke Dynamics for Enhanced Bot Detection
Ayda Aghaei Nia
https://arxiv.org/abs/2510.02374 https://arxiv.…
@… Unpopular, or just unpopular among humans? 😅
Humans are such a moronic species. How we survived this long is beyond me.
(The points made in this little animation have been backed up by substantial research. This is just a cute presentation of it.)
https://youtu.be/Omc37TvHN74
AD-NODE: Adaptive Dynamics Learning with Neural ODEs for Mobile Robots Control
Shao-Yi Yu, Jen-Wei Wang, Maya Horii, Vikas Garg, Tarek Zohdi
https://arxiv.org/abs/2510.05443 htt…
Leopards eating faces party is a time-honored tradition. #science
What Types of Code Review Comments Do Developers Most Frequently Resolve?
Saul Goldman, Hong Yi Lin, Jirat Pasuksmit, Patanamon Thongtanunam, Kla Tantithamthavorn, Zhe Wang, Ray Zhang, Ali Behnaz, Fan Jiang, Michael Siers, Ryan Jiang, Mike Buller, Minwoo Jeong, Ming Wu
https://arxiv.org/abs/2510.05450
@… First I've heard of this - very exciting! https://www.cbc.ca/news/science/jeremy-hansen-moon-1.7649455
Har brukt Qobuz i noen måneder nå, og anbefaler den. Jeg bruker ikke strŸmming (og hadde ikke Spotify fŸr) men ville ha en tjeneste hvor jeg kan kjŸpe og laste ned digitale musikkfiler i toppkvalitet og vite at jeg beholder dem selv om de forsvinner fra tjenesten (*host* Apple Music *host*). Qobuz innfrir, og en ekstra bonus er at de som abonnerer på strŸmmetjenesten får en betydelig rabatt på de albumene man kjŸper.
Dersom du ikke vil ha strŸmming, men heller vil at ekte mennesker lag…
The human biological advantage over AI
William Stewart
https://arxiv.org/abs/2509.04130 https://arxiv.org/pdf/2509.04130
Some Further Developments on a Neurobiologically-based Model for Color Sensations in Humans
Charles Q. Wu
https://arxiv.org/abs/2510.01000 https://arxiv.or…
The Interplay of Attention and Memory in Visual Enumeration
B. Sankar, Devottama Sen, Dibakar Sen
https://arxiv.org/abs/2510.05833 https://arxiv.org/pdf/25…
Multi-faceted light pollution modelling and its application to the decline of artificial illuminance in France
Rolf Buhler, Philippe Deverch\`ere, Christophe Plotard, S\'ebastien Vauclair
https://arxiv.org/abs/2510.02977
How Does Cognitive Bias Affect Large Language Models? A Case Study on the Anchoring Effect in Price Negotiation Simulations
Yoshiki Takenami, Yin Jou Huang, Yugo Murawaki, Chenhui Chu
https://arxiv.org/abs/2508.21137
Mask2IV: Interaction-Centric Video Generation via Mask Trajectories
Gen Li, Bo Zhao, Jianfei Yang, Laura Sevilla-Lara
https://arxiv.org/abs/2510.03135 https://
"Chimpanzees revise their beliefs if they encounter new information, a hallmark of rationality that was once assumed to be unique to humans"
(Original title: Chimps Are Capable of Human-Like Rational Thought, Breakthrough Study Finds)
https://www.404med…
Shape and word parts combine linearly in the Bouba–Kiki effect https://link.springer.com/article/10.3758/s13414-025-03151-1
In Ukraine, Trump's "Peace Efforts" result in even more savage attacks on civilians by Russia.
Those who see supporting Ukraine as a waste of resources are cheering as innocent lives are ruined and ended prematurely.
Perhaps even more disgusting and shameful than the genocide Russia wages on Ukraine, is the indifference shown by so many people around the world.
Monsters and ghouls who insist they are decent humans.
Giorgio is a true journalist reporting…
Sources: ex-xAI researcher Eric Zelikman is raising $1B at a $5B valuation for Humans&, which aims to train AI that is better at collaborating with humans (Anna Tong/Forbes)
https://www.forbes.com/sites/annatong/2025…
This week's Mindscape podcast is with Petter Törnberg who has been running agent based simulated social networks to try and figure out why they're so divisive and polarizing.
His models do apparently capture these features of the networks which promote the most popular posts to everyone.
Sad to learn that simply using chronological feeds apparently not only doesn't help he reckons it makes things worse!
Maybe his agent's just aren't smart enough to follow the right people, but then probably nor are mere humans.
🤔
#socialMedia #podcast #fediverse
KSON: A 💌 to the humans maintaining computer configurations
KSON combines the strengths of YAML and JSON. With KSON you can write files that are easy to read, edit, and debug—without worrying about invisible whitespace errors or missing commas.
🛠️ https://kson.org
🌝 Solar powered moon brick factory could build future lunar cities
#space
In his book Mathematica David Bessis notes this curios fact:
The _definition_ of a 'species' in biology is a bit like an equivalence relation in math: Two animals are of the same species if they can produce fertile offspring. The species are the equivalence classes.
But then, biologists are much more practically inclined to work with their definitions, or how do they know that elephants and humans are in fact distinct species? How do they check?
https://en.wikipedia.org/wiki/Species
“Thanks to the solidity of bearded vulture nest structures and their locations in the western Mediterranean… they have acted as natural museums, conserving historical material in good condition,” the researchers wrote
Among the centuries’ worth of eggshells, prey remains, and natural nesting material, researchers identified 226 objects that were either made or altered by humans.
These included weaponry like a crossbow bolt and wooden lance,
decorated sheep leather,
…
there are 500 trillion calories of humans in the world 😋
This is near where I live and I'm livid about the sheer waste of money to create something so disruptive to both animals and humans. https://www.theguardian.com/us-news/2025/oct/01/arizona-border-wall-san-rafael-valley
MM-Nav: Multi-View VLA Model for Robust Visual Navigation via Multi-Expert Learning
Tianyu Xu, Jiawei Chen, Jiazhao Zhang, Wenyao Zhang, Zekun Qi, Minghan Li, Zhizheng Zhang, He Wang
https://arxiv.org/abs/2510.03142
Research: AI's ability to complete long and complex software engineering tasks doubles every 6-7 months, but there is a "messiness tax" for real-world tasks (Boaz Barak/Windows On Theory)
https://windowsontheory.org/2025/11/04/thoughts-by…
Can AI really replace humans when it comes to the arts? https://www.theguardian.com/commentisfree/picture/2025/oct/11/can-ai-really-replace-humans-when-it-comes-to-the-arts?CMP=Share_iOSApp_Other…
Desperate companies now hiring humans to fix what #AI botched https://futurism.com/companies-hiring-humans-fix-ai I similarly wouldn't be surprised about an upcoming
As humans, we •all• live our lives with our heads up our butts — including but not limited to the Britannica writers. If we’re lucky, someone will help us see what we’re missing. If we’re wise, we listen. At its best, that’s what Wikipedia can do — and in public, at scale, for everyone.
7/
Long-Term Human Motion Prediction Using Spatio-Temporal Maps of Dynamics
Yufei Zhu, Andrey Rudenko, Tomasz P. Kucner, Achim J. Lilienthal, Martin Magnusson
https://arxiv.org/abs/2510.03031
Towards Cognitively-Faithful Decision-Making Models to Improve AI Alignment
Cyrus Cousins, Vijay Keswani, Vincent Conitzer, Hoda Heidari, Jana Schaich Borg, Walter Sinnott-Armstrong
https://arxiv.org/abs/2509.04445
How do Humans and LLMs Process Confusing Code?
Youssef Abdelsalam, Norman Peitek, Anna-Maria Maurer, Mariya Toneva, Sven Apel
https://arxiv.org/abs/2508.18547 https://
InfoMosaic-Bench: Evaluating Multi-Source Information Seeking in Tool-Augmented Agents
Yaxin Du, Yuanshuo Zhang, Xiyuan Yang, Yifan Zhou, Cheng Wang, Gongyi Zou, Xianghe Pang, Wenhao Wang, Menglan Chen, Shuo Tang, Zhiyu Li, Siheng Chen
https://arxiv.org/abs/2510.02271
OpenAI releases GDPval, a benchmark to test AI performance on "economically valuable, real-world tasks", and says Claude Opus 4.1 was the best performing model (Maxwell Zeff/TechCrunch)
https://techcrunch.com/2025/09/25/openai-says-g…
📏 Learning from punishment: Model makes sense of the cognitive processes humans use
#psychology
Oruga: An Avatar of Representational Systems Theory
Daniel Raggi, Gem Stapleton, Mateja Jamnik, Aaron Stockdill, Grecia Garcia Garcia, Peter C-H. Cheng
https://arxiv.org/abs/2509.04041
Virtual Community: An Open World for Humans, Robots, and Society
Qinhong Zhou, Hongxin Zhang, Xiangye Lin, Zheyuan Zhang, Yutian Chen, Wenjun Liu, Zunzhe Zhang, Sunli Chen, Lixing Fang, Qiushi Lyu, Xinyu Sun, Jincheng Yang, Zeyuan Wang, Bao Chi Dang, Zhehuan Chen, Daksha Ladia, Jiageng Liu, Chuang Gan
https://arxiv.org/abs/2508.14893
In-Context Learning can Perform Continual Learning Like Humans
Liuwang Kang, Fan Wang, Shaoshan Liu, Hung-Chyun Chou, Chuan Lin, Ning Ding
https://arxiv.org/abs/2509.22764 https…
Analyzing Reluctance to Ask for Help When Cooperating With Robots: Insights to Integrate Artificial Agents in HRC
Ane San Martin, Michael Hagenow, Julie Shah, Johan Kildal, Elena Lazkano
https://arxiv.org/abs/2509.01450
"Why a wildfire chemical toxic to humans lingers longer in clouds"
#Wildfires #Chemicals
https://
SpaceX defends lunar lander after challenge from NASA administrator
In a lengthy blog post Thursday, Elon Musk’s space company said it “rapidly advanced” the core Starship spacecraft through 11 test flights
and reached 49 milestones in developing the HLS (Human Landing System) version of the craft designed to reach the lunar surface as part of the government’s Artemis program.
“Starship continues to simultaneously be the fastest path to returning humans to the surface of t…
Analysis of Bluffing by DQN and CFR in Leduc Hold'em Poker
Tarik Zaciragic, Aske Plaat, K. Joost Batenburg
https://arxiv.org/abs/2509.04125 https://arx…
Learning Human-Perceived Fakeness in AI-Generated Videos via Multimodal LLMs
Xingyu Fu, Siyi Liu, Yinuo Xu, Pan Lu, Guangqiuse Hu, Tianbo Yang, Taran Anantasagar, Christopher Shen, Yikai Mao, Yuanzhe Liu, Keyush Shah, Chung Un Lee, Yejin Choi, James Zou, Dan Roth, Chris Callison-Burch
https://arxiv.org/abs/2509.22646
Beyond Words: Interjection Classification for Improved Human-Computer Interaction
Yaniv Goren, Yuval Cohen, Alexander Apartsin, Yehudit Aperstein
https://arxiv.org/abs/2509.03181
AgentPack: A Dataset of Code Changes, Co-Authored by Agents and Humans
Yangtian Zi, Zixuan Wu, Aleksander Boruch-Gruszecki, Jonathan Bell, Arjun Guha
https://arxiv.org/abs/2509.21891
Human-like Navigation in a World Built for Humans
Bhargav Chandaka, Gloria X. Wang, Haozhe Chen, Henry Che, Albert J. Zhai, Shenlong Wang
https://arxiv.org/abs/2509.21189 https:…
💯 Scientists produce quantum entanglement-like results without entangled particles in new experiment
https://phys.org/news/2025-08-scientists-quantum-entanglement-results-entangled.html
Humans Perceive Wrong Narratives from AI Reasoning Texts
Mosh Levy, Zohar Elyoseph, Yoav Goldberg
https://arxiv.org/abs/2508.16599 https://arxiv.org/pdf/25…
Social World Models
Xuhui Zhou, Jiarui Liu, Akhila Yerukola, Hyunwoo Kim, Maarten Sap
https://arxiv.org/abs/2509.00559 https://arxiv.org/pdf/2509.00559
OpenAI says its reasoning system solved all 12 problems at the 2025 ICPC World Finals, with GPT-5 solving 11 and an experimental model solving the last (Maximilian Schreiner/The Decoder)
https://the-decoder.com/openai-outperforms…
Assessing Human Cooperation for Enhancing Social Robot Navigation
Hariharan Arunachalam, Phani Teja Singamaneni, Rachid Alami
https://arxiv.org/abs/2508.21455 https://
Wrong Face, Wrong Move: The Social Dynamics of Emotion Misperception in Agent-Based Models
David Freire-Obreg\'on
https://arxiv.org/abs/2509.00080 https://
Uncovering the Computational Ingredients of Human-Like Representations in LLMs
Zach Studdiford, Timothy T. Rogers, Kushin Mukherjee, Siddharth Suresh
https://arxiv.org/abs/2510.01030
Multi-Agent Data Visualization and Narrative Generation
Anton Wolter, Georgios Vidalakis, Michael Yu, Ankit Grover, Vaishali Dhanoa
https://arxiv.org/abs/2509.00481 https://