#aiinbusiness #genai #llm #aiagents | Jon Ippolito
This story of an absolutely unhinged AI vending machine is both hysterical and a cautionary tale about letting AI agents run over long time horizons.
Researchers from Andon Labs wanted to test the ability of advanced LLMs to perform typical business tasks like balancing inventory, placing orders, setting prices, and handling daily fees. So they gave Claude 3.5 Sonnet, o3-mini, and other recent models “tools” like restock_machine, send_email, and search_web and told them to run a vending ma…