InstaLILY, whose industry-specific AI agents called InstaWorkers integrate with legacy systems, raised $25M from Insight Partners (Mike Wheatley/SiliconANGLE)
https://siliconangle.com/2025/08/27/instalily-gets-25m-accelerate-industr…
MMBench-GUI: Hierarchical Multi-Platform Evaluation Framework for GUI Agents
Xuehui Wang, Zhenyu Wu, JingJing Xie, Zichen Ding, Bowen Yang, Zehao Li, Zhaoyang Liu, Qingyun Li, Xuan Dong, Zhe Chen, Weiyun Wang, Xiangyu Zhao, Jixuan Chen, Haodong Duan, Tianbao Xie, Chenyu Yang, Shiqian Su, Yue Yu, Yuan Huang, Yiqian Liu, Xiao Zhang, Yanting Zhang, Xiangyu Yue, Weijie Su, Xizhou Zhu, Wei Shen, Jifeng Dai, Wenhai Wang

MMBench-GUI: Hierarchical Multi-Platform Evaluation Framework for GUI Agents
We introduce MMBench-GUI, a hierarchical benchmark for evaluating GUI automation agents across Windows, macOS, Linux, iOS, Android, and Web platforms. It comprises four levels: GUI Content Understanding, Element Grounding, Task Automation, and Task Collaboration, covering essential skills for GUI agents. In addition, we propose a novel Efficiency-Quality Area (EQA) metric to assess GUI agent execution efficiency in online automation scenarios. Through MMBench-GUI, we identify accurate visual gr…
Maintenance automation: methods for robotics manipulation planning and execution
Christian Friedrich, Ralf Gulde, Armin Lechler, Alexander Verl
https://arxiv.org/abs/2508.18399 …
Research on Sectionalizing Switches Placement Problem of Distribution System Automation Based on Multi-Objective Optimization Analysis
Selma Cheshmeh Khavar, Arya Abdollahi
https://arxiv.org/abs/2507.19029
Maisa AI, which offers an enterprise agentic automation service that uses a proprietary system to limit hallucinations, raised $25M seed led by Creandum (Anna Heim/TechCrunch)
https://techcrunch.com/2025/08/27/maisa-ai-gets-25m-to-fix-enterprise-ais…
I’ve been discussing some agent swarm based development work on LinkedIn. So far it’s going well, I’m figuring out how to get the results I want from the tools. As I say there, it feels more like managing a team of experienced product managers and developers (which I’ve done a few times in my career) than doing developer work faster.
CHEMSMART: Chemistry Simulation and Modeling Automation Toolkit for High-Efficiency Computational Chemistry Workflows
Xinglong Zhang, Huiwen Tan, Jingyi Liu, Zihan Li, Lewen Wang, Benjamin W. J. Chen
https://arxiv.org/abs/2508.20042
Efficient task and path planning for maintenance automation using a robot system
Christian Friedrich, Akos Csiszar, Armin Lechler, Alexander Verl
https://arxiv.org/abs/2508.18400
The popular meaning of "luddite" is a straw-man. It's a sloppy word with a sloppy meaning now, and it's one we'd do well to watch out for.
The actual reality of who the Luddites were is far more interesting, the center of the hard-fought struggles against owners of factories disrupting entire towns and cities economies with massively terrible results, centralizing power and money and leaving a great number of people without any control of their work, formerly artisans who'd had a hand in their own work, and many automated out of jobs. Luddites destroyed automated looms not because they hated technology. They destroyed automated looms because they were taking the livelihood they depended on, with no recourse, and it was a disaster for a good while, and then millwork has gone from those places probably forever.
The problem now with LLMs and automated research systems is there's very little way for workers and creators to stick their shoes in the machinery. They've tried (https://arxiv.org/abs/2407.12281) but mostly failed, since unlike a factory full of textile workers, the equipment is remote, the automation virtual, an intangible software object that few can access in any meaningful way.
Large Language Models (LLMs) for Electronic Design Automation (EDA)
Kangwei Xu, Denis Schwachhofer, Jason Blocklove, Ilia Polian, Peter Domanski, Dirk Pfl\"uger, Siddharth Garg, Ramesh Karri, Ozgur Sinanoglu, Johann Knechtel, Zhuorui Zhao, Ulf Schlichtmann, Bing Li
https://arxiv.org/abs/2508.20030…
Reading minds on the road: decoding perceived risk in automated vehicles through 140K ratings
Xiaolin He, Zirui Li, Xinwei Wang, Riender Happee, Meng Wang
https://arxiv.org/abs/2508.19121
HAMSA: Hijacking Aligned Compact Models via Stealthy Automation
Alexey Krylov, Iskander Vagizov, Dmitrii Korzh, Maryam Douiba, Azidine Guezzaz, Vladimir Kokh, Sergey D. Erokhin, Elena V. Tutubalina, Oleg Y. Rogov
https://arxiv.org/abs/2508.16484
Dutch web design automation startup Framer raised $100M led by Meritech and Atomico with Accel participation at a $2B valuation, after raising $27M in 2023 (Yazhou Sun/Bloomberg)
https://www.bloomberg.com/news/articles/20
My experience is limited to ASP.NET, SQL, and some Python at a Fortune 500 company, so take this with a grain of salt. When he talks about agents, it sounds like automation to me. I’ve been writing jobs (or agents) for decades to run automated tasks on a schedule. If you want to add another point of failure into your job, knock yourself out. Also, I’m wary of encouraging neophytes to outsource work they don’t know how to do. That’s a recipe for disaster.
CFTel: A Practical Architecture for Robust and Scalable Telerobotics with Cloud-Fog Automation
Thien Tran, Jonathan Kua, Minh Tran, Honghao Lyu, Thuong Hoang, Jiong Jin
https://arxiv.org/abs/2506.17991
Airalogy: AI-empowered universal data digitization for research automation
Zijie Yang, Qiji Zhou, Fang Guo, Sijie Zhang, Yexun Xi, Jinglei Nie, Yudian Zhu, Liping Huang, Chou Wu, Yonghe Xia, Xiaoyu Ma, Yingming Pu, Panzhong Lu, Junshu Pan, Mingtao Chen, Tiannan Guo, Yanmei Dou, Hongyu Chen, Anping Zeng, Jiaxing Huang, Tian Xu, Yue Zhang
https://
Technical Implementation of Tippy: Multi-Agent Architecture and System Design for Drug Discovery Laboratory Automation
Yao Fehlis, Charles Crain, Aidan Jensen, Michael Watson, James Juhasz, Paul Mandel, Betty Liu, Shawn Mahon, Daren Wilson, Nick Lynch-Jonely, Ben Leedom, David Fuller
https://arxiv.org/abs/2507.17852
Israeli customer service automation company Nice acquires Cognigy, which develops a conversational and agentic AI platform, for $955M, set to close in Q4 2025 (Shiri Habib-Valdhorn/Globes)
https://en.globes.co.il/en/article-nice-acquires-ai-co-cognigy-…
On my side of iOS 18.5, automation based on application start seems to be completely broken.
This doesn’t do anything
So I had this terrible idea a few years ago to write some infrastructure automation that provisions a new compute instance, sets up secrets storage, configures IAM roles, authorizes the new instance to be able to provision new instance and roles via infra-as-code automation, and then the new instance tears down the instance and roles that created it, before then creating its own new compute instance, etc
Like a self-propagating glider in Conway's Game of Life, except with cloud inf…
Multimodal Behaviour Trees for Robotic Laboratory Task Automation
Hatem Fakhruldeen, Arvind Raveendran Nambiar, Satheeshkumar Veeramani, Bonilkumar Vijaykumar Tailor, Hadi Beyzaee Juneghani, Gabriella Pizzuto, Andrew Ian Cooper
https://arxiv.org/abs/2506.20399
DHMS: A Digital Hostel Management System Integrating Campus ChatBot, Predictive Intelligence, and Real-Time Automation
Riddhi Heda, Sidhant Singh, Umair Yasir, Tanmay Jaiswal, Anil Mokhade
https://arxiv.org/abs/2507.17759
Especially since the "AI" bubble keeps focusing (through defense tech and supposed "workflow automation") on the governments of the world as bagholders.
https://tldr.nettime.org/@tante/115066177879999756
@… Lots of PRs are only annoying if the automation is bad.
Learn from previous runs to improve future automation and save plans in a gallery for reuse.
🔀 Parallel execution
Run multiple tasks simultaneously with status indicators for efficient workflow management.
🌐 Web automation
Browse websites, fill forms, navigate deep sites not indexed by search engines, with real-time browser view.
💻 Code execution
Generate and execute code alongside web browsing for comprehensive task automation capabilities.
🔗
Combined Stochastic and Robust Optimization for Electric Autonomous Mobility-on-Demand with Nested Benders Decomposition
Sten Elling Tingstad Jacobsen, Bal\'azs Kulcs\'ar, Anders Lindman
https://arxiv.org/abs/2508.19933
Leveraging Cloud-Fog Automation for Autonomous Collision Detection and Classification in Intelligent Unmanned Surface Vehicles
Thien Tran, Quang Nguyen, Jonathan Kua, Minh Tran, Toan Luu, Thuong Hoang, Jiong Jin
https://arxiv.org/abs/2506.18024
AI Agents for Photonic Integrated Circuit Design Automation
Ankita Sharma, YuQi Fu, Vahid Ansari, Rishabh Iyer, Fiona Kuang, Kashish Mistry, Raisa Islam Aishy, Sara Ahmad, Joaquin Matres, Dirk R. Englund, Joyce K. S. Poon
https://arxiv.org/abs/2508.14123
Blackstone agrees to acquire a majority stake in NetBrain, valuing the Burlington, Massachusetts-based network operations and automation company at $750M (Ryan Gould/Bloomberg)
https://www.bloomberg.com/news/articles/2025-07-…
Mobile-Agent-v3: Foundamental Agents for GUI Automation
Jiabo Ye, Xi Zhang, Haiyang Xu, Haowei Liu, Junyang Wang, Zhaoqing Zhu, Ziwei Zheng, Feiyu Gao, Junjie Cao, Zhengxi Lu, Jitong Liao, Qi Zheng, Fei Huang, Jingren Zhou, Ming Yan
https://arxiv.org/abs/2508.15144
🌟 New SIGs Spotlight: SIG-AI 🌟
A new space for collaboration on Artificial Intelligence within the NREN community is here.
SIG-AI brings the Research & Education community together to share expertise, best practices, and explore practical use cases of AI in NREN context—from cybersecurity and High-Performance Computing (HPC) to network automation and next-generation networks.
📖 For more insights, read the full interview with Leonie Schäfer (@…
CYCLE-INSTRUCT: Fully Seed-Free Instruction Tuning via Dual Self-Training and Cycle Consistency
Zhanming Shen, Hao Chen, Yulei Tang, Shaolin Zhu, Wentao Ye, Xiaomeng Hu, Haobo Wang, Gang Chen, Junbo Zhao
https://arxiv.org/abs/2508.16100
Code Difference Guided Fuzzing for FPGA Logic Synthesis Compilers via Bayesian Optimization
Zhihao Xu, Shikai Guo, Guilin Zhao, Peiyu Zou, Siwen Wang, Qian Ma, Hui Li, Furui Zhan
https://arxiv.org/abs/2508.17713
$AutoGuardX$: A Comprehensive Cybersecurity Framework for Connected Vehicles
Muhammad Ali Nadeem, Bishwo Prakash Pokharel, Naresh Kshetri, Achyut Shankar, Gokarna Sharma
https://arxiv.org/abs/2508.18155
Thoma Bravo buys customer service automation software provider Verint for $1.23B in cash, days after its $12.3B purchase of HR software provider Dayforce (Ryan Gould/Bloomberg)
https://www.bloomberg.com/news/articles/2025-08-25/tho…
SV-LLM: An Agentic Approach for SoC Security Verification using Large Language Models
Dipayan Saha, Shams Tarek, Hasan Al Shaikh, Khan Thamid Hasan, Pavan Sai Nalluri, Md. Ajoad Hasan, Nashmin Alam, Jingbo Zhou, Sujan Kumar Saha, Mark Tehranipoor, Farimah Farahmandi
https://arxiv.org/abs/2506.20415…
Intent-Based Network for RAN Management with Large Language Models
Fransiscus Asisi Bimo, Maria Amparo Canaveras Galdon, Chun-Kai Lai, Ray-Guang Cheng, Edwin K. P. Chong
https://arxiv.org/abs/2507.14230
How German delivery giant DHL uses automation and AI to help offset an ageing workforce, with one in three support staff set to retire in the next five years (Andrew Hill/Financial Times)
https://www.ft.com/content/ce09786f-2481-44fe-957c-f7bb0b43e284
MMReview: A Multidisciplinary and Multimodal Benchmark for LLM-Based Peer Review Automation
Xian Gao, Jiacheng Ruan, Zongyun Zhang, Jingsheng Gao, Ting Liu, Yuzhuo Fu
https://arxiv.org/abs/2508.14146
FedChip: Federated LLM for Artificial Intelligence Accelerator Chip Design
Mahmoud Nazzal, Khoa Nguyen, Deepak Vungarala, Ramtin Zand, Shaahin Angizi, Hai Phan, Abdallah Khreishah
https://arxiv.org/abs/2508.13162
I2I-STRADA -- Information to Insights via Structured Reasoning Agent for Data Analysis
SaiBarath Sundar, Pranav Satheesan, Udayaadithya Avadhanam
https://arxiv.org/abs/2507.17874
LLMind 2.0: Distributed IoT Automation with Natural Language M2M Communication and Lightweight LLM Agents
Yuyang Du, Qun Yang, Liujianfu Wang, Jingqi Lin, Hongwei Cui, Soung Chang Liew
https://arxiv.org/abs/2508.13920
DEV: A Driver-Environment-Vehicle Closed-Loop Framework for Risk-Aware Adaptive Automation of Driving
Ana\"is Halin, Christel Devue, Marc Van Droogenbroeck
https://arxiv.org/abs/2508.10618
Cognitive Agents Powered by Large Language Models for Agile Software Project Management
Konrad Cinkusz, Jaros{\l}aw A. Chudziak, Ewa Niewiadomska-Szynkiewicz
https://arxiv.org/abs/2508.16678
Robotic System for Chemical Experiment Automation with Dual Demonstration of End-effector and Jig Operations
Hikaru Sasaki, Naoto Komeno, Takumi Hachimine, Kei Takahashi, Yu-ya Ohnishi, Tetsunori Sugawara, Araki Wakiuchi, Miho Hatanaka, Tomoyuki Miyao, Hiroharu Ajiro, Mikiya Fujii, Takamitsu Matsubara
https://arxiv.org/abs/2506.…
Pactum, which helps companies automate supplier negotiations with AI and secure more favorable contractual terms, raised a $54M Series C led by Insight Partners (Maria Deutscher/SiliconANGLE)
https://siliconangle.com/2025/06/09/pactum-raises-…
ARCADE: A RAN Diagnosis Methodology in a Hybrid AI Environment for 6G Networks
Daniel Ricardo Cunha Oliveira, Rodrigo Moreira, Fl\'avio de Oliveira Silva
https://arxiv.org/abs/2507.17861
Join us on June 26th in Mannheim for the Sylius meetup! We have an exciting lineup of speakers, including Max Pesch, who will share a post-mortem analysis of Brille24, Jacques Bodin-Hullin, who will dive into Sylius and automation, and Stephan Hochdörfer, who will introduce the Sylius stack.
Register here: https://www.
Toward an Intent-Based and Ontology-Driven Autonomic Security Response in Security Orchestration Automation and Response
Zequan Huang, Jacques Robin, Nicolas Herbaut, Nourh\`ene Ben Rabah, B\'en\'edicte Le Grand
https://arxiv.org/abs/2507.12061
Mechanical Automation with Vision: A Design for Rubik's Cube Solver
Abhinav Chalise, Nimesh Gopal Pradhan, Nishan Khanal, Prashant Raj Bista, Dinesh Baniya Kshatri
https://arxiv.org/abs/2508.12469 …
DHL, UPS, FedEx, and Walmart are using robots to boost warehouse efficiency and cut costs, including automating the physically demanding task of loading trucks (Esther Fung/Wall Street Journal)
https://www.
SheetMind: An End-to-End LLM-Powered Multi-Agent Framework for Spreadsheet Automation
Ruiyan Zhu, Xi Cheng, Ke Liu, Brian Zhu, Daniel Jin, Neeraj Parihar, Zhoutian Xu, Oliver Gao
https://arxiv.org/abs/2506.12339
Replaced article(s) found for cs.CL. https://arxiv.org/list/cs.CL/new
[3/3]:
- CRISPR-GPT for Agentic Automation of Gene-editing Experiments
Qu, Huang, Yin, Zhan, Liu, Yin, Cousins, Johnson, Wang, Shah, Altman, Zhou, Wang, Cong
Toward General Physical Intelligence for Resilient Agile Manufacturing Automation
Sandeep Kanta, Mehrdad Tavassoli, Varun Teja Chirkuri, Venkata Akhil Kumar, Santhi Bharath Punati, Praveen Damacharla, Sunny Katyara
https://arxiv.org/abs/2508.11960
Navigating the growing field of research on AI for software testing -- the taxonomy for AI-augmented software testing and an ontology-driven literature survey
Ina K. Schieferdecker
https://arxiv.org/abs/2506.14640

Navigating the growing field of research on AI for software testing -- the taxonomy for AI-augmented software testing and an ontology-driven literature survey
In industry, software testing is the primary method to verify and validate the functionality, performance, security, usability, and so on, of software-based systems. Test automation has gained increasing attention in industry over the last decade, following decades of intense research into test automation and model-based testing. However, designing, developing, maintaining and evolving test automation is a considerable effort. Meanwhile, AI's breakthroughs in many engineering fields are opening…
AutoGraph: A Knowledge-Graph Framework for Modeling Interface Interaction and Automating Procedure Execution in Digital Nuclear Control Rooms
Xingyu Xiao, Jiejuan Tong, Jun Sun, Zhe Sui, Jingang Liang, Hongru Zhao, Jun Zhao, Haitao Wang
https://arxiv.org/abs/2506.18727
LLM-as-a-Judge for Reference-less Automatic Code Validation and Refinement for Natural Language to Bash in IT Automation
Ngoc Phuoc An Vo, Brent Paulovicks, Vadim Sheinin
https://arxiv.org/abs/2506.11237
Replaced article(s) found for cs.AI. https://arxiv.org/list/cs.AI/new
[1/5]:
- CRISPR-GPT for Agentic Automation of Gene-editing Experiments
Qu, Huang, Yin, Zhan, Liu, Yin, Cousins, Johnson, Wang, Shah, Altman, Zhou, Wang, Cong
Commure, which provides ambient AI, revenue cycle management, and workflow automation tools for healthcare providers, raised $200M from General Catalyst (Erin Brodwin/Axios)
https://www.axios.com/pro/health-tech-deals/2025/06/19/commure-…
NYC-based Tennr, which uses AI document parsing and workflow automation to cut patient waiting times, raised a $101M Series C led by IVP at a $605M valuation (Leo Schwartz/Fortune)
https://fortune.com/2025/06/18/tennr-healt…
Q&A with Taskrabbit CEO Ania Smith on the Ikea-owned platform's history, Taskers earning up to $50/hour, AI assistants, zero fees, high suburban use, and more (Nilay Patel/The Verge)
https://www.theverge.com/decoder-podcast-…
How AI and automation are transforming agriculture, enabling autonomous tractors and fruit-picking robots, and improving crop management via data and analytics (William Boston/Wall Street Journal)
https://www.wsj.com/tech/autonomous-farmin
Behavior Driven Development for 3D Games
Fernando Pastor Ric\'os, Beatriz Mar\'in, I. S. W. B. Prasetya, Tanja E. J. Vos, Joseph Davidson, Karel Hovorka
https://arxiv.org/abs/2506.17057
Studies: women are 25% less likely than men to have basic digital skills, are more likely to be in automation-threatened jobs, and use ChatGPT less at work (Isabel Berwick/Financial Times)
https://www.ft.com/content/7f0fbd7d-011a-448d-9d23-8a8db2006df4
Hadrian, which is building largely automated factories to produce space and defense parts, raised a $260M Series C, bringing its total funding to ~$500M (Aria Alamalhodaei/TechCrunch)
https://techcrunch.com/2025/07/17/hadria…
In March, New York added a checkbox to its WARN system for companies to show if "technological innovation or automation", like AI, was a reason for mass layoffs (Bloomberg)
https://www.bloomberg.com/news/newsletters
A look at Walmart's use of automation to grow sales without adding staff, as analysts say its efforts raise questions about the future of US retail labor (Gregory Meyer/Financial Times)
https://www.ft.com/content/5be70b28-018d-42d7-af8d-ea5a4bed4d44