MMBench-GUI: Hierarchical Multi-Platform Evaluation Framework for GUI Agents
Xuehui Wang, Zhenyu Wu, JingJing Xie, Zichen Ding, Bowen Yang, Zehao Li, Zhaoyang Liu, Qingyun Li, Xuan Dong, Zhe Chen, Weiyun Wang, Xiangyu Zhao, Jixuan Chen, Haodong Duan, Tianbao Xie, Chenyu Yang, Shiqian Su, Yue Yu, Yuan Huang, Yiqian Liu, Xiao Zhang, Yanting Zhang, Xiangyu Yue, Weijie Su, Xizhou Zhu, Wei Shen, Jifeng Dai, Wenhai Wang

MMBench-GUI: Hierarchical Multi-Platform Evaluation Framework for GUI Agents
We introduce MMBench-GUI, a hierarchical benchmark for evaluating GUI automation agents across Windows, macOS, Linux, iOS, Android, and Web platforms. It comprises four levels: GUI Content Understanding, Element Grounding, Task Automation, and Task Collaboration, covering essential skills for GUI agents. In addition, we propose a novel Efficiency-Quality Area (EQA) metric to assess GUI agent execution efficiency in online automation scenarios. Through MMBench-GUI, we identify accurate visual gr…
Say about Linux what you will, but getting this much detail about what your graphics card is doing is pretty cool.🐧
`amdgpu_top --gui`
#Linux #LinuxGaming #GamingOnLinux
Parallels Between VLA Model Post-Training and Human Motor Learning: Progress, Challenges, and Trends
Tian-Yu Xiang, Ao-Qun Jin, Xiao-Hu Zhou, Mei-Jiang Gui, Xiao-Liang Xie, Shi-Qi Liu, Shuang-Yi Wang, Sheng-Bin Duan, Fu-Chao Xie, Wen-Kai Wang, Si-Cheng Wang, Ling-Yun Li, Tian Tu, Zeng-Guang Hou
https://arxiv.org/abs/2506.20966…
Euphonica is a Rust-Powered MPD Client Heavy on Bling
MPD (Music Player Daemon) is a server-client audio player long popular with Linux users. The headless daemon runs as a background service, typically on a remote audio server. Music is then accessed via a GUI client frontend, which connects to the MPD server to stream content.
🎶
GUI zum Nacherleben: Website emuliert Einstellungen zahlreicher Macs
Wie hat man Macs früher konfiguriert? Der Designer Marcin Wichery hat 20 Jahre macOS-Geschichte auf einer Seite zusammengetragen.
http…
Replaced article(s) found for eess.IV. https://arxiv.org/list/eess.IV/new
[1/1]:
- MOSformer: Momentum encoder-based inter-slice fusion transformer for medical image segmentation
Huang, Zhou, Gui, Xie, Liu, Wang, Feng, Lai, Hou
Mobile-Agent-v3: Foundamental Agents for GUI Automation
Jiabo Ye, Xi Zhang, Haiyang Xu, Haowei Liu, Junyang Wang, Zhaoqing Zhu, Ziwei Zheng, Feiyu Gao, Junjie Cao, Zhengxi Lu, Jitong Liao, Qi Zheng, Fei Huang, Jingren Zhou, Ming Yan
https://arxiv.org/abs/2508.15144
My main problem with @edzitron.com 's piece on the #AIbubble is that I agree with so much of it.
I'm now wondering if I've missed something about #LLMs? The numbers and implications for stock markets are terrifyingly huge!
https://www.wheresyoured.at/the-haters-gui/
CachyOS is all the rage.
CachyOS is indeed very fast, even on this underpowered Chromebook.
It is also kinda broken.
The installer is so close to unusable (as in, it's a GUI, but the mouse cursor doesn't curse and the display brightness is pegged at minimum and can't be controlled) that I very nearly gave up. Installing Arch on the CLI is far faster.
Wayland is irritating, the font rendering is still fucked (text literally jiggles around on screen!) and the s…
Self-organization drives symmetry-breaking, scaling, and critical growth transitions in stem cell-derived organoids
Daniel Aguilar-Hidalgo, Joel Ostblom, M Mona Siu, Divy Raval, Ajinkya Ghagre, Tiam Heydari, Benjamin McMaster, Jonathan Gui, Nicolas Werschler, Mukul Tewary, Peter W. Zandstra
https://arxiv.org/abs/2507.18887
MagicGUI: A Foundational Mobile GUI Agent with Scalable Data Pipeline and Reinforcement Fine-tuning
Liujian Tang, Shaokang Dong, Yijia Huang, Minqi Xiang, Hongtao Ruan, Bin Wang, Shuo Li, Zhihui Cao, Hailiang Pang, Heng Kong, He Yang, Mingxu Chai, Zhilin Gao, Xingyu Liu, Yingnan Fu, Jiaming Liu, Tao Gui, Xuanjing Huang, Yu-Gang Jiang, Qi Zhang, Kang Wang, Yunke Zhang, Yuran Wang
You Don't Know Until You Click:Automated GUI Testing for Production-Ready Software Evaluation
Yutong Bian, Xianhao Lin, Yupeng Xie, Tianyang Liu, Mingchen Zhuge, Siyuan Lu, Haoming Tang, Jinlin Wang, Jiayi Zhang, Jiaqi Chen, Xiangru Tang, Yongxin Ni, Sirui Hong, Chenglin Wu
https://arxiv.org/abs/2508.14104
OH GOD SOMEONE SEND HELP I'VE STARTED LEARNING HOW TO TRACKER AGAIN
#genesis #ym2612 #furnace #tracker
Replaced article(s) found for cs.SD. https://arxiv.org/list/cs.SD/new
[1/1]:
- Koel-TTS: Enhancing LLM based Speech Generation with Preference Alignment and Classifier Free Gui...
Hussain, Neekhara, Yang, Casanova, Ghosh, Desta, Fejgin, Valle, Li
Learning, Reasoning, Refinement: A Framework for Kahneman's Dual-System Intelligence in GUI Agents
Jinjie Wei, Jiyao Liu, Lihao Liu, Ming Hu, Junzhi Ning, Mingcheng Li, Weijie Yin, Junjun He, Xiao Liang, Chao Feng, Dingkang Yang
https://arxiv.org/abs/2506.17913
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[5/5]:
- GUI-G$^2$: Gaussian Reward Modeling for GUI Grounding
Tang, Gu, Lu, Liu, Shen, Meng, Wang, Zhang, Shen, Lu, Xiao, Zhuang
Hear Your Code Fail, Voice-Assisted Debugging for Python
Sayed Mahbub Hasan Amiri, Md. Mainul Islam, Mohammad Shakhawat Hossen, Sayed Majhab Hasan Amiri, Mohammad Shawkat Ali Mamun, Sk. Humaun Kabir, Naznin Akter
https://arxiv.org/abs/2507.15007
MobileGUI-RL: Advancing Mobile GUI Agent through Reinforcement Learning in Online Environment
Yucheng Shi, Wenhao Yu, Zaitang Li, Yonglin Wang, Hongming Zhang, Ninghao Liu, Haitao Mi, Dong Yu
https://arxiv.org/abs/2507.05720
Radio emission from airplanes as observed with RNO-G
G Collaboration, S. Agarwal, J. A. Aguilar, N. Alden, S. Ali, P. Allison, M. Betts, D. Besson, A. Bishop, O. Botner, S. Bouma, S. Buitink, R. Camphyn, J. Chan, S. Chiche, B. A. Clark, A. Coleman, K. Couberly, S. de Kockere, K. D. de Vries, C. Deaconu, P. Giri, C. Glaser, T. Gl\"usenkamp, H. Gui, A. Hallgren, S. Hallmann, J. C. Hanson, K. Helbing, B. Hendricks, J. Henrichs, N. Heyer, C. Hornhuber, E. Huesca Santiago, K. Hughes, A…
I've managed to install the Stirling PDF tools on one of our company's mini PCs using #podman. I've tried doing it with Redhat's cockpit admin GUI but it didn't really work, probably because of SELinux or something (it's always SELinux...). I found some instruction for the commandline that worked.
This is all seems like a good use-case for Docker but the whole contain…
Tonight I had my first longer #VibeCoding session using #Aider. The result is a significantly improved UX for #SPOV (a
GUI-Robust: A Comprehensive Dataset for Testing GUI Agent Robustness in Real-World Anomalies
Jingqi Yang, Zhilong Song, Jiawei Chen, Mingli Song, Sheng Zhou, linjun sun, Xiaogang Ouyang, Chun Chen, Can Wang
https://arxiv.org/abs/2506.14477
Replaced article(s) found for math.QA. https://arxiv.org/list/math.QA/new
[1/1]:
- On a Connes Fusion Approach to Finite Index Extensions of Conformal Nets
Bin Gui
Hijacking JARVIS: Benchmarking Mobile GUI Agents against Unprivileged Third Parties
Guohong Liu, Jialei Ye, Jiacheng Liu, Yuanchun Li, Wei Liu, Pengzhi Gao, Jian Luan, Yunxin Liu
https://arxiv.org/abs/2507.04227
Replaced article(s) found for math.OA. https://arxiv.org/list/math.OA/new
[1/1]:
- On a Connes Fusion Approach to Finite Index Extensions of Conformal Nets
Bin Gui
LPO: Towards Accurate GUI Agent Interaction via Location Preference Optimization
Jiaqi Tang, Yu Xia, Yi-Feng Wu, Yuwei Hu, Yuhui Chen, Qing-Guo Chen, Xiaogang Xu, Xiangyu Wu, Hao Lu, Yanqing Ma, Shiyin Lu, Qifeng Chen
https://arxiv.org/abs/2506.09373
Non-Equilibrium Criticality-Enhanced Quantum Sensing with Superconducting Qubits
Hao Li, Yaoling Yang, Yun-Hao Shi, Zheng-An Wang, Ziting Wang, Jintao Li, Yipeng Zhang, Kui Zhao, Yue-Shan Xu, Cheng-Lin Deng, Yu Liu, Wei-Guo Ma, Tian-Ming Li, Jia-Chi Zhang, Cai-Ping Fang, Jia-Cheng Song, Hao-Tian Liu, Si-Yun Zhou, Zheng-He Liu, Bing-Jie Chen, Gui-Han Liang, Xiaohui Song, Zhongcheng Xiang, Kai Xu, Kaixuan Huang, Abolfazl Bayat, Heng Fan
FineState-Bench: A Comprehensive Benchmark for Fine-Grained State Control in GUI Agents
Fengxian Ji, Jingpu Yang, Zirui Song, Yuanxi Wang, Zhexuan Cui, Yuke Li, Qian Jiang, Miao Fang, Xiuying Chen
https://arxiv.org/abs/2508.09241
SPAN: A cross-platform Python GUI software for optical and near-infrared spectral analysis
Daniele Gasparri, Lorenzo Morelli, Umberto Battino, Jairo M\'endez Abreu, Adriana de Lorenzo-C\'aceres
https://arxiv.org/abs/2508.01923
VeriGUI: Verifiable Long-Chain GUI Dataset
Shunyu Liu, Minghao Liu, Huichi Zhou, Zhenyu Cui, Yang Zhou, Yuhao Zhou, Wendong Fan, Ge Zhang, Jiajun Shi, Weihao Xuan, Jiaxing Huang, Shuang Luo, Fang Wu, Heli Qi, Qingcheng Zeng, Ziqi Ren, Jialiang Gao, Jindi Lv, Junjie Wang, Aosong Feng, Heng Zhou, Wangchunshu Zhou, Zhenfei Yin, Wenlong Zhang, Guohao Li, Wenhao Yu, Irene Li, Lei Ma, Lei Bai, Qunshu Lin, Mingli Song, Dacheng Tao
A simple GUI-based screencast demo of the @… AI Layer (OPAL), showing what’s now possible via MCP A2A—namely, composing and orchestrating agentic workflows through loosely coupled software.
How?
An Agent Host routes tasks to Agents via A2A. Agents then invoke MCP-accessible tools (functions, procedures, APIs) described using JSON/YAML and exposed …
I never imagined getting used to a new #HPC infrastructure would be such a pain.
I got used to cli, ssh access, bash scrips before, but suddenly everything is handled via a GUI, everything is containerized.
AND ALL MY WORKFLOWS ARE BROKEN!
I don't even manage to set up a simple #conda environment.
On reflection, I think the big mistake is the conflation of #AI with #LLM and #MachineLearning.
There are genuine exciting advances in ML with applications all over the place, in science, (not least in my own research group looking at high resolution regional climate downscaling), health diagnostics, defence etc. But these are not the AIs that journalists are talking about, nor that are really related the LLMs.
They're still good uses of GPUs and will probably produce economic benefits, but probably not the multi- trillion ones the pundits seem to be expecting
#AIbubble is that I agree with so much of it.
I'm now wondering if I've missed something about #LLMs? The numbers and implications for stock markets are terrifyingly huge!
https://www.wheresyoured.at/the-haters-gui/
It's useful that there's the possibility to add #gnd ids to #zenodo as well (in case creators don't have an #orcid). I wish it'd be clearer from the GUI that the required input format is …
Since DJV went into a hiatus, I have tried #mrv2 more than once but it wouldn't do. Brought down my whole Linux desktop last time I tried, just by showing/hiding certain parts of the GUI.
So I've compiled #OpenRV but I really want something that loads faster than that clunky old player (even…
Crosslisted article(s) found for cs.PF. https://arxiv.org/list/cs.PF/new
[1/1]:
- Efficient GPU-Centered Singular Value Decomposition Using the Divide-and-Conquer Method
Shifang Liu, Huiyuan Li, Hongjiao Sheng, Haoyuan Gui, Xiaoyu Zhang
Test-Time Reinforcement Learning for GUI Grounding via Region Consistency
Yong Du, Yuchen Yan, Fei Tang, Zhengxi Lu, Chang Zong, Weiming Lu, Shengpei Jiang, Yongliang Shen
https://arxiv.org/abs/2508.05615
Replaced article(s) found for cs.CL. https://arxiv.org/list/cs.CL/new
[2/3]:
- Compression Hacking: A Supplementary Perspective on Informatics Properties of Language Models fro...
Zang, Ning, Wei, Dou, Zhang, Mo, Li, Gui, Zhang, Huang
ComputerRL: Scaling End-to-End Online Reinforcement Learning for Computer Use Agents
Hanyu Lai, Xiao Liu, Yanxiao Zhao, Han Xu, Hanchen Zhang, Bohao Jing, Yanyu Ren, Shuntian Yao, Yuxiao Dong, Jie Tang
https://arxiv.org/abs/2508.14040
Replaced article(s) found for cs.CL. https://arxiv.org/list/cs.CL/new
[2/3]:
- MemGuide: Intent-Driven Memory Selection for Goal-Oriented Multi-Session LLM Agents
Du, Wang, He, Liang, Wang, Li, Gui, Pan, Xu, Wong
GUIPilot: A Consistency-based Mobile GUI Testing Approach for Detecting Application-specific Bugs
Ruofan Liu, Xiwen Teoh, Yun Lin, Guanjie Chen, Ruofei Ren, Denys Poshyvanyk, Jin Song Dong
https://arxiv.org/abs/2506.07385
Repurposing an old fileserver as another backup server using #TrueNAS… First, I want to pull out any disks with reallocated sectors. Most disks are more than 5 years old and my plan is to replace future failing HDDs with larger ones. As an appliance, TrueNAS is not meant for low-level tinkering. The GUI only shows actual uncorrected sectors as SMART errors and has no way to light up a disk slot o…
Evidence of scaling advantage on an NP-Complete problem with enhanced quantum solvers
Quanfeng Lu, Shijie Wei, Keren Li, Pan Gao, Bao Yan, Muxi Zheng, Haoran Zhang, Jinfeng Zeng, Gui-Lu Long
https://arxiv.org/abs/2508.08869
DUSE: A Data Expansion Framework for Low-resource Automatic Modulation Recognition based on Active Learning
Yao Lu, Hongyu Gao, Zhuangzhi Chen, Dongwei Xu, Yun Lin, Qi Xuan, Guan Gui
https://arxiv.org/abs/2507.12011
Replaced article(s) found for cs.RO. https://arxiv.org/list/cs.RO/new
[2/3]:
- UniCalib: Targetless LiDAR-Camera Calibration via Probabilistic Flow on Unified Depth Representat...
Shu Han, Xubo Zhu, Ji Wu, Ximeng Cai, Wen Yang, Huai Yu, Gui-Song Xia
Phi-Ground Tech Report: Advancing Perception in GUI Grounding
Miaosen Zhang, Ziqiang Xu, Jialiang Zhu, Qi Dai, Kai Qiu, Yifan Yang, Chong Luo, Tianyi Chen, Justin Wagle, Tim Franklin, Baining Guo
https://arxiv.org/abs/2507.23779
InfiGUI-G1: Advancing GUI Grounding with Adaptive Exploration Policy Optimization
Yuhang Liu, Zeyu Liu, Shuanghe Zhu, Pengxiang Li, Congkai Xie, Jiasheng Wang, Xueyu Hu, Xiaotian Han, Jianbo Yuan, Xinyao Wang, Shengyu Zhang, Hongxia Yang, Fei Wu
https://arxiv.org/abs/2508.05731
Observation and Modulation of the Quantum Mpemba Effect on a Superconducting Quantum Processor
Yueshan Xu, Cai-Ping Fang, Bing-Jie Chen, Ming-Chuan Wang, Zi-Yong Ge, Yun-Hao Shi, Yu Liu, Cheng-Lin Deng, Kui Zhao, Zheng-He Liu, Tian-Ming Li, Hao Li, Ziting Wang, Gui-Han Liang, Da'er Feng, Xueyi Guo, Xu-Yang Gu, Yang He, Hao-Tian Liu, Zheng-Yang Mei, Yongxi Xiao, Yu Yan, Yi-Han Yu, Wei-Ping Yuan, Jia-Chi Zhang, Zheng-An Wang, Gangqin Liu, Xiaohui Song, Ye Tian, Yu-Ran Zhang, Shi-Xin …
Pre-Trained Policy Discriminators are General Reward Models
Shihan Dou, Shichun Liu, Yuming Yang, Yicheng Zou, Yunhua Zhou, Shuhao Xing, Chenhao Huang, Qiming Ge, Demin Song, Haijun Lv, Songyang Gao, Chengqi Lv, Enyu Zhou, Honglin Guo, Zhiheng Xi, Wenwei Zhang, Qipeng Guo, Qi Zhang, Xipeng Qiu, Xuanjing Huang, Tao Gui, Kai Chen
https://
Integrating Diffusion-based Multi-task Learning with Online Reinforcement Learning for Robust Quadruped Robot Control
Xinyao Qin, Xiaoteng Ma, Yang Qi, Qihan Liu, Chuanyi Xue, Ning Gui, Qinyu Dong, Jun Yang, Bin Liang
https://arxiv.org/abs/2507.05674
TofuML: A Spatio-Physical Interactive Machine Learning Device for Interactive Exploration of Machine Learning for Novices
Wataru Kawabe, Hiroto Fukuda, Akihisa Shitara, Yuri Nakao, Yusuke Sugano
https://arxiv.org/abs/2508.00252
Flexible Readout and Unconditional Reset for Superconducting Multi-Qubit Processors with Tunable Purcell Filters
Yong-Xi Xiao, Da'er Feng, Xu-Yang Gu, Gui-Han Liang, Ming-Chuan Wang, Zheng-Yu Peng, Bing-Jie Chen, Yu Yan, Zheng-Yang Mei, Si-Lu Zhao, Yi-Zhou Bu, Cheng-Lin Deng, Xiaohui Song, Dongning Zheng, Yu-Xiang Zhang, Yun-Hao Shi, Zhongcheng Xiang, Kai Xu, Heng Fan
VasoMIM: Vascular Anatomy-Aware Masked Image Modeling for Vessel Segmentation
De-Xing Huang, Xiao-Hu Zhou, Mei-Jiang Gui, Xiao-Liang Xie, Shi-Qi Liu, Shuang-Yi Wang, Tian-Yu Xiang, Rui-Ze Ma, Nu-Fang Xiao, Zeng-Guang Hou
https://arxiv.org/abs/2508.10794
AutoGEEval : A Multi-Level and Multi-Geospatial-Modality Automated Evaluation Framework for Large Language Models in Geospatial Code Generation on Google Earth Engine
Shuyang Hou, Zhangxiao Shen, Huayi Wu, Haoyue Jiao, Ziqi Liu, Lutong Xie, Chang Liu, Jianyuan Liang, Yaxian Qing, Xiaopu Zhang, Dehua Peng, Zhipeng Gui, Xuefeng Guan
https://
Replaced article(s) found for cs.HC. https://arxiv.org/list/cs.HC/new
[2/2]:
- GUI-G$^2$: Gaussian Reward Modeling for GUI Grounding
Tang, Gu, Lu, Liu, Shen, Meng, Wang, Zhang, Shen, Lu, Xiao, Zhuang
Replaced article(s) found for cs.AI. https://arxiv.org/list/cs.AI/new
[2/6]:
- NatureGAIA: Pushing the Frontiers of GUI Agents with a Challenging Benchmark and High-Quality Tra...
Zihan Zheng, Tianle Cui, Chuwen Xie, Jiahui Zhang, Jiahui Pan, Lewei He, Qianglong Chen
SCFlow: Implicitly Learning Style and Content Disentanglement with Flow Models
Pingchuan Ma, Xiaopei Yang, Yusong Li, Ming Gui, Felix Krause, Johannes Schusterbauer, Bj\"orn Ommer
https://arxiv.org/abs/2508.03402
Replaced article(s) found for cs.AI. https://arxiv.org/list/cs.AI/new
[4/6]:
- SciReplicate-Bench: Benchmarking LLMs in Agent-driven Algorithmic Reproduction from Research Papers
Yanzheng Xiang, Hanqi Yan, Shuyin Ouyang, Lin Gui, Yulan He