
2025-10-13 09:44:00
Auto-scaling Continuous Memory for GUI Agent
Wenyi Wu, Kun Zhou, Ruoxin Yuan, Vivian Yu, Stephen Wang, Zhiting Hu, Biwei Huang
https://arxiv.org/abs/2510.09038 https://
Auto-scaling Continuous Memory for GUI Agent
Wenyi Wu, Kun Zhou, Ruoxin Yuan, Vivian Yu, Stephen Wang, Zhiting Hu, Biwei Huang
https://arxiv.org/abs/2510.09038 https://
FineState-Bench: A Comprehensive Benchmark for Fine-Grained State Control in GUI Agents
Fengxian Ji, Jingpu Yang, Zirui Song, Yuanxi Wang, Zhexuan Cui, Yuke Li, Qian Jiang, Miao Fang, Xiuying Chen
https://arxiv.org/abs/2508.09241
#HomeAssistant is pretty good.
It has some quirks, but it's not as much hassle as I feared. I had to edit 4 lines of YAML, but everything else I wanted was in the GUI.
Finally, switch to the #Matter standard paid off. I can keep devices shared with HomeKit, while having way bet…
Crosslisted article(s) found for cs.LG. https://arxiv.org/list/cs.LG/new
[3/5]:
- Auto-scaling Continuous Memory for GUI Agent
Wenyi Wu, Kun Zhou, Ruoxin Yuan, Vivian Yu, Stephen Wang, Zhiting Hu, Biwei Huang
Replaced article(s) found for cs.CL. https://arxiv.org/list/cs.CL/new
[2/3]:
- MemGuide: Intent-Driven Memory Selection for Goal-Oriented Multi-Session LLM Agents
Du, Wang, He, Liang, Wang, Li, Gui, Pan, Xu, Wong
Evidence of scaling advantage on an NP-Complete problem with enhanced quantum solvers
Quanfeng Lu, Shijie Wei, Keren Li, Pan Gao, Bao Yan, Muxi Zheng, Haoran Zhang, Jinfeng Zeng, Gui-Lu Long
https://arxiv.org/abs/2508.08869
MagicGUI: A Foundational Mobile GUI Agent with Scalable Data Pipeline and Reinforcement Fine-tuning
Liujian Tang, Shaokang Dong, Yijia Huang, Minqi Xiang, Hongtao Ruan, Bin Wang, Shuo Li, Zhihui Cao, Hailiang Pang, Heng Kong, He Yang, Mingxu Chai, Zhilin Gao, Xingyu Liu, Yingnan Fu, Jiaming Liu, Tao Gui, Xuanjing Huang, Yu-Gang Jiang, Qi Zhang, Kang Wang, Yunke Zhang, Yuran Wang
GUI-ReRank: Enhancing GUI Retrieval with Multi-Modal LLM-based Reranking
Kristian Kolthoff, Felix Kretzer, Christian Bartelt, Alexander Maedche, Simone Paolo Ponzetto
https://arxiv.org/abs/2508.03298
An Open-Source Simulation and Data Management Tool for EnergyPlus Building Models
Ninad Gaikwad, Kasey Dettlaff, Athul Jose P, Anamika Dubey
https://arxiv.org/abs/2508.09130 htt…
On the existence of self-similar solutions to the steady Navier-Stokes equations in high dimensions
Jeaheang Bang, Changfeng Gui, Hao Liu, Yun Wang, Chunjing Xie
https://arxiv.org/abs/2510.10488
Breaking the Sabatier Principle by Dynamic Adsorption-Desorption Decoupling in Electrocatalytic Hydrogen Evolution
Zi-Xuan Yang, Lei Li, Tao Huang, Hui Wan, X. S. Wang, Gui-Fang Huang, Wangyu Hu, Wei-Qing Huang
https://arxiv.org/abs/2510.10555
ReInAgent: A Context-Aware GUI Agent Enabling Human-in-the-Loop Mobile Task Navigation
Haitao Jia, Ming He, Zimo Yin, Likang Wu, Jianping Fan, Jitao Sang
https://arxiv.org/abs/2510.07988
Trion FPGA adventure update:
It looks like you need to import the generated .isf into the interface designer if you want to switch between RTL and GUI flows (e.g. to use the GUI to constrain I/O blocks that were inferred from RTL). This is a pain, but at least it's something I can work around now that I understand it.
Next problem: I think I've found my first datasheet errata. The EFX_DDIO documentation in the Quantium Trion Primitives User Guide lists DDIO, DDIO_RESYNC, …
Dexplore: Scalable Neural Control for Dexterous Manipulation from Reference-Scoped Exploration
Sirui Xu, Yu-Wei Chao, Liuyu Bian, Arsalan Mousavian, Yu-Xiong Wang, Liang-Yan Gui, Wei Yang
https://arxiv.org/abs/2509.09671
Test-Time Reinforcement Learning for GUI Grounding via Region Consistency
Yong Du, Yuchen Yan, Fei Tang, Zhengxi Lu, Chang Zong, Weiming Lu, Shengpei Jiang, Yongliang Shen
https://arxiv.org/abs/2508.05615
VeriOS: Query-Driven Proactive Human-Agent-GUI Interaction for Trustworthy OS Agents
Zheng Wu, Heyuan Huang, Xingyu Lou, Xiangmou Qu, Pengzhou Cheng, Zongru Wu, Weiwen Liu, Weinan Zhang, Jun Wang, Zhaoxiang Wang, Zhuosheng Zhang
https://arxiv.org/abs/2509.07553
A Multimodal GUI Architecture for Interfacing with LLM-Based Conversational Assistants
Hans G. W. van Dam
https://arxiv.org/abs/2510.06223 https://arxiv.or…
APEX: Automatic Event Sequence Generation for Android Applications
Wenhao Chen, Morris Chang, Witawas Srisa-an, Yong Guan
https://arxiv.org/abs/2509.02412 https://
GUI zum Nacherleben: Website emuliert Einstellungen zahlreicher Macs
Wie hat man Macs früher konfiguriert? Der Designer Marcin Wichery hat 20 Jahre macOS-Geschichte auf einer Seite zusammengetragen.
http…
InfiGUI-G1: Advancing GUI Grounding with Adaptive Exploration Policy Optimization
Yuhang Liu, Zeyu Liu, Shuanghe Zhu, Pengxiang Li, Congkai Xie, Jiasheng Wang, Xueyu Hu, Xiaotian Han, Jianbo Yuan, Xinyao Wang, Shengyu Zhang, Hongxia Yang, Fei Wu
https://arxiv.org/abs/2508.05731
Weekend list of critical reading links about the state[1] of Tech, AI[2] hype/finance/politics, mostly long form:
Ed Zitron's The Hater's Guide To The AI Bubble
https://www.wheresyoured.at/the-haters-gui/
How to use computing power faster: on the weird ec…
Observation and Modulation of the Quantum Mpemba Effect on a Superconducting Quantum Processor
Yueshan Xu, Cai-Ping Fang, Bing-Jie Chen, Ming-Chuan Wang, Zi-Yong Ge, Yun-Hao Shi, Yu Liu, Cheng-Lin Deng, Kui Zhao, Zheng-He Liu, Tian-Ming Li, Hao Li, Ziting Wang, Gui-Han Liang, Da'er Feng, Xueyi Guo, Xu-Yang Gu, Yang He, Hao-Tian Liu, Zheng-Yang Mei, Yongxi Xiao, Yu Yan, Yi-Han Yu, Wei-Ping Yuan, Jia-Chi Zhang, Zheng-An Wang, Gangqin Liu, Xiaohui Song, Ye Tian, Yu-Ran Zhang, Shi-Xin …
Was able to grab and install Lagrange (for accessing Gopher/Gemini sites) here on my laptop that has Mint on it, just by searching for Lagrange in the Software Manager (gotta love Linux). Installed, launched-- and yeah, it really is a great, clean GUI. Also, you can see my Rogue desktop background 😂
#Gemini #Gopher …
MMBench-GUI: Hierarchical Multi-Platform Evaluation Framework for GUI Agents
Xuehui Wang, Zhenyu Wu, JingJing Xie, Zichen Ding, Bowen Yang, Zehao Li, Zhaoyang Liu, Qingyun Li, Xuan Dong, Zhe Chen, Weiyun Wang, Xiangyu Zhao, Jixuan Chen, Haodong Duan, Tianbao Xie, Chenyu Yang, Shiqian Su, Yue Yu, Yuan Huang, Yiqian Liu, Xiao Zhang, Yanting Zhang, Xiangyu Yue, Weijie Su, Xizhou Zhu, Wei Shen, Jifeng Dai, Wenhai Wang
Replaced article(s) found for cs.RO. https://arxiv.org/list/cs.RO/new
[2/3]:
- UniCalib: Targetless LiDAR-Camera Calibration via Probabilistic Flow on Unified Depth Representat...
Shu Han, Xubo Zhu, Ji Wu, Ximeng Cai, Wen Yang, Huai Yu, Gui-Song Xia
Validity and Power of Heavy-Tailed Combination Tests under Asymptotic Dependence
Lin Gui, Tiantian Mao, Jingshu Wang, Ruodu Wang
https://arxiv.org/abs/2508.05818 https://…
Crosslisted article(s) found for cs.DC. https://arxiv.org/list/cs.DC/new
[1/1]:
- FedQS: Optimizing Gradient and Model Aggregation for Semi-Asynchronous Federated Learning
Yunbo Li, Jiaping Gui, Zhihang Deng, Fanchao Meng, Yue Wu
VeriGUI: Verifiable Long-Chain GUI Dataset
Shunyu Liu, Minghao Liu, Huichi Zhou, Zhenyu Cui, Yang Zhou, Yuhao Zhou, Wendong Fan, Ge Zhang, Jiajun Shi, Weihao Xuan, Jiaxing Huang, Shuang Luo, Fang Wu, Heli Qi, Qingcheng Zeng, Ziqi Ren, Jialiang Gao, Jindi Lv, Junjie Wang, Aosong Feng, Heng Zhou, Wangchunshu Zhou, Zhenfei Yin, Wenlong Zhang, Guohao Li, Wenhao Yu, Irene Li, Lei Ma, Lei Bai, Qunshu Lin, Mingli Song, Dacheng Tao
Hab mir nun einen zweiten AP gekauft der nur via VPN ins Internet darf. Das GL-inet Gerät ist klein, aber die Leistung beeindruckt. Hat eine sehr gute GUI und darunter läuft ein OpenWRT. AdGuard z.b. kannst du einfach per klick installieren, VPN Einrichtung geht ebenfalls super einfach.
https://amzn.eu/d/3hj3EAC…
AgentGym-RL: Training LLM Agents for Long-Horizon Decision Making through Multi-Turn Reinforcement Learning
Zhiheng Xi, Jixuan Huang, Chenyang Liao, Baodai Huang, Honglin Guo, Jiaqi Liu, Rui Zheng, Junjie Ye, Jiazheng Zhang, Wenxiang Chen, Wei He, Yiwen Ding, Guanyu Li, Zehui Chen, Zhengyin Du, Xuesong Yao, Yufei Xu, Jiecao Chen, Tao Gui, Zuxuan Wu, Qi Zhang, Xuanjing Huang, Yu-Gang Jiang
KG-RAG: Enhancing GUI Agent Decision-Making via Knowledge Graph-Driven Retrieval-Augmented Generation
Ziyi Guan, Jason Chun Lok Li, Zhijian Hou, Pingping Zhang, Donglai Xu, Yuzhi Zhao, Mengyang Wu, Jinpeng Chen, Thanh-Toan Nguyen, Pengfei Xian, Wenao Ma, Shengchao Qin, Graziano Chesi, Ngai Wong
https://arxiv.org/abs/2509.00366
GUI-KV: Efficient GUI Agents via KV Cache with Spatio-Temporal Awareness
Kung-Hsiang Huang, Haoyi Qiu, Yutong Dai, Caiming Xiong, Chien-Sheng Wu
https://arxiv.org/abs/2510.00536
Numerical analysis of the homogeneous Landau equation: approximation, error estimates and simulation
Francis Filbet (IMT), Yanzhi Gui (THU), Ling-Bing He (THU)
https://arxiv.org/abs/2509.09276
Texture-zeros in minimal seesaw from non-invertible symmetry fusion rules
Zheng Jiang, Bu-Yao Qu, Gui-Jun Ding
https://arxiv.org/abs/2510.07236 https://arx…
My main problem with @edzitron.com 's piece on the #AIbubble is that I agree with so much of it.
I'm now wondering if I've missed something about #LLMs? The numbers and implications for stock markets are terrifyingly huge!
https://www.wheresyoured.at/the-haters-gui/
SPAN: A cross-platform Python GUI software for optical and near-infrared spectral analysis
Daniele Gasparri, Lorenzo Morelli, Umberto Battino, Jairo M\'endez Abreu, Adriana de Lorenzo-C\'aceres
https://arxiv.org/abs/2508.01923
About a month ago I was a bit too optimistic*, the admin took over a month to complete....: Only last Tuesday Reasonable Sourcery was officially inCOOPerated.
Today we finally sent out the official introduction:
<https://lists.gnu.org/archive/html/guix-devel/2025…
Replaced article(s) found for cs.SE. https://arxiv.org/list/cs.SE/new
[1/1]:
- Large Language Models for Mobile GUI Text Input Generation: An Empirical Study
Chenhui Cui, Tao Li, Junjie Wang, Chunyang Chen, Dave Towey, Rubing Huang
PG-Agent: An Agent Powered by Page Graph
Weizhi Chen, Ziwei Wang, Leyang Yang, Sheng Zhou, Xiaoxuan Tang, Jiajun Bu, Yong Li, Wei Jiang
https://arxiv.org/abs/2509.03536 https://…
If you just need a pretty figure from a dataset and not the full power of R, have a look at #gui
Multi-modal Uncertainty Robust Tree Cover Segmentation For High-Resolution Remote Sensing Images
Yuanyuan Gui, Wei Li, Yinjian Wang, Xiang-Gen Xia, Mauro Marty, Christian Ginzler, Zuyuan Wang
https://arxiv.org/abs/2509.04870
In my day we used dd to create bootable USB media. You're trying to sell me on a GUI with an upsell built in? Pass.
InterAct: Advancing Large-Scale Versatile 3D Human-Object Interaction Generation
Sirui Xu, Dongting Li, Yucheng Zhang, Xiyan Xu, Qi Long, Ziyin Wang, Yunzhi Lu, Shuchang Dong, Hezi Jiang, Akshat Gupta, Yu-Xiong Wang, Liang-Yan Gui
https://arxiv.org/abs/2509.09555
I will be talking at the next #DevOps #Prague meetup. You should join. It's free to attend and requires registration.
https://www.
🇺🇦 #NowPlaying on #BBC6Music's #AmyLamé
High Vis:
🎵 Guided Tour
#HighVis
https://highvis.bandcamp.com/track/guided-tour
https://open.spotify.com/track/0afflx9Jk8wIFitFncBnyS
SparkUI-Parser: Enhancing GUI Perception with Robust Grounding and Parsing
Hongyi Jing, Jiafu Chen, Chen Rao, Ziqiang Dang, Jiajie Teng, Tianyi Chu, Juncheng Mo, Shuo Fang, Huaizhong Lin, Rui Lv, Chenguang Ma, Lei Zhao
https://arxiv.org/abs/2509.04908
TriQuest:An AI Copilot-Powered Platform for Interdisciplinary Curriculum Design
Huazhen Wang, Huimin Yang, Hainbin Lin, Yan Dong, Lili Chen, Liangliang Xia, Wenwen Xu
https://arxiv.org/abs/2510.03369
AgenticRAG: Tool-Augmented Foundation Models for Zero-Shot Explainable Recommender Systems
Bo Ma, Hang Li, ZeHua Hu, XiaoFan Gui, LuYao Liu, Simon Liu
https://arxiv.org/abs/2510.02668
🇺🇦 Auf radioeins läuft...
Sam Taylor-Wood:
🎵 I'm In Love With A German Film Star
#NowPlaying #SamTaylorWood
https://kompakt.bandcamp.com/track/im-in-love-with-a-german-film-star-gui-boratto-remix
https://open.spotify.com/track/3NT7jidpDWSaiZe73A5nRs
#Accessibility modelers using #r5r #rstats, check this GUI for playing around with R5 network. If many people find it useful, I would get signal if I should invest any more free time into it.
Improving GUI Grounding with Explicit Position-to-Coordinate Mapping
Suyuchen Wang, Tianyu Zhang, Ahmed Masry, Christopher Pal, Spandana Gella, Bang Liu, Perouz Taslakian
https://arxiv.org/abs/2510.03230
Euphonica is a Rust-Powered MPD Client Heavy on Bling
MPD (Music Player Daemon) is a server-client audio player long popular with Linux users. The headless daemon runs as a background service, typically on a remote audio server. Music is then accessed via a GUI client frontend, which connects to the MPD server to stream content.
🎶
UItron: Foundational GUI Agent with Advanced Perception and Planning
Zhixiong Zeng, Jing Huang, Liming Zheng, Wenkang Han, Yufeng Zhong, Lei Chen, Longrong Yang, Yingjie Chu, Yuzhi He, Lin Ma
https://arxiv.org/abs/2508.21767
IndusGCC: A Data Benchmark and Evaluation Framework for GUI-Based General Computer Control in Industrial Automation
Xiaoran Yang, Yuyang Du, Kexin Chen, Soung Chang Liew, Jiamin Lu, Ziyu Guo, Xiaoyan Liu, Qun Yang, Shiqi Xu, Xingyu Fan, Yuchen Pan, Taoyong Cui, Hongyu Deng, Boris Dudder, Jianzhang Pan, Qun Fang, Pheng Ann Heng
https://arxi…
Evaluating LLM-Generated Legal Explanations for Regulatory Compliance in Social Media Influencer Marketing
Haoyang Gui, Thales Bertaglia, Taylor Annabell, Catalina Goanta, Tjomme Dooper, Gerasimos Spanakis
https://arxiv.org/abs/2510.08111
UI-TARS-2 Technical Report: Advancing GUI Agent with Multi-Turn Reinforcement Learning
Haoming Wang, Haoyang Zou, Huatong Song, Jiazhan Feng, Junjie Fang, Junting Lu, Longxiang Liu, Qinyu Luo, Shihao Liang, Shijue Huang, Wanjun Zhong, Yining Ye, Yujia Qin, Yuwen Xiong, Yuxin Song, Zhiyong Wu, Bo Li, Chen Dun, Chong Liu, Fuxing Leng, Hanbin Wang, Hao Yu, Haobin Chen, Hongyi Guo, Jing Su, Jingjia Huang, Kai Shen, Kaiyu Shi, Lin Yan, Peiyao Zhao, Pengfei Liu, Qinghao Ye, Renjie Zheng, Way…
OmniActor: A Generalist GUI and Embodied Agent for 2D&3D Worlds
Longrong Yang, Zhixiong Zeng, Yufeng Zhong, Jing Huang, Liming Zheng, Lei Chen, Haibo Qiu, Zequn Qin, Lin Ma, Xi Li
https://arxiv.org/abs/2509.02322
Realistic Environmental Injection Attacks on GUI Agents
Yitong Zhang, Ximo Li, Liyi Cai, Jia Li
https://arxiv.org/abs/2509.11250 https://arxiv.org/pdf/2509…
Revisit of the electromagnetic correction to $\tau\to\pi\pi\nu_\tau$ and its implication for muon $g-2$ based on $\tau$ data
Zhi-Xin Li, Ao Li, Jin Hao, Chun-Gui Duan, Zhi-Hui Guo
https://arxiv.org/abs/2510.04172
Mobile-Agent-v3: Foundamental Agents for GUI Automation
Jiabo Ye, Xi Zhang, Haiyang Xu, Haowei Liu, Junyang Wang, Zhaoqing Zhu, Ziwei Zheng, Feiyu Gao, Junjie Cao, Zhengxi Lu, Jitong Liao, Qi Zheng, Fei Huang, Jingren Zhou, Ming Yan
https://arxiv.org/abs/2508.15144
EVDI : Event-based Video Deblurring and Interpolation via Self-Supervised Learning
Chi Zhang, Xiang Zhang, Chenxu Jiang, Gui-Song Xia, Lei Yu
https://arxiv.org/abs/2509.08260 h…
Search-R3: Unifying Reasoning and Embedding Generation in Large Language Models
Yuntao Gui, James Cheng
https://arxiv.org/abs/2510.07048 https://arxiv.org/…
From KP-I Lump Solution to Travelling wave of 3D Gravity Capillary Water wave problem
Changfeng Gui, Shanfa Lai, Yong Liu, Juncheng Wei, Wen Yang
https://arxiv.org/abs/2509.06084
Crosslisted article(s) found for cs.AI. https://arxiv.org/list/cs.AI/new
[3/5]:
- \emph{FoQuS}: A Forgetting-Quality Coreset Selection Framework for Automatic Modulation Recognition
Yao Lu, Chunfeng Sun, Dongwei Xu, Yun Lin, Qi Xuan, Guan Gui
TofuML: A Spatio-Physical Interactive Machine Learning Device for Interactive Exploration of Machine Learning for Novices
Wataru Kawabe, Hiroto Fukuda, Akihisa Shitara, Yuri Nakao, Yusuke Sugano
https://arxiv.org/abs/2508.00252
LaTCoder: Converting Webpage Design to Code with Layout-as-Thought
Yi Gui, Zhen Li, Zhongyi Zhang, Guohao Wang, Tianpeng Lv, Gaoyang Jiang, Yi Liu, Dongping Chen, Yao Wan, Hongyu Zhang, Wenbin Jiang, Xuanhua Shi, Hai Jin
https://arxiv.org/abs/2508.03560
Learning Active Perception via Self-Evolving Preference Optimization for GUI Grounding
Wanfu Wang, Qipeng Huang, Guangquan Xue, Xiaobo Liang, Juntao Li
https://arxiv.org/abs/2509.04243
Real-Time Power electronics Control and Monitoring with TI F28379D DSC and GUI Composer
Ilyas Bennia, Lotfi Baghli, Ehsan Jamshidpour, Abdelkader Mechernene, Jean-Philippe Martin, Driss Yousfi
https://arxiv.org/abs/2509.25008
LaSM: Layer-wise Scaling Mechanism for Defending Pop-up Attack on GUI Agents
Zihe Yan, Zhuosheng Zhang
https://arxiv.org/abs/2507.10610 https://
Ferret-UI Lite: Lessons from Building Small On-Device GUI Agents
Zhen Yang, Zi-Yi Dou, Di Feng, Forrest Huang, Anh Nguyen, Keen You, Omar Attia, Yuhao Yang, Michael Feng, Haotian Zhang, Ram Ramrakhya, Chao Jia, Jeffrey Nichols, Alexander Toshev, Yinfei Yang, Zhe Gan
https://arxiv.org/abs/2509.26539…
Log2Plan: An Adaptive GUI Automation Framework Integrated with Task Mining Approach
Seoyoung Lee, Seonbin Yoon, Seongbeen Lee, Hyesoo Kim, Joo Yong Sim
https://arxiv.org/abs/2509.22137
Well-Posedness of the Cauchy Problem for First-order Quasilinear Equations with Non-Lipschitz Source Terms and Its Applications
Gaowei Cao, Gui-Qiang G. Chen, Wei Xiang, Xiaozhou Yang
https://arxiv.org/abs/2509.06020
RISK: A Framework for GUI Agents in E-commerce Risk Management
Renqi Chen, Zeyin Tao, Jianming Guo, Jingzhe Zhu, Yiheng Peng, Qingqing Sun, Tianyi Zhang, Shuai Chen
https://arxiv.org/abs/2509.21982
Blueprint First, Model Second: A Framework for Deterministic LLM Workflow
Libin Qiu, Yuhang Ye, Zhirong Gao, Xide Zou, Junfu Chen, Ziming Gui, Weizhi Huang, Xiaobo Xue, Wenkai Qiu, Kun Zhao
https://arxiv.org/abs/2508.02721
CRAFT-GUI: Curriculum-Reinforced Agent For GUI Tasks
Songqin Nong, Jingxuan Xu, Sheng Zhou, Jianfeng Chen, Xiaoxuan Tang, Tao Jiang, Wenhao Xu
https://arxiv.org/abs/2508.11360 h…
Phi-Ground Tech Report: Advancing Perception in GUI Grounding
Miaosen Zhang, Ziqiang Xu, Jialiang Zhu, Qi Dai, Kai Qiu, Yifan Yang, Chong Luo, Tianyi Chen, Justin Wagle, Tim Franklin, Baining Guo
https://arxiv.org/abs/2507.23779
R-Horizon: How Far Can Your Large Reasoning Model Really Go in Breadth and Depth?
Yi Lu, Jianing Wang, Linsen Guo, Wei He, Hongyin Tang, Tao Gui, Xuanjing Huang, Xuezhi Cao, Wei Wang, Xunliang Cai
https://arxiv.org/abs/2510.08189
Quantitative stability for the conformally invariant Chang-Gui inequality on the exponentiation of functions on the sphere
Monideep Ghosh, Debabrata Karmakar
https://arxiv.org/abs/2508.19930
An Automated Attack Investigation Approach Leveraging Threat-Knowledge-Augmented Large Language Models
Rujie Dai, Peizhuo Lv, Yujiang Gui, Qiujian Lv, Yuanyuan Qiao, Yan Wang, Degang Sun, Weiqing Huang, Yingjiu Li, XiaoFeng Wang
https://arxiv.org/abs/2509.01271
Replaced article(s) found for cs.HC. https://arxiv.org/list/cs.HC/new
[2/2]:
- GUI-G$^2$: Gaussian Reward Modeling for GUI Grounding
Tang, Gu, Lu, Liu, Shen, Meng, Wang, Zhang, Shen, Lu, Xiao, Zhuang
GenPilot: A Multi-Agent System for Test-Time Prompt Optimization in Image Generation
Wen Ye, Zhaocheng Liu, Yuwei Gui, Tingyu Yuan, Yunyue Su, Bowen Fang, Chaoyang Zhao, Qiang Liu, Liang Wang
https://arxiv.org/abs/2510.07217
D-Artemis: A Deliberative Cognitive Framework for Mobile GUI Multi-Agents
Hongze Mi, Yibo Feng, Wenjie Lu, Yuqi Wang, Jinyuan Li, Song Cao, He Cui, Tengfei Tian, Xuelin Zhang, Haotian Luo, Di Sun, Naiqiang Tan, Gang Pan
https://arxiv.org/abs/2509.21799
Structuring GUI Elements through Vision Language Models: Towards Action Space Generation
Yi Xu, Yesheng Zhang, jiajia Liu, Jingdong Chen
https://arxiv.org/abs/2508.16271 https:/…
Dark Patterns Meet GUI Agents: LLM Agent Susceptibility to Manipulative Interfaces and the Role of Human Oversight
Jingyu Tang, Chaoran Chen, Jiawen Li, Zhiping Zhang, Bingcan Guo, Ibrahim Khalilov, Simret Araya Gebreegziabher, Bingsheng Yao, Dakuo Wang, Yanfang Ye, Tianshi Li, Ziang Xiao, Yaxing Yao, Toby Jia-Jun Li
https://arxiv.org/abs/…
ProRe: A Proactive Reward System for GUI Agents via Reasoner-Actor Collaboration
Gaole Dai, Shiqi Jiang, Ting Cao, Yuqing Yang, Yuanchun Li, Rui Tan, Mo Li, Lili Qiu
https://arxiv.org/abs/2509.21823
Orcust: Stepwise-Feedback Reinforcement Learning for GUI Agent
Junyu Lu, Songxin Zhang, Zejian Xie, Zhuoyang Song, Jiaxing Zhang
https://arxiv.org/abs/2509.17917 https://…
SWIRL: A Staged Workflow for Interleaved Reinforcement Learning in Mobile GUI Control
Quanfeng Lu, Zhantao Ma, Shuai Zhong, Jin Wang, Dahai Yu, Michael K. Ng, Ping Luo
https://arxiv.org/abs/2508.20018 …
V2P: From Background Suppression to Center Peaking for Robust GUI Grounding Task
Jikai Chen, Long Chen, Dong Wang, Leilei Gan, Chenyi Zhuang, Jinjie Gu
https://arxiv.org/abs/2508.13634
SCFlow: Implicitly Learning Style and Content Disentanglement with Flow Models
Pingchuan Ma, Xiaopei Yang, Yusong Li, Ming Gui, Felix Krause, Johannes Schusterbauer, Bj\"orn Ommer
https://arxiv.org/abs/2508.03402
Replaced article(s) found for cs.AI. https://arxiv.org/list/cs.AI/new
[2/6]:
- NatureGAIA: Pushing the Frontiers of GUI Agents with a Challenging Benchmark and High-Quality Tra...
Zihan Zheng, Tianle Cui, Chuwen Xie, Jiahui Zhang, Jiahui Pan, Lewei He, Qianglong Chen
See, Think, Act: Teaching Multimodal Agents to Effectively Interact with GUI by Identifying Toggles
Zongru Wu, Rui Mao, Zhiyuan Tian, Pengzhou Cheng, Tianjie Ju, Zheng Wu, Lingzhong Dong, Haiyue Sheng, Zhuosheng Zhang, Gongshen Liu
https://arxiv.org/abs/2509.13615
Replaced article(s) found for cs.AI. https://arxiv.org/list/cs.AI/new
[4/6]:
- SciReplicate-Bench: Benchmarking LLMs in Agent-driven Algorithmic Reproduction from Research Papers
Yanzheng Xiang, Hanqi Yan, Shuyin Ouyang, Lin Gui, Yulan He
AutoMaAS: Self-Evolving Multi-Agent Architecture Search for Large Language Models
Bo Ma, Hang Li, ZeHua Hu, XiaoFan Gui, LuYao Liu, Simon Liu
https://arxiv.org/abs/2510.02669 ht…
Towards Adversarial Training under Hyperspectral Images
Weihua Zhang, Chengze Jiang, Jie Gui, Lu Dong
https://arxiv.org/abs/2510.01014 https://arxiv.org/pd…
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[5/5]:
- GUI-G$^2$: Gaussian Reward Modeling for GUI Grounding
Tang, Gu, Lu, Liu, Shen, Meng, Wang, Zhang, Shen, Lu, Xiao, Zhuang
InfraMind: A Novel Exploration-based GUI Agentic Framework for Mission-critical Industrial Management
Liangtao Lin, Zhaomeng Zhu, Tianwei Zhang, Yonggang Wen
https://arxiv.org/abs/2509.13704