Tootfinder

Opt-in global Mastodon full text search. Join the index!

@arXiv_csAI_bot@mastoxiv.page
2025-10-13 09:44:00

Auto-scaling Continuous Memory for GUI Agent
Wenyi Wu, Kun Zhou, Ruoxin Yuan, Vivian Yu, Stephen Wang, Zhiting Hu, Biwei Huang
arxiv.org/abs/2510.09038

@arXiv_csCV_bot@mastoxiv.page
2025-08-14 09:32:22

FineState-Bench: A Comprehensive Benchmark for Fine-Grained State Control in GUI Agents
Fengxian Ji, Jingpu Yang, Zirui Song, Yuanxi Wang, Zhexuan Cui, Yuke Li, Qian Jiang, Miao Fang, Xiuying Chen
arxiv.org/abs/2508.09241

@kornel@mastodon.social
2025-10-13 10:48:05

#HomeAssistant is pretty good.
It has some quirks, but it's not as much hassle as I feared. I had to edit 4 lines of YAML, but everything else I wanted was in the GUI.
Finally, switch to the #Matter standard paid off. I can keep devices shared with HomeKit, while having way bet…

@BBC6MusicBot@mastodonapp.uk
2025-09-14 15:19:41

🇺🇦 #NowPlaying on #BBC6Music
High Vis:
🎵 Guided Tour
#HighVis
highvis.bandcamp.com/track/gui
open.spotify.com/track/0afflx9

@arXiv_csLG_bot@mastoxiv.page
2025-10-13 12:17:19

Crosslisted article(s) found for cs.LG. arxiv.org/list/cs.LG/new
[3/5]:
- Auto-scaling Continuous Memory for GUI Agent
Wenyi Wu, Kun Zhou, Ruoxin Yuan, Vivian Yu, Stephen Wang, Zhiting Hu, Biwei Huang

@arXiv_csCL_bot@mastoxiv.page
2025-08-14 13:28:40

Replaced article(s) found for cs.CL. arxiv.org/list/cs.CL/new
[2/3]:
- MemGuide: Intent-Driven Memory Selection for Goal-Oriented Multi-Session LLM Agents
Du, Wang, He, Liang, Wang, Li, Gui, Pan, Xu, Wong

@arXiv_quantph_bot@mastoxiv.page
2025-08-13 09:36:12

Evidence of scaling advantage on an NP-Complete problem with enhanced quantum solvers
Quanfeng Lu, Shijie Wei, Keren Li, Pan Gao, Bao Yan, Muxi Zheng, Haoran Zhang, Jinfeng Zeng, Gui-Lu Long
arxiv.org/abs/2508.08869

@arXiv_csHC_bot@mastoxiv.page
2025-08-07 07:34:03

MagicGUI: A Foundational Mobile GUI Agent with Scalable Data Pipeline and Reinforcement Fine-tuning
Liujian Tang, Shaokang Dong, Yijia Huang, Minqi Xiang, Hongtao Ruan, Bin Wang, Shuo Li, Zhihui Cao, Hailiang Pang, Heng Kong, He Yang, Mingxu Chai, Zhilin Gao, Xingyu Liu, Yingnan Fu, Jiaming Liu, Tao Gui, Xuanjing Huang, Yu-Gang Jiang, Qi Zhang, Kang Wang, Yunke Zhang, Yuran Wang

@arXiv_csSE_bot@mastoxiv.page
2025-08-06 09:37:40

GUI-ReRank: Enhancing GUI Retrieval with Multi-Modal LLM-based Reranking
Kristian Kolthoff, Felix Kretzer, Christian Bartelt, Alexander Maedche, Simone Paolo Ponzetto
arxiv.org/abs/2508.03298

@arXiv_eessSY_bot@mastoxiv.page
2025-08-13 09:13:12

An Open-Source Simulation and Data Management Tool for EnergyPlus Building Models
Ninad Gaikwad, Kasey Dettlaff, Athul Jose P, Anamika Dubey
arxiv.org/abs/2508.09130

@arXiv_mathAP_bot@mastoxiv.page
2025-10-14 10:02:08

On the existence of self-similar solutions to the steady Navier-Stokes equations in high dimensions
Jeaheang Bang, Changfeng Gui, Hao Liu, Yun Wang, Chunjing Xie
arxiv.org/abs/2510.10488

@arXiv_condmatmtrlsci_bot@mastoxiv.page
2025-10-14 11:28:19

Breaking the Sabatier Principle by Dynamic Adsorption-Desorption Decoupling in Electrocatalytic Hydrogen Evolution
Zi-Xuan Yang, Lei Li, Tao Huang, Hui Wan, X. S. Wang, Gui-Fang Huang, Wangyu Hu, Wei-Qing Huang
arxiv.org/abs/2510.10555

@arXiv_csAI_bot@mastoxiv.page
2025-10-10 10:18:49

ReInAgent: A Context-Aware GUI Agent Enabling Human-in-the-Loop Mobile Task Navigation
Haitao Jia, Ming He, Zimo Yin, Likang Wu, Jianping Fan, Jitao Sang
arxiv.org/abs/2510.07988

@azonenberg@ioc.exchange
2025-09-08 10:39:13

Trion FPGA adventure update:
It looks like you need to import the generated .isf into the interface designer if you want to switch between RTL and GUI flows (e.g. to use the GUI to constrain I/O blocks that were inferred from RTL). This is a pain, but at least it's something I can work around now that I understand it.
Next problem: I think I've found my first datasheet errata. The EFX_DDIO documentation in the Quantium Trion Primitives User Guide lists DDIO, DDIO_RESYNC, …

@arXiv_csRO_bot@mastoxiv.page
2025-09-12 09:28:49

Dexplore: Scalable Neural Control for Dexterous Manipulation from Reference-Scoped Exploration
Sirui Xu, Yu-Wei Chao, Liuyu Bian, Arsalan Mousavian, Yu-Xiong Wang, Liang-Yan Gui, Wei Yang
arxiv.org/abs/2509.09671

@arXiv_csCV_bot@mastoxiv.page
2025-08-08 10:29:02

Test-Time Reinforcement Learning for GUI Grounding via Region Consistency
Yong Du, Yuchen Yan, Fei Tang, Zhengxi Lu, Chang Zong, Weiming Lu, Shengpei Jiang, Yongliang Shen
arxiv.org/abs/2508.05615

@arXiv_csCL_bot@mastoxiv.page
2025-09-10 10:19:41

VeriOS: Query-Driven Proactive Human-Agent-GUI Interaction for Trustworthy OS Agents
Zheng Wu, Heyuan Huang, Xingyu Lou, Xiangmou Qu, Pengzhou Cheng, Zongru Wu, Weiwen Liu, Weinan Zhang, Jun Wang, Zhaoxiang Wang, Zhuosheng Zhang
arxiv.org/abs/2509.07553

@arXiv_csHC_bot@mastoxiv.page
2025-10-09 07:58:11

A Multimodal GUI Architecture for Interfacing with LLM-Based Conversational Assistants
Hans G. W. van Dam
arxiv.org/abs/2510.06223 arxiv.or…

@arXiv_csCR_bot@mastoxiv.page
2025-09-03 14:10:23

APEX: Automatic Event Sequence Generation for Android Applications
Wenhao Chen, Morris Chang, Witawas Srisa-an, Yong Guan
arxiv.org/abs/2509.02412

@macandi@social.heise.de
2025-07-18 11:54:00

GUI zum Nacherleben: Website emuliert Einstellungen zahlreicher Macs
Wie hat man Macs früher konfiguriert? Der Designer Marcin Wichery hat 20 Jahre macOS-Geschichte auf einer Seite zusammengetragen.

@arXiv_csAI_bot@mastoxiv.page
2025-08-11 07:30:19

InfiGUI-G1: Advancing GUI Grounding with Adaptive Exploration Policy Optimization
Yuhang Liu, Zeyu Liu, Shuanghe Zhu, Pengxiang Li, Congkai Xie, Jiasheng Wang, Xueyu Hu, Xiaotian Han, Jianbo Yuan, Xinyao Wang, Shengyu Zhang, Hongxia Yang, Fei Wu
arxiv.org/abs/2508.05731

@toxi@mastodon.thi.ng
2025-07-25 11:06:28

Weekend list of critical reading links about the state[1] of Tech, AI[2] hype/finance/politics, mostly long form:
Ed Zitron's The Hater's Guide To The AI Bubble
wheresyoured.at/the-haters-gui
How to use computing power faster: on the weird ec…

@arXiv_quantph_bot@mastoxiv.page
2025-08-12 11:33:43

Observation and Modulation of the Quantum Mpemba Effect on a Superconducting Quantum Processor
Yueshan Xu, Cai-Ping Fang, Bing-Jie Chen, Ming-Chuan Wang, Zi-Yong Ge, Yun-Hao Shi, Yu Liu, Cheng-Lin Deng, Kui Zhao, Zheng-He Liu, Tian-Ming Li, Hao Li, Ziting Wang, Gui-Han Liang, Da'er Feng, Xueyi Guo, Xu-Yang Gu, Yang He, Hao-Tian Liu, Zheng-Yang Mei, Yongxi Xiao, Yu Yan, Yi-Han Yu, Wei-Ping Yuan, Jia-Chi Zhang, Zheng-An Wang, Gangqin Liu, Xiaohui Song, Ye Tian, Yu-Ran Zhang, Shi-Xin …

@jake4480@c.im
2025-08-31 02:47:35

Was able to grab and install Lagrange (for accessing Gopher/Gemini sites) here on my laptop that has Mint on it, just by searching for Lagrange in the Software Manager (gotta love Linux). Installed, launched-- and yeah, it really is a great, clean GUI. Also, you can see my Rogue desktop background 😂
#Gemini #Gopher

A snip of my desktop background showing the GUI for Gopher/Gemini client Lagrange, over my desktop background that's Rogue
@arXiv_csCV_bot@mastoxiv.page
2025-07-28 10:16:01

MMBench-GUI: Hierarchical Multi-Platform Evaluation Framework for GUI Agents
Xuehui Wang, Zhenyu Wu, JingJing Xie, Zichen Ding, Bowen Yang, Zehao Li, Zhaoyang Liu, Qingyun Li, Xuan Dong, Zhe Chen, Weiyun Wang, Xiangyu Zhao, Jixuan Chen, Haodong Duan, Tianbao Xie, Chenyu Yang, Shiqian Su, Yue Yu, Yuan Huang, Yiqian Liu, Xiao Zhang, Yanting Zhang, Xiangyu Yue, Weijie Su, Xizhou Zhu, Wei Shen, Jifeng Dai, Wenhai Wang

@arXiv_csRO_bot@mastoxiv.page
2025-08-12 18:00:12

Replaced article(s) found for cs.RO. arxiv.org/list/cs.RO/new
[2/3]:
- UniCalib: Targetless LiDAR-Camera Calibration via Probabilistic Flow on Unified Depth Representat...
Shu Han, Xubo Zhu, Ji Wu, Ximeng Cai, Wen Yang, Huai Yu, Gui-Song Xia

@arXiv_mathST_bot@mastoxiv.page
2025-08-11 07:57:19

Validity and Power of Heavy-Tailed Combination Tests under Asymptotic Dependence
Lin Gui, Tiantian Mao, Jingshu Wang, Ruodu Wang
arxiv.org/abs/2508.05818

@arXiv_csDC_bot@mastoxiv.page
2025-10-10 12:36:49

Crosslisted article(s) found for cs.DC. arxiv.org/list/cs.DC/new
[1/1]:
- FedQS: Optimizing Gradient and Model Aggregation for Semi-Asynchronous Federated Learning
Yunbo Li, Jiaping Gui, Zhihang Deng, Fanchao Meng, Yue Wu

@arXiv_csHC_bot@mastoxiv.page
2025-08-07 09:40:24

VeriGUI: Verifiable Long-Chain GUI Dataset
Shunyu Liu, Minghao Liu, Huichi Zhou, Zhenyu Cui, Yang Zhou, Yuhao Zhou, Wendong Fan, Ge Zhang, Jiajun Shi, Weihao Xuan, Jiaxing Huang, Shuang Luo, Fang Wu, Heli Qi, Qingcheng Zeng, Ziqi Ren, Jialiang Gao, Jindi Lv, Junjie Wang, Aosong Feng, Heng Zhou, Wangchunshu Zhou, Zhenfei Yin, Wenlong Zhang, Guohao Li, Wenhao Yu, Irene Li, Lei Ma, Lei Bai, Qunshu Lin, Mingli Song, Dacheng Tao

@hansaplast42@social.wastedalpaca.wtf
2025-09-06 12:01:10

Hab mir nun einen zweiten AP gekauft der nur via VPN ins Internet darf. Das GL-inet Gerät ist klein, aber die Leistung beeindruckt. Hat eine sehr gute GUI und darunter läuft ein OpenWRT. AdGuard z.b. kannst du einfach per klick installieren, VPN Einrichtung geht ebenfalls super einfach.
amzn.eu/d/3hj3EAC

@arXiv_csLG_bot@mastoxiv.page
2025-09-11 10:15:03

AgentGym-RL: Training LLM Agents for Long-Horizon Decision Making through Multi-Turn Reinforcement Learning
Zhiheng Xi, Jixuan Huang, Chenyang Liao, Baodai Huang, Honglin Guo, Jiaqi Liu, Rui Zheng, Junjie Ye, Jiazheng Zhang, Wenxiang Chen, Wei He, Yiwen Ding, Guanyu Li, Zehui Chen, Zhengyin Du, Xuesong Yao, Yufei Xu, Jiecao Chen, Tao Gui, Zuxuan Wu, Qi Zhang, Xuanjing Huang, Yu-Gang Jiang

@arXiv_csMA_bot@mastoxiv.page
2025-09-03 08:41:13

KG-RAG: Enhancing GUI Agent Decision-Making via Knowledge Graph-Driven Retrieval-Augmented Generation
Ziyi Guan, Jason Chun Lok Li, Zhijian Hou, Pingping Zhang, Donglai Xu, Yuzhi Zhao, Mengyang Wu, Jinpeng Chen, Thanh-Toan Nguyen, Pengfei Xian, Wenao Ma, Shengchao Qin, Graziano Chesi, Ngai Wong
arxiv.org/abs/2509.00366

@arXiv_csCL_bot@mastoxiv.page
2025-10-02 10:31:31

GUI-KV: Efficient GUI Agents via KV Cache with Spatio-Temporal Awareness
Kung-Hsiang Huang, Haoyi Qiu, Yutong Dai, Caiming Xiong, Chien-Sheng Wu
arxiv.org/abs/2510.00536

@arXiv_mathAP_bot@mastoxiv.page
2025-09-12 08:47:39

Numerical analysis of the homogeneous Landau equation: approximation, error estimates and simulation
Francis Filbet (IMT), Yanzhi Gui (THU), Ling-Bing He (THU)
arxiv.org/abs/2509.09276

@arXiv_hepph_bot@mastoxiv.page
2025-10-09 10:02:21

Texture-zeros in minimal seesaw from non-invertible symmetry fusion rules
Zheng Jiang, Bu-Yao Qu, Gui-Jun Ding
arxiv.org/abs/2510.07236 arx…

@ruth_mottram@fediscience.org
2025-07-22 09:49:05

My main problem with @edzitron.com 's piece on the #AIbubble is that I agree with so much of it.
I'm now wondering if I've missed something about #LLMs? The numbers and implications for stock markets are terrifyingly huge!
wheresyoured.at/the-haters-gui

@arXiv_astrophIM_bot@mastoxiv.page
2025-08-05 09:45:40

SPAN: A cross-platform Python GUI software for optical and near-infrared spectral analysis
Daniele Gasparri, Lorenzo Morelli, Umberto Battino, Jairo M\'endez Abreu, Adriana de Lorenzo-C\'aceres
arxiv.org/abs/2508.01923

@janneke@todon.nl
2025-10-05 14:07:42

About a month ago I was a bit too optimistic*, the admin took over a month to complete....: Only last Tuesday Reasonable Sourcery was officially inCOOPerated.
Today we finally sent out the official introduction:
<lists.gnu.org/archive/html/gui

@arXiv_csSE_bot@mastoxiv.page
2025-09-11 11:39:59

Replaced article(s) found for cs.SE. arxiv.org/list/cs.SE/new
[1/1]:
- Large Language Models for Mobile GUI Text Input Generation: An Empirical Study
Chenhui Cui, Tao Li, Junjie Wang, Chunyang Chen, Dave Towey, Rubing Huang

@arXiv_csAI_bot@mastoxiv.page
2025-09-05 07:30:20

PG-Agent: An Agent Powered by Page Graph
Weizhi Chen, Ziwei Wang, Leyang Yang, Sheng Zhou, Xiaoxuan Tang, Jiajun Bu, Yong Li, Wei Jiang
arxiv.org/abs/2509.03536

@datascience@genomic.social
2025-08-26 10:00:02

If you just need a pretty figure from a dataset and not the full power of R, have a look at #gui

@arXiv_eessIV_bot@mastoxiv.page
2025-09-08 09:00:30

Multi-modal Uncertainty Robust Tree Cover Segmentation For High-Resolution Remote Sensing Images
Yuanyuan Gui, Wei Li, Yinjian Wang, Xiang-Gen Xia, Mauro Marty, Christian Ginzler, Zuyuan Wang
arxiv.org/abs/2509.04870

@philip@mastodon.mallegolhansen.com
2025-08-04 03:33:31
Content warning:  

In my day we used dd to create bootable USB media. You're trying to sell me on a GUI with an upsell built in? Pass.

@NicolasGriseyDemengel@piaille.fr
2025-07-23 12:40:07

wheresyoured.at/the-haters-gui/

@volephd@fediscience.org
2025-08-05 19:06:22

I never imagined getting used to a new #HPC infrastructure would be such a pain.
I got used to cli, ssh access, bash scrips before, but suddenly everything is handled via a GUI, everything is containerized.
AND ALL MY WORKFLOWS ARE BROKEN!
I don't even manage to set up a simple #conda environment.

@arXiv_csCV_bot@mastoxiv.page
2025-09-12 10:14:19

InterAct: Advancing Large-Scale Versatile 3D Human-Object Interaction Generation
Sirui Xu, Dongting Li, Yucheng Zhang, Xiyan Xu, Qi Long, Ziyin Wang, Yunzhi Lu, Shuchang Dong, Hezi Jiang, Akshat Gupta, Yu-Xiong Wang, Liang-Yan Gui
arxiv.org/abs/2509.09555

@bogo@hapyyr.com
2025-10-03 14:09:35

I will be talking at the next #DevOps #Prague meetup. You should join. It's free to attend and requires registration.

@BBC6MusicBot@mastodonapp.uk
2025-09-14 06:34:42

🇺🇦 #NowPlaying on #BBC6Music's #AmyLamé
High Vis:
🎵 Guided Tour
#HighVis
highvis.bandcamp.com/track/gui
open.spotify.com/track/0afflx9

@arXiv_csAI_bot@mastoxiv.page
2025-09-08 09:12:00

SparkUI-Parser: Enhancing GUI Perception with Robust Grounding and Parsing
Hongyi Jing, Jiafu Chen, Chen Rao, Ziqiang Dang, Jiajie Teng, Tianyi Chu, Juncheng Mo, Shuo Fang, Huaizhong Lin, Rui Lv, Chenguang Ma, Lei Zhao
arxiv.org/abs/2509.04908

@arXiv_csCY_bot@mastoxiv.page
2025-10-07 09:19:12

TriQuest:An AI Copilot-Powered Platform for Interdisciplinary Curriculum Design
Huazhen Wang, Huimin Yang, Hainbin Lin, Yan Dong, Lili Chen, Liangliang Xia, Wenwen Xu
arxiv.org/abs/2510.03369

@awinkler@openbiblio.social
2025-08-01 08:11:08

It's useful that there's the possibility to add #gnd ids to #zenodo as well (in case creators don't have an #orcid). I wish it'd be clearer from the GUI that the required input format is …

@arXiv_csIR_bot@mastoxiv.page
2025-10-06 07:39:39

AgenticRAG: Tool-Augmented Foundation Models for Zero-Shot Explainable Recommender Systems
Bo Ma, Hang Li, ZeHua Hu, XiaoFan Gui, LuYao Liu, Simon Liu
arxiv.org/abs/2510.02668

@radioeinsmusicbot@mastodonapp.uk
2025-09-25 16:15:17

🇺🇦 Auf radioeins läuft...
Sam Taylor-Wood:
🎵 I'm In Love With A German Film Star
#NowPlaying #SamTaylorWood
kompakt.bandcamp.com/track/im-
open.spotify.com/track/3NT7jid

@EgorKotov@datasci.social
2025-09-03 11:14:38

#Accessibility modelers using #r5r #rstats, check this GUI for playing around with R5 network. If many people find it useful, I would get signal if I should invest any more free time into it.

Animation shows point and click interface for r5 network. User selects start and end locations on the map, the route is calculated and displayed on the map, and the route legs are presented below in a table.
@arXiv_csCV_bot@mastoxiv.page
2025-10-06 10:16:09

Improving GUI Grounding with Explicit Position-to-Coordinate Mapping
Suyuchen Wang, Tianyu Zhang, Ahmed Masry, Christopher Pal, Spandana Gella, Bang Liu, Perouz Taslakian
arxiv.org/abs/2510.03230

@kubikpixel@chaos.social
2025-07-26 08:25:11

Euphonica is a Rust-Powered MPD Client Heavy on Bling
MPD (Music Player Daemon) is a server-client audio player long popular with Linux users. The headless daemon runs as a background service, typically on a remote audio server. Music is then accessed via a GUI client frontend, which connects to the MPD server to stream content.
🎶

@arXiv_csCV_bot@mastoxiv.page
2025-09-01 09:56:32

UItron: Foundational GUI Agent with Advanced Perception and Planning
Zhixiong Zeng, Jing Huang, Liming Zheng, Wenkang Han, Yufeng Zhong, Lei Chen, Longrong Yang, Yingjie Chu, Yuzhi He, Lin Ma
arxiv.org/abs/2508.21767

@arXiv_eessSY_bot@mastoxiv.page
2025-09-03 12:32:33

IndusGCC: A Data Benchmark and Evaluation Framework for GUI-Based General Computer Control in Industrial Automation
Xiaoran Yang, Yuyang Du, Kexin Chen, Soung Chang Liew, Jiamin Lu, Ziyu Guo, Xiaoyan Liu, Qun Yang, Shiqi Xu, Xingyu Fan, Yuchen Pan, Taoyong Cui, Hongyu Deng, Boris Dudder, Jianzhang Pan, Qun Fang, Pheng Ann Heng
arxi…

@arXiv_csCL_bot@mastoxiv.page
2025-10-10 10:51:19

Evaluating LLM-Generated Legal Explanations for Regulatory Compliance in Social Media Influencer Marketing
Haoyang Gui, Thales Bertaglia, Taylor Annabell, Catalina Goanta, Tjomme Dooper, Gerasimos Spanakis
arxiv.org/abs/2510.08111

@arXiv_csAI_bot@mastoxiv.page
2025-09-03 14:14:33

UI-TARS-2 Technical Report: Advancing GUI Agent with Multi-Turn Reinforcement Learning
Haoming Wang, Haoyang Zou, Huatong Song, Jiazhan Feng, Junjie Fang, Junting Lu, Longxiang Liu, Qinyu Luo, Shihao Liang, Shijue Huang, Wanjun Zhong, Yining Ye, Yujia Qin, Yuwen Xiong, Yuxin Song, Zhiyong Wu, Bo Li, Chen Dun, Chong Liu, Fuxing Leng, Hanbin Wang, Hao Yu, Haobin Chen, Hongyi Guo, Jing Su, Jingjia Huang, Kai Shen, Kaiyu Shi, Lin Yan, Peiyao Zhao, Pengfei Liu, Qinghao Ye, Renjie Zheng, Way…

@arXiv_csCV_bot@mastoxiv.page
2025-09-03 14:59:13

OmniActor: A Generalist GUI and Embodied Agent for 2D&3D Worlds
Longrong Yang, Zhixiong Zeng, Yufeng Zhong, Jing Huang, Liming Zheng, Lei Chen, Haibo Qiu, Zequn Qin, Lin Ma, Xi Li
arxiv.org/abs/2509.02322

@arXiv_csCR_bot@mastoxiv.page
2025-09-16 11:56:27

Realistic Environmental Injection Attacks on GUI Agents
Yitong Zhang, Ximo Li, Liyi Cai, Jia Li
arxiv.org/abs/2509.11250 arxiv.org/pdf/2509…

@arXiv_hepph_bot@mastoxiv.page
2025-10-07 10:13:02

Revisit of the electromagnetic correction to $\tau\to\pi\pi\nu_\tau$ and its implication for muon $g-2$ based on $\tau$ data
Zhi-Xin Li, Ao Li, Jin Hao, Chun-Gui Duan, Zhi-Hui Guo
arxiv.org/abs/2510.04172

@arXiv_csAI_bot@mastoxiv.page
2025-08-22 08:54:40

Mobile-Agent-v3: Foundamental Agents for GUI Automation
Jiabo Ye, Xi Zhang, Haiyang Xu, Haowei Liu, Junyang Wang, Zhaoqing Zhu, Ziwei Zheng, Feiyu Gao, Junjie Cao, Zhengxi Lu, Jitong Liao, Qi Zheng, Fei Huang, Jingren Zhou, Ming Yan
arxiv.org/abs/2508.15144

@arXiv_csCV_bot@mastoxiv.page
2025-09-11 09:19:03

EVDI : Event-based Video Deblurring and Interpolation via Self-Supervised Learning
Chi Zhang, Xiang Zhang, Chenxu Jiang, Gui-Song Xia, Lei Yu
arxiv.org/abs/2509.08260

@arXiv_csCL_bot@mastoxiv.page
2025-10-09 10:39:11

Search-R3: Unifying Reasoning and Embedding Generation in Large Language Models
Yuntao Gui, James Cheng
arxiv.org/abs/2510.07048 arxiv.org/…

@arXiv_mathAP_bot@mastoxiv.page
2025-09-09 10:51:32

From KP-I Lump Solution to Travelling wave of 3D Gravity Capillary Water wave problem
Changfeng Gui, Shanfa Lai, Yong Liu, Juncheng Wei, Wen Yang
arxiv.org/abs/2509.06084

@arXiv_csAI_bot@mastoxiv.page
2025-09-11 11:16:26

Crosslisted article(s) found for cs.AI. arxiv.org/list/cs.AI/new
[3/5]:
- \emph{FoQuS}: A Forgetting-Quality Coreset Selection Framework for Automatic Modulation Recognition
Yao Lu, Chunfeng Sun, Dongwei Xu, Yun Lin, Qi Xuan, Guan Gui

@arXiv_csHC_bot@mastoxiv.page
2025-08-04 09:08:10

TofuML: A Spatio-Physical Interactive Machine Learning Device for Interactive Exploration of Machine Learning for Novices
Wataru Kawabe, Hiroto Fukuda, Akihisa Shitara, Yuri Nakao, Yusuke Sugano
arxiv.org/abs/2508.00252

@arXiv_csSE_bot@mastoxiv.page
2025-08-06 10:10:20

LaTCoder: Converting Webpage Design to Code with Layout-as-Thought
Yi Gui, Zhen Li, Zhongyi Zhang, Guohao Wang, Tianpeng Lv, Gaoyang Jiang, Yi Liu, Dongping Chen, Yao Wan, Hongyu Zhang, Wenbin Jiang, Xuanhua Shi, Hai Jin
arxiv.org/abs/2508.03560

@arXiv_csCV_bot@mastoxiv.page
2025-09-05 10:20:31

Learning Active Perception via Self-Evolving Preference Optimization for GUI Grounding
Wanfu Wang, Qipeng Huang, Guangquan Xue, Xiaobo Liang, Juntao Li
arxiv.org/abs/2509.04243

@arXiv_eessSY_bot@mastoxiv.page
2025-09-30 12:04:21

Real-Time Power electronics Control and Monitoring with TI F28379D DSC and GUI Composer
Ilyas Bennia, Lotfi Baghli, Ehsan Jamshidpour, Abdelkader Mechernene, Jean-Philippe Martin, Driss Yousfi
arxiv.org/abs/2509.25008

@arXiv_csCR_bot@mastoxiv.page
2025-07-16 08:17:11

LaSM: Layer-wise Scaling Mechanism for Defending Pop-up Attack on GUI Agents
Zihe Yan, Zhuosheng Zhang
arxiv.org/abs/2507.10610

@arXiv_csCV_bot@mastoxiv.page
2025-10-01 11:50:27

Ferret-UI Lite: Lessons from Building Small On-Device GUI Agents
Zhen Yang, Zi-Yi Dou, Di Feng, Forrest Huang, Anh Nguyen, Keen You, Omar Attia, Yuhao Yang, Michael Feng, Haotian Zhang, Ram Ramrakhya, Chao Jia, Jeffrey Nichols, Alexander Toshev, Yinfei Yang, Zhe Gan
arxiv.org/abs/2509.26539

@arXiv_csAI_bot@mastoxiv.page
2025-09-29 10:28:27

Log2Plan: An Adaptive GUI Automation Framework Integrated with Task Mining Approach
Seoyoung Lee, Seonbin Yoon, Seongbeen Lee, Hyesoo Kim, Joo Yong Sim
arxiv.org/abs/2509.22137

@arXiv_mathAP_bot@mastoxiv.page
2025-09-09 10:49:32

Well-Posedness of the Cauchy Problem for First-order Quasilinear Equations with Non-Lipschitz Source Terms and Its Applications
Gaowei Cao, Gui-Qiang G. Chen, Wei Xiang, Xiaozhou Yang
arxiv.org/abs/2509.06020

@arXiv_csAI_bot@mastoxiv.page
2025-09-29 10:18:37

RISK: A Framework for GUI Agents in E-commerce Risk Management
Renqi Chen, Zeyin Tao, Jianming Guo, Jingzhe Zhu, Yiheng Peng, Qingqing Sun, Tianyi Zhang, Shuai Chen
arxiv.org/abs/2509.21982

@arXiv_csSE_bot@mastoxiv.page
2025-08-06 08:20:30

Blueprint First, Model Second: A Framework for Deterministic LLM Workflow
Libin Qiu, Yuhang Ye, Zhirong Gao, Xide Zou, Junfu Chen, Ziming Gui, Weizhi Huang, Xiaobo Xue, Wenkai Qiu, Kun Zhao
arxiv.org/abs/2508.02721

@arXiv_csAI_bot@mastoxiv.page
2025-08-18 08:16:50

CRAFT-GUI: Curriculum-Reinforced Agent For GUI Tasks
Songqin Nong, Jingxuan Xu, Sheng Zhou, Jianfeng Chen, Xiaoxuan Tang, Tao Jiang, Wenhao Xu
arxiv.org/abs/2508.11360

@arXiv_csCV_bot@mastoxiv.page
2025-08-01 10:23:51

Phi-Ground Tech Report: Advancing Perception in GUI Grounding
Miaosen Zhang, Ziqiang Xu, Jialiang Zhu, Qi Dai, Kai Qiu, Yifan Yang, Chong Luo, Tianyi Chen, Justin Wagle, Tim Franklin, Baining Guo
arxiv.org/abs/2507.23779

@arXiv_csAI_bot@mastoxiv.page
2025-10-10 10:31:19

R-Horizon: How Far Can Your Large Reasoning Model Really Go in Breadth and Depth?
Yi Lu, Jianing Wang, Linsen Guo, Wei He, Hongyin Tang, Tao Gui, Xuanjing Huang, Xuezhi Cao, Wei Wang, Xunliang Cai
arxiv.org/abs/2510.08189

@arXiv_mathAP_bot@mastoxiv.page
2025-08-28 09:13:41

Quantitative stability for the conformally invariant Chang-Gui inequality on the exponentiation of functions on the sphere
Monideep Ghosh, Debabrata Karmakar
arxiv.org/abs/2508.19930

@arXiv_csCR_bot@mastoxiv.page
2025-09-03 13:17:53

An Automated Attack Investigation Approach Leveraging Threat-Knowledge-Augmented Large Language Models
Rujie Dai, Peizhuo Lv, Yujiang Gui, Qiujian Lv, Yuanyuan Qiao, Yan Wang, Degang Sun, Weiqing Huang, Yingjiu Li, XiaoFeng Wang
arxiv.org/abs/2509.01271

@arXiv_csHC_bot@mastoxiv.page
2025-07-29 16:28:38

Replaced article(s) found for cs.HC. arxiv.org/list/cs.HC/new
[2/2]:
- GUI-G$^2$: Gaussian Reward Modeling for GUI Grounding
Tang, Gu, Lu, Liu, Shen, Meng, Wang, Zhang, Shen, Lu, Xiao, Zhuang

@arXiv_csCV_bot@mastoxiv.page
2025-10-09 10:47:01

GenPilot: A Multi-Agent System for Test-Time Prompt Optimization in Image Generation
Wen Ye, Zhaocheng Liu, Yuwei Gui, Tingyu Yuan, Yunyue Su, Bowen Fang, Chaoyang Zhao, Qiang Liu, Liang Wang
arxiv.org/abs/2510.07217

@arXiv_csAI_bot@mastoxiv.page
2025-09-29 09:47:07

D-Artemis: A Deliberative Cognitive Framework for Mobile GUI Multi-Agents
Hongze Mi, Yibo Feng, Wenjie Lu, Yuqi Wang, Jinyuan Li, Song Cao, He Cui, Tengfei Tian, Xuelin Zhang, Haotian Luo, Di Sun, Naiqiang Tan, Gang Pan
arxiv.org/abs/2509.21799

@arXiv_csCV_bot@mastoxiv.page
2025-08-25 09:51:10

Structuring GUI Elements through Vision Language Models: Towards Action Space Generation
Yi Xu, Yesheng Zhang, jiajia Liu, Jingdong Chen
arxiv.org/abs/2508.16271

@arXiv_csHC_bot@mastoxiv.page
2025-09-16 07:50:06

Dark Patterns Meet GUI Agents: LLM Agent Susceptibility to Manipulative Interfaces and the Role of Human Oversight
Jingyu Tang, Chaoran Chen, Jiawen Li, Zhiping Zhang, Bingcan Guo, Ibrahim Khalilov, Simret Araya Gebreegziabher, Bingsheng Yao, Dakuo Wang, Yanfang Ye, Tianshi Li, Ziang Xiao, Yaxing Yao, Toby Jia-Jun Li
arxiv.org/abs/…

@arXiv_csAI_bot@mastoxiv.page
2025-09-29 09:48:47

ProRe: A Proactive Reward System for GUI Agents via Reasoner-Actor Collaboration
Gaole Dai, Shiqi Jiang, Ting Cao, Yuqing Yang, Yuanchun Li, Rui Tan, Mo Li, Lili Qiu
arxiv.org/abs/2509.21823

@arXiv_csAI_bot@mastoxiv.page
2025-09-23 12:03:20

Orcust: Stepwise-Feedback Reinforcement Learning for GUI Agent
Junyu Lu, Songxin Zhang, Zejian Xie, Zhuoyang Song, Jiaxing Zhang
arxiv.org/abs/2509.17917

@arXiv_csAI_bot@mastoxiv.page
2025-08-28 09:32:41

SWIRL: A Staged Workflow for Interleaved Reinforcement Learning in Mobile GUI Control
Quanfeng Lu, Zhantao Ma, Shuai Zhong, Jin Wang, Dahai Yu, Michael K. Ng, Ping Luo
arxiv.org/abs/2508.20018

@arXiv_csAI_bot@mastoxiv.page
2025-08-20 09:54:40

V2P: From Background Suppression to Center Peaking for Robust GUI Grounding Task
Jikai Chen, Long Chen, Dong Wang, Leilei Gan, Chenyi Zhuang, Jinjie Gu
arxiv.org/abs/2508.13634

@arXiv_csCV_bot@mastoxiv.page
2025-08-06 10:34:10

SCFlow: Implicitly Learning Style and Content Disentanglement with Flow Models
Pingchuan Ma, Xiaopei Yang, Yusong Li, Ming Gui, Felix Krause, Johannes Schusterbauer, Bj\"orn Ommer
arxiv.org/abs/2508.03402

@arXiv_csAI_bot@mastoxiv.page
2025-08-08 13:51:31

Replaced article(s) found for cs.AI. arxiv.org/list/cs.AI/new
[2/6]:
- NatureGAIA: Pushing the Frontiers of GUI Agents with a Challenging Benchmark and High-Quality Tra...
Zihan Zheng, Tianle Cui, Chuwen Xie, Jiahui Zhang, Jiahui Pan, Lewei He, Qianglong Chen

@arXiv_csAI_bot@mastoxiv.page
2025-09-18 09:33:51

See, Think, Act: Teaching Multimodal Agents to Effectively Interact with GUI by Identifying Toggles
Zongru Wu, Rui Mao, Zhiyuan Tian, Pengzhou Cheng, Tianjie Ju, Zheng Wu, Lingzhong Dong, Haiyue Sheng, Zhuosheng Zhang, Gongshen Liu
arxiv.org/abs/2509.13615

@arXiv_csAI_bot@mastoxiv.page
2025-08-08 13:51:57

Replaced article(s) found for cs.AI. arxiv.org/list/cs.AI/new
[4/6]:
- SciReplicate-Bench: Benchmarking LLMs in Agent-driven Algorithmic Reproduction from Research Papers
Yanzheng Xiang, Hanqi Yan, Shuyin Ouyang, Lin Gui, Yulan He

@arXiv_csAI_bot@mastoxiv.page
2025-10-06 09:07:19

AutoMaAS: Self-Evolving Multi-Agent Architecture Search for Large Language Models
Bo Ma, Hang Li, ZeHua Hu, XiaoFan Gui, LuYao Liu, Simon Liu
arxiv.org/abs/2510.02669

@arXiv_csCV_bot@mastoxiv.page
2025-10-02 10:58:01

Towards Adversarial Training under Hyperspectral Images
Weihua Zhang, Chengze Jiang, Jie Gui, Lu Dong
arxiv.org/abs/2510.01014 arxiv.org/pd…

@arXiv_csCV_bot@mastoxiv.page
2025-07-23 14:03:36

Replaced article(s) found for cs.CV. arxiv.org/list/cs.CV/new
[5/5]:
- GUI-G$^2$: Gaussian Reward Modeling for GUI Grounding
Tang, Gu, Lu, Liu, Shen, Meng, Wang, Zhang, Shen, Lu, Xiao, Zhuang

@arXiv_csAI_bot@mastoxiv.page
2025-09-18 09:34:41

InfraMind: A Novel Exploration-based GUI Agentic Framework for Mission-critical Industrial Management
Liangtao Lin, Zhaomeng Zhu, Tianwei Zhang, Yonggang Wen
arxiv.org/abs/2509.13704