UniAPL: A Unified Adversarial Preference Learning Framework for Instruct-Following
FaQiang Qian, WeiKun Zhang, Ziliang Wang, Kang An, Xuhui Zheng, Liangjian Wen, Mengya Gao, Yong Dai, Yichao Wu
https://arxiv.org/abs/2509.25148
Causal-EPIG: A Prediction-Oriented Active Learning Framework for CATE Estimation
Erdun Gao, Jake Fawkes, Dino Sejdinovic
https://arxiv.org/abs/2509.21866 https://
Eliciting User Requirements for AI-Enhanced Learning Environments using a Participatory Approach
Bibeg Limbu, Irene-Angelica Chounta, Vilma Sukacke, Andromachi Filippidi, Chara Spyropoulou, Marianna Anagnostopoulou, Eleftheria Tsourlidaki, Nikos Karacapilidis
https://arxiv.org/abs/2507.21088
From Past To Path: Masked History Learning for Next-Item Prediction in Generative Recommendation
KaiWen Wei, Kejun He, Xiaomian Kang, Jie Zhang, Yuming Yang, Jiang Zhong, He Bai, Junnan Zhu
https://arxiv.org/abs/2509.23649
DyMoDreamer: World Modeling with Dynamic Modulation
Boxuan Zhang, Runqing Wang, Wei Xiao, Weipu Zhang, Jian Sun, Gao Huang, Jie Chen, Gang Wang
https://arxiv.org/abs/2509.24804 …
Flowing Straighter with Conditional Flow Matching for Accurate Speech Enhancement
Mattias Cross, Anton Ragni
https://arxiv.org/abs/2508.20584 https://arxiv…
Actions as Language: Fine-Tuning VLMs into VLAs Without Catastrophic Forgetting
Asher J. Hancock, Xindi Wu, Lihan Zha, Olga Russakovsky, Anirudha Majumdar
https://arxiv.org/abs/2509.22195
A Greedy PDE Router for Blending Neural Operators and Classical Methods
Sahana Rayan, Yash Patel, Ambuj Tewari
https://arxiv.org/abs/2509.24814 https://arx…
MTRec: Learning to Align with User Preferences via Mental Reward Models
Mengchen Zhao, Yifan Gao, Yaqing Hou, Xiangyang Li, Pengjie Gu, Zhenhua Dong, Ruiming Tang, Yi Cai
https://arxiv.org/abs/2509.22807
StepORLM: A Self-Evolving Framework With Generative Process Supervision For Operations Research Language Models
Chenyu Zhou, Tianyi Xu, Jianghao Lin, Dongdong Ge
https://arxiv.org/abs/2509.22558