
2025-06-17 10:52:09
Can you see how I learn? Human observers' inferences about Reinforcement Learning agents' learning processes
Bernhard Hilpert, Muhan Hou, Kim Baraka, Joost Broekens
https://arxiv.org/abs/2506.13583
Can you see how I learn? Human observers' inferences about Reinforcement Learning agents' learning processes
Bernhard Hilpert, Muhan Hou, Kim Baraka, Joost Broekens
https://arxiv.org/abs/2506.13583
PROL : Rehearsal Free Continual Learning in Streaming Data via Prompt Online Learning
M. Anwar Ma'sum, Mahardhika Pratama, Savitha Ramasamy, Lin Liu, Habibullah Habibullah, Ryszard Kowalczyk
https://arxiv.org/abs/2507.12305
Cross-lingual Few-shot Learning for Persian Sentiment Analysis with Incremental Adaptation
Farideh Majidi, Ziaeddin Beheshtifard
https://arxiv.org/abs/2507.11634
Does using machine learning solve our problem of p-hacking and HARKing or do we have the same problems as with statistical tests and models?
https://digiresacademy.kit.com/posts/is-machine-learning-and-ai-solving-the-problem-of-p-hacking
Semi-Supervised Learning with Online Knowledge Distillation for Skin Lesion Classification
Siyamalan Manivannan
https://arxiv.org/abs/2508.11511 https://ar…
Universal Rates of ERM for Agnostic Learning
Steve Hanneke, Mingyue Xu
https://arxiv.org/abs/2506.14110 https://arxiv.org/pdf/2506.14…
EBS-CFL: Efficient and Byzantine-robust Secure Clustered Federated Learning
Zhiqiang Li, Haiyong Bao, Menghong Guan, Hao Pan, Cheng Huang, Hong-Ning Dai
https://arxiv.org/abs/2506.13612
Emergent Heterogeneous Swarm Control Through Hebbian Learning
Fuda van Diggelen, Tugay Alperen Karag\"uzel, Andres Garcia Rincon, A. E. Eiben, Dario Floreano, Eliseo Ferrante
https://arxiv.org/abs/2507.11566
Branch, or Layer? Zeroth-Order Optimization for Continual Learning of Vision-Language Models
Ziwei Liu, Borui Kang, Wei Li, Hangjie Yuan, Yanbing Yang, Wenbin Li, Jun Luo, Yifan Zhu, Tao Feng
https://arxiv.org/abs/2506.12409
Discovering Temporal Structure: An Overview of Hierarchical Reinforcement Learning
Martin Klissarov, Akhil Bagaria, Ziyan Luo, George Konidaris, Doina Precup, Marlos C. Machado
https://arxiv.org/abs/2506.14045
Sporadic Federated Learning Approach in Quantum Environment to Tackle Quantum Noise
Ratun Rahman, Atit Pokharel, Dinh C. Nguyen
https://arxiv.org/abs/2507.12492
Now out in #TMLR:
🍇 GRAPES: Learning to Sample Graphs for Scalable Graph Neural Networks 🍇
There's lots of work on sampling subgraphs for GNNs, but relatively little on making this sampling process _adaptive_. That is, learning to select the data from the graph that is relevant for your task.
We introduce an RL-based and a GFLowNet-based sampler and show that the approach perf…
Bio-inspired learning algorithm for time series using Loewner equation
Yusuke Shibasaki
https://arxiv.org/abs/2506.12372 https://arxi…
A Survey of Reinforcement Learning for Software Engineering
Dong Wang, Hanmo You, Lingwei Zhu, Kaiwei Lin, Zheng Chen, Chen Yang, Junji Yu, Zan Wang, Junjie Chen
https://arxiv.org/abs/2507.12483
Aligning Humans and Robots via Reinforcement Learning from Implicit Human Feedback
Suzie Kim, Hye-Bin Shin, Seong-Whan Lee
https://arxiv.org/abs/2507.13171
Bayesian inference for the learning rate in Generalised Bayesian inference
Jeong Eun Lee, Sitong Liu, Geoff K. Nicholls
https://arxiv.org/abs/2506.12532 ht…
Optimizing Federated Learning using Remote Embeddings for Graph Neural Networks
Pranjal Naman, Yogesh Simmhan
https://arxiv.org/abs/2506.12425 https://
Learning Best Paths in Quantum Networks
Xuchuang Wang, Maoli Liu, Xutong Liu, Zhuohua Li, Mohammad Hajiesmaili, John C. S. Lui, Don Towsley
https://arxiv.org/abs/2506.12462
SGCL: Unifying Self-Supervised and Supervised Learning for Graph Recommendation
Weizhi Zhang, Liangwei Yang, Zihe Song, Henrry Peng Zou, Ke Xu, Yuanjie Zhu, Philip S. Yu
https://arxiv.org/abs/2507.13336
Anthropic expands Claude's Learning Mode, available only to Education users since an April launch, to all users, including two learning variants for Claude Code (Igor Bonifacic/Engadget)
https://www.engadget.com/ai/anthropic-brin
Quantum-Enhanced Reinforcement Learning with LSTM Forecasting Signals for Optimizing Fintech Trading Decisions
Yen-Ku Liu, Yun-Huei Pan, Pei-Fan Lu, Yun-Cheng Tsai, Samuel Yen-Chi Chen
https://arxiv.org/abs/2507.12835
Self-learning Monte Carlo Method: A Review
Gaopei Pan, Chuang Chen, Zi Yang Meng
https://arxiv.org/abs/2507.12554 https://arxiv.org/p…
Cowboys' 1st-round rookie working to flatten learning curve of life in NFL https://cowboyswire.usatoday.com/story/sports/nfl/cowboys/2025/06/16/cowboys-rookie-tyler-booker-quotes/84233439007/
How does the #brain transfer #MotorSkills between hands?
This study reveals that transfer relies on re-expressing the neural patterns established during initial learning in distributed higher-order brain areas,
offering new insights into learning
Multiple machine-learning as a powerful tool for the star clusters analysis
Denilso Camargo
https://arxiv.org/abs/2506.13951 https://…
Unsupervised Ground Metric Learning
Janis Auffenberg, Jonas Bresch, Oleh Melnyk, Gabriele Steidl
https://arxiv.org/abs/2507.13094 https://
High computational density nanophotonic media for machine learning inference
Zhenyu Zhao, Yichen Pan, Jinlong Xiang, Yujia Zhang, An He, Yaotian Zhao, Youlve Chen, Yu He, Xinyuan Fang, Yikai Su, Min Gu, Xuhan Guo
https://arxiv.org/abs/2506.14269
EBS-CFL: Efficient and Byzantine-robust Secure Clustered Federated Learning
Zhiqiang Li, Haiyong Bao, Menghong Guan, Hao Pan, Cheng Huang, Hong-Ning Dai
https://arxiv.org/abs/2506.13612
From Misunderstandings to Learning Opportunities: Leveraging Generative AI in Discussion Forums to Support Student Learning
Stanislav Pozdniakov, Jonathan Brazil, Oleksandra Poquet, Stephan Krusche, Santiago Berrezueta-Guzman, Shazia Sadiq, Hassan Khosravi
https://arxiv.org/abs/2508.11150
WIP: Turning Fake Chips into Learning Opportunities
Haniye Mehraban, Saad Azmeen-ur-Rahman, John Hu
https://arxiv.org/abs/2507.13281 https://
Counterfactual Survival Q Learning for Longitudinal Randomized Trials via Buckley James Boosting
Jeongjin Lee, Jong-Min Kim
https://arxiv.org/abs/2508.11060 https://
Online Training and Pruning of Deep Reinforcement Learning Networks
Valentin Frank Ingmar Guenter, Athanasios Sideris
https://arxiv.org/abs/2507.11975 http…
Hierarchical Deep Feature Fusion and Ensemble Learning for Enhanced Brain Tumor MRI Classification
Zahid Ullah, Jihie Kim
https://arxiv.org/abs/2506.12363 …
Partitioner Guided Modal Learning Framework
Guimin Hu, Yi Xin, Lijie Hu, Zhihong Zhu, Hasti Seifi
https://arxiv.org/abs/2507.11661 https://
Enhancing Symbolic Machine Learning by Subsymbolic Representations
Stephen Roth, Lennart Baur, Derian Boer, Stefan Kramer
https://arxiv.org/abs/2506.14569 …
Quantum Transfer Learning to Boost Dementia Detection
Sounak Bhowmik, Talita Perciano, Himanshu Thapliyal
https://arxiv.org/abs/2507.12485 https://
SENIOR: Efficient Query Selection and Preference-Guided Exploration in Preference-based Reinforcement Learning
Hexian Ni, Tao Lu, Haoyuan Hu, Yinghao Cai, Shuo Wang
https://arxiv.org/abs/2506.14648
Isolating Noisy Labelled Test Cases in Human-in-the-Loop Oracle Learning
Charaka Geethal Kapugama
https://arxiv.org/abs/2506.13273 https://
Integrating Radiomics with Deep Learning Enhances Multiple Sclerosis Lesion Delineation
Nadezhda Alsahanova, Pavel Bartenev, Maksim Sharaev, Milos Ljubisavljevic, Taleb Al. Mansoori, Yauhen Statsenko
https://arxiv.org/abs/2506.14524
Latency Optimization for Wireless Federated Learning in Multihop Networks
Shaba Shaon, Van-Dinh Nguyen, Dinh C. Nguyen
https://arxiv.org/abs/2506.12081 htt…
Privacy-Preserving Federated Learning against Malicious Clients Based on Verifiable Functional Encryption
Nina Cai, Jinguang Han
https://arxiv.org/abs/2506.12846
Berlin-based Knowunity, an AI-powered learning platform with 20M users in 15 countries, raised a €27M Series B led by XAnge, bringing its total funding to €45M (Tamara Djurickovic/Tech.eu)
https://tech.eu/2025/06/13/knowunity-raises-eur…
Don't throw the baby out with the bathwater: How and why deep learning for ARC
Jack Cole, Mohamed Osman
https://arxiv.org/abs/2506.14276 https://
Evolutionary chemical learning in dimerization networks
Alexei V. Tkachenko, Bortolo Matteo Mognetti, Sergei Maslov
https://arxiv.org/abs/2506.14006 https:…
NineToothed: A Triton-Based High-Level Domain-Specific Language for Machine Learning
Jiacheng Huang, Zimin Li, Yinghui Li, Haojie Wang
https://arxiv.org/abs/2507.11978
Variational Learning Finds Flatter Solutions at the Edge of Stability
Avrajit Ghosh, Bai Cong, Rio Yokota, Saiprasad Ravishankar, Rongrong Wang, Molei Tao, Mohammad Emtiyaz Khan, Thomas M\"ollenhoff
https://arxiv.org/abs/2506.12903
GenFlowRL: Shaping Rewards with Generative Object-Centric Flow in Visual Reinforcement Learning
Kelin Yu, Sheng Zhang, Harshit Soora, Furong Huang, Heng Huang, Pratap Tokekar, Ruohan Gao
https://arxiv.org/abs/2508.11049
Convergence Rate of Generalized Nash Equilibrium Learning in Strongly Monotone Games with Linear Constraints
Tatiana Tatarenko, Maryam Kamgarpour
https://arxiv.org/abs/2507.12112 …
Human-in-the-Loop Systems for Adaptive Learning Using Generative AI
Bhavishya Tarun, Haoze Du, Dinesh Kannan, Edward F. Gehringer
https://arxiv.org/abs/2508.11062 https://
A Gradient Meta-Learning Joint Optimization for Beamforming and Antenna Position in Pinching-Antenna Systems
Kang Zhou, Weixi Zhou, Donghong Cai, Xianfu Lei, Yanqing Xu, Zhiguo Ding, Pingzhi Fan
https://arxiv.org/abs/2506.12583
Quantum-Inspired Differentiable Integral Neural Networks (QIDINNs): A Feynman-Based Architecture for Continuous Learning Over Streaming Data
Oscar Boullosa Dapena
https://arxiv.org/abs/2506.12111
Replaced article(s) found for cs.LG. https://arxiv.org/list/cs.LG/new
[3/5]:
- Uncertainty Quantification for Motor Imagery BCI -- Machine Learning vs. Deep Learning
Joris Suurmeijer, Ivo Pascal de Jong, Matias Valdenegro-Toro, Andreea Ioana Sburlea
Privacy-Preserving Federated Learning against Malicious Clients Based on Verifiable Functional Encryption
Nina Cai, Jinguang Han
https://arxiv.org/abs/2506.12846
Comparative Analysis of Deep Learning Strategies for Hypertensive Retinopathy Detection from Fundus Images: From Scratch and Pre-trained Models
Yanqiao Zhu
https://arxiv.org/abs/2506.12492
Ring-lite: Scalable Reasoning via C3PO-Stabilized Reinforcement Learning for LLMs
Ring Team, Bin Hu, Cai Chen, Deng Zhao, Ding Liu, Dingnan Jin, Feng Zhu, Hao Dai, Hongzhi Luan, Jia Guo, Jiaming Liu, Jiewei Wu, Jun Mei, Jun Zhou, Junbo Zhao, Junwu Xiong, Kaihong Zhang, Kuan Xu, Lei Liang, Liang Jiang, Liangcheng Fu, Longfei Zheng, Qiang Gao, Qing Cui, Quan Wan, Shaomian Zheng, Shuaicheng Li, Tongkai Yang, Wang Ren, Xiaodong Yan, Xiaopei Wan, Xiaoyun Feng, Xin Zhao, Xinxing Yang, Xinyu …
Causality in the human niche: lessons for machine learning
Richard D. Lange, Konrad P. Kording
https://arxiv.org/abs/2506.13803 https://
Quantum Machine Learning in Multi-Qubit Phase-Space Part I: Foundations
Timothy Heightman, Edward Jiang, Ruth Mora-Soto, Maciej Lewenstein, Marcin P{\l}odzie\'n
https://arxiv.org/abs/2507.12117
General and Estimable Learning Bound Unifying Covariate and Concept Shifts
Hongbo Chen, Li Charlie Xia
https://arxiv.org/abs/2506.12829 https://
Apple seems to be working on adding CUDA support to open-source ML framework MLX, which may mean that code developed using MLX would work with CUDA (Malcolm Owen/AppleInsider)
https://appleinsider.com/articles/25/0
A Bayesian Incentive Mechanism for Poison-Resilient Federated Learning
Daniel Commey, Rebecca A. Sarpong, Griffith S. Klogo, Winful Bagyl-Bac, Garth V. Crosby
https://arxiv.org/abs/2507.12439
Autonomous Resource Management in Microservice Systems via Reinforcement Learning
Yujun Zou, Nia Qi, Yingnan Deng, Zhihao Xue, Ming Gong, Wuyang Zhang
https://arxiv.org/abs/2507.12879
EKPC: Elastic Knowledge Preservation and Compensation for Class-Incremental Learning
Huaijie Wang, De Cheng, Lingfeng He, Yan Li, Jie Li, Nannan Wang, Xinbo Gao
https://arxiv.org/abs/2506.12351
Research on Optimal Control Problem Based on Reinforcement Learning under Knightian Uncertainty
Ziyu Li, Chen Fei, Weiyin Fei
https://arxiv.org/abs/2506.13207
Findings of MEGA: Maths Explanation with LLMs using the Socratic Method for Active Learning
Tosin Adewumi, Foteini Simistira Liwicki, Marcus Liwicki, Viktor Gardelli, Lama Alkhaled, Hamam Mokayed
https://arxiv.org/abs/2507.12079
Conversational AI as a Catalyst for Informal Learning: An Empirical Large-Scale Study on LLM Use in Everyday Learning
Na{\dj}a Terzimehi\'c, Babette B\"uhler, Enkelejda Kasneci
https://arxiv.org/abs/2506.11789
EgoVLA: Learning Vision-Language-Action Models from Egocentric Human Videos
Ruihan Yang, Qinxi Yu, Yecheng Wu, Rui Yan, Borui Li, An-Chieh Cheng, Xueyan Zou, Yunhao Fang, Hongxu Yin, Sifei Liu, Song Han, Yao Lu, Xiaolong Wang
https://arxiv.org/abs/2507.12440
BenchRL-QAS: Benchmarking reinforcement learning algorithms for quantum architecture search
Azhar Ikhtiarudin, Aditi Das, Param Thakkar, Akash Kundu
https://arxiv.org/abs/2507.12189
Activate Me!: Designing Efficient Activation Functions for Privacy-Preserving Machine Learning with Fully Homomorphic Encryption
Nges Brian Njungle, Michel A. Kinsy
https://arxiv.org/abs/2508.11575
The CAISAR Platform: Extending the Reach of Machine Learning Specification and Verification
Michele Alberti (LSL), Fran\c{c}ois Bobot (LSL), Julien Girard-Satabin (LSL), Alban Grastien (LSL), Aymeric Varasse (LSL), Zakaria Chihani (LSL)
https://arxiv.org/abs/2506.12084
Information-Theoretic Generalization Bounds of Replay-based Continual Learning
Wen Wen, Tieliang Gong, Yunjiao Zhang, Zeyu Gao, Weizhan Zhang, Yong-Jin Liu
https://arxiv.org/abs/2507.12043
CDP: Towards Robust Autoregressive Visuomotor Policy Learning via Causal Diffusion
Jiahua Ma, Yiran Qin, Yixiong Li, Xuanqi Liao, Yulan Guo, Ruimao Zhang
https://arxiv.org/abs/2506.14769
Rademacher learning rates for iterated random functions
Nikola Sandri\'c
https://arxiv.org/abs/2506.13946 https://arxiv.org/pdf/2…
Learning to Predict Mobile Robot Stability in Off-Road Environments
Nathaniel Rose, Arif Ahmed, Emanuel Gutierrez-Cornejo, Parikshit Maini
https://arxiv.org/abs/2507.12731
FourCastNet 3: A geometric approach to probabilistic machine-learning weather forecasting at scale
Boris Bonev, Thorsten Kurth, Ankur Mahesh, Mauro Bisson, Jean Kossaifi, Karthik Kashinath, Anima Anandkumar, William D. Collins, Michael S. Pritchard, Alexander Keller
https://arxiv.org/abs/2507.12144…
Inside Knowledge: Graph-based Path Generation with Explainable Data Augmentation and Curriculum Learning for Visual Indoor Navigation
Daniel Airinei, Elena Burceanu, Marius Leordeanu
https://arxiv.org/abs/2508.11446
Leveraging Quantum Layers in Classical Neural Networks
Silvie Ill\'esov\'a
https://arxiv.org/abs/2507.12505 https://arxiv.org…
Safeguarding Federated Learning-based Road Condition Classification
Sheng Liu, Panos Papadimitratos
https://arxiv.org/abs/2507.12568 https://
Understanding Learning Invariance in Deep Linear Networks
Hao Duan, Guido Mont\'ufar
https://arxiv.org/abs/2506.13714 https://arx…
Multi-Group Equivariant Augmentation for Reinforcement Learning in Robot Manipulation
Hongbin Lin, Juan Rojas, Kwok Wai Samuel Au
https://arxiv.org/abs/2508.11204 https://
Improving Reinforcement Learning Sample-Efficiency using Local Approximation
Mohit Prashant, Arvind Easwaran
https://arxiv.org/abs/2507.12383 https://
Actor-Critic for Continuous Action Chunks: A Reinforcement Learning Framework for Long-Horizon Robotic Manipulation with Sparse Reward
Jiarui Yang, Bin Zhu, Jingjing Chen, Yu-Gang Jiang
https://arxiv.org/abs/2508.11143
Random Matrix Theory for Deep Learning: Beyond Eigenvalues of Linear Models
Zhenyu Liao, Michael W. Mahoney
https://arxiv.org/abs/2506.13139 https://
A Crowdsensing Intrusion Detection Dataset For Decentralized Federated Learning Models
Chao Feng, Alberto Huertas Celdran, Jing Han, Heqing Ren, Xi Cheng, Zien Zeng, Lucas Krauter, Gerome Bovet, Burkhard Stiller
https://arxiv.org/abs/2507.13313
Evaluating Reinforcement Learning Algorithms for Navigation in Simulated Robotic Quadrupeds: A Comparative Study Inspired by Guide Dog Behaviour
Emma M. A. Harrison
https://arxiv.org/abs/2507.13277
InverTune: Removing Backdoors from Multimodal Contrastive Learning Models via Trigger Inversion and Activation Tuning
Mengyuan Sun, Yu Li, Yuchen Liu, Bo Du, Yunjie Ge
https://arxiv.org/abs/2506.12411
A Transfer Learning Framework for Multilayer Networks via Model Averaging
Yongqin Qiu, Xinyu Zhang
https://arxiv.org/abs/2506.12455 https://
CineTrans: Learning to Generate Videos with Cinematic Transitions via Masked Diffusion Models
Xiaoxue Wu, Bingjie Gao, Yu Qiao, Yaohui Wang, Xinyuan Chen
https://arxiv.org/abs/2508.11484
Replaced article(s) found for cs.LG. https://arxiv.org/list/cs.LG/new
[2/5]:
- Learning Universal Human Mobility Patterns with a Foundation Model for Cross-domain Data Fusion
Haoxuan Ma, Xishun Liao, Yifan Liu, Qinhua Jiang, Chris Stanford, Shangqing Cao, Jiaqi Ma
Quadrotor Morpho-Transition: Learning vs Model-Based Control Strategies
Ioannis Mandralis, Richard M. Murray, Morteza Gharib
https://arxiv.org/abs/2506.14039
A Privacy-Preserving Framework for Advertising Personalization Incorporating Federated Learning and Differential Privacy
Xiang Li, Yifan Lin, Yuanzhe Zhang
https://arxiv.org/abs/2507.12098
Beyond Shapley Values: Cooperative Games for the Interpretation of Machine Learning Models
Marouane Il Idrissi, Agathe Fernandes Machado, Arthur Charpentier
https://arxiv.org/abs/2506.13900
$\pi^3$: Scalable Permutation-Equivariant Visual Geometry Learning
Yifan Wang, Jianjun Zhou, Haoyi Zhu, Wenzheng Chang, Yang Zhou, Zizun Li, Junyi Chen, Jiangmiao Pang, Chunhua Shen, Tong He
https://arxiv.org/abs/2507.13347
Replaced article(s) found for cs.LG. https://arxiv.org/list/cs.LG/new
[5/5]:
- Machine Learning-Driven Compensation for Non-Ideal Channels in AWG-Based FBG Interrogator
Kazakov, Kulichenko, Kovalev, Treskova, Barma, Malakhov, Oseledets, Shipulin
ZipMPC: Compressed Context-Dependent MPC Cost via Imitation Learning
Rahel Rickenbach, Alan A. Lahoud, Erik Schaffernicht, Melanie N. Zeilinger, Johannes A. Stork
https://arxiv.org/abs/2507.13088
DEMONSTRATE: Zero-shot Language to Robotic Control via Multi-task Demonstration Learning
Rahel Rickenbach, Bruce Lee, Ren\'e Zurbr\"ugg, Carmen Amo Alonso, Melanie N. Zeilinger
https://arxiv.org/abs/2507.12855
Replaced article(s) found for cs.LG. https://arxiv.org/list/cs.LG/new
[5/5]:
- EgoVLA: Learning Vision-Language-Action Models from Egocentric Human Videos
Yang, Yu, Wu, Yan, Li, Cheng, Zou, Fang, Yin, Liu, Han, Lu, Wang
ILCL: Inverse Logic-Constraint Learning from Temporally Constrained Demonstrations
Minwoo Cho, Jaehwi Jang, Daehyung Park
https://arxiv.org/abs/2507.11000 …
Replaced article(s) found for cs.LG. https://arxiv.org/list/cs.LG/new
[1/8]:
Boosting Resource-Constrained Federated Learning Systems with Guessed Updates
Replaced article(s) found for cs.LG. https://arxiv.org/list/cs.LG/new
[7/8]:
Achieving Collective Welfare in Multi-Agent Reinforcement Learning via Suggestion Sharing
Trustworthy Tree-based Machine Learning by $MoS_2$ Flash-based Analog CAM with Inherent Soft Boundaries
Bo Wen, Guoyun Gao, Zhicheng Xu, Ruibin Mao, Xiaojuan Qi, X. Sharon Hu, Xunzhao Yin, Can Li
https://arxiv.org/abs/2507.12384