Tootfinder

Opt-in global Mastodon full text search. Join the index!

@arXiv_csHC_bot@mastoxiv.page
2025-06-17 10:52:09

Can you see how I learn? Human observers' inferences about Reinforcement Learning agents' learning processes
Bernhard Hilpert, Muhan Hou, Kim Baraka, Joost Broekens
arxiv.org/abs/2506.13583

@arXiv_csLG_bot@mastoxiv.page
2025-07-17 10:24:20

PROL : Rehearsal Free Continual Learning in Streaming Data via Prompt Online Learning
M. Anwar Ma'sum, Mahardhika Pratama, Savitha Ramasamy, Lin Liu, Habibullah Habibullah, Ryszard Kowalczyk
arxiv.org/abs/2507.12305

@arXiv_csCL_bot@mastoxiv.page
2025-07-17 08:05:40

Cross-lingual Few-shot Learning for Persian Sentiment Analysis with Incremental Adaptation
Farideh Majidi, Ziaeddin Beheshtifard
arxiv.org/abs/2507.11634

@HeidiSeibold@fosstodon.org
2025-07-18 14:06:06

Does using machine learning solve our problem of p-hacking and HARKing or do we have the same problems as with statistical tests and models?
digiresacademy.kit.com/posts/i

An elf pushing over the letter "P"
@arXiv_eessIV_bot@mastoxiv.page
2025-08-18 08:38:00

Semi-Supervised Learning with Online Knowledge Distillation for Skin Lesion Classification
Siyamalan Manivannan
arxiv.org/abs/2508.11511 ar…

@arXiv_statML_bot@mastoxiv.page
2025-06-18 10:28:23

Universal Rates of ERM for Agnostic Learning
Steve Hanneke, Mingyue Xu
arxiv.org/abs/2506.14110 arxiv.org/pdf/2506.14…

@arXiv_csCR_bot@mastoxiv.page
2025-06-17 11:31:46

EBS-CFL: Efficient and Byzantine-robust Secure Clustered Federated Learning
Zhiqiang Li, Haiyong Bao, Menghong Guan, Hao Pan, Cheng Huang, Hong-Ning Dai
arxiv.org/abs/2506.13612

@arXiv_csNE_bot@mastoxiv.page
2025-07-17 07:42:50

Emergent Heterogeneous Swarm Control Through Hebbian Learning
Fuda van Diggelen, Tugay Alperen Karag\"uzel, Andres Garcia Rincon, A. E. Eiben, Dario Floreano, Eliseo Ferrante
arxiv.org/abs/2507.11566

@arXiv_csCV_bot@mastoxiv.page
2025-06-17 09:51:45

Branch, or Layer? Zeroth-Order Optimization for Continual Learning of Vision-Language Models
Ziwei Liu, Borui Kang, Wei Li, Hangjie Yuan, Yanbing Yang, Wenbin Li, Jun Luo, Yifan Zhu, Tao Feng
arxiv.org/abs/2506.12409

@arXiv_csAI_bot@mastoxiv.page
2025-06-18 08:03:26

Discovering Temporal Structure: An Overview of Hierarchical Reinforcement Learning
Martin Klissarov, Akhil Bagaria, Ziyan Luo, George Konidaris, Doina Precup, Marlos C. Machado
arxiv.org/abs/2506.14045

@arXiv_quantph_bot@mastoxiv.page
2025-07-18 08:33:32

Sporadic Federated Learning Approach in Quantum Environment to Tackle Quantum Noise
Ratun Rahman, Atit Pokharel, Dinh C. Nguyen
arxiv.org/abs/2507.12492

@pbloem@sigmoid.social
2025-07-18 09:25:22

Now out in #TMLR:
🍇 GRAPES: Learning to Sample Graphs for Scalable Graph Neural Networks 🍇
There's lots of work on sampling subgraphs for GNNs, but relatively little on making this sampling process _adaptive_. That is, learning to select the data from the graph that is relevant for your task.
We introduce an RL-based and a GFLowNet-based sampler and show that the approach perf…

A diagram of the GRAPES pipeline. It shows a subgraph being sampled in two steps and being fed to a GNN, with a blue line showing the learning signal. The caption reads Figure 1: Overview of GRAPES. First, GRAPES processes a target node (green) by computing node inclusion probabilities on its 1-hop neighbors (shown by node color shade) with a sampling GNN. Given these probabilities, GRAPES samples k nodes. Then, GRAPES repeats this process over nodes in the 2-hop neighborhood. We pass the sampl…
A results table for node classification on heterophilious graphs. Table 2: F1-scores (%) for different sampling methods trained on heterophilous graphs for a batch size of 256, and a sample size of 256 per layer. We report the mean and standard deviation over 10 runs. The best values among the sampling baselines (all except GAS) are in bold, and the second best are underlined. MC stands for multi-class and ML stands for multi-label classification. OOM indicates out of memory.
Performance of samples vs sampling size showing that GRAPES generally performs well across sample sizes, while other samplers often show more variance across sample sizes. The caption reads Figure 4: Comparative analysis of classification accuracy across different sampling sizes for sampling baseline
and GRAPES. We repeated each experiment five times: The shaded regions show the 95% confidence intervals.
A diagrammatic illustration of a graph classification task used in one of the theorems. The caption reads Figure 9: An example of a graph for Theorem 1 with eight nodes. Red edges belong to E1, features xi and labels yi are shown beside every node. For nodes v1 and v2 we show the edge e12 as an example. As shown, the label of each node is the second feature of its neighbor, where a red edge connects them. The edge homophily ratio is h=12/28 = 0.43.
@arXiv_condmatstatmech_bot@mastoxiv.page
2025-06-17 10:49:45

Bio-inspired learning algorithm for time series using Loewner equation
Yusuke Shibasaki
arxiv.org/abs/2506.12372 arxi…

@arXiv_csSE_bot@mastoxiv.page
2025-07-18 09:11:32

A Survey of Reinforcement Learning for Software Engineering
Dong Wang, Hanmo You, Lingwei Zhu, Kaiwei Lin, Zheng Chen, Chen Yang, Junji Yu, Zan Wang, Junjie Chen
arxiv.org/abs/2507.12483

@arXiv_csRO_bot@mastoxiv.page
2025-07-18 09:49:52

Aligning Humans and Robots via Reinforcement Learning from Implicit Human Feedback
Suzie Kim, Hye-Bin Shin, Seong-Whan Lee
arxiv.org/abs/2507.13171

@arXiv_statME_bot@mastoxiv.page
2025-06-17 12:18:41

Bayesian inference for the learning rate in Generalised Bayesian inference
Jeong Eun Lee, Sitong Liu, Geoff K. Nicholls
arxiv.org/abs/2506.12532

@arXiv_csDC_bot@mastoxiv.page
2025-06-17 09:27:27

Optimizing Federated Learning using Remote Embeddings for Graph Neural Networks
Pranjal Naman, Yogesh Simmhan
arxiv.org/abs/2506.12425

@arXiv_csNI_bot@mastoxiv.page
2025-06-17 10:00:49

Learning Best Paths in Quantum Networks
Xuchuang Wang, Maoli Liu, Xutong Liu, Zhuohua Li, Mohammad Hajiesmaili, John C. S. Lui, Don Towsley
arxiv.org/abs/2506.12462

@arXiv_csIR_bot@mastoxiv.page
2025-07-18 08:46:22

SGCL: Unifying Self-Supervised and Supervised Learning for Graph Recommendation
Weizhi Zhang, Liangwei Yang, Zihe Song, Henrry Peng Zou, Ke Xu, Yuanjie Zhu, Philip S. Yu
arxiv.org/abs/2507.13336

@Techmeme@techhub.social
2025-08-14 17:30:58

Anthropic expands Claude's Learning Mode, available only to Education users since an April launch, to all users, including two learning variants for Claude Code (Igor Bonifacic/Engadget)
engadget.com/ai/anthropic-brin

@arXiv_csCE_bot@mastoxiv.page
2025-07-18 07:39:22

Quantum-Enhanced Reinforcement Learning with LSTM Forecasting Signals for Optimizing Fintech Trading Decisions
Yen-Ku Liu, Yun-Huei Pan, Pei-Fan Lu, Yun-Cheng Tsai, Samuel Yen-Chi Chen
arxiv.org/abs/2507.12835

@arXiv_condmatstrel_bot@mastoxiv.page
2025-07-18 08:17:12

Self-learning Monte Carlo Method: A Review
Gaopei Pan, Chuang Chen, Zi Yang Meng
arxiv.org/abs/2507.12554 arxiv.org/p…

@cowboys@darktundra.xyz
2025-06-16 23:16:22

Cowboys' 1st-round rookie working to flatten learning curve of life in NFL cowboyswire.usatoday.com/story

How does the #brain transfer #MotorSkills between hands?
This study reveals that transfer relies on re-expressing the neural patterns established during initial learning in distributed higher-order brain areas,
offering new insights into learning

@arXiv_astrophGA_bot@mastoxiv.page
2025-06-18 09:38:38

Multiple machine-learning as a powerful tool for the star clusters analysis
Denilso Camargo
arxiv.org/abs/2506.13951

@arXiv_mathOC_bot@mastoxiv.page
2025-07-18 09:29:22

Unsupervised Ground Metric Learning
Janis Auffenberg, Jonas Bresch, Oleh Melnyk, Gabriele Steidl
arxiv.org/abs/2507.13094

@arXiv_physicsoptics_bot@mastoxiv.page
2025-06-18 10:00:55

High computational density nanophotonic media for machine learning inference
Zhenyu Zhao, Yichen Pan, Jinlong Xiang, Yujia Zhang, An He, Yaotian Zhao, Youlve Chen, Yu He, Xinyuan Fang, Yikai Su, Min Gu, Xuhan Guo
arxiv.org/abs/2506.14269

@arXiv_csCR_bot@mastoxiv.page
2025-06-18 09:06:19

EBS-CFL: Efficient and Byzantine-robust Secure Clustered Federated Learning
Zhiqiang Li, Haiyong Bao, Menghong Guan, Hao Pan, Cheng Huang, Hong-Ning Dai
arxiv.org/abs/2506.13612

@arXiv_csHC_bot@mastoxiv.page
2025-08-18 09:23:20

From Misunderstandings to Learning Opportunities: Leveraging Generative AI in Discussion Forums to Support Student Learning
Stanislav Pozdniakov, Jonathan Brazil, Oleksandra Poquet, Stephan Krusche, Santiago Berrezueta-Guzman, Shazia Sadiq, Hassan Khosravi
arxiv.org/abs/2508.11150

@arXiv_csAR_bot@mastoxiv.page
2025-07-18 08:42:52

WIP: Turning Fake Chips into Learning Opportunities
Haniye Mehraban, Saad Azmeen-ur-Rahman, John Hu
arxiv.org/abs/2507.13281

@arXiv_statML_bot@mastoxiv.page
2025-08-18 08:24:40

Counterfactual Survival Q Learning for Longitudinal Randomized Trials via Buckley James Boosting
Jeongjin Lee, Jong-Min Kim
arxiv.org/abs/2508.11060

@arXiv_csLG_bot@mastoxiv.page
2025-07-17 10:13:20

Online Training and Pruning of Deep Reinforcement Learning Networks
Valentin Frank Ingmar Guenter, Athanasios Sideris
arxiv.org/abs/2507.11975

@arXiv_csCV_bot@mastoxiv.page
2025-06-17 09:40:47

Hierarchical Deep Feature Fusion and Ensemble Learning for Enhanced Brain Tumor MRI Classification
Zahid Ullah, Jihie Kim
arxiv.org/abs/2506.12363

@arXiv_csCL_bot@mastoxiv.page
2025-07-17 08:09:40

Partitioner Guided Modal Learning Framework
Guimin Hu, Yi Xin, Lijie Hu, Zhihong Zhu, Hasti Seifi
arxiv.org/abs/2507.11661

@arXiv_csAI_bot@mastoxiv.page
2025-06-18 08:11:22

Enhancing Symbolic Machine Learning by Subsymbolic Representations
Stephen Roth, Lennart Baur, Derian Boer, Stefan Kramer
arxiv.org/abs/2506.14569

@arXiv_quantph_bot@mastoxiv.page
2025-07-18 08:07:02

Quantum Transfer Learning to Boost Dementia Detection
Sounak Bhowmik, Talita Perciano, Himanshu Thapliyal
arxiv.org/abs/2507.12485

@arXiv_csRO_bot@mastoxiv.page
2025-06-18 09:23:35

SENIOR: Efficient Query Selection and Preference-Guided Exploration in Preference-based Reinforcement Learning
Hexian Ni, Tao Lu, Haoyuan Hu, Yinghao Cai, Shuo Wang
arxiv.org/abs/2506.14648

@arXiv_csSE_bot@mastoxiv.page
2025-06-17 10:59:37

Isolating Noisy Labelled Test Cases in Human-in-the-Loop Oracle Learning
Charaka Geethal Kapugama
arxiv.org/abs/2506.13273

@arXiv_eessIV_bot@mastoxiv.page
2025-06-18 09:04:55

Integrating Radiomics with Deep Learning Enhances Multiple Sclerosis Lesion Delineation
Nadezhda Alsahanova, Pavel Bartenev, Maksim Sharaev, Milos Ljubisavljevic, Taleb Al. Mansoori, Yauhen Statsenko
arxiv.org/abs/2506.14524

@arXiv_csNI_bot@mastoxiv.page
2025-06-17 09:51:29

Latency Optimization for Wireless Federated Learning in Multihop Networks
Shaba Shaon, Van-Dinh Nguyen, Dinh C. Nguyen
arxiv.org/abs/2506.12081

@arXiv_csCR_bot@mastoxiv.page
2025-06-17 09:52:25

Privacy-Preserving Federated Learning against Malicious Clients Based on Verifiable Functional Encryption
Nina Cai, Jinguang Han
arxiv.org/abs/2506.12846

@Techmeme@techhub.social
2025-06-15 21:30:34

Berlin-based Knowunity, an AI-powered learning platform with 20M users in 15 countries, raised a €27M Series B led by XAnge, bringing its total funding to €45M (Tamara Djurickovic/Tech.eu)
tech.eu/2025/06/13/knowunity-r

@arXiv_csAI_bot@mastoxiv.page
2025-06-18 08:06:21

Don't throw the baby out with the bathwater: How and why deep learning for ARC
Jack Cole, Mohamed Osman
arxiv.org/abs/2506.14276

@arXiv_condmatstatmech_bot@mastoxiv.page
2025-06-18 09:29:18

Evolutionary chemical learning in dimerization networks
Alexei V. Tkachenko, Bortolo Matteo Mognetti, Sergei Maslov
arxiv.org/abs/2506.14006

@arXiv_csDC_bot@mastoxiv.page
2025-07-17 09:23:20

NineToothed: A Triton-Based High-Level Domain-Specific Language for Machine Learning
Jiacheng Huang, Zimin Li, Yinghui Li, Haojie Wang
arxiv.org/abs/2507.11978

@arXiv_statML_bot@mastoxiv.page
2025-06-17 12:16:33

Variational Learning Finds Flatter Solutions at the Edge of Stability
Avrajit Ghosh, Bai Cong, Rio Yokota, Saiprasad Ravishankar, Rongrong Wang, Molei Tao, Mohammad Emtiyaz Khan, Thomas M\"ollenhoff
arxiv.org/abs/2506.12903

@arXiv_csRO_bot@mastoxiv.page
2025-08-18 08:35:20

GenFlowRL: Shaping Rewards with Generative Object-Centric Flow in Visual Reinforcement Learning
Kelin Yu, Sheng Zhang, Harshit Soora, Furong Huang, Heng Huang, Pratap Tokekar, Ruohan Gao
arxiv.org/abs/2508.11049

@arXiv_mathOC_bot@mastoxiv.page
2025-07-17 08:48:40

Convergence Rate of Generalized Nash Equilibrium Learning in Strongly Monotone Games with Linear Constraints
Tatiana Tatarenko, Maryam Kamgarpour
arxiv.org/abs/2507.12112

@arXiv_csHC_bot@mastoxiv.page
2025-08-18 09:15:50

Human-in-the-Loop Systems for Adaptive Learning Using Generative AI
Bhavishya Tarun, Haoze Du, Dinesh Kannan, Edward F. Gehringer
arxiv.org/abs/2508.11062

@arXiv_csIR_bot@mastoxiv.page
2025-06-17 09:43:44

A Gradient Meta-Learning Joint Optimization for Beamforming and Antenna Position in Pinching-Antenna Systems
Kang Zhou, Weixi Zhou, Donghong Cai, Xianfu Lei, Yanqing Xu, Zhiguo Ding, Pingzhi Fan
arxiv.org/abs/2506.12583

@arXiv_csSE_bot@mastoxiv.page
2025-06-17 09:57:37

Quantum-Inspired Differentiable Integral Neural Networks (QIDINNs): A Feynman-Based Architecture for Continuous Learning Over Streaming Data
Oscar Boullosa Dapena
arxiv.org/abs/2506.12111

@arXiv_csLG_bot@mastoxiv.page
2025-07-17 13:51:58

Replaced article(s) found for cs.LG. arxiv.org/list/cs.LG/new
[3/5]:
- Uncertainty Quantification for Motor Imagery BCI -- Machine Learning vs. Deep Learning
Joris Suurmeijer, Ivo Pascal de Jong, Matias Valdenegro-Toro, Andreea Ioana Sburlea

@arXiv_csCR_bot@mastoxiv.page
2025-06-18 08:28:45

Privacy-Preserving Federated Learning against Malicious Clients Based on Verifiable Functional Encryption
Nina Cai, Jinguang Han
arxiv.org/abs/2506.12846

@arXiv_csCV_bot@mastoxiv.page
2025-06-17 10:19:29

Comparative Analysis of Deep Learning Strategies for Hypertensive Retinopathy Detection from Fundus Images: From Scratch and Pre-trained Models
Yanqiao Zhu
arxiv.org/abs/2506.12492

@arXiv_csCL_bot@mastoxiv.page
2025-06-18 09:12:51

Ring-lite: Scalable Reasoning via C3PO-Stabilized Reinforcement Learning for LLMs
Ring Team, Bin Hu, Cai Chen, Deng Zhao, Ding Liu, Dingnan Jin, Feng Zhu, Hao Dai, Hongzhi Luan, Jia Guo, Jiaming Liu, Jiewei Wu, Jun Mei, Jun Zhou, Junbo Zhao, Junwu Xiong, Kaihong Zhang, Kuan Xu, Lei Liang, Liang Jiang, Liangcheng Fu, Longfei Zheng, Qiang Gao, Qing Cui, Quan Wan, Shaomian Zheng, Shuaicheng Li, Tongkai Yang, Wang Ren, Xiaodong Yan, Xiaopei Wan, Xiaoyun Feng, Xin Zhao, Xinxing Yang, Xinyu …

@arXiv_csAI_bot@mastoxiv.page
2025-06-18 08:01:27

Causality in the human niche: lessons for machine learning
Richard D. Lange, Konrad P. Kording
arxiv.org/abs/2506.13803

@arXiv_quantph_bot@mastoxiv.page
2025-07-17 10:00:40

Quantum Machine Learning in Multi-Qubit Phase-Space Part I: Foundations
Timothy Heightman, Edward Jiang, Ruth Mora-Soto, Maciej Lewenstein, Marcin P{\l}odzie\'n
arxiv.org/abs/2507.12117

@arXiv_statML_bot@mastoxiv.page
2025-06-17 12:12:49

General and Estimable Learning Bound Unifying Covariate and Concept Shifts
Hongbo Chen, Li Charlie Xia
arxiv.org/abs/2506.12829

@Techmeme@techhub.social
2025-07-15 16:40:52

Apple seems to be working on adding CUDA support to open-source ML framework MLX, which may mean that code developed using MLX would work with CUDA (Malcolm Owen/AppleInsider)
appleinsider.com/articles/25/0

@arXiv_csLG_bot@mastoxiv.page
2025-07-17 10:28:10

A Bayesian Incentive Mechanism for Poison-Resilient Federated Learning
Daniel Commey, Rebecca A. Sarpong, Griffith S. Klogo, Winful Bagyl-Bac, Garth V. Crosby
arxiv.org/abs/2507.12439

@arXiv_csDC_bot@mastoxiv.page
2025-07-18 08:00:52

Autonomous Resource Management in Microservice Systems via Reinforcement Learning
Yujun Zou, Nia Qi, Yingnan Deng, Zhihao Xue, Ming Gong, Wuyang Zhang
arxiv.org/abs/2507.12879

@arXiv_csCV_bot@mastoxiv.page
2025-06-17 09:40:32

EKPC: Elastic Knowledge Preservation and Compensation for Class-Incremental Learning
Huaijie Wang, De Cheng, Lingfeng He, Yan Li, Jie Li, Nannan Wang, Xinbo Gao
arxiv.org/abs/2506.12351

@arXiv_mathOC_bot@mastoxiv.page
2025-06-17 12:24:17

Research on Optimal Control Problem Based on Reinforcement Learning under Knightian Uncertainty
Ziyu Li, Chen Fei, Weiyin Fei
arxiv.org/abs/2506.13207

@arXiv_csCL_bot@mastoxiv.page
2025-07-17 10:05:10

Findings of MEGA: Maths Explanation with LLMs using the Socratic Method for Active Learning
Tosin Adewumi, Foteini Simistira Liwicki, Marcus Liwicki, Viktor Gardelli, Lama Alkhaled, Hamam Mokayed
arxiv.org/abs/2507.12079

@arXiv_csHC_bot@mastoxiv.page
2025-06-16 08:01:09

Conversational AI as a Catalyst for Informal Learning: An Empirical Large-Scale Study on LLM Use in Everyday Learning
Na{\dj}a Terzimehi\'c, Babette B\"uhler, Enkelejda Kasneci
arxiv.org/abs/2506.11789

@arXiv_csRO_bot@mastoxiv.page
2025-07-17 10:00:30

EgoVLA: Learning Vision-Language-Action Models from Egocentric Human Videos
Ruihan Yang, Qinxi Yu, Yecheng Wu, Rui Yan, Borui Li, An-Chieh Cheng, Xueyan Zou, Yunhao Fang, Hongxu Yin, Sifei Liu, Song Han, Yao Lu, Xiaolong Wang
arxiv.org/abs/2507.12440

@arXiv_quantph_bot@mastoxiv.page
2025-07-17 10:02:20

BenchRL-QAS: Benchmarking reinforcement learning algorithms for quantum architecture search
Azhar Ikhtiarudin, Aditi Das, Param Thakkar, Akash Kundu
arxiv.org/abs/2507.12189

@arXiv_csCR_bot@mastoxiv.page
2025-08-18 08:24:50

Activate Me!: Designing Efficient Activation Functions for Privacy-Preserving Machine Learning with Fully Homomorphic Encryption
Nges Brian Njungle, Michel A. Kinsy
arxiv.org/abs/2508.11575

@arXiv_csSE_bot@mastoxiv.page
2025-06-17 09:55:37

The CAISAR Platform: Extending the Reach of Machine Learning Specification and Verification
Michele Alberti (LSL), Fran\c{c}ois Bobot (LSL), Julien Girard-Satabin (LSL), Alban Grastien (LSL), Aymeric Varasse (LSL), Zakaria Chihani (LSL)
arxiv.org/abs/2506.12084

@arXiv_csLG_bot@mastoxiv.page
2025-07-17 10:16:00

Information-Theoretic Generalization Bounds of Replay-based Continual Learning
Wen Wen, Tieliang Gong, Yunjiao Zhang, Zeyu Gao, Weizhan Zhang, Yong-Jin Liu
arxiv.org/abs/2507.12043

@arXiv_csCV_bot@mastoxiv.page
2025-06-18 09:37:21

CDP: Towards Robust Autoregressive Visuomotor Policy Learning via Causal Diffusion
Jiahua Ma, Yiran Qin, Yixiong Li, Xuanqi Liao, Yulan Guo, Ruimao Zhang
arxiv.org/abs/2506.14769

@arXiv_statML_bot@mastoxiv.page
2025-06-18 10:27:41

Rademacher learning rates for iterated random functions
Nikola Sandri\'c
arxiv.org/abs/2506.13946 arxiv.org/pdf/2…

@arXiv_csRO_bot@mastoxiv.page
2025-07-18 08:35:12

Learning to Predict Mobile Robot Stability in Off-Road Environments
Nathaniel Rose, Arif Ahmed, Emanuel Gutierrez-Cornejo, Parikshit Maini
arxiv.org/abs/2507.12731

@arXiv_csLG_bot@mastoxiv.page
2025-07-17 10:20:50

FourCastNet 3: A geometric approach to probabilistic machine-learning weather forecasting at scale
Boris Bonev, Thorsten Kurth, Ankur Mahesh, Mauro Bisson, Jean Kossaifi, Karthik Kashinath, Anima Anandkumar, William D. Collins, Michael S. Pritchard, Alexander Keller
arxiv.org/abs/2507.12144

@arXiv_csCV_bot@mastoxiv.page
2025-08-18 09:52:30

Inside Knowledge: Graph-based Path Generation with Explainable Data Augmentation and Curriculum Learning for Visual Indoor Navigation
Daniel Airinei, Elena Burceanu, Marius Leordeanu
arxiv.org/abs/2508.11446

@arXiv_quantph_bot@mastoxiv.page
2025-07-18 08:38:22

Leveraging Quantum Layers in Classical Neural Networks
Silvie Ill\'esov\'a
arxiv.org/abs/2507.12505 arxiv.org…

@arXiv_csCR_bot@mastoxiv.page
2025-07-18 07:31:52

Safeguarding Federated Learning-based Road Condition Classification
Sheng Liu, Panos Papadimitratos
arxiv.org/abs/2507.12568

@arXiv_statML_bot@mastoxiv.page
2025-06-17 12:30:01

Understanding Learning Invariance in Deep Linear Networks
Hao Duan, Guido Mont\'ufar
arxiv.org/abs/2506.13714 arx…

@arXiv_csRO_bot@mastoxiv.page
2025-08-18 09:09:00

Multi-Group Equivariant Augmentation for Reinforcement Learning in Robot Manipulation
Hongbin Lin, Juan Rojas, Kwok Wai Samuel Au
arxiv.org/abs/2508.11204

@arXiv_csLG_bot@mastoxiv.page
2025-07-17 10:25:50

Improving Reinforcement Learning Sample-Efficiency using Local Approximation
Mohit Prashant, Arvind Easwaran
arxiv.org/abs/2507.12383

@arXiv_csRO_bot@mastoxiv.page
2025-08-18 09:07:50

Actor-Critic for Continuous Action Chunks: A Reinforcement Learning Framework for Long-Horizon Robotic Manipulation with Sparse Reward
Jiarui Yang, Bin Zhu, Jingjing Chen, Yu-Gang Jiang
arxiv.org/abs/2508.11143

@arXiv_statML_bot@mastoxiv.page
2025-06-17 12:19:53

Random Matrix Theory for Deep Learning: Beyond Eigenvalues of Linear Models
Zhenyu Liao, Michael W. Mahoney
arxiv.org/abs/2506.13139

@arXiv_csCR_bot@mastoxiv.page
2025-07-18 09:07:02

A Crowdsensing Intrusion Detection Dataset For Decentralized Federated Learning Models
Chao Feng, Alberto Huertas Celdran, Jing Han, Heqing Ren, Xi Cheng, Zien Zeng, Lucas Krauter, Gerome Bovet, Burkhard Stiller
arxiv.org/abs/2507.13313

@arXiv_csRO_bot@mastoxiv.page
2025-07-18 09:51:32

Evaluating Reinforcement Learning Algorithms for Navigation in Simulated Robotic Quadrupeds: A Comparative Study Inspired by Guide Dog Behaviour
Emma M. A. Harrison
arxiv.org/abs/2507.13277

@arXiv_csCR_bot@mastoxiv.page
2025-06-17 09:32:15

InverTune: Removing Backdoors from Multimodal Contrastive Learning Models via Trigger Inversion and Activation Tuning
Mengyuan Sun, Yu Li, Yuchen Liu, Bo Du, Yunjie Ge
arxiv.org/abs/2506.12411

@arXiv_statML_bot@mastoxiv.page
2025-06-17 12:10:25

A Transfer Learning Framework for Multilayer Networks via Model Averaging
Yongqin Qiu, Xinyu Zhang
arxiv.org/abs/2506.12455

@arXiv_csCV_bot@mastoxiv.page
2025-08-18 09:53:20

CineTrans: Learning to Generate Videos with Cinematic Transitions via Masked Diffusion Models
Xiaoxue Wu, Bingjie Gao, Yu Qiao, Yaohui Wang, Xinyuan Chen
arxiv.org/abs/2508.11484

@arXiv_csLG_bot@mastoxiv.page
2025-07-18 13:38:26

Replaced article(s) found for cs.LG. arxiv.org/list/cs.LG/new
[2/5]:
- Learning Universal Human Mobility Patterns with a Foundation Model for Cross-domain Data Fusion
Haoxuan Ma, Xishun Liao, Yifan Liu, Qinhua Jiang, Chris Stanford, Shangqing Cao, Jiaqi Ma

@arXiv_csRO_bot@mastoxiv.page
2025-06-18 08:40:11

Quadrotor Morpho-Transition: Learning vs Model-Based Control Strategies
Ioannis Mandralis, Richard M. Murray, Morteza Gharib
arxiv.org/abs/2506.14039

@arXiv_csCR_bot@mastoxiv.page
2025-07-17 09:01:20

A Privacy-Preserving Framework for Advertising Personalization Incorporating Federated Learning and Differential Privacy
Xiang Li, Yifan Lin, Yuanzhe Zhang
arxiv.org/abs/2507.12098

@arXiv_statML_bot@mastoxiv.page
2025-06-18 10:27:34

Beyond Shapley Values: Cooperative Games for the Interpretation of Machine Learning Models
Marouane Il Idrissi, Agathe Fernandes Machado, Arthur Charpentier
arxiv.org/abs/2506.13900

@arXiv_csCV_bot@mastoxiv.page
2025-07-18 10:22:02

$\pi^3$: Scalable Permutation-Equivariant Visual Geometry Learning
Yifan Wang, Jianjun Zhou, Haoyi Zhu, Wenzheng Chang, Yang Zhou, Zizun Li, Junyi Chen, Jiangmiao Pang, Chunhua Shen, Tong He
arxiv.org/abs/2507.13347

@arXiv_csLG_bot@mastoxiv.page
2025-07-17 13:52:20

Replaced article(s) found for cs.LG. arxiv.org/list/cs.LG/new
[5/5]:
- Machine Learning-Driven Compensation for Non-Ideal Channels in AWG-Based FBG Interrogator
Kazakov, Kulichenko, Kovalev, Treskova, Barma, Malakhov, Oseledets, Shipulin

@arXiv_csRO_bot@mastoxiv.page
2025-07-18 09:44:02

ZipMPC: Compressed Context-Dependent MPC Cost via Imitation Learning
Rahel Rickenbach, Alan A. Lahoud, Erik Schaffernicht, Melanie N. Zeilinger, Johannes A. Stork
arxiv.org/abs/2507.13088

@arXiv_csRO_bot@mastoxiv.page
2025-07-18 09:21:42

DEMONSTRATE: Zero-shot Language to Robotic Control via Multi-task Demonstration Learning
Rahel Rickenbach, Bruce Lee, Ren\'e Zurbr\"ugg, Carmen Amo Alonso, Melanie N. Zeilinger
arxiv.org/abs/2507.12855

@arXiv_csLG_bot@mastoxiv.page
2025-07-18 13:38:56

Replaced article(s) found for cs.LG. arxiv.org/list/cs.LG/new
[5/5]:
- EgoVLA: Learning Vision-Language-Action Models from Egocentric Human Videos
Yang, Yu, Wu, Yan, Li, Cheng, Zou, Fang, Yin, Liu, Han, Lu, Wang

@arXiv_csRO_bot@mastoxiv.page
2025-07-16 10:04:41

ILCL: Inverse Logic-Constraint Learning from Temporally Constrained Demonstrations
Minwoo Cho, Jaehwi Jang, Daehyung Park
arxiv.org/abs/2507.11000

@arXiv_csLG_bot@mastoxiv.page
2025-06-17 19:04:33

Replaced article(s) found for cs.LG. arxiv.org/list/cs.LG/new
[1/8]:
Boosting Resource-Constrained Federated Learning Systems with Guessed Updates

@arXiv_csLG_bot@mastoxiv.page
2025-06-17 19:05:35

Replaced article(s) found for cs.LG. arxiv.org/list/cs.LG/new
[7/8]:
Achieving Collective Welfare in Multi-Agent Reinforcement Learning via Suggestion Sharing

@arXiv_csLG_bot@mastoxiv.page
2025-07-17 10:26:00

Trustworthy Tree-based Machine Learning by $MoS_2$ Flash-based Analog CAM with Inherent Soft Boundaries
Bo Wen, Guoyun Gao, Zhicheng Xu, Ruibin Mao, Xiaojuan Qi, X. Sharon Hu, Xunzhao Yin, Can Li
arxiv.org/abs/2507.12384