Tootfinder

Opt-in global Mastodon full text search. Join the index!

@arXiv_csLG_bot@mastoxiv.page
2025-06-10 19:19:21

This arxiv.org/abs/2505.11862 has been replaced.
initial toot: mastoxiv.page/@arXiv_csLG_…

@arXiv_quantph_bot@mastoxiv.page
2025-06-10 18:12:00

This arxiv.org/abs/2204.04198 has been replaced.
link: scholar.google.com/scholar?q=a

@arXiv_eessSY_bot@mastoxiv.page
2025-06-11 09:03:25

Q-learning-based Hierarchical Cooperative Local Search for Steelmaking-continuous Casting Scheduling Problem
Yang Lv, Rong Hu, Bin Qian, Jian-Bo Yang
arxiv.org/abs/2506.08608

@arXiv_qfinST_bot@mastoxiv.page
2025-06-10 18:24:00

This arxiv.org/abs/2506.03780 has been replaced.
link: scholar.google.com/scholar?q=a

@arXiv_csSE_bot@mastoxiv.page
2025-06-10 16:46:19

This arxiv.org/abs/2207.04285 has been replaced.
link: scholar.google.com/scholar?q=a

@arXiv_csLG_bot@mastoxiv.page
2025-07-11 10:23:31

Reinforcement Learning with Action Chunking
Qiyang Li, Zhiyuan Zhou, Sergey Levine
arxiv.org/abs/2507.07969 arxiv.org/pdf/2507.07969 arxiv.org/html/2507.07969
arXiv:2507.07969v1 Announce Type: new
Abstract: We present Q-chunking, a simple yet effective recipe for improving reinforcement learning (RL) algorithms for long-horizon, sparse-reward tasks. Our recipe is designed for the offline-to-online RL setting, where the goal is to leverage an offline prior dataset to maximize the sample-efficiency of online learning. Effective exploration and sample-efficient learning remain central challenges in this setting, as it is not obvious how the offline data should be utilized to acquire a good exploratory policy. Our key insight is that action chunking, a technique popularized in imitation learning where sequences of future actions are predicted rather than a single action at each timestep, can be applied to temporal difference (TD)-based RL methods to mitigate the exploration challenge. Q-chunking adopts action chunking by directly running RL in a 'chunked' action space, enabling the agent to (1) leverage temporally consistent behaviors from offline data for more effective online exploration and (2) use unbiased $n$-step backups for more stable and efficient TD learning. Our experimental results demonstrate that Q-chunking exhibits strong offline performance and online sample efficiency, outperforming prior best offline-to-online methods on a range of long-horizon, sparse-reward manipulation tasks.
toXiv_bot_toot

@arXiv_qbioGN_bot@mastoxiv.page
2025-07-10 13:12:28

Replaced article(s) found for q-bio.GN. arxiv.org/list/q-bio.GN/new
[1/1]:
- Terrier: A Deep Learning Repeat Classifier
Robert Turnbull, Neil D. Young, Edoardo Tescari, Lee F. Skerratt, Tiffany A. Kosch

@Mediagazer@mstdn.social
2025-06-10 08:01:15

Q&A with Maggie Haberman about covering Trump's second term, how the media is separating "the signal from the noise", Trump's threats, Biden's decline, and more (Natalie Korach/Vanity Fair)
vanityfair.com/news/story/magg

@arXiv_csCR_bot@mastoxiv.page
2025-07-10 09:09:51

Q-Detection: A Quantum-Classical Hybrid Poisoning Attack Detection Method
Haoqi He, Xiaokai Lin, Jiancai Chen, Yan Xiao
arxiv.org/abs/2507.06262

@arXiv_statML_bot@mastoxiv.page
2025-06-06 07:39:46

Regret-Optimal Q-Learning with Low Cost for Single-Agent and Federated Reinforcement Learning
Haochen Zhang, Zhong Zheng, Lingzhou Xue
arxiv.org/abs/2506.04626

@arXiv_csAI_bot@mastoxiv.page
2025-07-04 09:21:51

Dilution, Diffusion and Symbiosis in Spatial Prisoner's Dilemma with Reinforcement Learning
Gustavo C. Mangold, Heitor C. M. Fernandes, Mendeli H. Vainstein
arxiv.org/abs/2507.02211

@arXiv_eessSP_bot@mastoxiv.page
2025-07-08 11:45:30

Deep Learning Based Antenna Selection Technique for RIS-Empowered RQSM System
Burak Ahmet Ozden, Fatih Cogen, Erdogan Aydin
arxiv.org/abs/2507.05071

@arXiv_csLG_bot@mastoxiv.page
2025-07-11 10:23:41

EXPO: Stable Reinforcement Learning with Expressive Policies
Perry Dong, Qiyang Li, Dorsa Sadigh, Chelsea Finn
arxiv.org/abs/2507.07986 arxiv.org/pdf/2507.07986 arxiv.org/html/2507.07986
arXiv:2507.07986v1 Announce Type: new
Abstract: We study the problem of training and fine-tuning expressive policies with online reinforcement learning (RL) given an offline dataset. Training expressive policy classes with online RL present a unique challenge of stable value maximization. Unlike simpler Gaussian policies commonly used in online RL, expressive policies like diffusion and flow-matching policies are parameterized by a long denoising chain, which hinders stable gradient propagation from actions to policy parameters when optimizing against some value function. Our key insight is that we can address stable value maximization by avoiding direct optimization over value with the expressive policy and instead construct an on-the-fly RL policy to maximize Q-value. We propose Expressive Policy Optimization (EXPO), a sample-efficient online RL algorithm that utilizes an on-the-fly policy to maximize value with two parameterized policies -- a larger expressive base policy trained with a stable imitation learning objective and a light-weight Gaussian edit policy that edits the actions sampled from the base policy toward a higher value distribution. The on-the-fly policy optimizes the actions from the base policy with the learned edit policy and chooses the value maximizing action from the base and edited actions for both sampling and temporal-difference (TD) backup. Our approach yields up to 2-3x improvement in sample efficiency on average over prior methods both in the setting of fine-tuning a pretrained policy given offline data and in leveraging offline data to train online.
toXiv_bot_toot

@arXiv_qbiobm_bot@mastoxiv.page
2025-08-06 13:28:45

Replaced article(s) found for q-bio.BM. arxiv.org/list/q-bio.BM/new
[1/1]:
- Topological Learning Prediction of Virus-like Particle Stoichiometry and Stability
Xiang Liu, Xuefei Huang, Guo-Wei Wei

@arXiv_qfinPM_bot@mastoxiv.page
2025-08-07 13:17:24

Replaced article(s) found for q-fin.PM. arxiv.org/list/q-fin.PM/new
[1/1]:
- Evaluation of Deep Reinforcement Learning Algorithms for Portfolio Optimisation
Chung I Lu

@arXiv_mathOC_bot@mastoxiv.page
2025-06-04 07:43:39

Learning-based primal-dual optimal control of discrete-time stochastic systems with multiplicative noise
Xiushan Jiang, Weihai Zhang
arxiv.org/abs/2506.02613

@arXiv_eessSY_bot@mastoxiv.page
2025-08-06 09:41:20

Improving Q-Learning for Real-World Control: A Case Study in Series Hybrid Agricultural Tractors
Hend Abououf, Sidra Ghayour Bhatti, Qadeer Ahmed
arxiv.org/abs/2508.03647

@arXiv_csDC_bot@mastoxiv.page
2025-08-04 08:10:21

Integrated user scheduling and beam steering in over-the-air federated learning for mobile IoT
Shengheng Liu, Ningning Fu, Zhonghao Zhang, Yongming Huang, Tony Q. S. Quek
arxiv.org/abs/2508.00341

@arXiv_qfinCP_bot@mastoxiv.page
2025-08-04 12:37:04

Replaced article(s) found for q-fin.CP. arxiv.org/list/q-fin.CP/new
[1/1]:
- Decision by Supervised Learning with Deep Ensembles: A Practical Framework for Robust Portfolio O...
Juhyeong Kim, Sungyoon Choi, Youngbin Lee, Yejin Kim, Yongmin Choi, Yongjae Lee

@arXiv_csNI_bot@mastoxiv.page
2025-08-05 07:36:50

A Deep Reinforcement Learning-Based TCP Congestion Control Algorithm: Design, Simulation, and Evaluation
Efe A\u{g}lamazlar, Emirhan Eken, Harun Batur Ge\c{c}ici
arxiv.org/abs/2508.01047

@arXiv_econGN_bot@mastoxiv.page
2025-05-30 07:21:55

Learning to Charge More: A Theoretical Study of Collusion by Q-Learning Agents
Cristian Chica, Yinglong Guo, Gilad Lerman
arxiv.org/abs/2505.22909

@arXiv_eessIV_bot@mastoxiv.page
2025-06-05 09:46:01

This arxiv.org/abs/2506.00605 has been replaced.
link: scholar.google.com/scholar?q=a

@arXiv_qbioQM_bot@mastoxiv.page
2025-07-30 13:19:41

Replaced article(s) found for q-bio.QM. arxiv.org/list/q-bio.QM/new
[1/1]:
- Machine learning-based multimodal prognostic models integrating pathology images and high-through...
Charlotte Jennings, Andrew Broad, Lucy Godson, Emily Clarke, David Westhead, Darren Treanor

@arXiv_csIR_bot@mastoxiv.page
2025-06-05 09:39:21

This arxiv.org/abs/2308.03734 has been replaced.
link: scholar.google.com/scholar?q=a

@arXiv_quantph_bot@mastoxiv.page
2025-06-06 10:11:26

This arxiv.org/abs/2412.20925 has been replaced.
link: scholar.google.com/scholar?q=a

@arXiv_qbiobm_bot@mastoxiv.page
2025-08-05 17:22:50

Replaced article(s) found for q-bio.BM. arxiv.org/list/q-bio.BM/new
[1/1]:
- Hierarchical Multi-Label Contrastive Learning for Protein-Protein Interaction Prediction Across O...
Shiyi Liu, Buwen Liang, Yuetong Fang, Zixuan Jiang, Renjing Xu

@arXiv_qbioNC_bot@mastoxiv.page
2025-07-29 16:19:02

Replaced article(s) found for q-bio.NC. arxiv.org/list/q-bio.NC/new
[1/1]:
- Advantageous and disadvantageous inequality aversion can be taught through vicarious learning of ...
Shen Zhang, Oriel FeldmanHall, S\'ebastien H\'etu, A. Ross Otto

@arXiv_csGR_bot@mastoxiv.page
2025-06-06 09:34:53

This arxiv.org/abs/2502.17327 has been replaced.
link: scholar.google.com/scholar?q=a

@arXiv_statME_bot@mastoxiv.page
2025-06-03 16:55:00

This arxiv.org/abs/2112.13738 has been replaced.
link: scholar.google.com/scholar?q=a

@arXiv_csGT_bot@mastoxiv.page
2025-05-29 10:10:54

This arxiv.org/abs/2208.09407 has been replaced.
link: scholar.google.com/scholar?q=a

@arXiv_econTH_bot@mastoxiv.page
2025-06-06 09:38:56

This arxiv.org/abs/2202.12453 has been replaced.
link: scholar.google.com/scholar?q=a

@arXiv_qfinTR_bot@mastoxiv.page
2025-06-06 07:39:03

Can Artificial Intelligence Trade the Stock Market?
J\k{e}drzej Maskiewicz, Pawe{\l} Sakowski
arxiv.org/abs/2506.04658

@arXiv_csCR_bot@mastoxiv.page
2025-06-03 16:18:49

This arxiv.org/abs/2402.02160 has been replaced.
link: scholar.google.com/scholar?q=a

@arXiv_mathNA_bot@mastoxiv.page
2025-06-03 16:43:06

This arxiv.org/abs/2111.06931 has been replaced.
link: scholar.google.com/scholar?q=a

@arXiv_condmatmeshall_bot@mastoxiv.page
2025-06-04 13:47:02

This arxiv.org/abs/2209.09443 has been replaced.
link: scholar.google.com/scholar?q=a

@arXiv_csAI_bot@mastoxiv.page
2025-06-05 09:45:09

This arxiv.org/abs/2506.02139 has been replaced.
link: scholar.google.com/scholar?q=a

@arXiv_csMA_bot@mastoxiv.page
2025-06-02 07:19:33

Distributed Neural Policy Gradient Algorithm for Global Convergence of Networked Multi-Agent Reinforcement Learning
Pengcheng Dai, Yuanqiu Mo, Wenwu Yu, Wei Ren
arxiv.org/abs/2505.24113

@arXiv_csLG_bot@mastoxiv.page
2025-06-03 21:54:47

This arxiv.org/abs/2505.19946 has been replaced.
initial toot: mastoxiv.page/@arXiv_csLG_…

@arXiv_eessSP_bot@mastoxiv.page
2025-08-04 08:39:40

When Vision-Language Model (VLM) Meets Beam Prediction: A Multimodal Contrastive Learning Framework
Ji Wang, Bin Tang, Jian Xiao, Qimei Cui, Xingwang Li, Tony Q. S. Quek
arxiv.org/abs/2508.00456

@arXiv_qfinPM_bot@mastoxiv.page
2025-08-04 12:37:21

Replaced article(s) found for q-fin.PM. arxiv.org/list/q-fin.PM/new
[1/1]:
- Decision by Supervised Learning with Deep Ensembles: A Practical Framework for Robust Portfolio O...
Juhyeong Kim, Sungyoon Choi, Youngbin Lee, Yejin Kim, Yongmin Choi, Yongjae Lee

@arXiv_csNI_bot@mastoxiv.page
2025-07-01 10:03:43

Offline Reinforcement Learning for Mobility Robustness Optimization
Pegah Alizadeh, Anastasios Giovanidis, Pradeepa Ramachandra, Vasileios Koutsoukis, Osama Arouk
arxiv.org/abs/2506.22793

@arXiv_qbiobm_bot@mastoxiv.page
2025-08-04 12:37:57

Replaced article(s) found for q-bio.BM. arxiv.org/list/q-bio.BM/new
[1/1]:
- Hierarchical Multi-Label Contrastive Learning for Protein-Protein Interaction Prediction Across O...
Shiyi Liu, Buwen Liang, Yuetong Fang, Zixuan Jiang, Renjing Xu

@arXiv_csLG_bot@mastoxiv.page
2025-06-03 16:45:26

This arxiv.org/abs/2205.09337 has been replaced.
link: scholar.google.com/scholar?q=a

@arXiv_csDS_bot@mastoxiv.page
2025-05-29 10:10:17

This arxiv.org/abs/2406.03674 has been replaced.
initial toot: mastoxiv.page/@arXiv_csDS_…

@arXiv_eessIV_bot@mastoxiv.page
2025-06-03 16:16:00

This arxiv.org/abs/2207.06418 has been replaced.
link: scholar.google.com/scholar?q=a

@arXiv_qfinGN_bot@mastoxiv.page
2025-07-24 12:51:24

Replaced article(s) found for q-fin.GN. arxiv.org/list/q-fin.GN/new
[1/1]:
- Machine Learning Classification and Portfolio Allocation: with Implications from Machine Uncertainty
Yang Bai, Kuntara Pukthuanthong

@arXiv_qfinCP_bot@mastoxiv.page
2025-08-01 12:56:29

Replaced article(s) found for q-fin.CP. arxiv.org/list/q-fin.CP/new
[1/1]:
- Decision by Supervised Learning with Deep Ensembles: A Practical Framework for Robust Portfolio O...
Juhyeong Kim, Sungyoon Choi, Youngbin Lee, Yejin Kim, Yongmin Choi, Yongjae Lee

@arXiv_qfinST_bot@mastoxiv.page
2025-07-31 12:36:39

Replaced article(s) found for q-fin.ST. arxiv.org/list/q-fin.ST/new
[1/1]:
- Year-over-Year Developments in Financial Fraud Detection via Deep Learning: A Systematic Literatu...
Yisong Chen, Chuqing Zhao, Yixin Xu, Chuanhao Nie, Yixin Zhang

@arXiv_qbioNC_bot@mastoxiv.page
2025-06-27 12:54:15

Replaced article(s) found for q-bio.NC. arxiv.org/list/q-bio.NC/new
[1/1]:
- Inverse Reinforcement Learning via Convex Optimization
Hao Zhu, Yuan Zhang, Joschka Boedecker

@arXiv_qfinPR_bot@mastoxiv.page
2025-07-21 12:10:56

Replaced article(s) found for q-fin.PR. arxiv.org/list/q-fin.PR/new
[1/1]:
- Machine-learning regression methods for American-style path-dependent contracts
Matteo Gambara, Giulia Livieri, Andrea Pallavicini

@arXiv_csOS_bot@mastoxiv.page
2025-05-22 09:47:36

This arxiv.org/abs/2409.19434 has been replaced.
link: scholar.google.com/scholar?q=a

@arXiv_econEM_bot@mastoxiv.page
2025-05-29 10:13:38

This arxiv.org/abs/2401.17909 has been replaced.
link: scholar.google.com/scholar?q=a

@arXiv_statME_bot@mastoxiv.page
2025-07-01 10:35:53

Robust estimation of optimal dynamic treatment regimes with nonignorable missing covariates
Jian Sun, Bo Fu, Li Su
arxiv.org/abs/2506.22892

@arXiv_csCR_bot@mastoxiv.page
2025-06-02 10:06:53

This arxiv.org/abs/2503.00140 has been replaced.
link: scholar.google.com/scholar?q=a

@arXiv_csLG_bot@mastoxiv.page
2025-06-03 17:19:46

This arxiv.org/abs/2205.10538 has been replaced.
link: scholar.google.com/scholar?q=a

@arXiv_mathRA_bot@mastoxiv.page
2025-05-27 13:40:10

This arxiv.org/abs/2210.07383 has been replaced.
link: scholar.google.com/scholar?q=a

@arXiv_physicsedph_bot@mastoxiv.page
2025-05-30 10:09:21

This arxiv.org/abs/2211.00694 has been replaced.
link: scholar.google.com/scholar?q=a

@arXiv_econTH_bot@mastoxiv.page
2025-05-29 10:14:01

This arxiv.org/abs/2304.12647 has been replaced.
initial toot: mastoxiv.page/@arXiv_eco…

@arXiv_eessSP_bot@mastoxiv.page
2025-07-31 07:49:41

Deep Learning for Gradient and BCG Artifacts Removal in EEG During Simultaneous fMRI
K. A. Shahriar, E. H. Bhuiyan, Q. Luo, M. E. H. Chowdhury, X. J. Zhou
arxiv.org/abs/2507.22263

@arXiv_qfinPM_bot@mastoxiv.page
2025-08-01 12:56:17

Replaced article(s) found for q-fin.PM. arxiv.org/list/q-fin.PM/new
[1/1]:
- Decision by Supervised Learning with Deep Ensembles: A Practical Framework for Robust Portfolio O...
Juhyeong Kim, Sungyoon Choi, Youngbin Lee, Yejin Kim, Yongmin Choi, Yongjae Lee

@arXiv_statML_bot@mastoxiv.page
2025-06-26 08:38:40

A Principled Path to Fitted Distributional Evaluation
Sungee Hong, Jiayi Wang, Zhengling Qi, Raymond Ka Wai Wong
arxiv.org/abs/2506.20048

@arXiv_quantph_bot@mastoxiv.page
2025-07-23 10:27:32

Hybrid Reward-Driven Reinforcement Learning for Efficient Quantum Circuit Synthesis
Sara Giordano, Kornikar Sen, Miguel A. Martin-Delgado
arxiv.org/abs/2507.16641

@arXiv_qbioQM_bot@mastoxiv.page
2025-06-16 14:59:44

Replaced article(s) found for q-bio.QM. arxiv.org/list/q-bio.QM/new
[1/1]:
DeepGDel: Deep Learning-based Gene Deletion Prediction Framework for Growth-Coupled Production in...

@arXiv_qfinCP_bot@mastoxiv.page
2025-07-28 12:26:51

Replaced article(s) found for q-fin.CP. arxiv.org/list/q-fin.CP/new
[1/1]:
- Decision by Supervised Learning with Deep Ensembles: A Practical Framework for Robust Portfolio O...
Juhyeong Kim, Sungyoon Choi, Youngbin Lee, Yejin Kim, Yongmin Choi, Yongjae Lee

@arXiv_hepph_bot@mastoxiv.page
2025-07-21 08:35:00

Theory-informed neural networks for particle physics
Barry M. Dillon, Michael Spannowsky
arxiv.org/abs/2507.13447 arx…

@arXiv_qfinRM_bot@mastoxiv.page
2025-05-19 09:47:19

This arxiv.org/abs/2203.01664 has been replaced.
link: scholar.google.com/scholar?q=a

@arXiv_csAI_bot@mastoxiv.page
2025-07-30 14:03:38

Replaced article(s) found for cs.AI. arxiv.org/list/cs.AI/new
[1/5]:
- A finite time analysis of distributed Q-learning
Han-Dong Lim, Donghwan Lee

@arXiv_mathOC_bot@mastoxiv.page
2025-07-28 08:15:31

Multi-Year Maintenance Planning for Large-Scale Infrastructure Systems: A Novel Network Deep Q-Learning Approach
Amir Fard, Arnold X. -X. Yuan
arxiv.org/abs/2507.18732

@arXiv_qbioNC_bot@mastoxiv.page
2025-06-18 15:22:27

Replaced article(s) found for q-bio.NC. arxiv.org/list/q-bio.NC/new
[1/1]:
Spatiotemporal Learning of Brain Dynamics from fMRI Using Frequency-Specific Multi-Band Attention...

@arXiv_eessSY_bot@mastoxiv.page
2025-07-14 07:52:32

Deep Reinforcement Learning in Applied Control: Challenges, Analysis, and Insights
Klinsmann Agyei, Pouria Sarhadi, Daniel Polani
arxiv.org/abs/2507.08196

@arXiv_qbioGN_bot@mastoxiv.page
2025-06-17 18:16:28

Replaced article(s) found for q-bio.GN. arxiv.org/list/q-bio.GN/new
[1/1]:
MLOmics: Cancer Multi-Omics Database for Machine Learning

@arXiv_qfinPM_bot@mastoxiv.page
2025-07-28 12:26:10

Replaced article(s) found for q-fin.PM. arxiv.org/list/q-fin.PM/new
[1/1]:
- Decision by Supervised Learning with Deep Ensembles: A Practical Framework for Robust Portfolio O...
Juhyeong Kim, Sungyoon Choi, Youngbin Lee, Yejin Kim, Yongmin Choi, Yongjae Lee

@arXiv_csMA_bot@mastoxiv.page
2025-07-30 12:54:04

Replaced article(s) found for cs.MA. arxiv.org/list/cs.MA/new
[1/1]:
- A finite time analysis of distributed Q-learning
Han-Dong Lim, Donghwan Lee

@arXiv_physicsgenph_bot@mastoxiv.page
2025-05-21 10:01:55

This arxiv.org/abs/2501.13881 has been replaced.
link: scholar.google.com/scholar?q=a

@arXiv_qbiobm_bot@mastoxiv.page
2025-07-22 16:43:33

Replaced article(s) found for q-bio.BM. arxiv.org/list/q-bio.BM/new
[1/1]:
- JAMUN: Bridging Smoothed Molecular Dynamics and Score-Based Learning for Conformational Ensembles
Ameya Daigavane, Bodhi P. Vani, Darcy Davidson, Saeed Saremi, Joshua Rackers, Joseph Kleinhenz

@arXiv_qfinCP_bot@mastoxiv.page
2025-07-24 12:51:49

Replaced article(s) found for q-fin.CP. arxiv.org/list/q-fin.CP/new
[1/1]:
- Machine Learning Classification and Portfolio Allocation: with Implications from Machine Uncertainty
Yang Bai, Kuntara Pukthuanthong

@arXiv_csNI_bot@mastoxiv.page
2025-06-19 08:28:34

GCN-Driven Reinforcement Learning for Probabilistic Real-Time Guarantees in Industrial URLLC
Eman Alqudah, Ashfaq Khokhar
arxiv.org/abs/2506.15011

@arXiv_qbioNC_bot@mastoxiv.page
2025-07-16 13:15:13

Replaced article(s) found for q-bio.NC. arxiv.org/list/q-bio.NC/new
[1/1]:
- Summary statistics of learning link changing neural representations to behavior
Jacob A. Zavatone-Veth, Blake Bordelon, Cengiz Pehlevan

@arXiv_statML_bot@mastoxiv.page
2025-07-24 09:02:39

To Trust or Not to Trust: On Calibration in ML-based Resource Allocation for Wireless Networks
Rashika Raina, Nidhi Simmons, David E. Simmons, Michel Daoud Yacoub, Trung Q. Duong
arxiv.org/abs/2507.17494

@arXiv_qbioQM_bot@mastoxiv.page
2025-06-13 14:40:57

Replaced article(s) found for q-bio.QM. arxiv.org/list/q-bio.QM/new/
[1/1]:
DeepGDel: Deep Learning-based Gene Deletion Prediction Framework for Growth-Coupled Production in...

@arXiv_csLG_bot@mastoxiv.page
2025-06-23 18:27:11

Replaced article(s) found for cs.LG. arxiv.org/list/cs.LG/new
[4/9]:
- Eau De $Q$-Network: Adaptive Distillation of Neural Networks in Deep Reinforcement Learning
Th\'eo Vincent, Tim Faust, Yogesh Tripathi, Jan Peters, Carlo D'Eramo

@arXiv_econGN_bot@mastoxiv.page
2025-05-28 10:14:11

This arxiv.org/abs/2502.15084 has been replaced.
initial toot: mastoxiv.page/@arXiv_eco…

@arXiv_qfinPM_bot@mastoxiv.page
2025-07-24 12:52:27

Replaced article(s) found for q-fin.PM. arxiv.org/list/q-fin.PM/new
[1/1]:
- Machine Learning Classification and Portfolio Allocation: with Implications from Machine Uncertainty
Yang Bai, Kuntara Pukthuanthong

@arXiv_qbiobm_bot@mastoxiv.page
2025-06-17 18:14:03

Replaced article(s) found for q-bio.BM. arxiv.org/list/q-bio.BM/new
[1/1]:
Interpretable Multimodal Learning for Tumor Protein-Metal Binding: Progress, Challenges, and Pers...

@arXiv_qbioGN_bot@mastoxiv.page
2025-07-14 12:17:54

Replaced article(s) found for q-bio.GN. arxiv.org/list/q-bio.GN/new
[1/1]:
- On learning functions over biological sequence space: relating Gaussian process priors, regulariz...
Samantha Petti, Carlos Mart\'i-G\'omez, Justin B. Kinney, Juannan Zhou, David M. Mc…

@arXiv_mathOC_bot@mastoxiv.page
2025-07-21 12:22:04

Replaced article(s) found for math.OC. arxiv.org/list/math.OC/new
[1/1]:
- Stochastic Primal-Dual Q-Learning
Narim Jeong, Donghwan Lee, Niao He

@arXiv_mathOC_bot@mastoxiv.page
2025-07-18 13:02:07

Replaced article(s) found for math.OC. arxiv.org/list/math.OC/new
[1/1]:
- Stochastic Primal-Dual Q-Learning
Donghwan Lee, Niao He