Tootfinder

Opt-in global Mastodon full text search. Join the index!

@arXiv_csLG_bot@mastoxiv.page
2025-08-29 10:18:31

Unbiased Stochastic Optimization for Gaussian Processes on Finite Dimensional RKHS
Neta Shoham, Haim Avron
arxiv.org/abs/2508.20588 arxiv.o…

@arXiv_csPF_bot@mastoxiv.page
2025-07-29 07:52:01

Towards Generalized Parameter Tuning in Coherent Ising Machines: A Portfolio-Based Approach
Tatsuro Hanyu, Takahiro Katagiri, Daichi Mukunoki, Tetsuya Hoshino
arxiv.org/abs/2507.20295

@arXiv_csSE_bot@mastoxiv.page
2025-07-22 10:56:00

Investigating the Role of LLMs Hyperparameter Tuning and Prompt Engineering to Support Domain Modeling
Vladyslav Bulhakov, Giordano d'Aloisio, Claudio Di Sipio, Antinisca Di Marco, Davide Di Ruscio
arxiv.org/abs/2507.14735

@arXiv_csCR_bot@mastoxiv.page
2025-08-22 09:33:41

Private Hyperparameter Tuning with Ex-Post Guarantee
Badih Ghazi, Pritish Kamath, Alexander Knop, Ravi Kumar, Pasin Manurangsi, Chiyuan Zhang
arxiv.org/abs/2508.15183

@arXiv_csCL_bot@mastoxiv.page
2025-06-27 09:58:19

Bridging Offline and Online Reinforcement Learning for LLMs
Jack Lanchantin, Angelica Chen, Janice Lan, Xian Li, Swarnadeep Saha, Tianlu Wang, Jing Xu, Ping Yu, Weizhe Yuan, Jason E Weston, Sainbayar Sukhbaatar, Ilia Kulikov
arxiv.org/abs/2506.21495 arxiv.org/pdf/2506.21495 arxiv.org/html/2506.21495
arXiv:2506.21495v1 Announce Type: new
Abstract: We investigate the effectiveness of reinforcement learning methods for finetuning large language models when transitioning from offline to semi-online to fully online regimes for both verifiable and non-verifiable tasks. Our experiments cover training on verifiable math as well as non-verifiable instruction following with a set of benchmark evaluations for both. Across these settings, we extensively compare online and semi-online Direct Preference Optimization and Group Reward Policy Optimization objectives, and surprisingly find similar performance and convergence between these variants, which all strongly outperform offline methods. We provide a detailed analysis of the training dynamics and hyperparameter selection strategies to achieve optimal results. Finally, we show that multi-tasking with verifiable and non-verifiable rewards jointly yields improved performance across both task types.
toXiv_bot_toot

@arXiv_physicsoptics_bot@mastoxiv.page
2025-07-23 08:03:32

Hyperparameter-free minimum-lengthscale constraints for topology optimization
Rodrigo Arrieta, Giuseppe Romano, Steven G. Johnson
arxiv.org/abs/2507.16108

@arXiv_csET_bot@mastoxiv.page
2025-07-23 08:04:12

Quantum Annealing Hyperparameter Analysis for Optimal Sensor Placement in Production Environments
Nico Kraus, Marvin Erdmann, Alexander Kuzmany, Daniel Porawski, Jonas Stein
arxiv.org/abs/2507.16584

@arXiv_condmatstatmech_bot@mastoxiv.page
2025-07-25 08:52:22

Black-box optimization using factorization and Ising machines
Ryo Tamura, Yuya Seki, Yuki Minamoto, Koki Kitai, Yoshiki Matsuda, Shu Tanaka, Koji Tsuda
arxiv.org/abs/2507.18003

@arXiv_csCV_bot@mastoxiv.page
2025-08-22 10:08:21

From Linearity to Non-Linearity: How Masked Autoencoders Capture Spatial Correlations
Anthony Bisulco, Rahul Ramesh, Randall Balestriero, Pratik Chaudhari
arxiv.org/abs/2508.15404

@arXiv_csLG_bot@mastoxiv.page
2025-08-25 10:01:50

Sparse but Wrong: Incorrect L0 Leads to Incorrect Features in Sparse Autoencoders
David Chanin, Adri\`a Garriga-Alonso
arxiv.org/abs/2508.16560

@arXiv_mathOC_bot@mastoxiv.page
2025-07-23 09:12:02

Learning Acceleration Algorithms for Fast Parametric Convex Optimization with Certified Robustness
Rajiv Sambharya, Jinho Bok, Nikolai Matni, George Pappas
arxiv.org/abs/2507.16264

@arXiv_statML_bot@mastoxiv.page
2025-08-21 08:10:59

Evaluation and Optimization of Leave-one-out Cross-validation for the Lasso
Ryan Burn
arxiv.org/abs/2508.14368 arxiv.org/pdf/2508.14368

@arXiv_csGR_bot@mastoxiv.page
2025-08-07 07:33:23

RLGS: Reinforcement Learning-Based Adaptive Hyperparameter Tuning for Gaussian Splatting
Zhan Li, Huangying Zhan, Changyang Li, Qingan Yan, Yi Xu
arxiv.org/abs/2508.04078

@arXiv_csNE_bot@mastoxiv.page
2025-07-22 07:49:50

Analyzing Internal Activity and Robustness of SNNs Across Neuron Parameter Space
Szymon Mazurek, Jakub Caputa, Maciej Wielgosz
arxiv.org/abs/2507.14757

@arXiv_eessIV_bot@mastoxiv.page
2025-07-09 07:44:22

Dual-Attention U-Net with Class-Specific Ensembles and Bayesian Hyperparameter Optimization for Precise Wound and Scale Marker Segmentation
Daniel Cie\'slak, Miriam Reca, Olena Onyshchenko, Jacek Rumi\'nski
arxiv.org/abs/2507.05314

@arXiv_csLG_bot@mastoxiv.page
2025-08-20 10:05:00

In-Context Decision Making for Optimizing Complex AutoML Pipelines
Amir Rezaei Balef, Katharina Eggensperger
arxiv.org/abs/2508.13657 arxiv…

@arXiv_physicscompph_bot@mastoxiv.page
2025-06-23 09:13:00

Great Restraining Wall in Multidimentional Collective Variable Space
Zhijun Pan, Maodong Li, Dechin Chen, Yi Isaac Yang
arxiv.org/abs/2506.17043

@arXiv_csHC_bot@mastoxiv.page
2025-07-17 09:16:00

Dataset-Adaptive Dimensionality Reduction
Hyeon Jeon, Jeongin Park, Soohyun Lee, Dae Hyun Kim, Sungbok Shin, Jinwook Seo
arxiv.org/abs/2507.11984

@arXiv_hepph_bot@mastoxiv.page
2025-08-20 08:33:40

Harnessing data-driven methods for precise model independent event shape estimation in relativistic heavy-ion collisions
Dipankar Basak, H. Hushnud, Kalyan Dey
arxiv.org/abs/2508.13349

@arXiv_mathOC_bot@mastoxiv.page
2025-05-30 10:14:28

This arxiv.org/abs/2412.06481 has been replaced.
initial toot: mastoxiv.page/@arXiv_mat…

@arXiv_csCR_bot@mastoxiv.page
2025-08-11 08:59:00

Towards Effective Offensive Security LLM Agents: Hyperparameter Tuning, LLM as a Judge, and a Lightweight CTF Benchmark
Minghao Shao, Nanda Rani, Kimberly Milner, Haoran Xi, Meet Udeshi, Saksham Aggarwal, Venkata Sai Charan Putrevu, Sandeep Kumar Shukla, Prashanth Krishnamurthy, Farshad Khorrami, Ramesh Karri, Muhammad Shafique
arx…

@arXiv_csIR_bot@mastoxiv.page
2025-08-15 09:30:13

CrossDenoise: Denoising Implicit Feedback via a Lightweight Entity-Aware Synergistic Framework
Ze Liu, Xianquan Wang, Shuochen Liu, Jie Ma, Huibo Xu, Yupeng Han, Zhe Yang, Kai Zhang, Longfei Li, Jun Zhou
arxiv.org/abs/2508.10851

@arXiv_csLG_bot@mastoxiv.page
2025-06-09 10:11:42

carps: A Framework for Comparing N Hyperparameter Optimizers on M Benchmarks
Carolin Benjamins, Helena Graf, Sarah Segel, Difan Deng, Tim Ruhkopf, Leona Hennig, Soham Basu, Neeratyoy Mallik, Edward Bergman, Deyao Chen, Fran\c{c}ois Cl\'ement, Matthias Feurer, Katharina Eggensperger, Frank Hutter, Carola Doerr, Marius Lindauer

@arXiv_mathOC_bot@mastoxiv.page
2025-05-30 10:14:28

This arxiv.org/abs/2412.06481 has been replaced.
initial toot: mastoxiv.page/@arXiv_mat…

@arXiv_statML_bot@mastoxiv.page
2025-08-18 08:29:50

Uniform convergence for Gaussian kernel ridge regression
Paul Dommel, Rajmadan Lakshmanan
arxiv.org/abs/2508.11274 arxiv.org/pdf/2508.11274…

@arXiv_statME_bot@mastoxiv.page
2025-07-08 12:19:30

Predictive posteriors under hidden confounding
Carlos Garc\'ia Meixide, David R\'ios Insua
arxiv.org/abs/2507.05170

@arXiv_csNE_bot@mastoxiv.page
2025-08-18 08:29:00

SO-PIFRNN: Self-optimization physics-informed Fourier-features randomized neural network for solving partial differential equations
Jiale Linghu, Weifeng Gao, Hao Dong, Yufeng Nie
arxiv.org/abs/2508.10921

@arXiv_csLG_bot@mastoxiv.page
2025-08-21 10:16:30

Successive Halving with Learning Curve Prediction via Latent Kronecker Gaussian Processes
Jihao Andreas Lin, Nicolas Mayoraz, Steffen Rendle, Dima Kuzmin, Emil Praun, Berivan Isik
arxiv.org/abs/2508.14818

@arXiv_eessSP_bot@mastoxiv.page
2025-08-05 11:03:00

The Role of Review Process Failures in Affective State Estimation: An Empirical Investigation of DEAP Dataset
Nazmun N Khan, Taylor Sweet, Chase A Harvey, Calder Knapp, Dean J. Krusienski, David E Thompson
arxiv.org/abs/2508.02417

@arXiv_csSD_bot@mastoxiv.page
2025-07-04 08:47:11

Posterior Transition Modeling for Unsupervised Diffusion-Based Speech Enhancement
Mostafa Sadeghi (MULTISPEECH), Jean-Eudes Ayilo (MULTISPEECH), Romain Serizel (MULTISPEECH), Xavier Alameda-Pineda (ROBOTLEARN)
arxiv.org/abs/2507.02391

@arXiv_statML_bot@mastoxiv.page
2025-08-18 08:39:50

ADMIRE-BayesOpt: Accelerated Data MIxture RE-weighting for Language Models with Bayesian Optimization
Shengzhuang Chen, Xu Ouyang, Michael Arthur Leopold Pearce, Thomas Hartvigsen, Jonathan Richard Schwarz
arxiv.org/abs/2508.11551

@arXiv_eessSY_bot@mastoxiv.page
2025-07-04 08:28:21

Enhancing Power Flow Estimation with Topology-Aware Gated Graph Neural Networks
Shrenik Jadhav, Birva Sevak, Srijita Das, Wencong Su, Van-Hai Bui
arxiv.org/abs/2507.02078

@arXiv_condmatdisnn_bot@mastoxiv.page
2025-07-11 09:25:31

A statistical physics framework for optimal learning
Francesca Mignacco, Francesco Mori
arxiv.org/abs/2507.07907 arxi…

@arXiv_physicsdataan_bot@mastoxiv.page
2025-08-12 09:19:33

Error Breakdown and Sensitivity Analysis of Dynamical Quantities in Markov State Models
Yehor Tuchkov, Luke Evans, Sonya M. Hanson, Erik H. Thiede
arxiv.org/abs/2508.06735

@arXiv_csCE_bot@mastoxiv.page
2025-07-30 07:33:51

Improving Neural Network Training using Dynamic Learning Rate Schedule for PINNs and Image Classification
D. Veerababu, Ashwin A. Raikar, Prasanta K. Ghosh
arxiv.org/abs/2507.21749

@arXiv_csLG_bot@mastoxiv.page
2025-08-20 10:17:40

AutoScale: Linear Scalarization Guided by Multi-Task Optimization Metrics
Yi Yang, Kei Ikemura, Qingwen Zhang, Xiaomeng Zhu, Ci Li, Nazre Batool, Sina Sharif Mansouri, John Folkesson
arxiv.org/abs/2508.13979

@arXiv_csIR_bot@mastoxiv.page
2025-07-10 09:20:11

SPEAR: Subset-sampled Performance Evaluation via Automated Ground Truth Generation for RAG
Zou Yuheng, Wang Yiran, Tian Yuzhu, Zhu Min, Huang Yanhua
arxiv.org/abs/2507.06554

@arXiv_mathOC_bot@mastoxiv.page
2025-06-19 09:08:07

On the Effectiveness of Classical Regression Methods for Optimal Switching Problems
Martin Andersson, Benny Avelin, Marcus Olofsson
arxiv.org/abs/2506.15436

@arXiv_nuclth_bot@mastoxiv.page
2025-08-06 08:45:40

Machine Learning-Driven High-Precision Model for $\alpha$-Decay Energy and Half-Life Prediction of superheavy nuclei
Qingning Yuan, Panpan Qi, Xuanpen Xiao, Xue Wang, Juan He, Guimei Long, Zhengwei Duan, Yangyan Dai, Runchao Yan, Gongming Yu, Haitao Yang, Qiang Hu
arxiv.org/abs/2508.03155

@arXiv_csGR_bot@mastoxiv.page
2025-06-10 07:38:52

Vid2Sim: Generalizable, Video-based Reconstruction of Appearance, Geometry and Physics for Mesh-free Simulation
Chuhao Chen, Zhiyang Dou, Chen Wang, Yiming Huang, Anjun Chen, Qiao Feng, Jiatao Gu, Lingjie Liu
arxiv.org/abs/2506.06440

@arXiv_eessIV_bot@mastoxiv.page
2025-07-31 08:17:11

trAIce3D: A Prompt-Driven Transformer Based U-Net for Semantic Segmentation of Microglial Cells from Large-Scale 3D Microscopy Images
MohammadAmin Alamalhoda, Arsalan Firoozi, Alessandro Venturino, Sandra Siegert
arxiv.org/abs/2507.22635

@arXiv_csNE_bot@mastoxiv.page
2025-06-11 07:44:43

A Practical Guide to Tuning Spiking Neuronal Dynamics
William Gebhardt, Alexander G. Ororbia, Nathan McDonald, Clare Thiem, Jack Lombardi
arxiv.org/abs/2506.08138

@arXiv_nlincd_bot@mastoxiv.page
2025-07-09 08:31:52

Minimal Deterministic Echo State Networks Outperform Random Reservoirs in Learning Chaotic Dynamics
Francesco Martinuzzi
arxiv.org/abs/2507.06050

@arXiv_csLG_bot@mastoxiv.page
2025-07-09 10:20:22

Improving Robustness of Foundation Models in Domain Adaptation with Soup-Adapters
Marco Roschkowski
arxiv.org/abs/2507.05807

@arXiv_statML_bot@mastoxiv.page
2025-05-30 10:16:22

This arxiv.org/abs/2502.06044 has been replaced.
initial toot: mastoxiv.page/@arXiv_sta…

@arXiv_statAP_bot@mastoxiv.page
2025-08-01 08:12:41

Efficient inference of dynamic gene regulatory networks using discrete penalty
Visweswaran Ravikumar, Aaresh Bhathena, Wajd N Al-Holou, Salar Fattahi, Arvind Rao
arxiv.org/abs/2507.23106

@arXiv_csLG_bot@mastoxiv.page
2025-06-10 19:23:11

This arxiv.org/abs/2506.05673 has been replaced.
initial toot: mastoxiv.page/@arXiv_csLG_…

@arXiv_mathOC_bot@mastoxiv.page
2025-06-02 07:27:33

Fine-tuning for Data-enabled Predictive Control of Noisy Systems by Reinforcement Learning
Jinbao Wang, Shiliang Zhang, Jun Liu, Xuehui Ma, Haolin Liu
arxiv.org/abs/2505.24572

@arXiv_csLG_bot@mastoxiv.page
2025-06-05 10:52:58

This arxiv.org/abs/2503.22733 has been replaced.
initial toot: mastoxiv.page/@arXiv_csLG_…

@arXiv_mathOC_bot@mastoxiv.page
2025-06-30 12:54:50

Replaced article(s) found for math.OC. arxiv.org/list/math.OC/new
[1/1]:
- Global relaxation-based LP-Newton method for multiple hyperparameter selection in support vector ...
Yaru Qian, Qingna Li, Alain Zemkoho

@arXiv_csLG_bot@mastoxiv.page
2025-06-03 21:31:55

This arxiv.org/abs/2505.00812 has been replaced.
initial toot: mastoxiv.page/@arXiv_csLG_…

@arXiv_csLG_bot@mastoxiv.page
2025-07-14 07:56:42

Tree-Structured Parzen Estimator Can Solve Black-Box Combinatorial Optimization More Efficiently
Kenshin Abe, Yunzhuo Wang, Shuhei Watanabe
arxiv.org/abs/2507.08053 arxiv.org/pdf/2507.08053 arxiv.org/html/2507.08053
arXiv:2507.08053v1 Announce Type: new
Abstract: Tree-structured Parzen estimator (TPE) is a versatile hyperparameter optimization (HPO) method supported by popular HPO tools. Since these HPO tools have been developed in line with the trend of deep learning (DL), the problem setups often used in the DL domain have been discussed for TPE such as multi-objective optimization and multi-fidelity optimization. However, the practical applications of HPO are not limited to DL, and black-box combinatorial optimization is actively utilized in some domains, e.g., chemistry and biology. As combinatorial optimization has been an untouched, yet very important, topic in TPE, we propose an efficient combinatorial optimization algorithm for TPE. In this paper, we first generalize the categorical kernel with the numerical kernel in TPE, enabling us to introduce a distance structure to the categorical kernel. Then we discuss modifications for the newly developed kernel to handle a large combinatorial search space. These modifications reduce the time complexity of the kernel calculation with respect to the size of a combinatorial search space. In the experiments using synthetic problems, we verified that our proposed method identifies better solutions with fewer evaluations than the original TPE. Our algorithm is available in Optuna, an open-source framework for HPO.
toXiv_bot_toot