
2025-09-16 11:04:06
Learning Representations in Video Game Agents with Supervised Contrastive Imitation Learning
Carlos Celemin, Joseph Brennan, Pierluigi Vito Amadori, Tim Bradley
https://arxiv.org/abs/2509.11880
Learning Representations in Video Game Agents with Supervised Contrastive Imitation Learning
Carlos Celemin, Joseph Brennan, Pierluigi Vito Amadori, Tim Bradley
https://arxiv.org/abs/2509.11880
GTA: Supervised-Guided Reinforcement Learning for Text Classification with Large Language Models
Min Zeng, Jinfei Sun, Xueyou Luo, Caiquan Liu, Shiqi Zhang, Li Xie, Xiaoxin Chen
https://arxiv.org/abs/2509.12108
Enhancing Dual Network Based Semi-Supervised Medical Image Segmentation with Uncertainty-Guided Pseudo-Labeling
Yunyao Lu, Yihang Wu, Ahmad Chaddad, Tareef Daqqaq, Reem Kateb
https://arxiv.org/abs/2509.13084
Learning from Uncertain Similarity and Unlabeled Data
Meng Wei, Zhongnian Li, Peng Ying, Xinzheng Xu
https://arxiv.org/abs/2509.11984 https://arxiv.org/pdf…
Data-Efficient Psychiatric Disorder Detection via Self-supervised Learning on Frequency-enhanced Brain Networks
Mujie Liu, Mengchu Zhu, Qichao Dong, Ting Dang, Jiangang Ma, Jing Ren, Feng Xia
https://arxiv.org/abs/2509.10524
Learning kernels with quantum optical circuits
A. Mandilara, A. D. Papadopoulos, D. Syvridis
https://arxiv.org/abs/2509.12072 https://arxiv.org/pdf/2509.12…
Weakly Supervised Vulnerability Localization via Multiple Instance Learning
Wenchao Gu, Yupan Chen, Yanlin Wang, Hongyu Zhang, Cuiyun Gao, Michael R. Lyu
https://arxiv.org/abs/2509.11312
Accurate Trust Evaluation for Effective Operation of Social IoT Systems via Hypergraph-Enabled Self-Supervised Contrastive Learning
Botao Zhu, Xianbin Wang
https://arxiv.org/abs/2509.12240
Learning Majority-to-Minority Transformations with MMD and Triplet Loss for Imbalanced Classification
Suman Cha, Hyunjoong Kim
https://arxiv.org/abs/2509.11511 https://
Online Training and Pruning of Deep Reinforcement Learning Networks
Valentin Frank Ingmar Guenter, Athanasios Sideris
https://arxiv.org/abs/2507.11975 http…
PPL: Point Cloud Supervised Proprioceptive Locomotion Reinforcement Learning for Legged Robots in Crawl Spaces
Bida Ma, Nuo Xu, Chenkun Qi, Xin Liu, Yule Mo, Jinkai Wang, Chunpeng Lu
https://arxiv.org/abs/2508.09950
Adapting Medical Vision Foundation Models for Volumetric Medical Image Segmentation via Active Learning and Selective Semi-supervised Fine-tuning
Jin Yang, Daniel S. Marcus, Aristeidis Sotiras
https://arxiv.org/abs/2509.10784
Self-Supervised Stereo Matching with Multi-Baseline Contrastive Learning
Peng Xu, Zhiyu Xiang, Jingyun Fu, Tianyu Pu, Kai Wang, Chaojie Ji, Tingming Bai, Eryun Liu
https://arxiv.org/abs/2508.10838
Loss Behavior in Supervised Learning with Entangled States
Alexander Mandl, Johanna Barzen, Marvin Bechtold, Frank Leymann, Lavinia Stiliadou
https://arxiv.org/abs/2509.10141 ht…
Unsupervised Deep Equilibrium Model Learning for Large-Scale Channel Estimation with Performance Guarantees
Haotian Tian, Lixiang Lian
https://arxiv.org/abs/2508.10546 https://
Prototypical Contrastive Learning For Improved Few-Shot Audio Classification
Christos Sgouropoulos, Christos Nikou, Stefanos Vlachos, Vasileios Theiou, Christos Foukanelis, Theodoros Giannakopoulos
https://arxiv.org/abs/2509.10074
NeuroStrike: Neuron-Level Attacks on Aligned LLMs
Lichao Wu, Sasha Behrouzi, Mohamadreza Rostami, Maximilian Thang, Stjepan Picek, Ahmad-Reza Sadeghi
https://arxiv.org/abs/2509.11864
DeePAQ: A Perceptual Audio Quality Metric Based On Foundational Models and Weakly Supervised Learning
Guanxin Jiang, Andreas Brendel, Pablo M. Delgado, J\"urgen Herre
https://arxiv.org/abs/2510.12326
Supervised and unsupervised learning with numerical computation for the Wolfram cellular automata
Kui Tuo, Shengfeng Deng, Yuxiang Yang, Yanyang Wang, Qiuping A. Wang, Wei Li, Wenjun Zhang
https://arxiv.org/abs/2509.10209
Self-Supervised Representation Learning with ID-Content Modality Alignment for Sequential Recommendation
Donglin Zhou, Weike Pan, Zhong Ming
https://arxiv.org/abs/2510.10556 htt…
Elemental Frequency-Based Supervised Classification Approach for the Search of Novel Topological Materials
Zodinpuia Ralte, Ramesh Kumar, Mukhtiyar Singh
https://arxiv.org/abs/2509.09978
SSL-AD: Spatiotemporal Self-Supervised Learning for Generalizability and Adaptability Across Alzheimer's Prediction Tasks and Datasets
Emily Kaczmarek, Justin Szeto, Brennan Nichyporuk, Tal Arbel
https://arxiv.org/abs/2509.10453
Towards Robust Artificial Intelligence: Self-Supervised Learning Approach for Out-of-Distribution Detection
Wissam Salhab, Darine Ameyed, Hamid Mcheick, Fehmi Jaafar
https://arxiv.org/abs/2510.12713
Few Shot Semi-Supervised Learning for Abnormal Stop Detection from Sparse GPS Trajectories
Muhammad Ayub Sabir, Junbiao Pang, Jiaqi Wu, Fatima Ashraf
https://arxiv.org/abs/2510.12686
Reasoning Pattern Matters: Learning to Reason without Human Rationales
Chaoxu Pang, Yixuan Cao, Ping Luo
https://arxiv.org/abs/2510.12643 https://arxiv.org…
Self-supervised Learning Of Visual Pose Estimation Without Pose Labels By Classifying LED States
Nicholas Carlotti, Mirko Nava, Alessandro Giusti
https://arxiv.org/abs/2509.10405
Detection of quantum information masking via machine learning
Sheng-Ao Mao, Lin Zhang, Bo Li
https://arxiv.org/abs/2510.12507 https://arxiv.org/pdf/2510.12…
A Systematic Evaluation of Self-Supervised Learning for Label-Efficient Sleep Staging with Wearable EEG
Emilio Estevan, Mar\'ia Sierra-Torralba, Eduardo L\'opez-Larraz, Luis Montesano
https://arxiv.org/abs/2510.07960
Machine Learning for Exoplanet Detection: A Comparative Analysis Using Kepler Data
Reihaneh Karimi, Mahdiyar Mousavi-Sadr, Mohammad H. Zhoolideh Haghighi, Fatemeh S. Tabatabaei
https://arxiv.org/abs/2508.09689
Fast and Simple Multiclass Data Segmentation: An Eigendecomposition and Projection-Free Approach
Chiara Faccio, Margherita Porcelli, Francesco Rinaldi, Martin Stoll
https://arxiv.org/abs/2508.09738
SS-DPPN: A self-supervised dual-path foundation model for the generalizable cardiac audio representation
Ummy Maria Muna, Md Mehedi Hasan Shawon, Md Jobayer, Sumaiya Akter, Md Rakibul Hasan, Md. Golam Rabiul Alam
https://arxiv.org/abs/2510.10719
Identification of Gamma Ray Pulsar Candidates in the \emph{Fermi}-LAT 4FGL-DR4 Unassociated Sources Using Supervised Machine Learning
A. Pathania, K. K. Singh, S. K. Singh, A. Tolamatti, B. B. Singh, K. K. Yadav
https://arxiv.org/abs/2510.08654
PET Head Motion Estimation Using Supervised Deep Learning with Attention
Zhuotong Cai, Tianyi Zeng, Jiazhen Zhang, El\'eonore V. Lieffrig, Kathryn Fontaine, Chenyu You, Enette Mae Revilla, James S. Duncan, Jingmin Xin, Yihuan Lu, John A. Onofrey
https://arxiv.org/abs/2510.12758
BERTector: Intrusion Detection Based on Joint-Dataset Learning
Haoyang Hu, Xun Huang, Chenyu Wu, Shiwen Liu, Zhichao Lian, Shuangquan Zhang
https://arxiv.org/abs/2508.10327 http…
Making Qwen3 Think in Korean with Reinforcement Learning
Jungyup Lee, Jemin Kim, Sang Park, SeungJae Lee
https://arxiv.org/abs/2508.10355 https://arxiv.org…
Replaced article(s) found for cs.LG. https://arxiv.org/list/cs.LG/new
[1/5]:
- A Computational Theory and Semi-Supervised Algorithm for Clustering
Nassir Mohammad
MAE-SAM2: Mask Autoencoder-Enhanced SAM2 for Clinical Retinal Vascular Leakage Segmentation
Xin Xing, Irmak Karaca, Samira Badrloo, Quan Dong Nguyen, Mahadevan Subramaniam
https://arxiv.org/abs/2509.10554
Kernel VICReg for Self-Supervised Learning in Reproducing Kernel Hilbert Space
M. Hadi Sepanj, Benyamin Ghojogh, Paul Fieguth
https://arxiv.org/abs/2509.07289 https://
IF-D: A High-Frequency, General-Purpose Inertial Foundation Dataset for Self-Supervised Learning
Patrick Ferreira, Paula Costa
https://arxiv.org/abs/2510.09539 https://
Value Function Approximation for Nonlinear MPC: Learning a Terminal Cost Function with a Descent Property
T. M. J. T. Baltussen, C. A. Orrico, A. Katriniok, W. P. M. H. Heemels, D. Krishnamoorthy
https://arxiv.org/abs/2508.05804
Anti-Money Laundering Machine Learning Pipelines; A Technical Analysis on Identifying High-risk Bank Clients with Supervised Learning
Khashayar Namdar, Pin-Chien Wang, Tushar Raju, Steven Zheng, Fiona Li, Safwat Tahmin Khan
https://arxiv.org/abs/2509.09127
Learning complexity of many-body quantum sign structures through the lens of Boolean Fourier analysis
Ilya Schurov, Anna Kravchenko, Mikhail I. Katsnelson, Andrey A. Bagrov, Tom Westerhout
https://arxiv.org/abs/2508.09870
Understanding Ice Crystal Habit Diversity with Self-Supervised Learning
Joseph Ko, Hariprasath Govindarajan, Fredrik Lindsten, Vanessa Przybylo, Kara Sulia, Marcus van Lier-Walqui, Kara Lamb
https://arxiv.org/abs/2509.07688
Few-shot Molecular Property Prediction: A Survey
Zeyu Wang, Tianyi Jiang, Huanchang Ma, Yao Lu, Xiaoze Bao, Shanqing Yu, Qi Xuan, Shirui Pan, Xin Zheng
https://arxiv.org/abs/2510.08900
Improving Out-of-Domain Audio Deepfake Detection via Layer Selection and Fusion of SSL-Based Countermeasures
Pierre Serrano, Rapha\"el Duroselle, Florian Angulo, Jean-Fran\c{c}ois Bonastre, Olivier Boeffard
https://arxiv.org/abs/2509.12003
LayerLock: Non-collapsing Representation Learning with Progressive Freezing
Goker Erdogan, Nikhil Parthasarathy, Catalin Ionescu, Drew Hudson, Alexander Lerchner, Andrew Zisserman, Mehdi Sajjadi, Joao Carreira
https://arxiv.org/abs/2509.10156
Data distribution impacts the performance and generalisability of contrastive learning-based foundation models of electrocardiograms
Gul Rukh Khattak, Konstantinos Patlatzoglou, Joseph Barker, Libor Pastika, Boroumand Zeidaabadi, Ahmed El-Medany, Hesham Aggour, Yixiu Liang, Antonio H. Ribeiro, Jeffrey Annis, Antonio Luiz Pinho Ribeiro, Junbo Ge, Daniel B. Kramer, Jonathan W. Waks, Evan Brittain, Nicholas Peters, Fu Siong Ng, Arunashis Sau
Normalization-equivariant Diffusion Models: Learning Posterior Samplers From Noisy And Partial Measurements
Brett Levac, Jon Tamir, Marcelo Pereyra, Julian Tachella
https://arxiv.org/abs/2510.11964
Layer-Wise Analysis of Self-Supervised Representations for Age and Gender Classification in Children's Speech
Abhijit Sinha, Harishankar Kumar, Mohit Joshi, Hemant Kumar Kathania, Shrikanth Narayanan, Sudarsana Reddy Kadiri
https://arxiv.org/abs/2508.10332
SG-XDEAT: Sparsity-Guided Cross-Dimensional and Cross-Encoding Attention with Target-Aware Conditioning in Tabular Learning
Chih-Chuan Cheng, Yi-Ju Tseng
https://arxiv.org/abs/2510.12659
Acoustic Overspecification in Electronic Dance Music Taxonomy
Weilun Xu, Tianhao Dai, Oscar Goudet, Xiaoxuan Wang
https://arxiv.org/abs/2509.11474 https://…
Towards Real-World Rumor Detection: Anomaly Detection Framework with Graph Supervised Contrastive Learning
Chaoqun Cui, Caiyan Jia
https://arxiv.org/abs/2508.07205 https://
Multi Anatomy X-Ray Foundation Model
Nishank Singla, Krisztian Koos, Farzin Haddadpour, Amin Honarmandi Shandiz, Lovish Chum, Xiaojian Xu, Qing Jin, Erhan Bas
https://arxiv.org/abs/2509.12146
PhishSSL: Self-Supervised Contrastive Learning for Phishing Website Detection
Wenhao Li, Selvakumar Manickam, Yung-Wey Chong, Shankar Karuppayah, Priyadarsi Nanda, Binyong Li
https://arxiv.org/abs/2510.05900
Interpretable Generative and Discriminative Learning for Multimodal and Incomplete Clinical Data
Albert Belenguer-Llorens, Carlos Sevilla-Salcedo, Janaina Mourao-Miranda, Vanessa G\'omez-Verdejo
https://arxiv.org/abs/2510.09513
ICDAR 2025 Competition on FEw-Shot Text line segmentation of ancient handwritten documents (FEST)
Silvia Zottin, Axel De Nardin, Giuseppe Branca, Claudio Piciarelli, Gian Luca Foresti
https://arxiv.org/abs/2509.12965
DPO-Tuned Large Language Models for Segmentation in Simultaneous Speech Translation
Zeyu Yang, Satoshi Nakamura
https://arxiv.org/abs/2510.12195 https://ar…
Selection of Layers from Self-supervised Learning Models for Predicting Mean-Opinion-Score of Speech
Xinyu Liang, Fredrik Cumlin, Victor Ungureanu, Chandan K. A. Reddy, Christian Schuldt, Saikat Chatterjee
https://arxiv.org/abs/2508.08962
MAESTRO: Masked AutoEncoders for Multimodal, Multitemporal, and Multispectral Earth Observation Data
Antoine Labatie, Michael Vaccaro, Nina Lardiere, Anatol Garioud, Nicolas Gonthier
https://arxiv.org/abs/2508.10894
Enhancement of Quantum Semi-Supervised Learning via Improved Laplacian and Poisson Methods
Hamed Gholipour, Farid Bozorgnia, Hamzeh Mohammadigheymasi, Kailash Hambarde, Javier Mancilla, Hugo Proenca, Joao Neves, Moharram Challenger
https://arxiv.org/abs/2508.02054
Automatic Music Sample Identification with Multi-Track Contrastive Learning
Alain Riou, Joan Serr\`a, Yuki Mitsufuji
https://arxiv.org/abs/2510.11507 https://
SimpleVLA-RL: Scaling VLA Training via Reinforcement Learning
Haozhan Li, Yuxin Zuo, Jiale Yu, Yuhao Zhang, Zhaohui Yang, Kaiyan Zhang, Xuekai Zhu, Yuchen Zhang, Tianxing Chen, Ganqu Cui, Dehui Wang, Dingxiang Luo, Yuchen Fan, Youbang Sun, Jia Zeng, Jiangmiao Pang, Shanghang Zhang, Yu Wang, Yao Mu, Bowen Zhou, Ning Ding
https://arxiv.org/a…
An Investigation into the Performance of Non-Contrastive Self-Supervised Learning Methods for Network Intrusion Detection
Hamed Fard, Tobias Schalau, Gerhard Wunder
https://arxiv.org/abs/2510.02349
MammoDINO: Anatomically Aware Self-Supervision for Mammographic Images
Sicheng Zhou, Lei Wu, Cao Xiao, Parminder Bhatia, Taha Kass-Hout
https://arxiv.org/abs/2510.11883 https://…
From <Answer> to <Think>: Multidimensional Supervision of Reasoning Process for LLM Optimization
Beining Wang, Weihang Su, Hongtao Tian, Tao Yang, Yujia Zhou, Ting Yao, Qingyao Ai, Yiqun Liu
https://arxiv.org/abs/2510.11457
Self-supervised Radio Representation Learning: Can we Learn Multiple Tasks?
Ogechukwu Kanu, Ashkan Eshaghbeigi, Hatem Abou-Zeid
https://arxiv.org/abs/2509.03077 https://
Contrastive Self-Supervised Learning at the Edge: An Energy Perspective
Fernanda Fam\'a, Roberto Pereira, Charalampos Kalalas, Paolo Dini, Lorena Qendro, Fahim Kawsar, Mohammad Malekzadeh
https://arxiv.org/abs/2510.08374
Semantic Concentration for Self-Supervised Dense Representations Learning
Peisong Wen, Qianqian Xu, Siran Dai, Runmin Cong, Qingming Huang
https://arxiv.org/abs/2509.09429 https…
Residual-Informed Learning of Solutions to Algebraic Loops
Felix Brandt, Andreas Heuermann, Philip Hannebohm, Bernhard Bachmann
https://arxiv.org/abs/2510.09317 https://
Adapting HFMCA to Graph Data: Self-Supervised Learning for Generalizable fMRI Representations
Jakub Frac, Alexander Schmatz, Qiang Li, Guido Van Wingen, Shujian Yu
https://arxiv.org/abs/2510.05177
VasoMIM: Vascular Anatomy-Aware Masked Image Modeling for Vessel Segmentation
De-Xing Huang, Xiao-Hu Zhou, Mei-Jiang Gui, Xiao-Liang Xie, Shi-Qi Liu, Shuang-Yi Wang, Tian-Yu Xiang, Rui-Ze Ma, Nu-Fang Xiao, Zeng-Guang Hou
https://arxiv.org/abs/2508.10794
Agent Learning via Early Experience
Kai Zhang, Xiangchao Chen, Bo Liu, Tianci Xue, Zeyi Liao, Zhihan Liu, Xiyao Wang, Yuting Ning, Zhaorun Chen, Xiaohan Fu, Jian Xie, Yuxuan Sun, Boyu Gou, Qi Qi, Zihang Meng, Jianwei Yang, Ning Zhang, Xian Li, Ashish Shah, Dat Huynh, Hengduo Li, Zi Yang, Sara Cao, Lawrence Jang, Shuyan Zhou, Jiacheng Zhu, Huan Sun, Jason Weston, Yu Su, Yifan Wu
Unsupervised operator learning approach for dissipative equations via Onsager principle
Zhipeng Chang, Zhenye Wen, Xiaofei Zhao
https://arxiv.org/abs/2508.07440 https://
Uncertainty-aware Cross-training for Semi-supervised Medical Image Segmentation
Kaiwen Huang, Tao Zhou, Huazhu Fu, Yizhe Zhang, Yi Zhou, Xiao-Jun Wu
https://arxiv.org/abs/2508.09014
From scratch to silver: Creating trustworthy training data for patent-SDG classification using Large Language Models
Grazia Sveva Ascione, Nicol\`o Tamagnone
https://arxiv.org/abs/2509.09303
Autonomous Learning From Success and Failure: Goal-Conditioned Supervised Learning with Negative Feedback
Zeqiang Zhang, Fabian Wurzberger, Gerrit Schmid, Sebastian Gottwald, Daniel A. Braun
https://arxiv.org/abs/2509.03206
MoLEx: Mixture of LoRA Experts in Speech Self-Supervised Models for Audio Deepfake Detection
Zihan Pan, Sailor Hardik Bhupendra, Jinyang Wu
https://arxiv.org/abs/2509.09175 http…
SPARSE Data, Rich Results: Few-Shot Semi-Supervised Learning via Class-Conditioned Image Translation
Guido Manni, Clemente Lauretti, Loredana Zollo, Paolo Soda
https://arxiv.org/abs/2508.06429
Exploring Expert Specialization through Unsupervised Training in Sparse Mixture of Experts
Strahinja Nikolic, Ilker Oguz, Demetri Psaltis
https://arxiv.org/abs/2509.10025 https:…
Maximally Useful and Minimally Redundant: The Key to Self Supervised Learning for Imbalanced Data
Yash Kumar Sharma, Vineet Nair, Wilson Naik
https://arxiv.org/abs/2509.08469 ht…
Multitask finetuning and acceleration of chemical pretrained models for small molecule drug property prediction
Matthew Adrian, Yunsie Chung, Kevin Boyd, Saee Paliwal, Srimukh Prasad Veccham, Alan C. Cheng
https://arxiv.org/abs/2510.12719
ALFred: An Active Learning Framework for Real-world Semi-supervised Anomaly Detection with Adaptive Thresholds
Shanle Yao, Ghazal Alinezhad Noghre, Armin Danesh Pazho, Hamed Tabkhi
https://arxiv.org/abs/2508.09058
Hybrid-grained Feature Aggregation with Coarse-to-fine Language Guidance for Self-supervised Monocular Depth Estimation
Wenyao Zhang, Hongsi Liu, Bohan Li, Jiawei He, Zekun Qi, Yunnan Wang, Shengyang Zhao, Xinqiang Yu, Wenjun Zeng, Xin Jin
https://arxiv.org/abs/2510.09320
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[1/5]:
- Enhancing Representations through Heterogeneous Self-Supervised Learning
Zhong-Yu Li, Bo-Wen Yin, Yongxiang Liu, Li Liu, Ming-Ming Cheng
Bridged Clustering for Representation Learning: Semi-Supervised Sparse Bridging
Patrick Peixuan Ye, Chen Shani, Ellen Vitercik
https://arxiv.org/abs/2510.07182 https://
EVDI : Event-based Video Deblurring and Interpolation via Self-Supervised Learning
Chi Zhang, Xiang Zhang, Chenxu Jiang, Gui-Song Xia, Lei Yu
https://arxiv.org/abs/2509.08260 h…
AgentGym-RL: Training LLM Agents for Long-Horizon Decision Making through Multi-Turn Reinforcement Learning
Zhiheng Xi, Jixuan Huang, Chenyang Liao, Baodai Huang, Honglin Guo, Jiaqi Liu, Rui Zheng, Junjie Ye, Jiazheng Zhang, Wenxiang Chen, Wei He, Yiwen Ding, Guanyu Li, Zehui Chen, Zhengyin Du, Xuesong Yao, Yufei Xu, Jiecao Chen, Tao Gui, Zuxuan Wu, Qi Zhang, Xuanjing Huang, Yu-Gang Jiang
Frequency Prior Guided Matching: A Data Augmentation Approach for Generalizable Semi-Supervised Polyp Segmentation
Haoran Xi, Chen Liu, Xiaolin Li
https://arxiv.org/abs/2508.06517
Discriminative Feature Feedback with General Teacher Classes
Omri Bar Oz, Tosca Lechner, Sivan Sabato
https://arxiv.org/abs/2510.07245 https://arxiv.org/pd…
Investigating Location-Regularised Self-Supervised Feature Learning for Seafloor Visual Imagery
Cailei Liang, Adrian Bodenmann, Emma J Curtis, Samuel Simmons, Kazunori Nagano, Stan Brown, Adam Riese, Blair Thornton
https://arxiv.org/abs/2509.06660
When Is Prior Knowledge Helpful? Exploring the Evaluation and Selection of Unsupervised Pretext Tasks from a Neuro-Symbolic Perspective
Lin-Han Jia, Si-Yu Han, Wen-Chao Hu, Jie-Jing Shao, Wen-Da Wei, Zhi Zhou, Lan-Zhe Guo, Yu-Feng Li
https://arxiv.org/abs/2508.07299
SL-SLR: Self-Supervised Representation Learning for Sign Language Recognition
Ariel Basso Madjoukeng, J\'er\^ome Fink, Pierre Poitier, Edith Belise Kenmogne, Benoit Frenay
https://arxiv.org/abs/2509.05188
HINT: Helping Ineffective Rollouts Navigate Towards Effectiveness
Xinyi Wang, Jinyi Han, Zishang Jiang, Tingyun Li, Jiaqing Liang, Sihang Jiang, Zhaoqian Dai, Shuguang Ma, Fei Yu, Yanghua Xiao
https://arxiv.org/abs/2510.09388
MedReasoner: Reinforcement Learning Drives Reasoning Grounding from Clinical Thought to Pixel-Level Precision
Zhonghao Yan, Muxi Diao, Yuxuan Yang, Jiayuan Xu, Kaizhou Zhang, Ruoyan Jing, Lele Yang, Yanxi Liu, Kongming Liang, Zhanyu Ma
https://arxiv.org/abs/2508.08177
Self-supervised Physics-guided Model with Implicit Representation Regularization for Fast MRI Reconstruction
Jingran Xu, Yuanyuan Liu, Yanjie Zhu
https://arxiv.org/abs/2510.06611
Replaced article(s) found for cs.LG. https://arxiv.org/list/cs.LG/new
[4/5]:
- SCIZOR: A Self-Supervised Approach to Data Curation for Large-Scale Imitation Learning
Yu Zhang, Yuqi Xie, Huihan Liu, Rutav Shah, Michael Wan, Linxi Fan, Yuke Zhu
Equivariant Splitting: Self-supervised learning from incomplete data
Victor Sechaud, J\'er\'emy Scanvic, Quentin Barth\'elemy, Patrice Abry, Juli\'an Tachella
https://arxiv.org/abs/2510.00929
RedDino: A foundation model for red blood cell analysis
Luca Zedda, Andrea Loddo, Cecilia Di Ruberto, Carsten Marr
https://arxiv.org/abs/2508.08180 https://
Resolution scaling governs DINOv3 transfer performance in chest radiograph classification
Soroosh Tayebi Arasteh, Mina Shaigan, Christiane Kuhl, Jakob Nikolas Kather, Sven Nebelung, Daniel Truhn
https://arxiv.org/abs/2510.07191
Test-Time Reinforcement Learning for GUI Grounding via Region Consistency
Yong Du, Yuchen Yan, Fei Tang, Zhengxi Lu, Chang Zong, Weiming Lu, Shengpei Jiang, Yongliang Shen
https://arxiv.org/abs/2508.05615