Tootfinder

Opt-in global Mastodon full text search. Join the index!

@arXiv_csLG_bot@mastoxiv.page
2025-09-15 09:48:01

FedBiF: Communication-Efficient Federated Learning via Bits Freezing
Shiwei Li, Qunwei Li, Haozhao Wang, Ruixuan Li, Jianbin Lin, Wenliang Zhong
arxiv.org/abs/2509.10161

@arXiv_csRO_bot@mastoxiv.page
2025-10-15 10:12:01

Residual MPC: Blending Reinforcement Learning with GPU-Parallelized Model Predictive Control
Se Hwan Jeon, Ho Jae Lee, Seungwoo Hong, Sangbae Kim
arxiv.org/abs/2510.12717

@arXiv_csCV_bot@mastoxiv.page
2025-10-15 10:54:31

ViCO: A Training Strategy towards Semantic Aware Dynamic High-Resolution
Long Cui, Weiyun Wang, Jie Shao, Zichen Wen, Gen Luo, Linfeng Zhang, Yanting Zhang, Yu Qiao, Wenhai Wang
arxiv.org/abs/2510.12793

@arXiv_csSD_bot@mastoxiv.page
2025-09-15 08:21:21

DiTReducio: A Training-Free Acceleration for DiT-Based TTS via Progressive Calibration
Yanru Huo, Ziyue Jiang, Zuoli Tang, Qingyang Hong, Zhou Zhao
arxiv.org/abs/2509.09748

@arXiv_csDC_bot@mastoxiv.page
2025-10-14 09:05:28

FLAMMABLE: A Multi-Model Federated Learning Framework with Multi-Model Engagement and Adaptive Batch Sizes
Shouxu Lin, Zimeng Pan, Yuhang Yao, Haeyoung Noh, Pei Zhang, Carlee Joe-Wong
arxiv.org/abs/2510.10380

@arXiv_csAI_bot@mastoxiv.page
2025-09-16 08:29:56

LLM Enhancement with Domain Expert Mental Model to Reduce LLM Hallucination with Causal Prompt Engineering
Boris Kovalerchuk, Brent D. Fegley
arxiv.org/abs/2509.10818

@arXiv_csCL_bot@mastoxiv.page
2025-10-13 10:32:00

Logit Arithmetic Elicits Long Reasoning Capabilities Without Training
Yunxiang Zhang, Muhammad Khalifa, Lechen Zhang, Xin Liu, Ayoung Lee, Xinliang Frederick Zhang, Farima Fatahi Bayat, Lu Wang
arxiv.org/abs/2510.09354

@arXiv_csIR_bot@mastoxiv.page
2025-10-14 11:13:28

Next Interest Flow: A Generative Pre-training Paradigm for Recommender Systems by Modeling All-domain Movelines
Chen Gao, Zixin Zhao, Lv Shao, Tong Liu
arxiv.org/abs/2510.11317

@arXiv_csLG_bot@mastoxiv.page
2025-10-15 10:46:41

Laminar: A Scalable Asynchronous RL Post-Training Framework
Guangming Sheng, Yuxuan Tong, Borui Wan, Wang Zhang, Chaobo Jia, Xibin Wu, Yuqi Wu, Xiang Li, Chi Zhang, Yanghua Peng, Haibin Lin, Xin Liu, Chuan Wu
arxiv.org/abs/2510.12633

@arXiv_physicsaoph_bot@mastoxiv.page
2025-09-16 08:49:36

How does an AI Weather Model Learn to Forecast Extreme Weather?
Rebecca Baiman, Elizabeth A. Barnes, Ankur Mahesh
arxiv.org/abs/2509.10639

@arXiv_csCV_bot@mastoxiv.page
2025-10-15 07:59:31

Task-Specific Dual-Model Framework for Comprehensive Traffic Safety Video Description and Analysis
Blessing Agyei Kyem, Neema Jakisa Owor, Andrews Danyo, Joshua Kofi Asamoah, Eugene Denteh, Tanner Muturi, Anthony Dontoh, Yaw Adu-Gyamfi, Armstrong Aboah
arxiv.org/abs/2510.11907

@arXiv_statME_bot@mastoxiv.page
2025-09-15 09:15:01

A sampling method based on highest density regions: Applications to surrogate models for rare events estimation
Jocelyn Minini, Micha Wasem
arxiv.org/abs/2509.10149

@arXiv_csCR_bot@mastoxiv.page
2025-10-14 11:53:08

SASER: Stego attacks on open-source LLMs
Ming Tan, Wei Li, Hu Tao, Hailong Ma, Aodi Liu, Qian Chen, Zilong Wang
arxiv.org/abs/2510.10486 ar…

@arXiv_statML_bot@mastoxiv.page
2025-10-15 10:16:51

Universal Adaptive Environment Discovery
Madi Matymov, Ba-Hien Tran, Maurizio Filippone
arxiv.org/abs/2510.12547 arxiv.org/pdf/2510.12547…

@arXiv_quantph_bot@mastoxiv.page
2025-09-15 09:44:51

Loss Behavior in Supervised Learning with Entangled States
Alexander Mandl, Johanna Barzen, Marvin Bechtold, Frank Leymann, Lavinia Stiliadou
arxiv.org/abs/2509.10141

@arXiv_eessIV_bot@mastoxiv.page
2025-09-15 08:25:31

Soft Tissue Simulation and Force Estimation from Heterogeneous Structures using Equivariant Graph Neural Networks
Madina Kojanazarova, Sidady El Hadramy, Jack Wilkie, Georg Rauter, Philippe C. Cattin
arxiv.org/abs/2509.10125

@arXiv_csCV_bot@mastoxiv.page
2025-10-14 13:47:38

DiT360: High-Fidelity Panoramic Image Generation via Hybrid Training
Haoran Feng, Dizhe Zhang, Xiangtai Li, Bo Du, Lu Qi
arxiv.org/abs/2510.11712

@arXiv_csLG_bot@mastoxiv.page
2025-10-14 13:41:38

Representation-Based Exploration for Language Models: From Test-Time to Post-Training
Jens Tuyls, Dylan J. Foster, Akshay Krishnamurthy, Jordan T. Ash
arxiv.org/abs/2510.11686

@arXiv_csIR_bot@mastoxiv.page
2025-10-15 09:56:41

SAIL-Embedding Technical Report: Omni-modal Embedding Foundation Model
Lin Lin, Jiefeng Long, Zhihe Wan, Yuchi Wang, Dingkang Yang, Shuang Yang, Yueyang Yao, Xu Chen, Zirui Guo, Shengqiang Li, Weiran Li, Hanyu Li, Yaling Mou, Yan Qiu, Haiyang Yu, Xiao Liang, Hongsheng Li, Chao Feng
arxiv.org/abs/2510.12709

@arXiv_csNI_bot@mastoxiv.page
2025-09-15 07:51:11

Taming Volatility: Stable and Private QUIC Classification with Federated Learning
Richard Jozsa, Karel Hynek, Adrian Pekar
arxiv.org/abs/2509.09997

@arXiv_eessAS_bot@mastoxiv.page
2025-09-15 08:19:11

Spectral Bottleneck in Deep Neural Networks: Noise is All You Need
Hemanth Chandravamsi, Dhanush V. Shenoy, Itay Zinn, Shimon Pisnoy, Steven H. Frankel
arxiv.org/abs/2509.09719

@arXiv_mathDS_bot@mastoxiv.page
2025-10-13 08:17:40

Architecture Induces Structural Invariant Manifolds of Neural Network Training Dynamics
Jiajie Zhao, Tao Luo, Yaoyu Zhang
arxiv.org/abs/2510.09564

@arXiv_csCL_bot@mastoxiv.page
2025-09-15 09:59:01

Is In-Context Learning Learning?
Adrian de Wynter
arxiv.org/abs/2509.10414 arxiv.org/pdf/2509.10414

@arXiv_csAI_bot@mastoxiv.page
2025-10-15 08:48:32

AI Agents as Universal Task Solvers
Alessandro Achille, Stefano Soatto
arxiv.org/abs/2510.12066 arxiv.org/pdf/2510.12066

@arXiv_csRO_bot@mastoxiv.page
2025-09-15 09:37:51

Self-supervised Learning Of Visual Pose Estimation Without Pose Labels By Classifying LED States
Nicholas Carlotti, Mirko Nava, Alessandro Giusti
arxiv.org/abs/2509.10405

@Techmeme@techhub.social
2025-09-06 00:01:42

OpenAI is merging its Model Behavior team with its Post Training group to bring the work of the Model Behavior team closer to core model development (Maxwell Zeff/TechCrunch)
techcrunch.com/2025/09/05/open

@arXiv_csLG_bot@mastoxiv.page
2025-09-15 09:42:11

Exploring Expert Specialization through Unsupervised Training in Sparse Mixture of Experts
Strahinja Nikolic, Ilker Oguz, Demetri Psaltis
arxiv.org/abs/2509.10025

@arXiv_csCV_bot@mastoxiv.page
2025-10-14 13:44:48

How many samples to label for an application given a foundation model? Chest X-ray classification study
Nikolay Nechaev, Evgenia Przhezdzetskaya, Viktor Gombolevskiy, Dmitry Umerenkov, Dmitry Dylov
arxiv.org/abs/2510.11553

@arXiv_mathNA_bot@mastoxiv.page
2025-10-13 08:11:10

Augmented data and neural networks for robust epidemic forecasting: application to COVID-19 in Italy
Giacomo Dimarco, Federica Ferrarese, Lorenzo Pareschi
arxiv.org/abs/2510.09192

@arXiv_csSD_bot@mastoxiv.page
2025-09-15 08:46:01

Improving Audio Event Recognition with Consistency Regularization
Shanmuka Sadhu, Weiran Wang
arxiv.org/abs/2509.10391 arxiv.org/pdf/2509.1…

@arXiv_statME_bot@mastoxiv.page
2025-10-14 11:02:28

Iterative Data Curation with Theoretical Guarantees
V\"ain\"o Yrj\"an\"ainen Johan Jonasson, M{\aa}ns Magnusson
arxiv.org/abs/2510.11428

@arXiv_csGT_bot@mastoxiv.page
2025-10-13 08:46:10

Measuring the Hidden Cost of Data Valuation through Collective Disclosure
Patrick Mesana, Gilles Caporossi, Sebastien Gambs
arxiv.org/abs/2510.08869

@arXiv_econTH_bot@mastoxiv.page
2025-10-14 08:30:48

Token is All You Price
Weijie Zhong
arxiv.org/abs/2510.09859 arxiv.org/pdf/2510.09859

@arXiv_csCL_bot@mastoxiv.page
2025-10-15 10:41:01

VISaGE: Understanding Visual Generics and Exceptions
Stella Frank, Emily Allaway
arxiv.org/abs/2510.12548 arxiv.org/pdf/2510.12548

@arXiv_csCV_bot@mastoxiv.page
2025-10-15 10:46:21

LayerSync: Self-aligning Intermediate Layers
Yasaman Haghighi, Bastien van Delft, Mariam Hassan, Alexandre Alahi
arxiv.org/abs/2510.12581 a…

@arXiv_csAR_bot@mastoxiv.page
2025-10-09 07:51:00

Cocoon: A System Architecture for Differentially Private Training with Correlated Noises
Donghwan Kim, Xin Gu, Jinho Baek, Timothy Lo, Younghoon Min, Kwangsik Shin, Jongryool Kim, Jongse Park, Kiwan Maeng
arxiv.org/abs/2510.07304

@arXiv_csAI_bot@mastoxiv.page
2025-09-12 07:31:39

ForTIFAI: Fending Off Recursive Training Induced Failure for AI Models
Soheil Zibakhsh Shabgahi, Pedram Aghazadeh, Azalia Mirhosseini, Farinaz Koushanfar
arxiv.org/abs/2509.08972

@arXiv_qbioOT_bot@mastoxiv.page
2025-09-15 08:29:21

Standards in the Preparation of Biomedical Research Metadata: A Bridge2AI Perspective
Harry Caufield, Satrajit Ghosh, Sek Wong Kong, Jillian Parker, Nathan Sheffield, Bhavesh Patel, Andrew Williams, Timothy Clark, Monica C. Munoz-Torres
arxiv.org/abs/2509.10432

@arXiv_csCL_bot@mastoxiv.page
2025-10-15 14:19:02

Replaced article(s) found for cs.CL. arxiv.org/list/cs.CL/new
[1/5]:
- MLRIP: Pre-training a military language representation model with informative factual knowledge a...
Hui Li, Xuekang Yang

@arXiv_quantph_bot@mastoxiv.page
2025-10-10 11:18:09

Universality and kernel-adaptive training for classically trained, quantum-deployed generative models
Andrii Kurkin, Kevin Shen, Susanne Pielawa, Hao Wang, Vedran Dunjko
arxiv.org/abs/2510.08476

@arXiv_csCR_bot@mastoxiv.page
2025-09-09 12:06:22

Imitative Membership Inference Attack
Yuntao Du, Yuetian Chen, Hanshen Xiao, Bruno Ribeiro, Ninghui Li
arxiv.org/abs/2509.06796 arxiv.org/p…

@arXiv_statML_bot@mastoxiv.page
2025-10-10 08:29:48

A Honest Cross-Validation Estimator for Prediction Performance
Tianyu Pan, Vincent Z. Yu, Viswanath Devanarayan, Lu Tian
arxiv.org/abs/2510.07649

@arXiv_csLG_bot@mastoxiv.page
2025-09-15 09:44:01

FedRP: A Communication-Efficient Approach for Differentially Private Federated Learning Using Random Projection
Mohammad Hasan Narimani, Mostafa Tavassolipour
arxiv.org/abs/2509.10041

@arXiv_csRO_bot@mastoxiv.page
2025-09-12 08:52:59

VLA-Adapter: An Effective Paradigm for Tiny-Scale Vision-Language-Action Model
Yihao Wang, Pengxiang Ding, Lingxiao Li, Can Cui, Zirui Ge, Xinyang Tong, Wenxuan Song, Han Zhao, Wei Zhao, Pengxu Hou, Siteng Huang, Yifan Tang, Wenhui Wang, Ru Zhang, Jianyi Liu, Donglin Wang
arxiv.org/abs/2509.09372

@arXiv_eessIV_bot@mastoxiv.page
2025-09-11 09:24:23

CNN-ViT Hybrid for Pneumonia Detection: Theory and Empiric on Limited Data without Pretraining
Prashant Singh Basnet, Roshan Chitrakar
arxiv.org/abs/2509.08586

@arXiv_csSD_bot@mastoxiv.page
2025-10-14 09:30:58

Improving Speech Emotion Recognition with Mutual Information Regularized Generative Model
Chung-Soo Ahn, Rajib Rana, Sunil Sivadas, Carlos Busso, Jagath C. Rajapakse
arxiv.org/abs/2510.10078

@arXiv_csCL_bot@mastoxiv.page
2025-09-15 09:54:21

Scaling Arabic Medical Chatbots Using Synthetic Data: Enhancing Generative AI with Synthetic Patient Records
Abdulrahman Allam, Seif Ahmed, Ali Hamdi, Khaled Shaban
arxiv.org/abs/2509.10108

@Techmeme@techhub.social
2025-10-01 11:25:55

Peloton unveils its Cross Training Series, including a $1,695 Bike and $6,695 Tread Plus, and AI-based Peloton IQ to track workouts on new and old machines (Victoria Song/The Verge)
theverge.com/tech/789282/pelot

@arXiv_csLG_bot@mastoxiv.page
2025-10-14 13:40:08

Diffusion-DFL: Decision-focused Diffusion Models for Stochastic Optimization
Zihao Zhao, Christopher Yeh, Lingkai Kong, Kai Wang
arxiv.org/abs/2510.11590

@arXiv_csCV_bot@mastoxiv.page
2025-10-15 10:54:51

DriveVLA-W0: World Models Amplify Data Scaling Law in Autonomous Driving
Yingyan Li, Shuyao Shang, Weisong Liu, Bing Zhan, Haochen Wang, Yuqi Wang, Yuntao Chen, Xiaoman Wang, Yasong An, Chufeng Tang, Lu Hou, Lue Fan, Zhaoxiang Zhang
arxiv.org/abs/2510.12796

@arXiv_csLG_bot@mastoxiv.page
2025-10-15 08:21:22

GAR: Generative Adversarial Reinforcement Learning for Formal Theorem Proving
Ruida Wang, Jiarui Yao, Rui Pan, Shizhe Diao, Tong Zhang
arxiv.org/abs/2510.11769

@arXiv_csSD_bot@mastoxiv.page
2025-10-13 08:32:30

VM-UNSSOR: Unsupervised Neural Speech Separation Enhanced by Higher-SNR Virtual Microphone Arrays
Shulin He, Zhong-Qiu Wang
arxiv.org/abs/2510.08914

@arXiv_csCV_bot@mastoxiv.page
2025-09-10 10:43:41

Active Membership Inference Test (aMINT): Enhancing Model Auditability with Multi-Task Learning
Daniel DeAlcala, Aythami Morales, Julian Fierrez, Gonzalo Mancera, Ruben Tolosana, Javier Ortega-Garcia
arxiv.org/abs/2509.07879

@arXiv_csAI_bot@mastoxiv.page
2025-10-13 10:02:20

Localist LLMs -- A Mathematical Framework for Dynamic Locality Control
Joachim Diederich
arxiv.org/abs/2510.09338 arxiv.org/pdf/2510.09338

@arXiv_csCR_bot@mastoxiv.page
2025-09-11 09:39:33

Prototype-Guided Robust Learning against Backdoor Attacks
Wei Guo, Maura Pintor, Ambra Demontis, Battista Biggio
arxiv.org/abs/2509.08748 a…

@Techmeme@techhub.social
2025-10-09 17:40:43

A study finds that as few as 250 malicious documents can produce a "backdoor" vulnerability in an LLM, regardless of model size or training data volume (Anthropic)
anthropic.com/research/small-s

@arXiv_csRO_bot@mastoxiv.page
2025-10-13 10:14:30

Zero-shot Structure Learning and Planning for Autonomous Robot Navigation using Active Inference
Daria de tinguy, Tim Verbelen, Emilio Gamba, Bart Dhoedt
arxiv.org/abs/2510.09574

@arXiv_eessIV_bot@mastoxiv.page
2025-09-12 08:06:29

Generalized User-Oriented Image Semantic Coding Empowered by Large Vision-Language Model
Sin-Yu Huang, Vincent W. S. Wong
arxiv.org/abs/2509.08913

@arXiv_csLG_bot@mastoxiv.page
2025-10-08 11:00:59

Training Dynamics Impact Post-Training Quantization Robustness
Albert Catalan-Tatjer, Niccol\`o Ajroldi, Jonas Geiping
arxiv.org/abs/2510.06213

@arXiv_csCL_bot@mastoxiv.page
2025-09-12 09:37:59

Improving Synthetic Data Training for Contextual Biasing Models with a Keyword-Aware Cost Function
Chin Yuen Kwok, Jia Qi Yip, Eng Siong Chng
arxiv.org/abs/2509.09197

@arXiv_csCV_bot@mastoxiv.page
2025-09-12 10:09:09

Exploring Pre-training Across Domains for Few-Shot Surgical Skill Assessment
Dimitrios Anastasiou, Razvan Caramalau, Nazir Sirajudeen, Matthew Boal, Philip Edwards, Justin Collins, John Kelly, Ashwin Sridhar, Maxine Tran, Faiz Mumtaz, Nevil Pavithran, Nader Francis, Danail Stoyanov, Evangelos B. Mazomenos
arxiv.org/abs/2509.09327…

@arXiv_csRO_bot@mastoxiv.page
2025-10-10 09:44:29

GM3: A General Physical Model for Micro-Mobility Vehicles
Grace Cai, Nithin Parepally, Laura Zheng, Ming C. Lin
arxiv.org/abs/2510.07807 ar…

@arXiv_csCL_bot@mastoxiv.page
2025-09-09 11:58:42

SLiNT: Structure-aware Language Model with Injection and Contrastive Training for Knowledge Graph Completion
Mengxue Yang, Chun Yang, Jiaqi Zhu, Jiafan Li, Jingqi Zhang, Yuyang Li, Ying Li
arxiv.org/abs/2509.06531

@arXiv_csLG_bot@mastoxiv.page
2025-09-12 09:22:29

Open-sci-ref-0.01: open and reproducible reference baselines for language model and dataset comparison
Marianna Nezhurina, Taishi Nakamura, Timur Carstensen, Niccol\`o Ajroldi, Ville Komulainen, David Salinas, Jenia Jitsev
arxiv.org/abs/2509.09009

@arXiv_csCR_bot@mastoxiv.page
2025-09-11 09:16:43

DSFL: A Dual-Server Byzantine-Resilient Federated Learning Framework via Group-Based Secure Aggregation
Charuka Herath, Yogachandran Rahulamathavan, Varuna De Silva, Sangarapillai Lambotharan
arxiv.org/abs/2509.08449

@arXiv_csAI_bot@mastoxiv.page
2025-10-13 09:22:20

RADAR: Mechanistic Pathways for Detecting Data Contamination in LLM Evaluation
Ashish Kattamuri, Harshwardhan Fartale, Arpita Vats, Rahul Raja, Ishita Prasad
arxiv.org/abs/2510.08931

@arXiv_csCL_bot@mastoxiv.page
2025-10-13 10:30:00

Verifying Chain-of-Thought Reasoning via Its Computational Graph
Zheng Zhao, Yeskendir Koishekenov, Xianjun Yang, Naila Murray, Nicola Cancedda
arxiv.org/abs/2510.09312

@arXiv_csLG_bot@mastoxiv.page
2025-10-13 10:42:40

CHUCKLE -- When Humans Teach AI To Learn Emotions The Easy Way
Ankush Pratap Singh, Houwei Cao, Yong Liu
arxiv.org/abs/2510.09382 arxiv.org…

@arXiv_csCV_bot@mastoxiv.page
2025-10-13 10:30:10

Boosting Multi-modal Keyphrase Prediction with Dynamic Chain-of-Thought in Vision-Language Models
Qihang Ma, Shengyu Li, Jie Tang, Dingkang Yang, Shaodong Chen, Yingyi Zhang, Chao Feng, Jiao Ran
arxiv.org/abs/2510.09358

@arXiv_csCL_bot@mastoxiv.page
2025-10-09 10:27:51

Mid-Training of Large Language Models: A Survey
Kaixiang Mo, Yuxin Shi, Weiwei Weng, Zhiqiang Zhou, Shuman Liu, Haibo Zhang, Anxiang Zeng
arxiv.org/abs/2510.06826

@arXiv_csLG_bot@mastoxiv.page
2025-10-08 10:54:49

Out-of-Distribution Detection from Small Training Sets using Bayesian Neural Network Classifiers
Kevin Raina, Tanya Schmah
arxiv.org/abs/2510.06025

@arXiv_csCL_bot@mastoxiv.page
2025-09-10 10:04:21

AIxcellent Vibes at GermEval 2025 Shared Task on Candy Speech Detection: Improving Model Performance by Span-Level Training
Christian Rene Thelen, Patrick Gustav Blaneck, Tobias Bornheim, Niklas Grieger, Stephan Bialonski
arxiv.org/abs/2509.07459

@arXiv_csLG_bot@mastoxiv.page
2025-10-13 10:43:00

HINT: Helping Ineffective Rollouts Navigate Towards Effectiveness
Xinyi Wang, Jinyi Han, Zishang Jiang, Tingyun Li, Jiaqing Liang, Sihang Jiang, Zhaoqian Dai, Shuguang Ma, Fei Yu, Yanghua Xiao
arxiv.org/abs/2510.09388

@arXiv_csLG_bot@mastoxiv.page
2025-10-13 10:41:10

Safety Game: Balancing Safe and Informative Conversations with Blackbox Agentic AI using LP Solvers
Tuan Nguyen, Long Tran-Thanh
arxiv.org/abs/2510.09330

@arXiv_csCV_bot@mastoxiv.page
2025-09-09 12:30:52

BIR-Adapter: A Low-Complexity Diffusion Model Adapter for Blind Image Restoration
Cem Eteke, Alexander Griessel, Wolfgang Kellerer, Eckehard Steinbach
arxiv.org/abs/2509.06904

@arXiv_csLG_bot@mastoxiv.page
2025-10-13 10:44:40

On Uniformly Scaling Flows: A Density-Aligned Approach to Deep One-Class Classification
Faried Abu Zaid, Tim Katzke, Emmanuel M\"uller, Daniel Neider
arxiv.org/abs/2510.09452

@arXiv_csLG_bot@mastoxiv.page
2025-10-13 10:46:00

Geo-Aware Models for Stream Temperature Prediction across Different Spatial Regions and Scales
Shiyuan Luo, Runlong Yu, Shengyu Chen, Yingda Fan, Yiqun Xie, Yanhua Li, Xiaowei Jia
arxiv.org/abs/2510.09500

@arXiv_csCL_bot@mastoxiv.page
2025-09-10 09:53:51

Does This Look Familiar to You? Knowledge Analysis via Model Internal Representations
Sihyun Park
arxiv.org/abs/2509.07311 arxiv.org/pdf/25…

@arXiv_csCL_bot@mastoxiv.page
2025-10-10 10:55:19

Training-Free Group Relative Policy Optimization
Yuzheng Cai, Siqi Cai, Yuchen Shi, Zihan Xu, Lichao Chen, Yulei Qin, Xiaoyu Tan, Gang Li, Zongyi Li, Haojia Lin, Yong Mao, Ke Li, Xing Sun
arxiv.org/abs/2510.08191

@arXiv_csLG_bot@mastoxiv.page
2025-10-09 10:49:51

DPMM-CFL: Clustered Federated Learning via Dirichlet Process Mixture Model Nonparametric Clustering
Mariona Jaramillo-Civill, Peng Wu, Pau Closas
arxiv.org/abs/2510.07132

@arXiv_csLG_bot@mastoxiv.page
2025-09-05 10:29:11

Towards a Unified View of Large Language Model Post-Training
Xingtai Lv, Yuxin Zuo, Youbang Sun, Hongyi Liu, Yuntian Wei, Zhekai Chen, Lixuan He, Xuekai Zhu, Kaiyan Zhang, Bingning Wang, Ning Ding, Bowen Zhou
arxiv.org/abs/2509.04419

@arXiv_csCV_bot@mastoxiv.page
2025-10-10 11:02:19

Hyperspectral data augmentation with transformer-based diffusion models
Mattia Ferrari, Lorenzo Bruzzone
arxiv.org/abs/2510.08363 arxiv.org…

@arXiv_csCL_bot@mastoxiv.page
2025-09-10 10:21:11

BALI: Enhancing Biomedical Language Representations through Knowledge Graph and Language Model Alignment
Andrey Sakhovskiy, Elena Tutubalina
arxiv.org/abs/2509.07588

@arXiv_csLG_bot@mastoxiv.page
2025-10-10 10:58:19

Long-tailed Recognition with Model Rebalancing
Jiaan Luo, Feng Hong, Qiang Hu, Xiaofeng Cao, Feng Liu, Jiangchao Yao
arxiv.org/abs/2510.08177

@arXiv_csLG_bot@mastoxiv.page
2025-10-09 10:40:41

Sharpness-Aware Data Generation for Zero-shot Quantization
Dung Hoang-Anh, Cuong Pham Trung Le, Jianfei Cai, Thanh-Toan Do
arxiv.org/abs/2510.07018

@arXiv_csCL_bot@mastoxiv.page
2025-09-08 10:11:50

Knowledge Collapse in LLMs: When Fluency Survives but Facts Fail under Recursive Synthetic Training
Figarri Keisha, Zekun Wu, Ze Wang, Adriano Koshiyama, Philip Treleaven
arxiv.org/abs/2509.04796

@arXiv_csCV_bot@mastoxiv.page
2025-09-08 09:40:40

SynGen-Vision: Synthetic Data Generation for training industrial vision models
Alpana Dubey, Suma Mani Kuriakose, Nitish Bhardwaj
arxiv.org/abs/2509.04894

@arXiv_csCL_bot@mastoxiv.page
2025-09-11 07:41:33

AntiDote: Bi-level Adversarial Training for Tamper-Resistant LLMs
Debdeep Sanyal, Manodeep Ray, Murari Mandal
arxiv.org/abs/2509.08000 arxi…

@arXiv_csCV_bot@mastoxiv.page
2025-09-08 09:08:30

Enhancing Self-Driving Segmentation in Adverse Weather Conditions: A Dual Uncertainty-Aware Training Approach to SAM Optimization
Dharsan Ravindran, Kevin Wang, Zhuoyuan Cao, Saleh Abdelrahman, Jeffery Wu
arxiv.org/abs/2509.04735

@arXiv_csLG_bot@mastoxiv.page
2025-09-11 10:11:03

Heart Disease Prediction: A Comparative Study of Optimisers Performance in Deep Neural Networks
Chisom Chibuike, Adeyinka Ogunsanya
arxiv.org/abs/2509.08499

@arXiv_csLG_bot@mastoxiv.page
2025-10-09 10:41:51

Unified Molecule Pre-training with Flexible 2D and 3D Modalities: Single and Paired Modality Integration
Tengwei Song, Min Wu, Yuan Fang
arxiv.org/abs/2510.07035

@arXiv_csCL_bot@mastoxiv.page
2025-10-09 10:21:51

AWM: Accurate Weight-Matrix Fingerprint for Large Language Models
Boyi Zeng, Lin Chen, Ziwei He, Xinbing Wang, Zhouhan Lin
arxiv.org/abs/2510.06738

@arXiv_csLG_bot@mastoxiv.page
2025-10-09 10:36:51

Fisher Information, Training and Bias in Fourier Regression Models
Lorenzo Pastori, Veronika Eyring, Mierk Schwabe
arxiv.org/abs/2510.06945

@arXiv_csCL_bot@mastoxiv.page
2025-09-09 12:08:52

The Majority is not always right: RL training for solution aggregation
Wenting Zhao, Pranjal Aggarwal, Swarnadeep Saha, Asli Celikyilmaz, Jason Weston, Ilia Kulikov
arxiv.org/abs/2509.06870

@arXiv_csLG_bot@mastoxiv.page
2025-09-04 10:32:21

DPQuant: Efficient and Differentially-Private Model Training via Dynamic Quantization Scheduling
Yubo Gao, Renbo Tu, Gennady Pekhimenko, Nandita Vijaykumar
arxiv.org/abs/2509.03472

@arXiv_csLG_bot@mastoxiv.page
2025-10-07 13:07:12

Boomerang Distillation Enables Zero-Shot Model Size Interpolation
Sara Kangaslahti, Nihal V. Nayak, Jonathan Geuter, Marco Fumero, Francesco Locatello, David Alvarez-Melis
arxiv.org/abs/2510.05064

@arXiv_csLG_bot@mastoxiv.page
2025-09-04 10:33:21

Warming Up for Zeroth-Order Federated Pre-Training with Low Resource Clients
Gwen Legate, Irina Rish, Eugene Belilovsky
arxiv.org/abs/2509.03503

@arXiv_csLG_bot@mastoxiv.page
2025-10-08 10:26:09

Correlating Cross-Iteration Noise for DP-SGD using Model Curvature
Xin Gu, Yingtai Xiao, Guanlin He, Jiamu Bai, Daniel Kifer, Kiwan Maeng
arxiv.org/abs/2510.05416

@arXiv_csLG_bot@mastoxiv.page
2025-09-10 10:38:21

K2-Think: A Parameter-Efficient Reasoning System
Zhoujun Cheng, Richard Fan, Shibo Hao, Taylor W. Killian, Haonan Li, Suqi Sun, Hector Ren, Alexander Moreno, Daqian Zhang, Tianjun Zhong, Yuxin Xiong, Yuanzhe Hu, Yutao Xie, Xudong Han, Yuqi Wang, Varad Pimpalkhute, Yonghao Zhuang, Aaryamonvikram Singh, Xuezhi Liang, Anze Xie, Jianshu She, Desai Fan, Chengqian Gao, Liqun Ma, Mikhail Yurochkin, John Maggs, Xuezhe Ma, Guowei He, Zhiting Hu, Zhengzhong Liu, Eric P. Xing

@arXiv_csLG_bot@mastoxiv.page
2025-10-06 10:27:09

Estimation of Resistance Training RPE using Inertial Sensors and Electromyography
James Thomas, Johan Walhstr\"om
arxiv.org/abs/2510.03197