Tootfinder

Opt-in global Mastodon full text search. Join the index!

@arXiv_csRO_bot@mastoxiv.page
2025-09-18 09:56:31

Dual-Actor Fine-Tuning of VLA Models: A Talk-and-Tweak Human-in-the-Loop Approach
Piaopiao Jin, Qi Wang, Guokang Sun, Ziwen Cai, Pinjia He, Yangwei You
arxiv.org/abs/2509.13774

@arXiv_csCL_bot@mastoxiv.page
2025-09-18 08:12:31

Latent Traits and Cross-Task Transfer: Deconstructing Dataset Interactions in LLM Fine-tuning
Shambhavi Krishna, Atharva Naik, Chaitali Agarwal, Sudharshan Govindan, Taesung Lee, Haw-Shiuan Chang
arxiv.org/abs/2509.13624

@arXiv_csCR_bot@mastoxiv.page
2025-10-14 11:46:48

Pharmacist: Safety Alignment Data Curation for Large Language Models against Harmful Fine-tuning
Guozhi Liu, Qi Mu, Tiansheng Huang, Xinhua Wang, Li Shen, Weiwei Lin, Zhang Li
arxiv.org/abs/2510.10085

@arXiv_csAI_bot@mastoxiv.page
2025-10-15 09:39:41

Evolution of meta's llama models and parameter-efficient fine-tuning of large language models: a survey
Abdulhady Abas Abdullah, Arkaitz Zubiaga, Seyedali Mirjalili, Amir H. Gandomi, Fatemeh Daneshfar, Mohammadsadra Amini, Alan Salam Mohammed, Hadi Veisi
arxiv.org/abs/2510.12178

@arXiv_csLG_bot@mastoxiv.page
2025-09-18 10:18:21

Language models' activations linearly encode training-order recency
Dmitrii Krasheninnikov, Richard E. Turner, David Krueger
arxiv.org/abs/2509.14223

@arXiv_csCV_bot@mastoxiv.page
2025-10-15 10:49:41

Personalized Federated Fine-Tuning of Vision Foundation Models for Healthcare
Adam Tupper, Christian Gagn\'e
arxiv.org/abs/2510.12741 a…

@arXiv_csIT_bot@mastoxiv.page
2025-10-15 07:42:01

FedLoDrop: Federated LoRA with Dropout for Generalized LLM Fine-tuning
Sijing Xie, Dingzhu Wen, Changsheng You, Qimei Chen, Mehdi Bennis, Kaibin Huang
arxiv.org/abs/2510.12078

@arXiv_csLG_bot@mastoxiv.page
2025-09-18 10:19:31

NIRVANA: Structured pruning reimagined for large language models compression
Mengting Ai, Tianxin Wei, Sirui Chen, Jingrui He
arxiv.org/abs/2509.14230

@Techmeme@techhub.social
2025-12-12 19:20:51

Mira Murati's Thinking Machines Lab makes Tinker, its API for fine-tuning language models, generally available, adds support for Kimi K2 Thinking, and more (Thinking Machines Lab)
thinkingmachines.ai/blog/tinke

@arXiv_csCL_bot@mastoxiv.page
2025-09-18 08:58:11

Improving Context Fidelity via Native Retrieval-Augmented Reasoning
Suyuchen Wang, Jinlin Wang, Xinyu Wang, Shiqi Li, Xiangru Tang, Sirui Hong, Xiao-Wen Chang, Chenglin Wu, Bang Liu
arxiv.org/abs/2509.13683

@arXiv_csCV_bot@mastoxiv.page
2025-09-18 10:23:11

Towards Rationale-Answer Alignment of LVLMs via Self-Rationale Calibration
Yuanchen Wu, Ke Yan, Shouhong Ding, Ziyin Zhou, Xiaoqiang Li
arxiv.org/abs/2509.13919

@arXiv_eessAS_bot@mastoxiv.page
2025-09-18 07:52:31

TICL: Text-Embedding KNN For Speech In-Context Learning Unlocks Speech Recognition Abilities of Large Multimodal Models
Haolong Zheng, Yekaterina Yegorova, Mark Hasegawa-Johnson
arxiv.org/abs/2509.13395

@arXiv_condmatmtrlsci_bot@mastoxiv.page
2025-10-14 09:32:48

Predicting Crystal Structures and Ionic Conductivity in Li$_{3}$YCl$_{6-x}$Br$_{x}$ Halide Solid Electrolytes Using a Fine-Tuned Machine Learning Interatomic Potential
Jonas B\"ohm, Aur\'elie Champagne
arxiv.org/abs/2510.09861

@arXiv_csCL_bot@mastoxiv.page
2025-09-18 10:16:51

Canary-1B-v2 & Parakeet-TDT-0.6B-v3: Efficient and High-Performance Models for Multilingual ASR and AST
Monica Sekoyan, Nithin Rao Koluguri, Nune Tadevosyan, Piotr Zelasko, Travis Bartley, Nick Karpov, Jagadeesh Balam, Boris Ginsburg
arxiv.org/abs/2509.14128

@arXiv_qbioQM_bot@mastoxiv.page
2025-10-07 08:47:32

InstructPLM-mu: 1-Hour Fine-Tuning of ESM2 Beats ESM3 in Protein Mutation Predictions
Junde Xu, Yapin Shi, Lijun Lang, Taoyong Cui, Zhiming Zhang, Guangyong Chen, Jiezhong Qiu, Pheng-Ann Heng
arxiv.org/abs/2510.03370

@arXiv_csHC_bot@mastoxiv.page
2025-10-15 10:02:51

Data-Model Co-Evolution: Growing Test Sets to Refine LLM Behavior
Minjae Lee, Minsuk Kahng
arxiv.org/abs/2510.12728 arxiv.org/pdf/2510.1272…

@arXiv_csSE_bot@mastoxiv.page
2025-10-13 09:59:30

TIT: A Tree-Structured Instruction Tuning Approach for LLM-Based Code Translation
He Jiang, Yufu Wang, Hao Lin, Peiyu Zou, Zhide Zhou, Ang Jia, Xiaochen Li, Zhilei Ren
arxiv.org/abs/2510.09400

@arXiv_eessIV_bot@mastoxiv.page
2025-10-09 08:04:51

Stacked Regression using Off-the-shelf, Stimulus-tuned and Fine-tuned Neural Networks for Predicting fMRI Brain Responses to Movies (Algonauts 2025 Report)
Robert Scholz, Kunal Bagga, Christine Ahrends, Carlo Alberto Barbano
arxiv.org/abs/2510.06235

@arXiv_csIR_bot@mastoxiv.page
2025-10-15 09:34:41

Leveraging Language Semantics for Collaborative Filtering with TextGCN and TextGCN-MLP: Zero-Shot vs In-Domain Performance
Andrei Chernov, Haroon Wahab, Oleg Novitskij
arxiv.org/abs/2510.12461

@arXiv_hepph_bot@mastoxiv.page
2025-10-15 09:46:12

A Solution to the Hierarchy Problem with Non-Linear Quantum Mechanics
David E. Kaplan, Surjeet Rajendran
arxiv.org/abs/2510.12030 arxiv.org…

@arXiv_csCL_bot@mastoxiv.page
2025-10-14 13:15:58

MeTA-LoRA: Data-Efficient Multi-Task Fine-Tuning for Large Language Models
Bo Cheng, Xu Wang, Jinda Liu, Yi Chang, Yuan Wu
arxiv.org/abs/2510.11598

@tinoeberl@mastodon.online
2025-10-11 05:07:02

Ein Artikel in Plos One beleuchtet den politischen #Bias von #Chatbots und zeigt, dass viele KI-Modelle eher linksgerichtete Antworten geben.
Dies könnte auf das Supervised Fine-Tuning zurückzuführen sein, bei dem KI durch menschliche Beispiele lernt. Interessanterweise weisen unterschiedliche

@arXiv_csSD_bot@mastoxiv.page
2025-10-14 10:45:18

Knowledge-Decoupled Functionally Invariant Path with Synthetic Personal Data for Personalized ASR
Yue Gu, Zhihao Du, Ying Shi, Jiqing Han, Yongjun He
arxiv.org/abs/2510.10401

@arXiv_csAR_bot@mastoxiv.page
2025-10-14 09:20:08

Efficient In-Memory Acceleration of Sparse Block Diagonal LLMs
Jo\~ao Paulo Cardoso de Lima, Marc Dietrich, Jeronimo Castrillon, Asif Ali Khan
arxiv.org/abs/2510.11192

@arXiv_csDB_bot@mastoxiv.page
2025-10-14 08:25:48

GrASP: A Generalizable Address-based Semantic Prefetcher for Scalable Transactional and Analytical Workloads
Farzaneh Zirak, Farhana Choudhury, Renata Borovica-Gajic
arxiv.org/abs/2510.11011

@arXiv_csCR_bot@mastoxiv.page
2025-10-06 09:47:19

Attack via Overfitting: 10-shot Benign Fine-tuning to Jailbreak LLMs
Zhixin Xie, Xurui Song, Jun Luo
arxiv.org/abs/2510.02833 arxiv.org/pdf…

@arXiv_csCL_bot@mastoxiv.page
2025-10-14 13:08:18

Early Detection and Reduction of Memorisation for Domain Adaptation and Instruction Tuning
Dean L. Slack, Noura Al Moubayed
arxiv.org/abs/2510.11372

@Techmeme@techhub.social
2025-10-01 18:19:29

Mira Murati's Thinking Machines Lab launches its first product, Tinker, which automates the creation of custom frontier AI models (Will Knight/Wired)
wired.com/story/thinking-machi

@arXiv_csCE_bot@mastoxiv.page
2025-10-02 07:49:51

Flow of Knowledge: Federated Fine-Tuning of LLMs in Healthcare under Non-IID Conditions
Zeyu Chen, Yun Ji, Bowen Wang, Liwen Shi, Zijie Zeng, Sheng Zhang
arxiv.org/abs/2510.00543

@arXiv_csAI_bot@mastoxiv.page
2025-10-10 10:45:19

How to Teach Large Multimodal Models New Skills
Zhen Zhu, Yiming Gong, Yao Xiao, Yaoyao Liu, Derek Hoiem
arxiv.org/abs/2510.08564 arxiv.org…

@arXiv_csCY_bot@mastoxiv.page
2025-09-30 11:01:31

Learning from Convenience Samples: A Case Study on Fine-Tuning LLMs for Survey Non-response in the German Longitudinal Election Study
Tobias Holtdirk, Dennis Assenmacher, Arnim Bleier, Claudia Wagner
arxiv.org/abs/2509.25063

@arXiv_astrophEP_bot@mastoxiv.page
2025-10-06 08:52:09

Self-supervised diffusion model fine-tuning for costate initialization using Markov chain Monte Carlo
Jannik Graebner, Ryne Beeson
arxiv.org/abs/2510.02527

@arXiv_csCL_bot@mastoxiv.page
2025-10-13 10:32:50

Understanding the Effects of Domain Finetuning on LLMs
Eshaan Tanwar, Deepak Nathani, William Yang Wang, Tanmoy Chakraborty
arxiv.org/abs/2510.09359

@arXiv_csCV_bot@mastoxiv.page
2025-10-15 10:44:21

CoIRL-AD: Collaborative-Competitive Imitation-Reinforcement Learning in Latent World Models for Autonomous Driving
Xiaoji Zheng, Ziyuan Yang, Yanhao Chen, Yuhang Peng, Yuanrong Tang, Gengyuan Liu, Bokui Chen, Jiangtao Gong
arxiv.org/abs/2510.12560

@arXiv_csLG_bot@mastoxiv.page
2025-10-09 10:43:11

Enhancing Speech Emotion Recognition via Fine-Tuning Pre-Trained Models and Hyper-Parameter Optimisation
Aryan Golbaghi, Shuo Zhou
arxiv.org/abs/2510.07052

@arXiv_csSE_bot@mastoxiv.page
2025-10-09 10:00:31

Prompt, Synthesize, Fine-Tune: A Secure Code Generation Recipe
Junjie Li, Fazle Rabbi, Bo Yang, Song Wang, Jinqiu Yang
arxiv.org/abs/2510.07189

@arXiv_csIR_bot@mastoxiv.page
2025-10-14 10:38:48

Does LLM Focus on the Right Words? Diagnosing Language Bias in LLM-based Recommenders
Bohao Wang, Jiawei Chen, Feng Liu, Changwang Zhang, Jun Wang, Canghong Jin, Chun Chen, Can Wang
arxiv.org/abs/2510.10978

@arXiv_csSD_bot@mastoxiv.page
2025-10-14 11:06:48

Proficiency-Aware Adaptation and Data Augmentation for Robust L2 ASR
Ling Sun, Charlotte Zhu, Shuju Shi
arxiv.org/abs/2510.10738 arxiv.org/…

@arXiv_csRO_bot@mastoxiv.page
2025-09-29 10:18:57

Actions as Language: Fine-Tuning VLMs into VLAs Without Catastrophic Forgetting
Asher J. Hancock, Xindi Wu, Lihan Zha, Olga Russakovsky, Anirudha Majumdar
arxiv.org/abs/2509.22195

@arXiv_csCL_bot@mastoxiv.page
2025-10-14 13:09:08

Valid Survey Simulations with Limited Human Data: The Roles of Prompting, Fine-Tuning, and Rectification
Stefan Krsteski, Giuseppe Russo, Serina Chang, Robert West, Kristina Gligori\'c
arxiv.org/abs/2510.11408

@arXiv_csCR_bot@mastoxiv.page
2025-10-14 11:47:28

MetaBreak: Jailbreaking Online LLM Services via Special Token Manipulation
Wentian Zhu, Zhen Xiang, Wei Niu, Le Guan
arxiv.org/abs/2510.10271

@arXiv_csDB_bot@mastoxiv.page
2025-10-13 07:45:10

HES-SQL: Hybrid Reasoning for Efficient Text-to-SQL with Structural Skeleton Guidance
Suming Qiu, Jing Li, Zhicheng Zhou, Junjie Huang, Linyuan Qiu, Zhijie Sun
arxiv.org/abs/2510.08896

@arXiv_csAI_bot@mastoxiv.page
2025-10-13 09:45:30

MEC$^3$O: Multi-Expert Consensus for Code Time Complexity Prediction
Joonghyuk Hahn, Soohan Lim, Yo-Sub Han
arxiv.org/abs/2510.09049 arxiv.…

@arXiv_condmatmtrlsci_bot@mastoxiv.page
2025-10-07 11:01:22

Comparing fine-tuning strategies of MACE machine learning force field for modeling Li-ion diffusion in LiF for batteries
Nada Alghamdi, Paolo de Angelis, Pietro Asinari, Eliodoro Chiavazzo
arxiv.org/abs/2510.05020

@arXiv_csIT_bot@mastoxiv.page
2025-09-26 07:36:11

A Deep Transfer Learning-Based Low-overhead Beam Prediction in Vehicle Communications
Zhiqiang Xiao, Yuwen Cao, Mondher Bouazizi, Tomoaki Ohtsuki, Shahid Mumtaz
arxiv.org/abs/2509.20659

@arXiv_csCV_bot@mastoxiv.page
2025-10-14 22:04:45

Replaced article(s) found for cs.CV. arxiv.org/list/cs.CV/new
[6/8]:
- GeoVLM-R1: Reinforcement Fine-Tuning for Improved Remote Sensing Reasoning
Mustansar Fiaz, Hiyam Debary, Paolo Fraccaro, Danda Paudel, Luc Van Gool, Fahad Khan, Salman Khan

@arXiv_eessAS_bot@mastoxiv.page
2025-10-15 09:09:02

DeePAQ: A Perceptual Audio Quality Metric Based On Foundational Models and Weakly Supervised Learning
Guanxin Jiang, Andreas Brendel, Pablo M. Delgado, J\"urgen Herre
arxiv.org/abs/2510.12326

@arXiv_csLG_bot@mastoxiv.page
2025-10-14 13:40:48

MATH-Beyond: A Benchmark for RL to Expand Beyond the Base Model
Prasanna Mayilvahanan, Ricardo Dominguez-Olmedo, Thadd\"aus Wiedemer, Wieland Brendel
arxiv.org/abs/2510.11653

@arXiv_csCE_bot@mastoxiv.page
2025-09-29 08:19:57

Sci2Pol: Evaluating and Fine-tuning LLMs on Scientific-to-Policy Brief Generation
Weimin Wu, Alexander C. Furnas, Eddie Yang, Gefei Liu, Akhil Pandey Akella, Xuefeng Song, Dashun Wang, Han Liu
arxiv.org/abs/2509.21493

@arXiv_csCL_bot@mastoxiv.page
2025-10-15 10:41:41

Teaching Language Models to Faithfully Express their Uncertainty
Bryan Eikema, Evgenia Ilia, Jos\'e G. C. de Souza, Chrysoula Zerva, Wilker Aziz
arxiv.org/abs/2510.12587

@arXiv_csCR_bot@mastoxiv.page
2025-10-03 07:34:30

Fine-Tuning Jailbreaks under Highly Constrained Black-Box Settings: A Three-Pronged Approach
Xiangfang Li, Yu Wang, Bo Li
arxiv.org/abs/2510.01342

@arXiv_csAI_bot@mastoxiv.page
2025-10-13 14:49:33

Replaced article(s) found for cs.AI. arxiv.org/list/cs.AI/new
[5/7]:
- On-Policy RL Meets Off-Policy Experts: Harmonizing Supervised Fine-Tuning and Reinforcement Learn...
Zhang, Xie, Sun, Chen, Wang, Li, Ding, Zhou

@arXiv_csCL_bot@mastoxiv.page
2025-10-07 12:18:02

Resource-Efficient Fine-Tuning of LLaMA-3.2-3B for Medical Chain-of-Thought Reasoning
Imran Mansha
arxiv.org/abs/2510.05003 arxiv.org/pdf/2…

@arXiv_csSD_bot@mastoxiv.page
2025-10-07 08:23:42

Lightweight and Generalizable Acoustic Scene Representations via Contrastive Fine-Tuning and Distillation
Kuang Yuan, Yang Gao, Xilin Li, Xinhao Mei, Syavosh Zadissa, Tarun Pruthi, Saeed Bagheri Sereshki
arxiv.org/abs/2510.03728

@arXiv_csLG_bot@mastoxiv.page
2025-10-08 10:29:39

AMAQ: Adaptive Mixed-bit Activation Quantization for Collaborative Parameter Efficient Fine-tuning
Yurun Song, Zhuoyi Yang, Ian G. Harris, Sangeetha Abdu Jyothi
arxiv.org/abs/2510.05468

@arXiv_csSE_bot@mastoxiv.page
2025-10-01 07:50:27

Devstral: Fine-tuning Language Models for Coding Agent Applications
Abhinav Rastogi, Adam Yang, Albert Q. Jiang, Alexander H. Liu, Alexandre Sablayrolles, Am\'elie H\'eliou, Am\'elie Martin, Anmol Agarwal, Andy Ehrenberg, Andy Lo, Antoine Roux, Arthur Darcet, Arthur Mensch, Baptiste Bout, Baptiste Rozi\`ere, Baudouin De Monicault, Chris Bamford, Christian Wallenwein, Christophe Renaudin, Cl\'emence Lanfranchi, Cl\'ement Denoix, Corentin Barreau, Darius Dabert Devon …

@arXiv_csAI_bot@mastoxiv.page
2025-10-08 09:59:59

MetaVLA: Unified Meta Co-training For Efficient Embodied Adaption
Chen Li, Zhantao Yang, Han Zhang, Fangyi Chen, Chenchen Zhu, Anudeepsekhar Bolimera, Marios Savvides
arxiv.org/abs/2510.05580

@arXiv_condmatmtrlsci_bot@mastoxiv.page
2025-10-01 09:42:38

Fine-Tuning Bulk-oriented Universal Interatomic Potentials for Surfaces: Accuracy, Efficiency, and Forgetting Control
Jaekyun Hwang, Taehun Lee, Yonghyuk Lee, Su-Hyun Yoo
arxiv.org/abs/2509.25807

@arXiv_csCV_bot@mastoxiv.page
2025-10-06 10:05:39

Multimodal Carotid Risk Stratification with Large Vision-Language Models: Benchmarking, Fine-Tuning, and Clinical Insights
Daphne Tsolissou, Theofanis Ganitidis, Konstantinos Mitsis, Stergios CHristodoulidis, Maria Vakalopoulou, Konstantina Nikita
arxiv.org/abs/2510.02922

@arXiv_csLG_bot@mastoxiv.page
2025-09-30 14:45:31

TR2-D2: Tree Search Guided Trajectory-Aware Fine-Tuning for Discrete Diffusion
Sophia Tang, Yuchen Zhu, Molei Tao, Pranam Chatterjee
arxiv.org/abs/2509.25171

@arXiv_csCL_bot@mastoxiv.page
2025-10-15 10:44:11

Reasoning Pattern Matters: Learning to Reason without Human Rationales
Chaoxu Pang, Yixuan Cao, Ping Luo
arxiv.org/abs/2510.12643 arxiv.org…

@arXiv_csLG_bot@mastoxiv.page
2025-10-02 11:10:31

Sample-Efficient Differentially Private Fine-Tuning via Gradient Matrix Denoising
Ali Dadsetan, Frank Rudzicz
arxiv.org/abs/2510.01137 arxi…

@arXiv_csAI_bot@mastoxiv.page
2025-09-24 10:31:04

Data Efficient Adaptation in Large Language Models via Continuous Low-Rank Fine-Tuning
Xiao Han, Zimo Zhao, Wanyu Wang, Maolin Wang, Zitao Liu, Yi Chang, Xiangyu Zhao
arxiv.org/abs/2509.18942

@arXiv_csSD_bot@mastoxiv.page
2025-09-22 08:16:11

Exploring Fine-Tuning of Large Audio Language Models for Spoken Language Understanding under Limited Speech data
Youngwon Choi, Jaeyoon Jung, Hyeonyu Kim, Huu-Kim Nguyen, Hwayeon Kim
arxiv.org/abs/2509.15389

@arXiv_csCV_bot@mastoxiv.page
2025-12-12 14:07:46

Replaced article(s) found for cs.CV. arxiv.org/list/cs.CV/new
[4/5]:
- Fairness-Aware Fine-Tuning of Vision-Language Models for Medical Glaucoma Diagnosis
Zijian Gu, Yuxi Liu, Zhenhao Zhang, Song Wang

@arXiv_csCL_bot@mastoxiv.page
2025-10-15 10:38:21

Probing Latent Knowledge Conflict for Faithful Retrieval-Augmented Generation
Linfeng Gao, Baolong Bi, Zheng Yuan, Le Wang, Zerui Chen, Zhimin Wei, Shenghua Liu, Qinggang Zhang, Jinsong Su
arxiv.org/abs/2510.12460

@arXiv_csLG_bot@mastoxiv.page
2025-10-13 10:41:10

Safety Game: Balancing Safe and Informative Conversations with Blackbox Agentic AI using LP Solvers
Tuan Nguyen, Long Tran-Thanh
arxiv.org/abs/2510.09330

@arXiv_csAI_bot@mastoxiv.page
2025-10-01 11:47:17

Fine-tuning Behavioral Cloning Policies with Preference-Based Reinforcement Learning
Ma\"el Macuglia, Paul Friedrich, Giorgia Ramponi
arxiv.org/abs/2509.26605

@arXiv_csLG_bot@mastoxiv.page
2025-10-10 11:10:19

FlyLoRA: Boosting Task Decoupling and Parameter Efficiency via Implicit Rank-Wise Mixture-of-Experts
Heming Zou, Yunliang Zang, Wutong Xu, Yao Zhu, Xiangyang Ji
arxiv.org/abs/2510.08396

@arXiv_csCL_bot@mastoxiv.page
2025-09-22 10:24:11

BEFT: Bias-Efficient Fine-Tuning of Language Models
Baichuan Huang, Ananth Balashankar, Amir Aminifar
arxiv.org/abs/2509.15974 arxiv.org/pd…

@arXiv_csCV_bot@mastoxiv.page
2025-10-02 10:54:41

AI-CNet3D: An Anatomically-Informed Cross-Attention Network with Multi-Task Consistency Fine-tuning for 3D Glaucoma Classification
Roshan Kenia, Anfei Li, Rishabh Srivastava, Kaveri A. Thakoor
arxiv.org/abs/2510.00882

@arXiv_csAI_bot@mastoxiv.page
2025-10-03 09:10:01

DualTune: Decoupled Fine-Tuning for On-Device Agentic Systems
Rohan Kadekodi, Zhan Jin, Keisuke Kamahori, Yile Gu, Sean Khatiri, Noah H. Bayindirli, Sergey Gorbunov, Baris Kasikci
arxiv.org/abs/2510.00229

@arXiv_csLG_bot@mastoxiv.page
2025-10-13 10:43:00

HINT: Helping Ineffective Rollouts Navigate Towards Effectiveness
Xinyi Wang, Jinyi Han, Zishang Jiang, Tingyun Li, Jiaqing Liang, Sihang Jiang, Zhaoqian Dai, Shuguang Ma, Fei Yu, Yanghua Xiao
arxiv.org/abs/2510.09388

@arXiv_csLG_bot@mastoxiv.page
2025-09-30 14:43:51

BALF: Budgeted Activation-Aware Low-Rank Factorization for Fine-Tuning-Free Model Compression
David Gonz\'alez Mart\'inez
arxiv.org/abs/2509.25136

@arXiv_csCV_bot@mastoxiv.page
2025-10-10 11:15:59

SliceFine: The Universal Winning-Slice Hypothesis for Pretrained Networks
Md Kowsher, Ali O. Polat, Ehsan Mohammady Ardehaly, Mehrdad Salehi, Zia Ghiasi, Prasanth Murali, Chen Chen
arxiv.org/abs/2510.08513

@arXiv_csCL_bot@mastoxiv.page
2025-10-02 10:29:21

Beyond Log Likelihood: Probability-Based Objectives for Supervised Fine-Tuning across the Model Capability Continuum
Gaotang Li, Ruizhong Qiu, Xiusi Chen, Heng Ji, Hanghang Tong
arxiv.org/abs/2510.00526

@arXiv_csLG_bot@mastoxiv.page
2025-09-29 11:37:57

IA2: Alignment with ICL Activations Improves Supervised Fine-Tuning
Aayush Mishra, Daniel Khashabi, Anqi Liu
arxiv.org/abs/2509.22621 arxiv…

@arXiv_csCV_bot@mastoxiv.page
2025-09-23 13:11:11

TempSamp-R1: Effective Temporal Sampling with Reinforcement Fine-Tuning for Video LLMs
Yunheng Li, Jing Cheng, Shaoyong Jia, Hangyi Kuang, Shaohui Jiao, Qibin Hou, Ming-Ming Cheng
arxiv.org/abs/2509.18056

@arXiv_csCL_bot@mastoxiv.page
2025-10-13 14:30:10

Replaced article(s) found for cs.CL. arxiv.org/list/cs.CL/new
[1/4]:
- Privacy-Preserving Parameter-Efficient Fine-Tuning for Large Language Model Services
Yansong Li, Zhixing Tan, Paula Branco, Yang Liu

@arXiv_csAI_bot@mastoxiv.page
2025-10-10 10:37:29

Symmetry-Aware Fully-Amortized Optimization with Scale Equivariant Graph Metanetworks
Bart Kuipers, Freek Byrman, Daniel Uyterlinde, Alejandro Garc\'ia-Castellanos
arxiv.org/abs/2510.08300

@arXiv_csLG_bot@mastoxiv.page
2025-10-09 10:50:21

A Multi-Agent Framework for Stateful Inference-Time Search
Arshika Lalan, Rajat Ghosh, Aditya Kolsur, Debojyoti Dutta
arxiv.org/abs/2510.07147

@arXiv_csCL_bot@mastoxiv.page
2025-10-01 11:16:37

One-Token Rollout: Guiding Supervised Fine-Tuning of LLMs with Policy Gradient
Rui Ming, Haoyuan Wu, Shoubo Hu, Zhuolun He, Bei Yu
arxiv.org/abs/2509.26313

@arXiv_csCV_bot@mastoxiv.page
2025-09-25 10:04:22

Robust RGB-T Tracking via Learnable Visual Fourier Prompt Fine-tuning and Modality Fusion Prompt Generation
Hongtao Yang, Bineng Zhong, Qihua Liang, Zhiruo Zhu, Yaozong Zheng, Ning Li
arxiv.org/abs/2509.19733

@arXiv_csAI_bot@mastoxiv.page
2025-10-10 10:44:59

Agent Learning via Early Experience
Kai Zhang, Xiangchao Chen, Bo Liu, Tianci Xue, Zeyi Liao, Zhihan Liu, Xiyao Wang, Yuting Ning, Zhaorun Chen, Xiaohan Fu, Jian Xie, Yuxuan Sun, Boyu Gou, Qi Qi, Zihang Meng, Jianwei Yang, Ning Zhang, Xian Li, Ashish Shah, Dat Huynh, Hengduo Li, Zi Yang, Sara Cao, Lawrence Jang, Shuyan Zhou, Jiacheng Zhu, Huan Sun, Jason Weston, Yu Su, Yifan Wu

@arXiv_csLG_bot@mastoxiv.page
2025-10-08 10:58:29

Influence Functions for Efficient Data Selection in Reasoning
Prateek Humane, Paolo Cudrano, Daniel Z. Kaplan, Matteo Matteucci, Supriyo Chakraborty, Irina Rish
arxiv.org/abs/2510.06108

@arXiv_csCL_bot@mastoxiv.page
2025-10-07 12:03:32

TiTok: Transfer Token-level Knowledge via Contrastive Excess to Transplant LoRA
Chanjoo Jung, Jaehyung Kim
arxiv.org/abs/2510.04682 arxiv.o…

@arXiv_csCL_bot@mastoxiv.page
2025-09-22 10:09:21

Fine-Tuning Large Multimodal Models for Automatic Pronunciation Assessment
Ke Wang, Wenning Wei, Yan Deng, Lei He, Sheng Zhao
arxiv.org/abs/2509.15701

@arXiv_csAI_bot@mastoxiv.page
2025-09-29 10:29:27

InfiMed-Foundation: Pioneering Advanced Multimodal Medical Models with Compute-Efficient Pre-Training and Multi-Stage Fine-Tuning
Guanghao Zhu, Zhitian Hou, Zeyu Liu, Zhijie Sang, Congkai Xie, Hongxia Yang
arxiv.org/abs/2509.22261

@arXiv_csLG_bot@mastoxiv.page
2025-09-22 10:23:51

Targeted Fine-Tuning of DNN-Based Receivers via Influence Functions
Marko Tuononen, Heikki Penttinen, Ville Hautam\"aki
arxiv.org/abs/2509.15950

@arXiv_csCL_bot@mastoxiv.page
2025-10-02 10:39:21

Family Matters: Language Transfer and Merging for Adapting Small LLMs to Faroese
Jenny Kunz, Iben Nyholm Debess, Annika Simonsen
arxiv.org/abs/2510.00810

@arXiv_csLG_bot@mastoxiv.page
2025-10-10 11:18:59

Improving Reasoning for Diffusion Language Models via Group Diffusion Policy Optimization
Kevin Rojas, Jiahe Lin, Kashif Rasul, Anderson Schneider, Yuriy Nevmyvaka, Molei Tao, Wei Deng
arxiv.org/abs/2510.08554

@arXiv_csLG_bot@mastoxiv.page
2025-10-10 11:08:29

Guided Star-Shaped Masked Diffusion
Viacheslav Meshchaninov, Egor Shibaev, Artem Makoian, Ivan Klimov, Danil Sheshenya, Andrei Malinin, Nikita Balagansky, Daniil Gavrilov, Aibek Alanov, Dmitry Vetrov
arxiv.org/abs/2510.08369

@arXiv_csCL_bot@mastoxiv.page
2025-09-24 10:34:34

When Long Helps Short: How Context Length in Supervised Fine-tuning Affects Behavior of Large Language Models
Yingming Zheng, Hanqi Li, Kai Yu, Lu Chen
arxiv.org/abs/2509.18762

@arXiv_csLG_bot@mastoxiv.page
2025-09-26 10:31:11

Explaining Fine Tuned LLMs via Counterfactuals A Knowledge Graph Driven Framework
Yucheng Wang, Ziyang Chen, Md Faisal Kabir
arxiv.org/abs/2509.21241

@arXiv_csCL_bot@mastoxiv.page
2025-09-25 10:47:32

Investigating the Representation of Backchannels and Fillers in Fine-tuned Language Models
Yu Wang, Leyi Lao, Langchu Huang, Gabriel Skantze, Yang Xu, Hendrik Buschmeier
arxiv.org/abs/2509.20237

@arXiv_csCL_bot@mastoxiv.page
2025-10-03 10:55:21

AccurateRAG: A Framework for Building Accurate Retrieval-Augmented Question-Answering Applications
Linh The Nguyen, Chi Tran, Dung Ngoc Nguyen, Van-Cuong Pham, Hoang Ngo, Dat Quoc Nguyen
arxiv.org/abs/2510.02243

@arXiv_csCL_bot@mastoxiv.page
2025-09-22 09:58:31

A method for improving multilingual quality and diversity of instruction fine-tuning datasets
Chunguang Zhao, Yilun Liu, Pufan Zeng, Yuanchang Luo, Shimin Tao, Minggui He, Weibin Meng, Song Xu, Ziang Chen, Chen Liu, Hongxia Ma, Li Zhang, Boxing Chen, Daimeng Wei
arxiv.org/abs/2509.15549

@arXiv_csCL_bot@mastoxiv.page
2025-10-01 11:15:07

Finetune Once: Decoupling General & Domain Learning with Dynamic Boosted Annealing
Yang Tang, Ruijie Liu, Yifan Wang, Shiyu Li, Xi Chen
arxiv.org/abs/2509.26242

@arXiv_csCL_bot@mastoxiv.page
2025-09-30 14:03:41

Metaphor identification using large language models: A comparison of RAG, prompt engineering, and fine-tuning
Matteo Fuoli, Weihang Huang, Jeannette Littlemore, Sarah Turner, Ellen Wilding
arxiv.org/abs/2509.24866

@arXiv_csCL_bot@mastoxiv.page
2025-10-09 10:21:11

Scaling LLM Multi-turn RL with End-to-end Summarization-based Context Management
Miao Lu, Weiwei Sun, Weihua Du, Zhan Ling, Xuesong Yao, Kang Liu, Jiecao Chen
arxiv.org/abs/2510.06727