Tootfinder

Opt-in global Mastodon full text search. Join the index!

@arXiv_csCR_bot@mastoxiv.page
2025-10-14 11:46:48

Pharmacist: Safety Alignment Data Curation for Large Language Models against Harmful Fine-tuning
Guozhi Liu, Qi Mu, Tiansheng Huang, Xinhua Wang, Li Shen, Weiwei Lin, Zhang Li
arxiv.org/abs/2510.10085

@Techmeme@techhub.social
2025-08-14 18:05:49

Google announces Gemma 3 270M, a compact model designed for task-specific fine-tuning with strong capabilities in instruction following and text structuring (Google Developers Blog)
developers.googleblog.com/en/i

@arXiv_csCL_bot@mastoxiv.page
2025-10-13 10:32:50

Understanding the Effects of Domain Finetuning on LLMs
Eshaan Tanwar, Deepak Nathani, William Yang Wang, Tanmoy Chakraborty
arxiv.org/abs/2510.09359

@arXiv_csNI_bot@mastoxiv.page
2025-08-14 09:34:22

NEFMind: Parameter-Efficient Fine-Tuning of Open-Source LLMs for Telecom APIs Automation
Zainab Khan, Ahmed Hussain, Mukesh Thakur, Arto Hellas, Panos Papadimitratos
arxiv.org/abs/2508.09240

@arXiv_csCV_bot@mastoxiv.page
2025-09-12 10:14:39

PeftCD: Leveraging Vision Foundation Models with Parameter-Efficient Fine-Tuning for Remote Sensing Change Detection
Sijun Dong, Yuxuan Hu, LiBo Wang, Geng Chen, Xiaoliang Meng
arxiv.org/abs/2509.09572

@arXiv_csCL_bot@mastoxiv.page
2025-10-14 13:15:58

MeTA-LoRA: Data-Efficient Multi-Task Fine-Tuning for Large Language Models
Bo Cheng, Xu Wang, Jinda Liu, Yi Chang, Yuan Wu
arxiv.org/abs/2510.11598

@arXiv_csSE_bot@mastoxiv.page
2025-10-13 09:59:30

TIT: A Tree-Structured Instruction Tuning Approach for LLM-Based Code Translation
He Jiang, Yufu Wang, Hao Lin, Peiyu Zou, Zhide Zhou, Ang Jia, Xiaochen Li, Zhilei Ren
arxiv.org/abs/2510.09400

@arXiv_condmatmtrlsci_bot@mastoxiv.page
2025-10-14 09:32:48

Predicting Crystal Structures and Ionic Conductivity in Li$_{3}$YCl$_{6-x}$Br$_{x}$ Halide Solid Electrolytes Using a Fine-Tuned Machine Learning Interatomic Potential
Jonas B\"ohm, Aur\'elie Champagne
arxiv.org/abs/2510.09861

@arXiv_csHC_bot@mastoxiv.page
2025-08-12 10:46:53

Fine-Tuning Large Language Models Using EEG Microstate Features for Mental Workload Assessment
Bujar Raufi
arxiv.org/abs/2508.07283 arxiv.o…

@arXiv_csLG_bot@mastoxiv.page
2025-10-13 10:41:10

Safety Game: Balancing Safe and Informative Conversations with Blackbox Agentic AI using LP Solvers
Tuan Nguyen, Long Tran-Thanh
arxiv.org/abs/2510.09330

@arXiv_csAI_bot@mastoxiv.page
2025-10-13 09:45:30

MEC$^3$O: Multi-Expert Consensus for Code Time Complexity Prediction
Joonghyuk Hahn, Soohan Lim, Yo-Sub Han
arxiv.org/abs/2510.09049 arxiv.…

@arXiv_csCL_bot@mastoxiv.page
2025-10-14 13:08:18

Early Detection and Reduction of Memorisation for Domain Adaptation and Instruction Tuning
Dean L. Slack, Noura Al Moubayed
arxiv.org/abs/2510.11372

@arXiv_csCR_bot@mastoxiv.page
2025-09-12 08:44:59

DP-FedLoRA: Privacy-Enhanced Federated Fine-Tuning for On-Device Large Language Models
Honghui Xu, Shiva Shrestha, Wei Chen, Zhiyuan Li, Zhipeng Cai
arxiv.org/abs/2509.09097

@arXiv_statML_bot@mastoxiv.page
2025-10-14 09:12:08

Calibrating Generative Models
Henry D. Smith, Nathaniel L. Diamant, Brian L. Trippe
arxiv.org/abs/2510.10020 arxiv.org/pdf/2510.10020

@arXiv_eessAS_bot@mastoxiv.page
2025-08-12 10:30:33

G-IFT: A Gated Linear Unit adapter with Iterative Fine-Tuning for Low-Resource Children's Speaker Verification
Vishwas M. Shetty, Jiusi Zheng, Abeer Alwan
arxiv.org/abs/2508.07836

@arXiv_csGR_bot@mastoxiv.page
2025-08-11 07:34:19

DogFit: Domain-guided Fine-tuning for Efficient Transfer Learning of Diffusion Models
Yara Bahram, Mohammadhadi Shateri, Eric Granger
arxiv.org/abs/2508.05685

@arXiv_csSD_bot@mastoxiv.page
2025-10-14 10:45:18

Knowledge-Decoupled Functionally Invariant Path with Synthetic Personal Data for Personalized ASR
Yue Gu, Zhihao Du, Ying Shi, Jiqing Han, Yongjun He
arxiv.org/abs/2510.10401

@arXiv_csAR_bot@mastoxiv.page
2025-10-14 09:20:08

Efficient In-Memory Acceleration of Sparse Block Diagonal LLMs
Jo\~ao Paulo Cardoso de Lima, Marc Dietrich, Jeronimo Castrillon, Asif Ali Khan
arxiv.org/abs/2510.11192

@arXiv_csDB_bot@mastoxiv.page
2025-10-13 07:45:10

HES-SQL: Hybrid Reasoning for Efficient Text-to-SQL with Structural Skeleton Guidance
Suming Qiu, Jing Li, Zhicheng Zhou, Junjie Huang, Linyuan Qiu, Zhijie Sun
arxiv.org/abs/2510.08896

@arXiv_csIR_bot@mastoxiv.page
2025-10-14 10:38:48

Does LLM Focus on the Right Words? Diagnosing Language Bias in LLM-based Recommenders
Bohao Wang, Jiawei Chen, Feng Liu, Changwang Zhang, Jun Wang, Canghong Jin, Chun Chen, Can Wang
arxiv.org/abs/2510.10978

@arXiv_condmatmeshall_bot@mastoxiv.page
2025-08-14 09:06:42

Phonon interference effects in GaAs-GaP superlattice nanowires
Chaitanya Arya, Johannes Trautvetter, Jose M. Sojo-Gordillo, Yashpreet Kaur, Valentina Zannier, Fabio Beltram, Tommaso Albrigi, Alicia Ruiz-Caridad, Lucia Sorba, Riccardo Rurali, Ilaria Zardo
arxiv.org/abs/2508.09556

@arXiv_csCL_bot@mastoxiv.page
2025-10-14 13:09:08

Valid Survey Simulations with Limited Human Data: The Roles of Prompting, Fine-Tuning, and Rectification
Stefan Krsteski, Giuseppe Russo, Serina Chang, Robert West, Kristina Gligori\'c
arxiv.org/abs/2510.11408

@arXiv_csRO_bot@mastoxiv.page
2025-08-05 11:45:31

CO-RFT: Efficient Fine-Tuning of Vision-Language-Action Models through Chunked Offline Reinforcement Learning
Dongchi Huang, Zhirui Fang, Tianle Zhang, Yihang Li, Lin Zhao, Chunhe Xia
arxiv.org/abs/2508.02219

@arXiv_csAI_bot@mastoxiv.page
2025-10-13 14:49:33

Replaced article(s) found for cs.AI. arxiv.org/list/cs.AI/new
[5/7]:
- On-Policy RL Meets Off-Policy Experts: Harmonizing Supervised Fine-Tuning and Reinforcement Learn...
Zhang, Xie, Sun, Chen, Wang, Li, Ding, Zhou

@arXiv_csLG_bot@mastoxiv.page
2025-10-13 10:43:00

HINT: Helping Ineffective Rollouts Navigate Towards Effectiveness
Xinyi Wang, Jinyi Han, Zishang Jiang, Tingyun Li, Jiaqing Liang, Sihang Jiang, Zhaoqian Dai, Shuguang Ma, Fei Yu, Yanghua Xiao
arxiv.org/abs/2510.09388

@arXiv_qbioQM_bot@mastoxiv.page
2025-10-07 08:47:32

InstructPLM-mu: 1-Hour Fine-Tuning of ESM2 Beats ESM3 in Protein Mutation Predictions
Junde Xu, Yapin Shi, Lijun Lang, Taoyong Cui, Zhiming Zhang, Guangyong Chen, Jiezhong Qiu, Pheng-Ann Heng
arxiv.org/abs/2510.03370

@arXiv_eessIV_bot@mastoxiv.page
2025-10-09 08:04:51

Stacked Regression using Off-the-shelf, Stimulus-tuned and Fine-tuned Neural Networks for Predicting fMRI Brain Responses to Movies (Algonauts 2025 Report)
Robert Scholz, Kunal Bagga, Christine Ahrends, Carlo Alberto Barbano
arxiv.org/abs/2510.06235

@seeingwithsound@mas.to
2025-09-11 14:06:05

[OT] Revealed: Apple is teaching its AI to adapt to the Trump era politico.eu/article/apple-teac "updated guidelines on how the AI talks about diversity, equity and inclusion poli…

@tinoeberl@mastodon.online
2025-10-11 05:07:02

Ein Artikel in Plos One beleuchtet den politischen #Bias von #Chatbots und zeigt, dass viele KI-Modelle eher linksgerichtete Antworten geben.
Dies könnte auf das Supervised Fine-Tuning zurückzuführen sein, bei dem KI durch menschliche Beispiele lernt. Interessanterweise weisen unterschiedliche

@arXiv_csSD_bot@mastoxiv.page
2025-10-14 11:06:48

Proficiency-Aware Adaptation and Data Augmentation for Robust L2 ASR
Ling Sun, Charlotte Zhu, Shuju Shi
arxiv.org/abs/2510.10738 arxiv.org/…

@arXiv_csIR_bot@mastoxiv.page
2025-08-11 09:29:49

Fine-Tuning Vision-Language Models for Markdown Conversion of Financial Tables in Malaysian Audited Financial Reports
Jin Khye Tan (Faculty of Computer Science,Information Technology, Universiti Malaya), En Jun Choong, Ethan Jeremiah Chitty, Yan Pheng Choo, John Hsin Yang Wong, Chern Eu Cheah
arxiv.org/abs/2508.05669

@arXiv_csCR_bot@mastoxiv.page
2025-09-12 07:37:29

When FinTech Meets Privacy: Securing Financial LLMs with Differential Private Fine-Tuning
Sichen Zhu, Hoyeung Leung, Xiaoyi Wang, Jia Wei, Honghui Xu
arxiv.org/abs/2509.08995

@arXiv_csDB_bot@mastoxiv.page
2025-10-14 08:25:48

GrASP: A Generalizable Address-based Semantic Prefetcher for Scalable Transactional and Analytical Workloads
Farzaneh Zirak, Farhana Choudhury, Renata Borovica-Gajic
arxiv.org/abs/2510.11011

@arXiv_csLG_bot@mastoxiv.page
2025-10-14 13:40:48

MATH-Beyond: A Benchmark for RL to Expand Beyond the Base Model
Prasanna Mayilvahanan, Ricardo Dominguez-Olmedo, Thadd\"aus Wiedemer, Wieland Brendel
arxiv.org/abs/2510.11653

@arXiv_csCL_bot@mastoxiv.page
2025-08-14 09:51:22

A Comprehensive Evaluation framework of Alignment Techniques for LLMs
Muneeza Azmat, Momin Abbas, Maysa Malfiza Garcia de Macedo, Marcelo Carpinette Grave, Luan Soares de Souza, Tiago Machado, Rogerio A de Paula, Raya Horesh, Yixin Chen, Heloisa Caroline de Souza Pereira Candello, Rebecka Nordenlow, Aminat Adebiyi
arxiv.org/abs/250…

@arXiv_csCV_bot@mastoxiv.page
2025-08-12 12:46:03

MedReasoner: Reinforcement Learning Drives Reasoning Grounding from Clinical Thought to Pixel-Level Precision
Zhonghao Yan, Muxi Diao, Yuxuan Yang, Jiayuan Xu, Kaizhou Zhang, Ruoyan Jing, Lele Yang, Yanxi Liu, Kongming Liang, Zhanyu Ma
arxiv.org/abs/2508.08177

@arXiv_csCR_bot@mastoxiv.page
2025-10-14 11:47:28

MetaBreak: Jailbreaking Online LLM Services via Special Token Manipulation
Wentian Zhu, Zhen Xiang, Wei Niu, Le Guan
arxiv.org/abs/2510.10271

@arXiv_csRO_bot@mastoxiv.page
2025-09-12 09:37:39

SimpleVLA-RL: Scaling VLA Training via Reinforcement Learning
Haozhan Li, Yuxin Zuo, Jiale Yu, Yuhao Zhang, Zhaohui Yang, Kaiyan Zhang, Xuekai Zhu, Yuchen Zhang, Tianxing Chen, Ganqu Cui, Dehui Wang, Dingxiang Luo, Yuchen Fan, Youbang Sun, Jia Zeng, Jiangmiao Pang, Shanghang Zhang, Yu Wang, Yao Mu, Bowen Zhou, Ning Ding
arxiv.org/a…

@Techmeme@techhub.social
2025-09-10 03:01:40

Memo: Apple's March update to its AI training guidelines for data annotators marked DEI as a "controversial" topic and removed intolerance as "harmful" behavior (Océane Herrero/Politico)
politico.eu/article/apple-teac

@arXiv_csIR_bot@mastoxiv.page
2025-08-11 09:36:39

Domain-Specific Fine-Tuning and Prompt-Based Learning: A Comparative Study for developing Natural Language-Based BIM Information Retrieval Systems
Han Gao, Timo Hartmann, Botao Zhong, Kai Lia, Hanbin Luo
arxiv.org/abs/2508.05676

@arXiv_eessIV_bot@mastoxiv.page
2025-08-12 17:47:26

Replaced article(s) found for eess.IV. arxiv.org/list/eess.IV/new
[1/2]:
- Accurate Measles Rash Detection via Vision Transformer Fine-Tuning
Qingguo Wang

@arXiv_csSE_bot@mastoxiv.page
2025-10-09 10:00:31

Prompt, Synthesize, Fine-Tune: A Secure Code Generation Recipe
Junjie Li, Fazle Rabbi, Bo Yang, Song Wang, Jinqiu Yang
arxiv.org/abs/2510.07189

@arXiv_csCL_bot@mastoxiv.page
2025-08-13 10:16:42

CPO: Addressing Reward Ambiguity in Role-playing Dialogue via Comparative Policy Optimization
Xinge Ye, Rui Wang, Yuchuan Wu, Victor Ma, Feiteng Fang, Fei Huang, Yongbin Li
arxiv.org/abs/2508.09074

@arXiv_csAI_bot@mastoxiv.page
2025-10-10 10:45:19

How to Teach Large Multimodal Models New Skills
Zhen Zhu, Yiming Gong, Yao Xiao, Yaoyao Liu, Derek Hoiem
arxiv.org/abs/2510.08564 arxiv.org…

@arXiv_csCV_bot@mastoxiv.page
2025-09-12 10:07:59

You Share Beliefs, I Adapt: Progressive Heterogeneous Collaborative Perception
Hao Si, Ehsan Javanmardi, Manabu Tsukada
arxiv.org/abs/2509.09310

@arXiv_csCR_bot@mastoxiv.page
2025-08-11 09:36:19

DMFI: Dual-Modality Fine-Tuning and Inference Framework for LLM-Based Insider Threat Detection
Kaichuan Kong, Dongjie Liu, Xiaobo Jin, Guanggang Geng, Zhiying Li, Jian Weng
arxiv.org/abs/2508.05694

@arXiv_csCL_bot@mastoxiv.page
2025-09-12 09:59:09

Bridging the Capability Gap: Joint Alignment Tuning for Harmonizing LLM-based Multi-Agent Systems
Minghang Zhu, Zhengliang Shi, Zhiwei Xu, Shiguang Wu, Lingjie Wang, Pengjie Ren, Zhaochun Ren, Zhumin Chen
arxiv.org/abs/2509.09629

@arXiv_csLG_bot@mastoxiv.page
2025-10-09 10:43:11

Enhancing Speech Emotion Recognition via Fine-Tuning Pre-Trained Models and Hyper-Parameter Optimisation
Aryan Golbaghi, Shuo Zhou
arxiv.org/abs/2510.07052

@arXiv_csSD_bot@mastoxiv.page
2025-09-10 08:31:41

When Fine-Tuning is Not Enough: Lessons from HSAD on Hybrid and Adversarial Audio Spoof Detection
Bin Hu, Kunyang Huang, Daehan Kwak, Meng Xu, Kuan Huang
arxiv.org/abs/2509.07323

@Techmeme@techhub.social
2025-10-01 18:19:29

Mira Murati's Thinking Machines Lab launches its first product, Tinker, which automates the creation of custom frontier AI models (Will Knight/Wired)
wired.com/story/thinking-machi

@arXiv_csCL_bot@mastoxiv.page
2025-08-13 10:12:52

A Survey on Training-free Alignment of Large Language Models
Birong Pan, Yongqi Li, Weiyu Zhang, Wenpeng Lu, Mayi Xu, Shen Zhou, Yuanyuan Zhu, Ming Zhong, Tieyun Qian
arxiv.org/abs/2508.09016

@arXiv_csCL_bot@mastoxiv.page
2025-10-13 14:30:10

Replaced article(s) found for cs.CL. arxiv.org/list/cs.CL/new
[1/4]:
- Privacy-Preserving Parameter-Efficient Fine-Tuning for Large Language Model Services
Yansong Li, Zhixing Tan, Paula Branco, Yang Liu

@arXiv_csLG_bot@mastoxiv.page
2025-09-12 10:09:49

Conditioning on PDE Parameters to Generalise Deep Learning Emulation of Stochastic and Chaotic Dynamics
Ira J. S. Shokar, Rich R. Kerswell, Peter H. Haynes
arxiv.org/abs/2509.09599

@arXiv_csRO_bot@mastoxiv.page
2025-09-11 09:37:43

Zero-Shot Metric Depth Estimation via Monocular Visual-Inertial Rescaling for Autonomous Aerial Navigation
Steven Yang, Xiaoyu Tian, Kshitij Goel, Wennie Tabib
arxiv.org/abs/2509.08159

@arXiv_csCL_bot@mastoxiv.page
2025-09-11 09:37:13

Low-Resource Fine-Tuning for Multi-Task Structured Information Extraction with a Billion-Parameter Instruction-Tuned Model
Yu Cheng Chih, Yong Hao Hou
arxiv.org/abs/2509.08381

@arXiv_csAI_bot@mastoxiv.page
2025-10-08 09:59:59

MetaVLA: Unified Meta Co-training For Efficient Embodied Adaption
Chen Li, Zhantao Yang, Han Zhang, Fangyi Chen, Chenchen Zhu, Anudeepsekhar Bolimera, Marios Savvides
arxiv.org/abs/2510.05580

@arXiv_csCL_bot@mastoxiv.page
2025-08-14 07:45:52

Leveraging Large Language Models for Rare Disease Named Entity Recognition
Nan Miles Xi, Yu Deng, Lin Wang
arxiv.org/abs/2508.09323 arxiv.o…

@arXiv_csCV_bot@mastoxiv.page
2025-08-11 10:17:39

Effective Training Data Synthesis for Improving MLLM Chart Understanding
Yuwei Yang, Zeyu Zhang, Yunzhong Hou, Zhuowan Li, Gaowen Liu, Ali Payani, Yuan-Sen Ting, Liang Zheng
arxiv.org/abs/2508.06492

@arXiv_csLG_bot@mastoxiv.page
2025-08-12 11:39:23

Surgical Knowledge Rewrite in Compact LLMs: An 'Unlearn-then-Learn' Strategy with ($IA^3$) for Localized Factual Modulation and Catastrophic Forgetting Mitigation
Stanley Ngugi
arxiv.org/abs/2508.07075

@arXiv_csCR_bot@mastoxiv.page
2025-10-06 09:47:19

Attack via Overfitting: 10-shot Benign Fine-tuning to Jailbreak LLMs
Zhixin Xie, Xurui Song, Jun Luo
arxiv.org/abs/2510.02833 arxiv.org/pdf…

@arXiv_csCL_bot@mastoxiv.page
2025-08-14 09:50:22

Assessing the Feasibility of Lightweight Whisper Models for Low-Resource Urdu Transcription
Abdul Rehman Antall, Naveed Akhtar
arxiv.org/abs/2508.09865

@arXiv_csCV_bot@mastoxiv.page
2025-09-08 09:49:50

Leveraging Transfer Learning and Mobile-enabled Convolutional Neural Networks for Improved Arabic Handwritten Character Recognition
Mohsine El Khayati, Ayyad Maafiri, Yassine Himeur, Hamzah Ali Alkhazaleh, Shadi Atalla, Wathiq Mansoor
arxiv.org/abs/2509.05019

@arXiv_csCL_bot@mastoxiv.page
2025-09-09 12:09:22

UNH at CheckThat! 2025: Fine-tuning Vs Prompting in Claim Extraction
Joe Wilder, Nikhil Kadapala, Benji Xu, Mohammed Alsaadi, Aiden Parsons, Mitchell Rogers, Palash Agarwal, Adam Hassick, Laura Dietz
arxiv.org/abs/2509.06883

@arXiv_csLG_bot@mastoxiv.page
2025-09-10 10:22:31

Systematic Optimization of Open Source Large Language Models for Mathematical Reasoning
Pranav Pawar, Dhwaj Jain, Varun Gupta, Kaustav Dedhia, Dashrath Kale, Sudhir Dhekane
arxiv.org/abs/2509.07238

@arXiv_csCL_bot@mastoxiv.page
2025-09-11 08:09:13

MVPBench: A Benchmark and Fine-Tuning Framework for Aligning Large Language Models with Diverse Human Values
Yao Liang, Dongcheng Zhao, Feifei Zhao, Guobin Shen, Yuwei Wang, Dongqi Liang, Yi Zeng
arxiv.org/abs/2509.08022

@arXiv_csCV_bot@mastoxiv.page
2025-09-01 09:43:42

Federated Fine-tuning of SAM-Med3D for MRI-based Dementia Classification
Kaouther Mouheb, Marawan Elbatel, Janne Papma, Geert Jan Biessels, Jurgen Claassen, Huub Middelkoop, Barbara van Munster, Wiesje van der Flier, Inez Ramakers, Stefan Klein, Esther E. Bron
arxiv.org/abs/2508.21458

@arXiv_csLG_bot@mastoxiv.page
2025-10-10 11:10:19

FlyLoRA: Boosting Task Decoupling and Parameter Efficiency via Implicit Rank-Wise Mixture-of-Experts
Heming Zou, Yunliang Zang, Wutong Xu, Yao Zhu, Xiangyang Ji
arxiv.org/abs/2510.08396

@arXiv_csCL_bot@mastoxiv.page
2025-09-08 10:11:00

A Study of Large Language Models for Patient Information Extraction: Model Architecture, Fine-Tuning Strategy, and Multi-task Instruction Tuning
Cheng Peng, Xinyu Dong, Mengxian Lyu, Daniel Paredes, Yaoyun Zhang, Yonghui Wu
arxiv.org/abs/2509.04753

@arXiv_csCR_bot@mastoxiv.page
2025-09-01 08:42:32

zkLoRA: Fine-Tuning Large Language Models with Verifiable Security via Zero-Knowledge Proofs
Guofu Liao, Taotao Wang, Shengli Zhang, Jiqun Zhang, Shi Long, Dacheng Tao
arxiv.org/abs/2508.21393

@arXiv_csLG_bot@mastoxiv.page
2025-10-08 10:29:39

AMAQ: Adaptive Mixed-bit Activation Quantization for Collaborative Parameter Efficient Fine-tuning
Yurun Song, Zhuoyi Yang, Ian G. Harris, Sangeetha Abdu Jyothi
arxiv.org/abs/2510.05468

@arXiv_csCL_bot@mastoxiv.page
2025-10-07 12:18:02

Resource-Efficient Fine-Tuning of LLaMA-3.2-3B for Medical Chain-of-Thought Reasoning
Imran Mansha
arxiv.org/abs/2510.05003 arxiv.org/pdf/2…

@arXiv_csCV_bot@mastoxiv.page
2025-10-06 10:05:39

Multimodal Carotid Risk Stratification with Large Vision-Language Models: Benchmarking, Fine-Tuning, and Clinical Insights
Daphne Tsolissou, Theofanis Ganitidis, Konstantinos Mitsis, Stergios CHristodoulidis, Maria Vakalopoulou, Konstantina Nikita
arxiv.org/abs/2510.02922

@arXiv_csCL_bot@mastoxiv.page
2025-09-11 07:41:33

AntiDote: Bi-level Adversarial Training for Tamper-Resistant LLMs
Debdeep Sanyal, Manodeep Ray, Murari Mandal
arxiv.org/abs/2509.08000 arxi…

@arXiv_csLG_bot@mastoxiv.page
2025-09-11 10:14:13

Sharing is Caring: Efficient LM Post-Training with Collective RL Experience Sharing
Jeffrey Amico, Gabriel Passamani Andrade, John Donaghy, Ben Fielding, Tristin Forbus, Harry Grieve, Semih Kara, Jari Kolehmainen, Yihua Lou, Christopher Nies, Edward Phillip Flores Nu\~no, Diogo Ortega, Shikhar Rastogi, Austin Virts, Matthew J. Wright

@arXiv_csCR_bot@mastoxiv.page
2025-10-03 07:34:30

Fine-Tuning Jailbreaks under Highly Constrained Black-Box Settings: A Three-Pronged Approach
Xiangfang Li, Yu Wang, Bo Li
arxiv.org/abs/2510.01342

@arXiv_csCV_bot@mastoxiv.page
2025-10-10 11:15:59

SliceFine: The Universal Winning-Slice Hypothesis for Pretrained Networks
Md Kowsher, Ali O. Polat, Ehsan Mohammady Ardehaly, Mehrdad Salehi, Zia Ghiasi, Prasanth Murali, Chen Chen
arxiv.org/abs/2510.08513

@arXiv_csCL_bot@mastoxiv.page
2025-09-08 10:12:50

L1RA: Dynamic Rank Assignment in LoRA Fine-Tuning
Raul Singh, Nicolo Brunello, Vincenzo Scotti, Mark James Carman
arxiv.org/abs/2509.04884

@arXiv_csLG_bot@mastoxiv.page
2025-09-11 10:15:03

AgentGym-RL: Training LLM Agents for Long-Horizon Decision Making through Multi-Turn Reinforcement Learning
Zhiheng Xi, Jixuan Huang, Chenyang Liao, Baodai Huang, Honglin Guo, Jiaqi Liu, Rui Zheng, Junjie Ye, Jiazheng Zhang, Wenxiang Chen, Wei He, Yiwen Ding, Guanyu Li, Zehui Chen, Zhengyin Du, Xuesong Yao, Yufei Xu, Jiecao Chen, Tao Gui, Zuxuan Wu, Qi Zhang, Xuanjing Huang, Yu-Gang Jiang

@arXiv_csCL_bot@mastoxiv.page
2025-09-12 07:41:09

Noise or Nuance: An Investigation Into Useful Information and Filtering For LLM Driven AKBC
Alex Clay, Ernesto Jim\'enez-Ruiz, Pranava Madhyastha
arxiv.org/abs/2509.08903

@arXiv_csCL_bot@mastoxiv.page
2025-09-10 10:24:31

Are LLMs Enough for Hyperpartisan, Fake, Polarized and Harmful Content Detection? Evaluating In-Context Learning vs. Fine-Tuning
Michele Joshua Maggini, Dhia Merzougui, Rabiraj Bandyopadhyay, Ga\"el Dias, Fabrice Maurel, Pablo Gamallo
arxiv.org/abs/2509.07768

@arXiv_csLG_bot@mastoxiv.page
2025-10-09 10:50:21

A Multi-Agent Framework for Stateful Inference-Time Search
Arshika Lalan, Rajat Ghosh, Aditya Kolsur, Debojyoti Dutta
arxiv.org/abs/2510.07147

@arXiv_csCL_bot@mastoxiv.page
2025-09-12 09:01:59

Improving LLM Safety and Helpfulness using SFT and DPO: A Study on OPT-350M
Piyush Pant
arxiv.org/abs/2509.09055 arxiv.org/pdf/2509.09055…

@arXiv_csLG_bot@mastoxiv.page
2025-09-05 10:25:21

RL's Razor: Why Online Reinforcement Learning Forgets Less
Idan Shenfeld, Jyothish Pari, Pulkit Agrawal
arxiv.org/abs/2509.04259 arxiv.…

@arXiv_csCL_bot@mastoxiv.page
2025-09-10 10:30:01

From Detection to Mitigation: Addressing Gender Bias in Chinese Texts via Efficient Tuning and Voting-Based Rebalancing
Chengyan Wu, Yiqiang Cai, Yufei Cheng, Yun Xue
arxiv.org/abs/2509.07889

@arXiv_csCL_bot@mastoxiv.page
2025-08-11 10:02:49

Learning the Topic, Not the Language: How LLMs Classify Online Immigration Discourse Across Languages
Andrea Nasuto, Stefano Maria Iacus, Francisco Rowe, Devika Jain
arxiv.org/abs/2508.06435

@arXiv_csLG_bot@mastoxiv.page
2025-10-02 11:10:31

Sample-Efficient Differentially Private Fine-Tuning via Gradient Matrix Denoising
Ali Dadsetan, Frank Rudzicz
arxiv.org/abs/2510.01137 arxi…

@arXiv_csCL_bot@mastoxiv.page
2025-09-10 08:51:41

Towards EnergyGPT: A Large Language Model Specialized for the Energy Sector
Amal Chebbi, Babajide Kolade
arxiv.org/abs/2509.07177 arxiv.org…

@arXiv_csLG_bot@mastoxiv.page
2025-09-30 14:45:31

TR2-D2: Tree Search Guided Trajectory-Aware Fine-Tuning for Discrete Diffusion
Sophia Tang, Yuchen Zhu, Molei Tao, Pranam Chatterjee
arxiv.org/abs/2509.25171

@arXiv_csCL_bot@mastoxiv.page
2025-09-12 09:53:49

DeMeVa at LeWiDi-2025: Modeling Perspectives with In-Context Learning and Label Distribution Learning
Daniil Ignatev, Nan Li, Hugh Mee Wong, Anh Dang, Shane Kaszefski Yaschuk
arxiv.org/abs/2509.09524

@arXiv_csLG_bot@mastoxiv.page
2025-10-08 10:58:29

Influence Functions for Efficient Data Selection in Reasoning
Prateek Humane, Paolo Cudrano, Daniel Z. Kaplan, Matteo Matteucci, Supriyo Chakraborty, Irina Rish
arxiv.org/abs/2510.06108

@arXiv_csCL_bot@mastoxiv.page
2025-09-09 12:05:52

Anchoring Refusal Direction: Mitigating Safety Risks in Tuning via Projection Constraint
Yanrui Du, Fenglei Fan, Sendong Zhao, Jiawei Cao, Qika Lin, Kai He, Ting Liu, Bing Qin, Mengling Feng
arxiv.org/abs/2509.06795

@arXiv_csCL_bot@mastoxiv.page
2025-09-10 09:53:51

Does This Look Familiar to You? Knowledge Analysis via Model Internal Representations
Sihyun Park
arxiv.org/abs/2509.07311 arxiv.org/pdf/25…

@arXiv_csCL_bot@mastoxiv.page
2025-09-01 09:48:52

Not All Parameters Are Created Equal: Smart Isolation Boosts Fine-Tuning Performance
Yao Wang, Di Liang, Minlong Peng
arxiv.org/abs/2508.21741

@arXiv_csCL_bot@mastoxiv.page
2025-09-05 10:07:41

SPFT-SQL: Enhancing Large Language Model for Text-to-SQL Parsing by Self-Play Fine-Tuning
Yuhao Zhang, Shaoming Duan, Jinhang Su, Chuanyi Liu, Peiyi Han
arxiv.org/abs/2509.03937

@arXiv_csCL_bot@mastoxiv.page
2025-09-10 10:28:31

Small Open Models Achieve Near Parity with Large Models in Low Resource Literary Translation at a Fraction of the Cost
Mihai Nadas, Laura Diosan, Andreea Tomescu, Andrei Piscoran
arxiv.org/abs/2509.07829

@arXiv_csCL_bot@mastoxiv.page
2025-09-10 10:19:11

ALLabel: Three-stage Active Learning for LLM-based Entity Recognition using Demonstration Retrieval
Zihan Chen, Lei Shi, Weize Wu, Qiji Zhou, Yue Zhang
arxiv.org/abs/2509.07512

@arXiv_csCL_bot@mastoxiv.page
2025-08-11 10:04:19

Post-training for Efficient Communication via Convention Formation
Yilun Hua, Evan Wang, Yoav Artzi
arxiv.org/abs/2508.06482 arxiv.org/pdf/…

@arXiv_csCL_bot@mastoxiv.page
2025-09-09 11:50:02

MSLEF: Multi-Segment LLM Ensemble Finetuning in Recruitment
Omar Walid, Mohamed T. Younes, Khaled Shaban, Mai Hassan, Ali Hamdi
arxiv.org/abs/2509.06200

@arXiv_csCL_bot@mastoxiv.page
2025-09-22 10:24:11

BEFT: Bias-Efficient Fine-Tuning of Language Models
Baichuan Huang, Ananth Balashankar, Amir Aminifar
arxiv.org/abs/2509.15974 arxiv.org/pd…

@arXiv_csCL_bot@mastoxiv.page
2025-10-07 12:03:32

TiTok: Transfer Token-level Knowledge via Contrastive Excess to Transplant LoRA
Chanjoo Jung, Jaehyung Kim
arxiv.org/abs/2510.04682 arxiv.o…