Tootfinder

Opt-in global Mastodon full text search. Join the index!

@arXiv_csCL_bot@mastoxiv.page
2025-06-10 18:54:40

This arxiv.org/abs/2505.19912 has been replaced.
initial toot: mastoxiv.page/@arXiv_csCL_…

@arXiv_csRO_bot@mastoxiv.page
2025-06-11 08:15:45

TGRPO :Fine-tuning Vision-Language-Action Model via Trajectory-wise Group Relative Policy Optimization
Zengjue Chen, Runliang Niu, He Kong, Qi Wang
arxiv.org/abs/2506.08440

@arXiv_csCR_bot@mastoxiv.page
2025-07-11 09:11:11

May I have your Attention? Breaking Fine-Tuning based Prompt Injection Defenses using Architecture-Aware Attacks
Nishit V. Pandya, Andrey Labunets, Sicun Gao, Earlence Fernandes
arxiv.org/abs/2507.07417

@arXiv_physicscompph_bot@mastoxiv.page
2025-06-10 09:58:03

A Study on the Fine-Tuning Performance of Universal Machine-Learned Interatomic Potentials (U-MLIPs)
Xiaoqing Liu, Kehan Zeng, Yangshuai Wang, Teng Zhao
arxiv.org/abs/2506.07401

@arXiv_csLG_bot@mastoxiv.page
2025-06-10 19:21:55

This arxiv.org/abs/2506.02308 has been replaced.
initial toot: mastoxiv.page/@arXiv_csLG_…

@arXiv_csGR_bot@mastoxiv.page
2025-08-11 07:34:19

DogFit: Domain-guided Fine-tuning for Efficient Transfer Learning of Diffusion Models
Yara Bahram, Mohammadhadi Shateri, Eric Granger
arxiv.org/abs/2508.05685

@arXiv_csIR_bot@mastoxiv.page
2025-06-11 07:41:03

Reinforcement Fine-Tuning for Reasoning towards Multi-Step Multi-Source Search in Large Language Models
Wentao Shi, Yiqing Shen
arxiv.org/abs/2506.08352

@arXiv_csDC_bot@mastoxiv.page
2025-07-08 07:51:00

Symbiosis: Multi-Adapter Inference and Fine-Tuning
Saransh Gupta, Umesh Deshpande, Travis Janssen, Swami Sundararaman
arxiv.org/abs/2507.03220

@arXiv_csCY_bot@mastoxiv.page
2025-07-09 08:03:32

Narrowing the Gap: Supervised Fine-Tuning of Open-Source LLMs as a Viable Alternative to Proprietary Models for Pedagogical Tools
Lorenzo Lee Solano, Charles Koutcheme, Juho Leinonen, Alexandra Vassar, Jake Renzella
arxiv.org/abs/2507.05305

@arXiv_csCR_bot@mastoxiv.page
2025-08-11 09:36:19

DMFI: Dual-Modality Fine-Tuning and Inference Framework for LLM-Based Insider Threat Detection
Kaichuan Kong, Dongjie Liu, Xiaobo Jin, Guanggang Geng, Zhiying Li, Jian Weng
arxiv.org/abs/2508.05694

@arXiv_csLG_bot@mastoxiv.page
2025-06-09 10:06:32

Mitigating Catastrophic Forgetting with Adaptive Transformer Block Expansion in Federated Fine-Tuning
Yujia Huo, Jianchun Liu, Hongli Xu, Zhenguo Ma, Shilong Wang, Liusheng Huang
arxiv.org/abs/2506.05977

@arXiv_csCL_bot@mastoxiv.page
2025-07-11 09:55:31

KeyKnowledgeRAG (K^2RAG): An Enhanced RAG method for improved LLM question-answering capabilities
Hruday Markondapatnaikuni, Basem Suleiman, Abdelkarim Erradi, Shijing Chen
arxiv.org/abs/2507.07695

@arXiv_csCV_bot@mastoxiv.page
2025-07-10 08:54:01

Video-RTS: Rethinking Reinforcement Learning and Test-Time Scaling for Efficient and Enhanced Video Reasoning
Ziyang Wang, Jaehong Yoon, Shoubin Yu, Md Mohaiminul Islam, Gedas Bertasius, Mohit Bansal
arxiv.org/abs/2507.06485

@arXiv_csAI_bot@mastoxiv.page
2025-07-11 08:59:31

DrugMCTS: a drug repurposing framework combining multi-agent, RAG and Monte Carlo Tree Search
Zerui Yang, Yuwei Wan, Yinqiao Li, Yudai Matsuda, Tong Xie, Linqi Song
arxiv.org/abs/2507.07426

@arXiv_csGR_bot@mastoxiv.page
2025-06-10 07:48:32

Noise Consistency Regularization for Improved Subject-Driven Image Synthesis
Yao Ni, Song Wen, Piotr Koniusz, Anoop Cherian
arxiv.org/abs/2506.06483

@arXiv_csSE_bot@mastoxiv.page
2025-06-10 16:49:19

This arxiv.org/abs/2408.09568 has been replaced.
initial toot: mastoxiv.page/@arXiv_csSE_…

@arXiv_eessIV_bot@mastoxiv.page
2025-07-11 09:03:21

Label-Efficient Chest X-ray Diagnosis via Partial CLIP Adaptation
Heet Nitinkumar Dalsania
arxiv.org/abs/2507.07254 a…

@arXiv_csLG_bot@mastoxiv.page
2025-06-10 19:23:57

This arxiv.org/abs/2506.06105 has been replaced.
initial toot: mastoxiv.page/@arXiv_csLG_…

@arXiv_eessAS_bot@mastoxiv.page
2025-06-09 08:01:22

Low-Resource Domain Adaptation for Speech LLMs via Text-Only Fine-Tuning
Yangui Fang, Jing Peng, Xu Li, Yu Xi, Chengwei Zhang, Guohui Zhong, Kai Yu
arxiv.org/abs/2506.05671

@arXiv_csNE_bot@mastoxiv.page
2025-06-11 07:45:54

Efficient Fireworks Algorithm Equipped with an Explosion Mechanism based on Student's T-distribution
Cen Shipeng, Tan Ying
arxiv.org/abs/2506.08484

@arXiv_csIR_bot@mastoxiv.page
2025-06-10 16:35:59

This arxiv.org/abs/2504.04178 has been replaced.
initial toot: mastoxiv.page/@arXiv_csIR_…

@arXiv_csCV_bot@mastoxiv.page
2025-07-10 10:15:51

Integrating Pathology Foundation Models and Spatial Transcriptomics for Cellular Decomposition from Histology Images
Yutong Sun, Sichen Zhu, Peng Qiu
arxiv.org/abs/2507.07013

@arXiv_hepph_bot@mastoxiv.page
2025-07-09 08:49:42

Impact of First-order Electroweak Phase Transition on QCD Axion
Dipendu Bhandari, Soumen Kumar Manna, Arunansu Sil
arxiv.org/abs/2507.05353

@arXiv_csCL_bot@mastoxiv.page
2025-08-11 10:02:49

Learning the Topic, Not the Language: How LLMs Classify Online Immigration Discourse Across Languages
Andrea Nasuto, Stefano Maria Iacus, Francisco Rowe, Devika Jain
arxiv.org/abs/2508.06435

@arXiv_csHC_bot@mastoxiv.page
2025-08-07 07:34:03

MagicGUI: A Foundational Mobile GUI Agent with Scalable Data Pipeline and Reinforcement Fine-tuning
Liujian Tang, Shaokang Dong, Yijia Huang, Minqi Xiang, Hongtao Ruan, Bin Wang, Shuo Li, Zhihui Cao, Hailiang Pang, Heng Kong, He Yang, Mingxu Chai, Zhilin Gao, Xingyu Liu, Yingnan Fu, Jiaming Liu, Tao Gui, Xuanjing Huang, Yu-Gang Jiang, Qi Zhang, Kang Wang, Yunke Zhang, Yuran Wang

@arXiv_csCR_bot@mastoxiv.page
2025-07-09 09:28:02

TuneShield: Mitigating Toxicity in Conversational AI while Fine-tuning on Untrusted Data
Aravind Cheruvu, Shravya Kanchi, Sifat Muhammad Abdullah, Nicholas Kong, Daphne Yao, Murtuza Jadliwala, Bimal Viswanath
arxiv.org/abs/2507.05660

@bmariusz@techhub.social
2025-06-11 14:45:25

Day 7
✅ 24 test suites, 153 tests passing.
Solid coverage across service and controller layers in my modular monorepo. Strict typing (TypeScript), full DTO validation, and realistic mocks across complex relations (TypeORM).
Next: fine-tuning error handling & exploring e2e strategies.
write.tyolabs.com/?p=25

@arXiv_csDC_bot@mastoxiv.page
2025-07-10 12:40:38

Replaced article(s) found for cs.DC. arxiv.org/list/cs.DC/new
[1/1]:
- Fine-tuning Multimodal Transformers on Edge: A Parallel Split Learning Approach
Timo Fudala, Vasileios Tsouvalas, Nirvana Meratnia

@arXiv_csRO_bot@mastoxiv.page
2025-08-05 11:45:31

CO-RFT: Efficient Fine-Tuning of Vision-Language-Action Models through Chunked Offline Reinforcement Learning
Dongchi Huang, Zhirui Fang, Tianle Zhang, Yihang Li, Lin Zhao, Chunhe Xia
arxiv.org/abs/2508.02219

@arXiv_mathOC_bot@mastoxiv.page
2025-07-09 08:29:12

On the Inherent Privacy of Zeroth Order Projected Gradient Descent
Devansh Gupta, Meisam Razaviyayn, Vatsal Sharan
arxiv.org/abs/2507.05610

@arXiv_csCY_bot@mastoxiv.page
2025-06-10 07:40:12

Position: Simulating Society Requires Simulating Thought
Chance Jiajie Li, Jiayi Wu, Zhenze Mo, Ao Qu, Yuhan Tang, Kaiya Ivy Zhao, Yulu Gan, Jie Fan, Jiangbo Yu, Jinhua Zhao, Paul Liang, Luis Alonso, Kent Larson
arxiv.org/abs/2506.06958

@arXiv_eessSP_bot@mastoxiv.page
2025-06-03 07:28:42

Movable Antenna Enhanced Federated Fine-Tuning of Large Language Models via Hybrid Client Selection Optimization
Yang Zhao, Yue Xiu, Chengxiao Dai, Ning Wei, Dusit Niyato
arxiv.org/abs/2506.00011

@arXiv_csSD_bot@mastoxiv.page
2025-07-08 09:58:00

CLEP-DG: Contrastive Learning for Speech Emotion Domain Generalization via Soft Prompt Tuning
Jiacheng Shi, Yanfu Zhang, Ye Gao
arxiv.org/abs/2507.04048

@arXiv_physicsfludyn_bot@mastoxiv.page
2025-08-05 08:57:10

Fine-tuning physics-informed neural networks for cavity flows using coordinate transformation
Ryuta Takao, Satoshi Ii
arxiv.org/abs/2508.01122

@arXiv_csCE_bot@mastoxiv.page
2025-08-04 08:51:31

Online Fine-Tuning of Carbon Emission Predictions using Real-Time Recurrent Learning for State Space Models
Julian Lemmel, Manuel Kranzl, Adam Lamine, Philipp Neubauer, Radu Grosu, Sophie Neubauer
arxiv.org/abs/2508.00804

@arXiv_csCR_bot@mastoxiv.page
2025-06-06 07:17:06

Why LLM Safety Guardrails Collapse After Fine-tuning: A Similarity Analysis Between Alignment and Fine-tuning Datasets
Lei Hsiung, Tianyu Pang, Yung-Chen Tang, Linyue Song, Tsung-Yi Ho, Pin-Yu Chen, Yaoqing Yang
arxiv.org/abs/2506.05346

@arXiv_csLG_bot@mastoxiv.page
2025-06-10 19:23:11

This arxiv.org/abs/2506.05673 has been replaced.
initial toot: mastoxiv.page/@arXiv_csLG_…

@arXiv_eessIV_bot@mastoxiv.page
2025-06-06 07:22:21

Gradient Inversion Attacks on Parameter-Efficient Fine-Tuning
Hasin Us Sami, Swapneel Sen, Amit K. Roy-Chowdhury, Srikanth V. Krishnamurthy, Basak Guler
arxiv.org/abs/2506.04453

@ErikJonker@mastodon.social
2025-07-05 16:53:00

A foundation model to predict and capture human cognition.
nature.com/articles/s41586-025
And the reaction and criticism,

@newsie@darktundra.xyz
2025-06-27 14:11:24

Fine-Tuning LLMs For ‘Good’ Behavior Makes Them More Likely To Say No 404media.co/fine-tuning-llms-c

@arXiv_csCL_bot@mastoxiv.page
2025-07-08 13:42:01

GradOT: Training-free Gradient-preserving Offsite-tuning for Large Language Models
Kai Yao, Zhaorui Tan, Penglei Gao, Lichun Li, Kaixin Wu, Yinggui Wang, Yuan Zhao, Yixin Ji, Wei Wang, Jianke Zhu
arxiv.org/abs/2507.04455

@arXiv_csCV_bot@mastoxiv.page
2025-07-08 14:29:31

Hear-Your-Click: Interactive Video-to-Audio Generation via Object-aware Contrastive Audio-Visual Fine-tuning
Yingshan Liang, Keyu Fan, Zhicheng Du, Yiran Wang, Qingyang Shi, Xinyu Zhang, Jiasheng Lu, Peiwu Qin
arxiv.org/abs/2507.04959

@arXiv_csAI_bot@mastoxiv.page
2025-06-03 08:08:56

Understanding Overadaptation in Supervised Fine-Tuning: The Role of Ensemble Methods
Yifan Hao, Xingyuan Pan, Hanning Zhang, Chenlu Ye, Rui Pan, Tong Zhang
arxiv.org/abs/2506.01901

@arXiv_csIR_bot@mastoxiv.page
2025-06-10 16:44:49

This arxiv.org/abs/2506.02916 has been replaced.
initial toot: mastoxiv.page/@arXiv_csIR_…

@arXiv_csSE_bot@mastoxiv.page
2025-06-05 07:23:54

VisCoder: Fine-Tuning LLMs for Executable Python Visualization Code Generation
Yuansheng Ni, Ping Nie, Kai Zou, Xiang Yue, Wenhu Chen
arxiv.org/abs/2506.03930

@arXiv_csLG_bot@mastoxiv.page
2025-06-10 19:18:37

This arxiv.org/abs/2505.01997 has been replaced.
initial toot: mastoxiv.page/@arXiv_csLG_…

@arXiv_csDC_bot@mastoxiv.page
2025-07-08 08:07:20

Analysis and Optimized CXL-Attached Memory Allocation for Long-Context LLM Fine-Tuning
Yong-Cheng Liaw, Shuo-Han Chen
arxiv.org/abs/2507.03305

@arXiv_csCL_bot@mastoxiv.page
2025-06-10 18:56:30

This arxiv.org/abs/2505.22942 has been replaced.
initial toot: mastoxiv.page/@arXiv_csCL_…

@arXiv_csCR_bot@mastoxiv.page
2025-06-09 07:49:22

FedShield-LLM: A Secure and Scalable Federated Fine-Tuned Large Language Model
Md Jueal Mia, M. Hadi Amini
arxiv.org/abs/2506.05640

@arXiv_eessAS_bot@mastoxiv.page
2025-06-11 08:00:05

Approaching Dialogue State Tracking via Aligning Speech Encoders and LLMs
\v{S}imon Sedl\'a\v{c}ek, Bolaji Yusuf, J\'an \v{S}vec, Pradyoth Hegde, Santosh Kesiraju, Old\v{r}ich Plchot, Jan \v{C}ernock\'y
arxiv.org/abs/2506.08633

@arXiv_csLG_bot@mastoxiv.page
2025-06-10 19:21:11

This arxiv.org/abs/2505.23868 has been replaced.
initial toot: mastoxiv.page/@arXiv_csLG_…

@arXiv_csSD_bot@mastoxiv.page
2025-07-08 10:45:31

TTS-CtrlNet: Time varying emotion aligned text-to-speech generation with ControlNet
Jaeseok Jeong, Yuna Lee, Mingi Kwon, Youngjung Uh
arxiv.org/abs/2507.04349

@arXiv_csCL_bot@mastoxiv.page
2025-08-11 10:04:19

Post-training for Efficient Communication via Convention Formation
Yilun Hua, Evan Wang, Yoav Artzi
arxiv.org/abs/2508.06482 arxiv.org/pdf/…

@arXiv_csLG_bot@mastoxiv.page
2025-06-10 19:17:43

This arxiv.org/abs/2505.00347 has been replaced.
initial toot: mastoxiv.page/@arXiv_csLG_…

@arXiv_csCR_bot@mastoxiv.page
2025-06-11 07:30:33

Your Agent Can Defend Itself against Backdoor Attacks
Li Changjiang, Liang Jiacheng, Cao Bochuan, Chen Jinghui, Wang Ting
arxiv.org/abs/2506.08336

@arXiv_csSE_bot@mastoxiv.page
2025-07-09 09:36:02

Multi-Agent Debate Strategies to Enhance Requirements Engineering with Large Language Models
Marc Oriol, Quim Motger, Jordi Marco, Xavier Franch
arxiv.org/abs/2507.05981

@arXiv_csHC_bot@mastoxiv.page
2025-07-04 09:43:21

Are You Listening to Me? Fine-Tuning Chatbots for Empathetic Dialogue
Paulo Ricardo Knob, Leonardo Scholler, Juliano Rigatti, Soraia Raupp Musse
arxiv.org/abs/2507.02537

@arXiv_csCL_bot@mastoxiv.page
2025-07-10 09:48:31

Enhancing Food-Domain Question Answering with a Multimodal Knowledge Graph: Hybrid QA Generation and Diversity Analysis
Srihari K B, Pushpak Bhattacharyya
arxiv.org/abs/2507.06571

@arXiv_csLG_bot@mastoxiv.page
2025-06-10 19:21:44

This arxiv.org/abs/2506.01790 has been replaced.
initial toot: mastoxiv.page/@arXiv_csLG_…

@arXiv_csRO_bot@mastoxiv.page
2025-08-06 10:14:50

DiWA: Diffusion Policy Adaptation with World Models
Akshay L Chandra, Iman Nematollahi, Chenguang Huang, Tim Welschehold, Wolfram Burgard, Abhinav Valada
arxiv.org/abs/2508.03645

@arXiv_csCR_bot@mastoxiv.page
2025-06-10 16:38:09

This arxiv.org/abs/2506.05394 has been replaced.
initial toot: mastoxiv.page/@arXiv_csCR_…

@arXiv_eessAS_bot@mastoxiv.page
2025-06-10 16:49:59

This arxiv.org/abs/2410.00527 has been replaced.
initial toot: mastoxiv.page/@arXiv_ees…

@arXiv_csDC_bot@mastoxiv.page
2025-06-04 07:45:04

Memory-Efficient Split Federated Learning for LLM Fine-Tuning on Heterogeneous Mobile Devices
Xiaopei Chen, Liang Li, Fei Ji, Wen Wu
arxiv.org/abs/2506.02940

@arXiv_csLG_bot@mastoxiv.page
2025-07-10 14:09:52

Replaced article(s) found for cs.LG. arxiv.org/list/cs.LG/new
[4/5]:
- Fine-tuning Multimodal Transformers on Edge: A Parallel Split Learning Approach
Timo Fudala, Vasileios Tsouvalas, Nirvana Meratnia

@arXiv_csCL_bot@mastoxiv.page
2025-07-10 09:57:21

Checklist Engineering Empowers Multilingual LLM Judges
Mohammad Ghiasvand Mohammadkhani, Hamid Beigy
arxiv.org/abs/2507.06774

@arXiv_csCV_bot@mastoxiv.page
2025-06-04 14:59:51

This arxiv.org/abs/2505.21920 has been replaced.
initial toot: mastoxiv.page/@arXiv_csCV_…

@arXiv_csSD_bot@mastoxiv.page
2025-08-06 08:34:30

Fine-Tuning Text-to-Speech Diffusion Models Using Reinforcement Learning with Human Feedback
Jingyi Chen, Ju Seung Byun, Micha Elsner, Pichao Wang, Andrew Perrault
arxiv.org/abs/2508.03123

@arXiv_csCY_bot@mastoxiv.page
2025-06-03 07:20:20

SafeCOMM: What about Safety Alignment in Fine-Tuned Telecom Large Language Models?
Aladin Djuhera, Swanand Ravindra Kadhe, Farhan Ahmed, Syed Zawad, Holger Boche, Walid Saad
arxiv.org/abs/2506.00062

@arXiv_csLG_bot@mastoxiv.page
2025-06-09 10:10:12

Text-to-LoRA: Instant Transformer Adaption
Rujikorn Charakorn, Edoardo Cetin, Yujin Tang, Robert Tjarko Lange
arxiv.org/abs/2506.06105

@arXiv_csCR_bot@mastoxiv.page
2025-06-11 07:35:43

WGLE:Backdoor-free and Multi-bit Black-box Watermarking for Graph Neural Networks
Tingzhi Li, Xuefeng Liu
arxiv.org/abs/2506.08602

@arXiv_csAI_bot@mastoxiv.page
2025-06-03 07:17:29

Whispers of Many Shores: Cultural Alignment through Collaborative Cultural Expertise
Shuai Feng, Wei-Chuang Chan, Srishti Chouhan, Junior Francisco Garcia Ayala, Srujananjali Medicherla, Kyle Clark, Mingwei Shi
arxiv.org/abs/2506.00242

@arXiv_csLG_bot@mastoxiv.page
2025-07-11 10:23:41

EXPO: Stable Reinforcement Learning with Expressive Policies
Perry Dong, Qiyang Li, Dorsa Sadigh, Chelsea Finn
arxiv.org/abs/2507.07986 arxiv.org/pdf/2507.07986 arxiv.org/html/2507.07986
arXiv:2507.07986v1 Announce Type: new
Abstract: We study the problem of training and fine-tuning expressive policies with online reinforcement learning (RL) given an offline dataset. Training expressive policy classes with online RL present a unique challenge of stable value maximization. Unlike simpler Gaussian policies commonly used in online RL, expressive policies like diffusion and flow-matching policies are parameterized by a long denoising chain, which hinders stable gradient propagation from actions to policy parameters when optimizing against some value function. Our key insight is that we can address stable value maximization by avoiding direct optimization over value with the expressive policy and instead construct an on-the-fly RL policy to maximize Q-value. We propose Expressive Policy Optimization (EXPO), a sample-efficient online RL algorithm that utilizes an on-the-fly policy to maximize value with two parameterized policies -- a larger expressive base policy trained with a stable imitation learning objective and a light-weight Gaussian edit policy that edits the actions sampled from the base policy toward a higher value distribution. The on-the-fly policy optimizes the actions from the base policy with the learned edit policy and chooses the value maximizing action from the base and edited actions for both sampling and temporal-difference (TD) backup. Our approach yields up to 2-3x improvement in sample efficiency on average over prior methods both in the setting of fine-tuning a pretrained policy given offline data and in leveraging offline data to train online.
toXiv_bot_toot

@arXiv_csIR_bot@mastoxiv.page
2025-07-09 09:27:32

RecRankerEval: A Flexible and Extensible Framework for Top-k LLM-based Recommendation
Zeyuan Meng, Zixuan Yi, Iadh Ounis
arxiv.org/abs/2507.05880

@arXiv_csDC_bot@mastoxiv.page
2025-06-04 07:22:42

EcoLoRA: Communication-Efficient Federated Fine-Tuning of Large Language Models
Han Liu, Ruoyao Wen, Srijith Nair, Jia Liu, Wenjing Lou, Chongjie Zhang, William Yeoh, Yevgeniy Vorobeychik, Ning Zhang
arxiv.org/abs/2506.02001

@arXiv_csSD_bot@mastoxiv.page
2025-07-08 11:29:50

EXPOTION: Facial Expression and Motion Control for Multimodal Music Generation
Fathinah Izzati, Xinyue Li, Gus Xia
arxiv.org/abs/2507.04955

@arXiv_csRO_bot@mastoxiv.page
2025-07-08 12:40:20

Training-free Generation of Temporally Consistent Rewards from VLMs
Yinuo Zhao, Jiale Yuan, Zhiyuan Xu, Xiaoshuai Hao, Xinyi Zhang, Kun Wu, Zhengping Che, Chi Harold Liu, Jian Tang
arxiv.org/abs/2507.04789

@arXiv_csCL_bot@mastoxiv.page
2025-07-08 13:45:01

AdS: Adapter-state Sharing Framework for Multimodal Sarcasm Detection
Soumyadeep Jana, Sahil Danayak, Sanasam Ranbir Singh
arxiv.org/abs/2507.04508

@arXiv_csIR_bot@mastoxiv.page
2025-06-05 07:19:10

GORACS: Group-level Optimal Transport-guided Coreset Selection for LLM-based Recommender Systems
Tiehua Mei, Hengrui Chen, Peng Yu, Jiaqing Liang, Deqing Yang
arxiv.org/abs/2506.04015

@arXiv_csCV_bot@mastoxiv.page
2025-07-08 14:34:11

Open Vision Reasoner: Transferring Linguistic Cognitive Behavior for Visual Reasoning
Yana Wei, Liang Zhao, Jianjian Sun, Kangheng Lin, Jisheng Yin, Jingcheng Hu, Yinmin Zhang, En Yu, Haoran Lv, Zejia Weng, Jia Wang, Chunrui Han, Yuang Peng, Qi Han, Zheng Ge, Xiangyu Zhang, Daxin Jiang, Vishal M. Patel
arxiv.org/abs/2507…

@arXiv_csCL_bot@mastoxiv.page
2025-07-02 09:52:10

Impact of Fine-Tuning Methods on Memorization in Large Language Models
Jie Hou, Chuxiong Wu, Lannan Luo, Qiang Zeng
arxiv.org/abs/2507.00258

@arXiv_csSD_bot@mastoxiv.page
2025-06-04 13:37:15

This arxiv.org/abs/2505.24200 has been replaced.
initial toot: mastoxiv.page/@arXiv_csSD_…

@arXiv_csRO_bot@mastoxiv.page
2025-06-05 07:21:44

Enhancing Safety of Foundation Models for Visual Navigation through Collision Avoidance via Repulsive Estimation
Joonkyung Kim, Joonyeol Sim, Woojun Kim, Katia Sycara, Changjoo Nam
arxiv.org/abs/2506.03834

@arXiv_csCV_bot@mastoxiv.page
2025-08-04 10:11:31

SU-ESRGAN: Semantic and Uncertainty-Aware ESRGAN for Super-Resolution of Satellite and Drone Imagery with Fine-Tuning for Cross Domain Evaluation
Prerana Ramkumar
arxiv.org/abs/2508.00750

@arXiv_csSD_bot@mastoxiv.page
2025-06-03 07:37:27

Fine-Tuning ASR for Stuttered Speech: Personalized vs. Generalized Approaches
Dena Mujtaba, Nihar Mahapatra
arxiv.org/abs/2506.00853

@arXiv_csRO_bot@mastoxiv.page
2025-07-08 12:03:40

MLLM-Fabric: Multimodal Large Language Model-Driven Robotic Framework for Fabric Sorting and Selection
Liman Wang, Hanyang Zhong, Tianyuan Wang, Shan Luo, Jihong Zhu
arxiv.org/abs/2507.04351

@arXiv_csCR_bot@mastoxiv.page
2025-06-09 07:37:32

Sentinel: SOTA model to protect against prompt injections
Dror Ivry, Oran Nahum
arxiv.org/abs/2506.05446 arxiv.org/pd…

@arXiv_csLG_bot@mastoxiv.page
2025-07-09 09:44:42

Navigating Sparse Molecular Data with Stein Diffusion Guidance
Van Khoa Nguyen, Lionel Blond\'e, Alexandros Kalousis
arxiv.org/abs/2507.05482

@arXiv_csCV_bot@mastoxiv.page
2025-07-31 10:12:01

TR-PTS: Task-Relevant Parameter and Token Selection for Efficient Tuning
Siqi Luo, Haoran Yang, Yi Xin, Mingyang Yi, Guangyang Wu, Guangtao Zhai, Xiaohong Liu
arxiv.org/abs/2507.22872

@arXiv_csLG_bot@mastoxiv.page
2025-06-09 10:07:22

Dynamic Mixture of Progressive Parameter-Efficient Expert Library for Lifelong Robot Learning
Yuheng Lei, Sitong Mao, Shunbo Zhou, Hongyuan Zhang, Xuelong Li, Ping Luo
arxiv.org/abs/2506.05985

@arXiv_csCR_bot@mastoxiv.page
2025-06-06 07:16:18

Learning to Diagnose Privately: DP-Powered LLMs for Radiology Report Classification
Payel Bhattacharjee, Fengwei Tian, Ravi Tandon, Joseph Lo, Heidi Hanson, Geoffrey Rubin, Nirav Merchant, John Gounley
arxiv.org/abs/2506.04450

@arXiv_csLG_bot@mastoxiv.page
2025-06-09 10:13:42

Corrector Sampling in Language Models
Itai Gat, Neta Shaul, Uriel Singer, Yaron Lipman
arxiv.org/abs/2506.06215 arxiv…

@arXiv_csCL_bot@mastoxiv.page
2025-06-23 12:12:50

Fine-Tuning Lowers Safety and Disrupts Evaluation Consistency
Kathleen C. Fraser, Hillary Dawkins, Isar Nejadgholi, Svetlana Kiritchenko
arxiv.org/abs/2506.17209

@arXiv_csCR_bot@mastoxiv.page
2025-07-30 10:08:51

SDD: Self-Degraded Defense against Malicious Fine-tuning
Zixuan Chen, Weikai Lu, Xin Lin, Ziqian Zeng
arxiv.org/abs/2507.21182 arxiv.org/pd…

@arXiv_csLG_bot@mastoxiv.page
2025-06-05 11:01:12

This arxiv.org/abs/2506.02308 has been replaced.
initial toot: mastoxiv.page/@arXiv_csLG_…

@arXiv_csCL_bot@mastoxiv.page
2025-07-08 14:03:01

InfoSteer: Steering Information Utility in Language Model Post-Training
Chunyuan Deng, Ruidi Chang, Hanjie Chen
arxiv.org/abs/2507.05158

@arXiv_csLG_bot@mastoxiv.page
2025-06-03 21:37:23

This arxiv.org/abs/2505.03793 has been replaced.
initial toot: mastoxiv.page/@arXiv_csLG_…

@arXiv_csCL_bot@mastoxiv.page
2025-07-08 13:57:01

CoSteer: Collaborative Decoding-Time Personalization via Local Delta Steering
Hang Lv, Sheng Liang, Hao Wang, Hongchao Gu, Yaxiong Wu, Wei Guo, Defu Lian, Yong Liu, Enhong Chen
arxiv.org/abs/2507.04756

@arXiv_csCR_bot@mastoxiv.page
2025-07-08 11:26:31

SecureT2I: No More Unauthorized Manipulation on AI Generated Images from Prompts
Xiaodong Wu, Xiangman Li, Qi Li, Jianbing Ni, Rongxing Lu
arxiv.org/abs/2507.03636

@arXiv_csCL_bot@mastoxiv.page
2025-07-04 09:52:11

MOTIF: Modular Thinking via Reinforcement Fine-tuning in LLMs
Purbesh Mitra, Sennur Ulukus
arxiv.org/abs/2507.02851 a…

@arXiv_csCL_bot@mastoxiv.page
2025-07-08 13:48:31

Knowledge-Aware Self-Correction in Language Models via Structured Memory Graphs
Swayamjit Saha
arxiv.org/abs/2507.04625