Tootfinder

Opt-in global Mastodon full text search. Join the index!

@arXiv_csLG_bot@mastoxiv.page
2025-09-18 10:18:21

Language models' activations linearly encode training-order recency
Dmitrii Krasheninnikov, Richard E. Turner, David Krueger
arxiv.org/abs/2509.14223

@Techmeme@techhub.social
2025-09-18 13:10:41

In a peer-reviewed Nature article, DeepSeek says it has spent $294,000 on training its R1 model and used 512 Nvidia H800 chips (Eduardo Baptista/Reuters)
reuters.com/world/china/chinas

@arXiv_csCL_bot@mastoxiv.page
2025-08-18 09:40:50

Rationalizing Transformer Predictions via End-To-End Differentiable Self-Training
Marc Brinner, Sina Zarrie{\ss}
arxiv.org/abs/2508.11393 a…

@arXiv_csCV_bot@mastoxiv.page
2025-08-18 09:55:00

Training-Free Anomaly Generation via Dual-Attention Enhancement in Diffusion Model
Zuo Zuo, Jiahao Dong, Yanyun Qu, Zongze Wu
arxiv.org/abs/2508.11550

@arXiv_csCR_bot@mastoxiv.page
2025-08-19 11:02:50

Substituting Proof of Work in Blockchain with Training-Verified Collaborative Model Computation
Mohammad Ishzaz Asif Rafid, Morsalin Sakib
arxiv.org/abs/2508.12138

@arXiv_csRO_bot@mastoxiv.page
2025-09-18 10:14:41

FlightDiffusion: Revolutionising Autonomous Drone Training with Diffusion Models Generating FPV Video
Valerii Serpiva, Artem Lykov, Faryal Batool, Vladislav Kozlovskiy, Miguel Altamirano Cabrera, Dzmitry Tsetserukou
arxiv.org/abs/2509.14082

@drgeraint@glasgow.social
2025-09-18 09:26:56

"In theory, AI model makers could eliminate hallucinations by using a dataset that contains no errors."
I think someone has fundamentally misunderstood the technology. Developing a model using a 100% correct training dataset does not mean that the resulting AI will be able to correctly answer questions that were not in the training data.
Over-fitting is a thing.

@arXiv_csCY_bot@mastoxiv.page
2025-08-19 10:17:20

SimInterview: Transforming Business Education through Large Language Model-Based Simulated Multilingual Interview Training System
Truong Thanh Hung Nguyen, Tran Diem Quynh Nguyen, Hoang Loc Cao, Thi Cam Thanh Tran, Thi Cam Mai Truong, Hung Cao
arxiv.org/abs/2508.11873

@arXiv_csAI_bot@mastoxiv.page
2025-08-19 10:09:30

Mantis: A Simulation-Grounded Foundation Model for Disease Forecasting
Carson Dudley, Reiden Magdaleno, Christopher Harding, Ananya Sharma, Emily Martin, Marisa Eisenberg
arxiv.org/abs/2508.12260

@arXiv_statML_bot@mastoxiv.page
2025-08-18 08:39:50

ADMIRE-BayesOpt: Accelerated Data MIxture RE-weighting for Language Models with Bayesian Optimization
Shengzhuang Chen, Xu Ouyang, Michael Arthur Leopold Pearce, Thomas Hartvigsen, Jonathan Richard Schwarz
arxiv.org/abs/2508.11551

@arXiv_csLG_bot@mastoxiv.page
2025-08-18 09:38:30

NeMo: A Neuron-Level Modularizing-While-Training Approach for Decomposing DNN Models
Xiaohan Bi, Binhang Qi, Hailong Sun, Xiang Gao, Yue Yu, Xiaojun Liang
arxiv.org/abs/2508.11348

@arXiv_csDC_bot@mastoxiv.page
2025-08-19 07:38:50

Breaking the Aggregation Bottleneck in Federated Recommendation: A Personalized Model Merging Approach
Jundong Chen, Honglei Zhang, Chunxu Zhang, Fangyuan Luo, Yidong Li
arxiv.org/abs/2508.12386

@Techmeme@techhub.social
2025-08-17 15:55:49

NYC-based Protege, which prepares and sells real-world datasets like lab results and sports footage for AI training, raised a $25M Series A led by Footwork (Natasha Mascarenhas/The Information)
theinformation.com/articles/on

@arXiv_csGR_bot@mastoxiv.page
2025-09-19 08:18:21

WorldForge: Unlocking Emergent 3D/4D Generation in Video Diffusion Model via Training-Free Guidance
Chenxi Song, Yanming Yang, Tong Zhao, Ruibo Li, Chi Zhang
arxiv.org/abs/2509.15130

@arXiv_csIR_bot@mastoxiv.page
2025-08-19 08:21:20

A Large-Scale Web Search Dataset for Federated Online Learning to Rank
Marcel Gregoriadis, Jingwei Kang, Johan Pouwelse
arxiv.org/abs/2508.12353

@arXiv_csSD_bot@mastoxiv.page
2025-09-19 10:09:31

From Hype to Insight: Rethinking Large Language Model Integration in Visual Speech Recognition
Rishabh Jain, Naomi Harte
arxiv.org/abs/2509.14880

@arXiv_eessSP_bot@mastoxiv.page
2025-09-18 08:54:21

Efficient Quantization-Aware Neural Receivers: Beyond Post-Training Quantization
SaiKrishna Saketh Yellapragada, Esa Ollila, Mario Costa
arxiv.org/abs/2509.13786

@arXiv_csCV_bot@mastoxiv.page
2025-09-18 10:25:11

CSMoE: An Efficient Remote Sensing Foundation Model with Soft Mixture-of-Experts
Leonard Hackel, Tom Burgert, Beg\"um Demir
arxiv.org/abs/2509.14104

@arXiv_csCR_bot@mastoxiv.page
2025-09-19 09:41:41

Adversarial Distilled Retrieval-Augmented Guarding Model for Online Malicious Intent Detection
Yihao Guo, Haocheng Bian, Liutong Zhou, Ze Wang, Zhaoyi Zhang, Francois Kawala, Milan Dean, Ian Fischer, Yuantao Peng, Noyan Tokgozoglu, Ivan Barrientos, Riyaaz Shaik, Rachel Li, Chandru Venkataraman, Reza Shifteh Far, Moses Pawar, Venkat Sundaranatha, Michael Xu, Frank Chu

@arXiv_csRO_bot@mastoxiv.page
2025-09-18 10:14:51

GeoAware-VLA: Implicit Geometry Aware Vision-Language-Action Model
Ali Abouzeid, Malak Mansour, Zezhou Sun, Dezhen Song
arxiv.org/abs/2509.14117

@arXiv_eessAS_bot@mastoxiv.page
2025-09-19 09:58:01

From Who Said What to Who They Are: Modular Training-free Identity-Aware LLM Refinement of Speaker Diarization
Yu-Wen Chen, William Ho, Maxim Topaz, Julia Hirschberg, Zoran Kostic
arxiv.org/abs/2509.15082

@arXiv_csCL_bot@mastoxiv.page
2025-08-18 09:41:00

Survey-to-Behavior: Downstream Alignment of Human Values in LLMs via Survey Questions
Shangrui Nie, Florian Mai, David Kacz\'er, Charles Welch, Zhixue Zhao, Lucie Flek
arxiv.org/abs/2508.11414

@arXiv_csLG_bot@mastoxiv.page
2025-09-18 10:15:41

A Universal Banach--Bregman Framework for Stochastic Iterations: Unifying Stochastic Mirror Descent, Learning and LLM Training
Johnny R. Zhang (Independent Researcher), Xiaomei Mi (University of Manchester), Gaoyuan Du (Amazon), Qianyi Sun (Microsoft), Shiqi Wang (Meta), Jiaxuan Li (Amazon), Wenhua Zhou (Independent Researcher)
arx…

@arXiv_statME_bot@mastoxiv.page
2025-09-19 09:20:31

Semiparametric Learning from Open-Set Label Shift Data
Siyan Liu, Yukun Liu, Qinglong Tian, Pengfei Li, Jing Qin
arxiv.org/abs/2509.14522 a…

@arXiv_csAR_bot@mastoxiv.page
2025-08-19 08:15:30

AutoPower: Automated Few-Shot Architecture-Level Power Modeling by Power Group Decoupling
Qijun Zhang, Yao Lu, Mengming Li, Zhiyao Xie
arxiv.org/abs/2508.12294

@arXiv_physicsfludyn_bot@mastoxiv.page
2025-09-18 09:18:21

A proposal for automated turbulence modelling
Marco Castelletti, Maurizio Quadrio
arxiv.org/abs/2509.14140 arxiv.org/pdf/2509.14140

@arXiv_eessIV_bot@mastoxiv.page
2025-09-17 08:49:50

MEGAN: Mixture of Experts for Robust Uncertainty Estimation in Endoscopy Videos
Damola Agbelese, Krishna Chaitanya, Pushpak Pati, Chaitanya Parmar, Pooya Mobadersany, Shreyas Fadnavis, Lindsey Surace, Shadi Yarandi, Louis R. Ghanem, Molly Lucas, Tommaso Mansi, Oana Gabriela Cula, Pablo F. Damasceno, Kristopher Standish
arxiv.org/ab…

@arXiv_csSD_bot@mastoxiv.page
2025-09-18 08:24:01

Noise Supervised Contrastive Learning and Feature-Perturbed for Anomalous Sound Detection
Shun Huang, Zhihua Fang, Liang He
arxiv.org/abs/2509.13853

@arXiv_csCV_bot@mastoxiv.page
2025-09-19 10:25:41

Synthetic-to-Real Object Detection using YOLOv11 and Domain Randomization Strategies
Luisa Torquato Ni\~no, Hamza A. A. Gardi
arxiv.org/abs/2509.15045

@arXiv_csAI_bot@mastoxiv.page
2025-09-18 07:40:11

FRIT: Using Causal Importance to Improve Chain-of-Thought Faithfulness
Anand Swaroop, Akshat Nallani, Saksham Uboweja, Adiliia Uzdenova, Michael Nguyen, Kevin Zhu, Sunishchal Dev, Ashwinee Panda, Vasu Sharma, Maheep Chaudhary
arxiv.org/abs/2509.13334

@arXiv_csLG_bot@mastoxiv.page
2025-08-18 09:45:50

Physics-Informed Diffusion Models for Unsupervised Anomaly Detection in Multivariate Time Series
Juhi Soni, Markus Lange-Hegermann, Stefan Windmann
arxiv.org/abs/2508.11528

@arXiv_csCR_bot@mastoxiv.page
2025-08-19 11:18:00

Where to Start Alignment? Diffusion Large Language Model May Demand a Distinct Position
Zhixin Xie, Xurui Song, Jun Luo
arxiv.org/abs/2508.12398

@arXiv_csCL_bot@mastoxiv.page
2025-09-18 10:16:51

Canary-1B-v2 & Parakeet-TDT-0.6B-v3: Efficient and High-Performance Models for Multilingual ASR and AST
Monica Sekoyan, Nithin Rao Koluguri, Nune Tadevosyan, Piotr Zelasko, Travis Bartley, Nick Karpov, Jagadeesh Balam, Boris Ginsburg
arxiv.org/abs/2509.14128

@arXiv_csRO_bot@mastoxiv.page
2025-09-19 10:11:11

CollabVLA: Self-Reflective Vision-Language-Action Model Dreaming Together with Human
Nan Sun, Yongchang Li, Chenxu Wang, Huiying Li, Huaping Liu
arxiv.org/abs/2509.14889

@arXiv_physicschemph_bot@mastoxiv.page
2025-08-18 11:18:45

Replaced article(s) found for physics.chem-ph. arxiv.org/list/physics.chem-ph
[1/1]:
- Machine Learning Interatomic Potentials: library for efficient training, model development and si...
Christoph Brunken, et al.

@arXiv_csHC_bot@mastoxiv.page
2025-08-15 07:47:02

Pre-trained Transformer-models using chronic invasive electrophysiology for symptom decoding without patient-individual training
Timon Merk, Saeed Salehi, Richard M. Koehler, Qiming Cui, Maria Olaru, Amelia Hahn, Nicole R. Provenza, Simon Little, Reza Abbasi-Asl, Phil A. Starr, Wolf-Julian Neumann
arxiv.org/abs/2508.10160

@arXiv_physicsspaceph_bot@mastoxiv.page
2025-08-19 08:20:10

A Neural-Network Framework for Tracking and Identification of Cosmic-Ray Nuclei in the RadMap Telescope
Luise Meyer-Hetling, Martin J. Losekamm, Stephan Paul, Thomas P\"oschl
arxiv.org/abs/2508.12708

@arXiv_csSE_bot@mastoxiv.page
2025-09-17 08:49:39

Ensembling Large Language Models for Code Vulnerability Detection: An Empirical Evaluation
Zhihong Sun, Jia Li, Yao Wan, Chuanyi Li, Hongyu Zhang, Zhi jin, Ge Li, Hong Liu, Chen Lyu, Songlin Hu
arxiv.org/abs/2509.12629

@arXiv_csDC_bot@mastoxiv.page
2025-08-19 09:00:30

Accelerating Edge Inference for Distributed MoE Models with Latency-Optimized Expert Placement
Tian Wu, Liming Wang, Zijian Wen, Xiaoxi Zhang, Jingpu Duan, Xianwei Zhang, Jinhang Zuo
arxiv.org/abs/2508.12851

@arXiv_physicsaoph_bot@mastoxiv.page
2025-09-16 08:49:36

How does an AI Weather Model Learn to Forecast Extreme Weather?
Rebecca Baiman, Elizabeth A. Barnes, Ankur Mahesh
arxiv.org/abs/2509.10639

@arXiv_csCL_bot@mastoxiv.page
2025-08-18 09:08:00

Beyond the Rosetta Stone: Unification Forces in Generalization Dynamics
Carter Blum, Katja Filipova, Ann Yuan, Asma Ghandeharioun, Julian Zimmert, Fred Zhang, Jessica Hoffmann, Tal Linzen, Martin Wattenberg, Lucas Dixon, Mor Geva
arxiv.org/abs/2508.11017

@arXiv_csCV_bot@mastoxiv.page
2025-09-16 12:43:17

FS-SAM2: Adapting Segment Anything Model 2 for Few-Shot Semantic Segmentation via Low-Rank Adaptation
Bernardo Forni, Gabriele Lombardi, Federico Pozzi, Mirco Planamente
arxiv.org/abs/2509.12105

@arXiv_csCR_bot@mastoxiv.page
2025-08-19 11:38:10

Unlearning Comparator: A Visual Analytics System for Comparative Evaluation of Machine Unlearning Methods
Jaeung Lee, Suhyeon Yu, Yurim Jang, Simon S. Woo, Jaemin Jo
arxiv.org/abs/2508.12730

@arXiv_csSD_bot@mastoxiv.page
2025-09-19 10:12:21

Back to Ear: Perceptually Driven High Fidelity Music Reconstruction
Kangdi Wang, Zhiyue Wu, Dinghao Zhou, Rui Lin, Junyu Dai, Tao Jiang
arxiv.org/abs/2509.14912

@arXiv_eessSP_bot@mastoxiv.page
2025-08-19 11:07:10

ATLAS: AI-Native Receiver Test-and-Measurement by Leveraging AI-Guided Search
Mauro Belgiovine, Suyash Pradhan, Johannes Lange, Michael L\"ohning, Kaushik Chowdhury
arxiv.org/abs/2508.12204

@arXiv_csAI_bot@mastoxiv.page
2025-08-19 10:19:50

Wisdom of the Crowd: Reinforcement Learning from Coevolutionary Collective Feedback
Wenzhen Yuan, Shengji Tang, Weihao Lin, Jiacheng Ruan, Ganqu Cui, Bo Zhang, Tao Chen, Ting Liu, Yuzhuo Fu, Peng Ye, Lei Bai
arxiv.org/abs/2508.12338

@arXiv_csCL_bot@mastoxiv.page
2025-09-17 10:34:40

Multi-Model Synthetic Training for Mission-Critical Small Language Models
Nolan Platt, Pragyansmita Nayak
arxiv.org/abs/2509.13047 arxiv.or…

@arXiv_csCR_bot@mastoxiv.page
2025-09-18 09:31:31

Differential Privacy in Federated Learning: Mitigating Inference Attacks with Randomized Response
Ozer Ozturk, Busra Buyuktanir, Gozde Karatas Baydogmus, Kazim Yildiz
arxiv.org/abs/2509.13987

@arXiv_csLG_bot@mastoxiv.page
2025-09-16 12:40:07

Low-rank Orthogonalization for Large-scale Matrix Optimization with Applications to Foundation Model Training
Chuan He, Zhanwang Deng, Zhaosong Lu
arxiv.org/abs/2509.11983

@arXiv_csDC_bot@mastoxiv.page
2025-10-14 09:05:28

FLAMMABLE: A Multi-Model Federated Learning Framework with Multi-Model Engagement and Adaptive Batch Sizes
Shouxu Lin, Zimeng Pan, Yuhang Yao, Haeyoung Noh, Pei Zhang, Carlee Joe-Wong
arxiv.org/abs/2510.10380

@arXiv_csRO_bot@mastoxiv.page
2025-09-19 09:40:21

Toward Embodiment Equivariant Vision-Language-Action Policy
Anzhe Chen, Yifei Yang, Zhenjie Zhu, Kechun Xu, Zhongxiang Zhou, Rong Xiong, Yue Wang
arxiv.org/abs/2509.14630

@arXiv_csCV_bot@mastoxiv.page
2025-08-18 12:06:15

Replaced article(s) found for cs.CV. arxiv.org/list/cs.CV/new
[4/4]:
- IRL-VLA: Training an Vision-Language-Action Policy via Reward World Model
Jiang, Gao, Wang, Sun, Wang, Heng, Sun, Tang, Zhu, Chai, Wang, Gu, Jiang, Sun

@arXiv_eessAS_bot@mastoxiv.page
2025-09-19 09:46:01

Mitigating data replication in text-to-audio generative diffusion models through anti-memorization guidance
Francisco Messina, Francesca Ronchini, Luca Comanducci, Paolo Bestagini, Fabio Antonacci
arxiv.org/abs/2509.14934

@arXiv_csIR_bot@mastoxiv.page
2025-08-15 08:42:32

Proxy Model-Guided Reinforcement Learning for Client Selection in Federated Recommendation
Liang Qu, Jianxin Li, Wei Yuan, Penghui Ruan, Yuhui Shi, Hongzhi Yin
arxiv.org/abs/2508.10401

@arXiv_csLG_bot@mastoxiv.page
2025-09-18 10:20:21

Compute as Teacher: Turning Inference Compute Into Reference-Free Supervision
Dulhan Jayalath, Shashwat Goel, Thomas Foster, Parag Jain, Suchin Gururangan, Cheng Zhang, Anirudh Goyal, Alan Schelten
arxiv.org/abs/2509.14234

@arXiv_csCL_bot@mastoxiv.page
2025-09-19 10:36:51

LLM-OREF: An Open Relation Extraction Framework Based on Large Language Models
Hongyao Tu, Liang Zhang, Yujie Lin, Xin Lin, Haibo Zhang, Long Zhang, Jinsong Su
arxiv.org/abs/2509.15089

@arXiv_csRO_bot@mastoxiv.page
2025-09-19 10:01:51

RealMirror: A Comprehensive, Open-Source Vision-Language-Action Platform for Embodied AI
Cong Tai, Zhaoyu Zheng, Haixu Long, Hansheng Wu, Haodong Xiang, Zhengbin Long, Jun Xiong, Rong Shi, Shizhuang Zhang, Gang Qiu, He Wang, Ruifeng Li, Jun Huang, Bin Chang, Shuai Feng, Tao Shen
arxiv.org/abs/2509.14687

@arXiv_csSD_bot@mastoxiv.page
2025-09-19 10:06:11

MeanFlowSE: one-step generative speech enhancement via conditional mean flow
Duojia Li, Shenghui Lu, Hongchen Pan, Zongyi Zhan, Qingyang Hong, Lin Li
arxiv.org/abs/2509.14858

@arXiv_eessSP_bot@mastoxiv.page
2025-09-17 09:40:00

Bayesian Signal Separation via Plug-and-Play Diffusion-Within-Gibbs Sampling
Yi Zhang, Rui Guo, Yonina C. Eldar
arxiv.org/abs/2509.12857 ar…

@arXiv_csCV_bot@mastoxiv.page
2025-09-18 10:19:01

Morphology-optimized Multi-Scale Fusion: Combining Local Artifacts and Mesoscopic Semantics for Deepfake Detection and Localization
Chao Shuai, Gaojian Wang, Kun Pan, Tong Wu, Fanli Jin, Haohan Tan, Mengxiang Li, Zhenguang Liu, Feng Lin, Kui Ren
arxiv.org/abs/2509.13776

@arXiv_csLG_bot@mastoxiv.page
2025-08-18 09:43:10

On-Policy RL Meets Off-Policy Experts: Harmonizing Supervised Fine-Tuning and Reinforcement Learning via Dynamic Weighting
Wenhao Zhang, Yuexiang Xie, Yuchang Sun, Yanxi Chen, Guoyin Wang, Yaliang Li, Bolin Ding, Jingren Zhou
arxiv.org/abs/2508.11408

@arXiv_csCR_bot@mastoxiv.page
2025-09-19 08:50:11

Beyond Data Privacy: New Privacy Risks for Large Language Models
Yuntao Du, Zitao Li, Ninghui Li, Bolin Ding
arxiv.org/abs/2509.14278 arxiv…

@arXiv_csCL_bot@mastoxiv.page
2025-08-18 09:48:00

Dataset Creation for Visual Entailment using Generative AI
Rob Reijtenbach, Suzan Verberne, Gijs Wijnholds
arxiv.org/abs/2508.11605 arxiv.o…

@arXiv_csAI_bot@mastoxiv.page
2025-09-16 08:29:56

LLM Enhancement with Domain Expert Mental Model to Reduce LLM Hallucination with Causal Prompt Engineering
Boris Kovalerchuk, Brent D. Fegley
arxiv.org/abs/2509.10818

@arXiv_csSD_bot@mastoxiv.page
2025-08-18 07:39:40

Benchmarking Prosody Encoding in Discrete Speech Tokens
Kentaro Onda, Satoru Fukayama, Daisuke Saito, Nobuaki Minematsu
arxiv.org/abs/2508.11224

@arXiv_csIR_bot@mastoxiv.page
2025-08-15 09:23:42

FuXi-\beta: Towards a Lightweight and Fast Large-Scale Generative Recommendation Model
Yufei Ye, Wei Guo, Hao Wang, Hong Zhu, Yuyang Ye, Yong Liu, Huifeng Guo, Ruiming Tang, Defu Lian, Enhong Chen
arxiv.org/abs/2508.10615

@arXiv_csLG_bot@mastoxiv.page
2025-09-18 10:07:21

Differentially private federated learning for localized control of infectious disease dynamics
Raouf Kerkouche, Henrik Zunker, Mario Fritz, Martin J. K\"uhn
arxiv.org/abs/2509.14024

@arXiv_eessAS_bot@mastoxiv.page
2025-08-18 08:46:20

MoE-TTS: Enhancing Out-of-Domain Text Understanding for Description-based TTS via Mixture-of-Experts
Heyang Xue, Xuchen Song, Yu Tang, Jianyu Chen, Yanru Chen, Yang Li, Yahui Zhou
arxiv.org/abs/2508.11326

@arXiv_csDC_bot@mastoxiv.page
2025-08-14 07:48:22

Verify Distributed Deep Learning Model Implementation Refinement with Iterative Relation Inference
Zhanghan Wang, Ding Ding, Hang Zhu, Haibin Lin, Aurojit Panda
arxiv.org/abs/2508.09505

@arXiv_csCR_bot@mastoxiv.page
2025-08-19 10:22:40

Optimizing Token Choice for Code Watermarking: A RL Approach
Zhimeng Guo, Huaisheng Zhu, Siyuan Xu, Hangfan Zhang, Teng Xiao, Minhao Cheng
arxiv.org/abs/2508.11925

@arXiv_csCL_bot@mastoxiv.page
2025-09-19 10:39:41

LNE-Blocking: An Efficient Framework for Contamination Mitigation Evaluation on Large Language Models
Ruijie Hou, Yueyang Jiao, Hanxu Hu, Yingming Li, Wai Lam, Huajian Zhang, Hongyuan Lu
arxiv.org/abs/2509.15218

@arXiv_csLG_bot@mastoxiv.page
2025-09-18 10:13:01

From Distributional to Quantile Neural Basis Models: the case of Electricity Price Forecasting
Alessandro Brusaferri, Danial Ramin, Andrea Ballarino
arxiv.org/abs/2509.14113

@arXiv_csAI_bot@mastoxiv.page
2025-09-12 07:31:39

ForTIFAI: Fending Off Recursive Training Induced Failure for AI Models
Soheil Zibakhsh Shabgahi, Pedram Aghazadeh, Azalia Mirhosseini, Farinaz Koushanfar
arxiv.org/abs/2509.08972

@arXiv_csCL_bot@mastoxiv.page
2025-09-18 09:43:51

DSPC: Dual-Stage Progressive Compression Framework for Efficient Long-Context Reasoning
Yaxin Gao, Yao Lu, Zongfei Zhang, Jiaqi Nie, Shanqing Yu, Qi Xuan
arxiv.org/abs/2509.13723

@arXiv_csRO_bot@mastoxiv.page
2025-10-15 10:12:01

Residual MPC: Blending Reinforcement Learning with GPU-Parallelized Model Predictive Control
Se Hwan Jeon, Ho Jae Lee, Seungwoo Hong, Sangbae Kim
arxiv.org/abs/2510.12717

@arXiv_csCV_bot@mastoxiv.page
2025-08-15 10:23:12

An Efficient Model-Driven Groupwise Approach for Atlas Construction
Ziwei Zou, Bei Zou, Xiaoyan Kui, Wenqi Lu, Haoran Dou, Arezoo Zakeri, Timothy Cootes, Alejandro F Frangi, Jinming Duan
arxiv.org/abs/2508.10743

@arXiv_csSD_bot@mastoxiv.page
2025-09-17 10:00:10

UTI-LLM: A Personalized Articulatory-Speech Therapy Assistance System Based on Multimodal Large Language Model
Yudong Yang, Xiaokang Liu, Shaofeng zhao, Rongfeng Su, Nan Yan, Lan Wang
arxiv.org/abs/2509.13145

@arXiv_csCV_bot@mastoxiv.page
2025-08-15 10:26:32

Human-in-Context: Unified Cross-Domain 3D Human Motion Modeling via In-Context Learning
Mengyuan Liu, Xinshun Wang, Zhongbin Fang, Deheng Ye, Xia Li, Tao Tang, Songtao Wu, Xiangtai Li, Ming-Hsuan Yang
arxiv.org/abs/2508.10897

@arXiv_csRO_bot@mastoxiv.page
2025-08-15 09:31:42

CorrectNav: Self-Correction Flywheel Empowers Vision-Language-Action Navigation Model
Zhuoyuan Yu, Yuxing Long, Zihan Yang, Chengyan Zeng, Hongwei Fan, Jiyao Zhang, Hao Dong
arxiv.org/abs/2508.10416

@arXiv_csCL_bot@mastoxiv.page
2025-09-17 10:26:50

All Roads Lead to Rome: Graph-Based Confidence Estimation for Large Language Model Reasoning
Caiqi Zhang, Chang Shu, Ehsan Shareghi, Nigel Collier
arxiv.org/abs/2509.12908

@arXiv_csCR_bot@mastoxiv.page
2025-09-16 11:58:07

MAUI: Reconstructing Private Client Data in Federated Transfer Learning
Ahaan Dabholkar, Atul Sharma, Z. Berkay Celik, Saurabh Bagchi
arxiv.org/abs/2509.11451

@arXiv_csAI_bot@mastoxiv.page
2025-08-15 09:37:12

Diversity First, Quality Later: A Two-Stage Assumption for Language Model Alignment
Zetian Sun, Dongfang Li, Baotian Hu
arxiv.org/abs/2508.10530

@arXiv_csSD_bot@mastoxiv.page
2025-09-15 08:21:21

DiTReducio: A Training-Free Acceleration for DiT-Based TTS via Progressive Calibration
Yanru Huo, Ziyue Jiang, Zuoli Tang, Qingyang Hong, Zhou Zhao
arxiv.org/abs/2509.09748

@arXiv_csLG_bot@mastoxiv.page
2025-09-15 09:48:01

FedBiF: Communication-Efficient Federated Learning via Bits Freezing
Shiwei Li, Qunwei Li, Haozhao Wang, Ruixuan Li, Jianbin Lin, Wenliang Zhong
arxiv.org/abs/2509.10161

@arXiv_csCL_bot@mastoxiv.page
2025-09-17 09:16:00

MAGIC-Enhanced Keyword Prompting for Zero-Shot Audio Captioning with CLIP Models
Vijay Govindarajan, Pratik Patel, Sahil Tripathi, Md Azizul Hoque, Gautam Siddharth Kashyap
arxiv.org/abs/2509.12591

@arXiv_csCV_bot@mastoxiv.page
2025-10-15 10:54:31

ViCO: A Training Strategy towards Semantic Aware Dynamic High-Resolution
Long Cui, Weiyun Wang, Jie Shao, Zichen Wen, Gen Luo, Linfeng Zhang, Yanting Zhang, Yu Qiao, Wenhai Wang
arxiv.org/abs/2510.12793

@arXiv_csCV_bot@mastoxiv.page
2025-10-15 07:59:31

Task-Specific Dual-Model Framework for Comprehensive Traffic Safety Video Description and Analysis
Blessing Agyei Kyem, Neema Jakisa Owor, Andrews Danyo, Joshua Kofi Asamoah, Eugene Denteh, Tanner Muturi, Anthony Dontoh, Yaw Adu-Gyamfi, Armstrong Aboah
arxiv.org/abs/2510.11907

@arXiv_csLG_bot@mastoxiv.page
2025-10-15 10:46:41

Laminar: A Scalable Asynchronous RL Post-Training Framework
Guangming Sheng, Yuxuan Tong, Borui Wan, Wang Zhang, Chaobo Jia, Xibin Wu, Yuqi Wu, Xiang Li, Chi Zhang, Yanghua Peng, Haibin Lin, Xin Liu, Chuan Wu
arxiv.org/abs/2510.12633

@arXiv_csCR_bot@mastoxiv.page
2025-08-15 09:12:32

FIDELIS: Blockchain-Enabled Protection Against Poisoning Attacks in Federated Learning
Jane Carney, Kushal Upreti, Gaby G. Dagher, Tim Andersen
arxiv.org/abs/2508.10042

@arXiv_csCV_bot@mastoxiv.page
2025-09-10 10:43:41

Active Membership Inference Test (aMINT): Enhancing Model Auditability with Multi-Task Learning
Daniel DeAlcala, Aythami Morales, Julian Fierrez, Gonzalo Mancera, Ruben Tolosana, Javier Ortega-Garcia
arxiv.org/abs/2509.07879

@arXiv_csLG_bot@mastoxiv.page
2025-08-12 12:00:23

Revisiting Data Attribution for Influence Functions
Hongbo Zhu, Angelo Cangelosi
arxiv.org/abs/2508.07297 arxiv.org/pdf/2508.07297

@arXiv_csCL_bot@mastoxiv.page
2025-10-13 10:32:00

Logit Arithmetic Elicits Long Reasoning Capabilities Without Training
Yunxiang Zhang, Muhammad Khalifa, Lechen Zhang, Xin Liu, Ayoung Lee, Xinliang Frederick Zhang, Farima Fatahi Bayat, Lu Wang
arxiv.org/abs/2510.09354

@arXiv_csLG_bot@mastoxiv.page
2025-10-08 11:00:59

Training Dynamics Impact Post-Training Quantization Robustness
Albert Catalan-Tatjer, Niccol\`o Ajroldi, Jonas Geiping
arxiv.org/abs/2510.06213

@arXiv_csCV_bot@mastoxiv.page
2025-10-14 13:47:38

DiT360: High-Fidelity Panoramic Image Generation via Hybrid Training
Haoran Feng, Dizhe Zhang, Xiangtai Li, Bo Du, Lu Qi
arxiv.org/abs/2510.11712

@arXiv_csLG_bot@mastoxiv.page
2025-09-16 12:46:17

Event2Vec: A Geometric Approach to Learning Composable Representations of Event Sequences
Antonin Sulc
arxiv.org/abs/2509.12188 arxiv.org/p…

@arXiv_csCV_bot@mastoxiv.page
2025-10-15 10:46:21

LayerSync: Self-aligning Intermediate Layers
Yasaman Haghighi, Bastien van Delft, Mariam Hassan, Alexandre Alahi
arxiv.org/abs/2510.12581 a…

@arXiv_csLG_bot@mastoxiv.page
2025-10-14 13:41:38

Representation-Based Exploration for Language Models: From Test-Time to Post-Training
Jens Tuyls, Dylan J. Foster, Akshay Krishnamurthy, Jordan T. Ash
arxiv.org/abs/2510.11686

@arXiv_csCV_bot@mastoxiv.page
2025-09-12 10:09:09

Exploring Pre-training Across Domains for Few-Shot Surgical Skill Assessment
Dimitrios Anastasiou, Razvan Caramalau, Nazir Sirajudeen, Matthew Boal, Philip Edwards, Justin Collins, John Kelly, Ashwin Sridhar, Maxine Tran, Faiz Mumtaz, Nevil Pavithran, Nader Francis, Danail Stoyanov, Evangelos B. Mazomenos
arxiv.org/abs/2509.09327…

@arXiv_csLG_bot@mastoxiv.page
2025-09-15 09:42:11

Exploring Expert Specialization through Unsupervised Training in Sparse Mixture of Experts
Strahinja Nikolic, Ilker Oguz, Demetri Psaltis
arxiv.org/abs/2509.10025

@arXiv_csLG_bot@mastoxiv.page
2025-09-12 09:22:29

Open-sci-ref-0.01: open and reproducible reference baselines for language model and dataset comparison
Marianna Nezhurina, Taishi Nakamura, Timur Carstensen, Niccol\`o Ajroldi, Ville Komulainen, David Salinas, Jenia Jitsev
arxiv.org/abs/2509.09009