Tootfinder

Opt-in global Mastodon full text search. Join the index!

@Techmeme@techhub.social
2025-10-10 20:26:02

SemiAnalysis launches InferenceMAX, an open-source benchmark that automatically tracks LLM inference performance across AI models and frameworks every night (Kimbo Chen/SemiAnalysis)
newsletter.semianalysis.com/p/

@arXiv_csAI_bot@mastoxiv.page
2025-08-13 07:32:22

LLM-BI: Towards Fully Automated Bayesian Inference with Large Language Models
Yongchao Huang
arxiv.org/abs/2508.08300 arxiv.org/pdf/2508.08…

@arXiv_csCR_bot@mastoxiv.page
2025-09-12 08:40:49

Towards Confidential and Efficient LLM Inference with Dual Privacy Protection
Honglan Yu, Yibin Wang, Feifei Dai, Dong Liu, Haihui Fan, Xiaoyan Gu
arxiv.org/abs/2509.09091

@arXiv_csRO_bot@mastoxiv.page
2025-10-13 10:14:30

Zero-shot Structure Learning and Planning for Autonomous Robot Navigation using Active Inference
Daria de tinguy, Tim Verbelen, Emilio Gamba, Bart Dhoedt
arxiv.org/abs/2510.09574

@arXiv_csLG_bot@mastoxiv.page
2025-09-12 10:05:29

Fused Lasso Improves Accuracy of Co-occurrence Network Inference in Grouped Samples
Daniel Agyapong, Briana H. Beatty, Peter G. Kennedy, Toby D. Hocking
arxiv.org/abs/2509.09413

@arXiv_csCL_bot@mastoxiv.page
2025-08-13 10:16:32

READER: Retrieval-Assisted Drafter for Efficient LLM Inference
Maxim Divilkovskiy, Vitaly Malygin, Sergey Zlobin, Sultan Isali, Vasily Kalugin, Stanislav Ilyushin, Nuriza Aitassova, Yi Fei, Zeng Weidi
arxiv.org/abs/2508.09072

@arXiv_statML_bot@mastoxiv.page
2025-10-13 09:03:40

Efficient Autoregressive Inference for Transformer Probabilistic Models
Conor Hassan, Nasrulloh Loka, Cen-You Li, Daolang Huang, Paul E. Chang, Yang Yang, Francesco Silvestrin, Samuel Kaski, Luigi Acerbi
arxiv.org/abs/2510.09477

@arXiv_statME_bot@mastoxiv.page
2025-10-13 08:57:40

Robust and Efficient Semiparametric Inference for the Stepped Wedge Design
Fan Xia, K. C. Gary Chan, Emily Voldal, Avi Kenny, Patrick J. Heagerty, James P. Hughes
arxiv.org/abs/2510.08972

@arXiv_csAR_bot@mastoxiv.page
2025-09-12 07:32:59

Combating the Memory Walls: Optimization Pathways for Long-Context Agentic LLM Inference
Haoran Wu, Can Xiao, Jiayi Nie, Xuan Guo, Binglei Lou, Jeffrey T. H. Wong, Zhiwen Mo, Cheng Zhang, Przemyslaw Forys, Wayne Luk, Hongxiang Fan, Jianyi Cheng, Timothy M. Jones, Rika Antonova, Robert Mullins, Aaron Zhao
arxiv.org/abs/2509.09505

@arXiv_csCV_bot@mastoxiv.page
2025-09-11 09:32:43

Boosted Training of Lightweight Early Exits for Optimizing CNN Image Classification Inference
Yehudit Aperstein, Alexander Apartsin
arxiv.org/abs/2509.08318

@arXiv_csDC_bot@mastoxiv.page
2025-08-13 07:34:52

Profiling Concurrent Vision Inference Workloads on NVIDIA Jetson -- Extended
Abhinaba Chakraborty, Wouter Tavernier, Akis Kourtis, Mario Pickavet, Andreas Oikonomakis, Didier Colle
arxiv.org/abs/2508.08430

@arXiv_csPF_bot@mastoxiv.page
2025-08-13 08:06:12

Profiling Large Language Model Inference on Apple Silicon: A Quantization Perspective
Afsara Benazir, Felix Xiaozhu Lin
arxiv.org/abs/2508.08531

@arXiv_csDB_bot@mastoxiv.page
2025-08-13 07:39:42

FIER: Fine-Grained and Efficient KV Cache Retrieval for Long-context LLM Inference
Dongwei Wang, Zijie Liu, Song Wang, Yuxin Ren, Jianing Deng, Jingtong Hu, Tianlong Chen, Huanrui Yang
arxiv.org/abs/2508.08256

@arXiv_astrophCO_bot@mastoxiv.page
2025-09-12 09:28:29

Cosmology inference with perturbative forward modeling at the field level: a comparison with joint power spectrum and bispectrum analyses
Kazuyuki Akitsu, Marko Simonovi\'c, Shi-Fan Chen, Giovanni Cabass, Matias Zaldarriaga
arxiv.org/abs/2509.09673

@arXiv_mathNA_bot@mastoxiv.page
2025-09-11 08:12:43

Tensor-Train Operator Inference
Engin Danis, Duc Truong, Kim {\O}. Rasmussen{\S}, Boian S. Alexandrov
arxiv.org/abs/2509.08071 arxiv.org/pd…

@arXiv_mathST_bot@mastoxiv.page
2025-08-13 08:07:42

Toward Optimal Statistical Inference in Noisy Linear Quadratic Reinforcement Learning over a Finite Horizon
Bo Pan, Jianya Lu, Yafei Wang, Hao Li, Bei Jiang, Linglong Kong
arxiv.org/abs/2508.08436

@arXiv_csIT_bot@mastoxiv.page
2025-09-12 07:35:39

Improved Receiver Chain Performance via Error Location Inference
Michael Greenwood, Robert Hunter
arxiv.org/abs/2509.08869 arxiv.org/pdf/25…

@arXiv_csSD_bot@mastoxiv.page
2025-09-12 07:37:59

In situ estimation of the acoustic surface impedance using simulation-based inference
Jonas M. Schmid, Johannes D. Schmid, Martin Eser, Steffen Marburg
arxiv.org/abs/2509.08873

@arXiv_csCR_bot@mastoxiv.page
2025-09-12 09:09:09

CryptGNN: Enabling Secure Inference for Graph Neural Networks
Pritam Sen, Yao Ma, Cristian Borcea
arxiv.org/abs/2509.09107 arxiv.org/pdf/25…

@arXiv_csLG_bot@mastoxiv.page
2025-10-13 10:41:20

Efficient Bayesian Inference from Noisy Pairwise Comparisons
Till Aczel, Lucas Theis, Wattenhofer Roger
arxiv.org/abs/2510.09333 arxiv.org/…

@arXiv_csAI_bot@mastoxiv.page
2025-09-12 09:05:49

Jupiter: Enhancing LLM Data Analysis Capabilities via Notebook and Inference-Time Value-Guided Search
Shuocheng Li, Yihao Liu, Silin Du, Wenxuan Zeng, Zhe Xu, Mengyu Zhou, Yeye He, Haoyu Dong, Shi Han, Dongmei Zhang
arxiv.org/abs/2509.09245

@arXiv_csCL_bot@mastoxiv.page
2025-10-13 10:30:20

FLRC: Fine-grained Low-Rank Compressor for Efficient LLM Inference
Yu-Chen Lu, Chong-Yan Chen, Chi-Chih Chang, Yu-Fang Hu, Kai-Chiang Wu
arxiv.org/abs/2510.09332

@arXiv_csET_bot@mastoxiv.page
2025-10-13 07:33:30

When to Reason: Semantic Router for vLLM
Chen Wang, Xunzhuo Liu, Yuhan Liu, Yue Zhu, Xiangxi Mo, Junchen Jiang, Huamin Chen
arxiv.org/abs/2510.08731

@arXiv_csCR_bot@mastoxiv.page
2025-09-12 09:43:49

ENSI: Efficient Non-Interactive Secure Inference for Large Language Models
Zhiyu He, Maojiang Wang, Xinwen Gao, Yuchuan Luo, Lin Liu, Shaojing Fu
arxiv.org/abs/2509.09424

@arXiv_csRO_bot@mastoxiv.page
2025-10-13 08:27:50

Adaptive Motion Planning via Contact-Based Intent Inference for Human-Robot Collaboration
Jiurun Song, Xiao Liang, Minghui Zheng
arxiv.org/abs/2510.08811

@arXiv_statML_bot@mastoxiv.page
2025-10-13 09:11:30

Conditional Flow Matching for Bayesian Posterior Inference
So Won Jeong, Percy S. Zhai, Veronika Ro\v{c}ov\'a
arxiv.org/abs/2510.09534

@arXiv_mathLO_bot@mastoxiv.page
2025-10-13 07:53:00

The Fractal Logic of $\Phi$-adic Recursion
Milan Rosko
arxiv.org/abs/2510.08934 arxiv.org/pdf/2510.08934

@arXiv_csSE_bot@mastoxiv.page
2025-09-12 09:05:19

CLARA: A Developer's Companion for Code Comprehension and Analysis
Ahmed Adnan, Mushfiqur Rahman, Saad Sakib Noor, Kazi Sakib
arxiv.org/abs/2509.09072

@arXiv_csHC_bot@mastoxiv.page
2025-09-12 09:49:49

The Impact of Device Type, Data Practices, and Use Case Scenarios on Privacy Concerns about Eye-tracked Augmented Reality in the United States and Germany
Efe Bozkir, Babette B\"uhler, Xiaoyuan Wu, Enkelejda Kasneci, Lujo Bauer, Lorrie Faith Cranor
arxiv.org/abs/2509.09285

@arXiv_csLO_bot@mastoxiv.page
2025-09-09 08:27:02

Compositional Inductive Invariant Inference via Assume-Guarantee Reasoning
Ian Dardik, Eunsuk Kang
arxiv.org/abs/2509.06250 arxiv.org/pdf/2…

@arXiv_statME_bot@mastoxiv.page
2025-10-13 07:59:40

A Design-based Solution for Causal Inference with Text: Can a Language Model Be Too Large?
Graham Tierney, Srikar Katta, Christopher Bail, Sunshine Hillygus, Alexander Volfovsky
arxiv.org/abs/2510.08758

@arXiv_astrophGA_bot@mastoxiv.page
2025-09-10 08:36:51

LIMFAST. IV. Learning High-Redshift Galaxy Formation from Multiline Intensity Mapping with Implicit Likelihood Inference
Guochao Sun, Tri Nguyen, Claude-Andr\'e Faucher-Gigu\`ere, Adam Lidz, Tjitske Starkenburg, Bryan R. Scott, Tzu-Ching Chang, Steven R. Furlanetto
arxiv.org/abs/2509.07060

@arXiv_qfinRM_bot@mastoxiv.page
2025-09-11 08:14:03

Chaotic Bayesian Inference: Strange Attractors as Risk Models for Black Swan Events
Crystal Rust
arxiv.org/abs/2509.08183 arxiv.org/pdf/250…

@arXiv_qbioTO_bot@mastoxiv.page
2025-10-13 08:27:40

Unsupervised full-field Bayesian inference of orthotropic hyperelasticity from a single biaxial test: a myocardial case study
Rogier P. Krijnen, Akshay Joshi, Siddhant Kumar, Mathias Peirlinck
arxiv.org/abs/2510.09498

@arXiv_csCL_bot@mastoxiv.page
2025-10-13 10:29:50

Mask Tokens as Prophet: Fine-Grained Cache Eviction for Efficient dLLM Inference
Jianuo Huang, Yaojie Zhang, Yicun Yang, Benhao Huang, Biqing Qi, Dongrui Liu, Linfeng Zhang
arxiv.org/abs/2510.09309

@arXiv_csCR_bot@mastoxiv.page
2025-08-13 07:57:52

Selective KV-Cache Sharing to Mitigate Timing Side-Channels in LLM Inference
Kexin Chu, Zecheng Lin, Dawei Xiang, Zixu Shen, Jianchang Su, Cheng Chu, Yiwei Yang, Wenhui Zhang, Wenfei Wu, Wei Zhang
arxiv.org/abs/2508.08438

@arXiv_csIR_bot@mastoxiv.page
2025-08-13 09:24:32

SPARC: Soft Probabilistic Adaptive multi-interest Retrieval Model via Codebooks for recommender system
Jialiang Shi, Yaguang Dou, Tian Qi
arxiv.org/abs/2508.09090

@arXiv_statCO_bot@mastoxiv.page
2025-10-13 08:57:20

Bayesian Model Inference using Bayesian Quadrature: the Art of Acquisition Functions and Beyond
Jingwen Song, Pengfei Wei
arxiv.org/abs/2510.08974

@arXiv_astrophHE_bot@mastoxiv.page
2025-09-10 09:00:31

When (not) to trust Monte Carlo approximations for hierarchical Bayesian inference
Jack Heinzel, Salvatore Vitale
arxiv.org/abs/2509.07221

@arXiv_csLG_bot@mastoxiv.page
2025-09-12 09:53:19

Adaptive Pareto-Optimal Token Merging for Edge Transformer Models in Semantic Communication
Omar Erak, Omar Alhussein, Hatem Abou-Zeid, Mehdi Bennis
arxiv.org/abs/2509.09168

@arXiv_econEM_bot@mastoxiv.page
2025-09-11 09:00:03

Posterior inference of attitude-behaviour relationships using latent class choice models
Akshay Vij, Stephane Hess
arxiv.org/abs/2509.08373

@arXiv_condmatstatmech_bot@mastoxiv.page
2025-10-13 08:57:30

Restoring detailed balance in non-Hermitian Markov processes
Tim Van Wesemael, Gilberto Nakamura, Jan Baetens, Odemir M. Bruno, Alexandre S. Martinez, Christophe Deroulers
arxiv.org/abs/2510.09467

@arXiv_csAI_bot@mastoxiv.page
2025-09-12 09:38:09

Boosting Embodied AI Agents through Perception-Generation Disaggregation and Asynchronous Pipeline Execution
Shulai Zhang, Ao Xu, Quan Chen, Han Zhao, Weihao Cui, Ningxin Zheng, Haibin Lin, Xin Liu, Minyi Guo
arxiv.org/abs/2509.09560

@fanf@mendeddrum.org
2025-08-26 17:42:03

from my link log —
Type inference for plain data.
haskellforall.com/2025/08/type
saved 2025-08-13

@arXiv_eessSP_bot@mastoxiv.page
2025-09-08 08:12:00

Communication-Efficient Collaborative LLM Inference via Distributed Speculative Decoding
Ce Zheng, Tingting Yang
arxiv.org/abs/2509.04576 a…

@arXiv_csCV_bot@mastoxiv.page
2025-09-10 10:43:41

Active Membership Inference Test (aMINT): Enhancing Model Auditability with Multi-Task Learning
Daniel DeAlcala, Aythami Morales, Julian Fierrez, Gonzalo Mancera, Ruben Tolosana, Javier Ortega-Garcia
arxiv.org/abs/2509.07879

@arXiv_csDC_bot@mastoxiv.page
2025-09-10 08:34:31

DuoServe-MoE: Dual-Phase Expert Prefetch and Cache Scheduling for Efficient MoE LLM Inference
Yuning Zhang, Grant Pinkert, Nan Yang, Yanli Li, Dong Yuan
arxiv.org/abs/2509.07379

@arXiv_statME_bot@mastoxiv.page
2025-10-13 09:39:00

Defensive Model Expansion for Robust Bayesian Inference
Antonio R. Linero
arxiv.org/abs/2510.09598 arxiv.org/pdf/2510.09598

@arXiv_csAR_bot@mastoxiv.page
2025-10-10 07:36:49

SPAD: Specialized Prefill and Decode Hardware for Disaggregated LLM Inference
Hengrui Zhang, Pratyush Patel, August Ning, David Wentzlaff
arxiv.org/abs/2510.08544

@Techmeme@techhub.social
2025-09-05 13:40:43

Baseten, which helps companies launch open-source or custom AI models, raised a $150M Series D led by Bond at a $2.15B valuation, up from $825M in February (Allie Garfinkle/Fortune)
fortune.com/2025/09/05/exclusi

@arXiv_csSE_bot@mastoxiv.page
2025-09-11 09:07:33

Handling Open-Vocabulary Constructs in Formalizing Specifications: Retrieval-Augmented Parsing with Expert Knowledge
Mohammad Saqib Hasan, Sayontan Ghosh, Dhruv Verma, Geoff Kuenning, Erez Zadok, Scott A. Smolka, Niranjan Balasubramanian
arxiv.org/abs/2509.08808

@arXiv_csCV_bot@mastoxiv.page
2025-10-13 10:32:30

Utilizing dynamic sparsity on pretrained DETR
Reza Sedghi, Anand Subramoney, David Kappel
arxiv.org/abs/2510.09380 arxiv.org/pdf/2510.09380…

@arXiv_statME_bot@mastoxiv.page
2025-10-13 09:37:40

Uncertainty Quantification for Multi-level Models Using the Survey-Weighted Pseudo-Posterior
Matthew R. Williams, F. Hunter McGuire, Terrance D. Savitsky
arxiv.org/abs/2510.09401

@arXiv_csSD_bot@mastoxiv.page
2025-09-12 08:45:59

DiFlow-TTS: Discrete Flow Matching with Factorized Speech Tokens for Low-Latency Zero-Shot Text-To-Speech
Ngoc-Son Nguyen, Hieu-Nghia Huynh-Nguyen, Thanh V. T. Tran, Truong-Son Hy, Van Nguyen
arxiv.org/abs/2509.09631

@arXiv_csLO_bot@mastoxiv.page
2025-10-10 08:12:48

Dynamic Automated Deduction by Contradiction Separation: The Standard Extension Algorithm
Yang Xu, Xingxing He, Shuwei Chen, Jun Liu, Xiaomei Zhong
arxiv.org/abs/2510.08468

@arXiv_statML_bot@mastoxiv.page
2025-10-10 09:22:29

Stick-Breaking Mixture Normalizing Flows with Component-Wise Tail Adaptation for Variational Inference
Seungsu Han, Juyoung Hwang, Won Chang
arxiv.org/abs/2510.07965

@arXiv_astrophCO_bot@mastoxiv.page
2025-09-11 07:57:52

Taking the Weight Off: Mitigating Parameter Bias from Catastrophic Outliers in 3$\times$2pt Analysis
Carolyn McDonald Mill, C. Danielle Leonard, Markus Michael Rau, Cora Uhlemann, Shahab Joudaki
arxiv.org/abs/2509.08052

@arXiv_csAR_bot@mastoxiv.page
2025-09-09 07:31:41

High Utilization Energy-Aware Real-Time Inference Deep Convolutional Neural Network Accelerator
Kuan-Ting Lin, Ching-Te Chiu, Jheng-Yi Chang, Shi-Zong Huang, Yu-Ting Li
arxiv.org/abs/2509.05688

@arXiv_csLG_bot@mastoxiv.page
2025-10-09 10:50:21

A Multi-Agent Framework for Stateful Inference-Time Search
Arshika Lalan, Rajat Ghosh, Aditya Kolsur, Debojyoti Dutta
arxiv.org/abs/2510.07147

@arXiv_csRO_bot@mastoxiv.page
2025-09-11 08:50:03

SVN-ICP: Uncertainty Estimation of ICP-based LiDAR Odometry using Stein Variational Newton
Shiping Ma, Haoming Zhang, Marc Toussaint
arxiv.org/abs/2509.08069

@arXiv_csAI_bot@mastoxiv.page
2025-09-11 07:41:02

Automatic Failure Attribution and Critical Step Prediction Method for Multi-Agent Systems Based on Causal Inference
Guoqing Ma, Jia Zhu, Hanghui Guo, Weijie Shi, Jiawei Shen, Jingjiang Liu, Yidan Liang
arxiv.org/abs/2509.08682

@arXiv_csCV_bot@mastoxiv.page
2025-09-12 10:08:09

Image Recognition with Vision and Language Embeddings of VLMs
Illia Volkov, Nikita Kisel, Klara Janouskova, Jiri Matas
arxiv.org/abs/2509.09311

@arXiv_csDC_bot@mastoxiv.page
2025-08-13 12:09:43

Replaced article(s) found for cs.DC. arxiv.org/list/cs.DC/new
[1/1]:
- Keep Your Friends Close: Leveraging Affinity Groups to Accelerate AI Inference Workflows
Thiago Garrett, Weijia Song, Roman Vitenberg, Ken Birman

@arXiv_csCL_bot@mastoxiv.page
2025-08-13 10:10:42

Retrospective Sparse Attention for Efficient Long-Context Generation
Seonghwan Choi, Beomseok Kang, Dongwon Jo, Jae-Joon Kim
arxiv.org/abs/2508.09001

@Techmeme@techhub.social
2025-08-29 15:10:48

FriendliAI, which aims to help companies run AI model inference faster and cheaper, raised a $20M extension to its $6M seed fund from late 2021 (Mary Ann Azevedo/Crunchbase News)
news.crunchbase.com/ai/inferen

@arXiv_csAI_bot@mastoxiv.page
2025-09-12 07:30:19

An Interval Type-2 Version of Bayes Theorem Derived from Interval Probability Range Estimates Provided by Subject Matter Experts
John T. Rickard, William A. Dembski, James Rickards
arxiv.org/abs/2509.08834

@arXiv_csCV_bot@mastoxiv.page
2025-10-13 10:33:00

Dynamic Weight-based Temporal Aggregation for Low-light Video Enhancement
Ruirui Lin, Guoxi Huang, Nantheera Anantrasirichai
arxiv.org/abs/2510.09450

@arXiv_astrophCO_bot@mastoxiv.page
2025-10-13 09:21:50

Cosmology Likelihood for Observables in \Euclid (CLOE). 1. Theoretical recipe
Collaboration, Cardone, Joudaki, Blot, Bonici, Camera, Ca\~nas-Herrera, Carrilho, Casas, Davini, Di Domizio, Farrens, Goh, Beauchamps, Ili\'c, Keil, Le Brun, Martinelli, Moretti, Pettorino, Pezzotta, S\'anchez, Sakr, Sciotti, Tanidis, Tutusaus, Ajani, Crocce, Giocoli, Legrand, Lembo, Lesci, Girones, Nouri-Zonoz, Pamuk, Tsedrik, Bel, Carbone, Duncan, Kilbinger, Lacasa, Lattanzi, Sapone, Sellentin, Tayl…

@arXiv_csCR_bot@mastoxiv.page
2025-09-09 12:06:22

Imitative Membership Inference Attack
Yuntao Du, Yuetian Chen, Hanshen Xiao, Bruno Ribeiro, Ninghui Li
arxiv.org/abs/2509.06796 arxiv.org/p…

@arXiv_csLG_bot@mastoxiv.page
2025-10-13 10:41:10

Safety Game: Balancing Safe and Informative Conversations with Blackbox Agentic AI using LP Solvers
Tuan Nguyen, Long Tran-Thanh
arxiv.org/abs/2510.09330

@arXiv_csCL_bot@mastoxiv.page
2025-09-12 09:40:19

CCF: A Context Compression Framework for Efficient Long-Sequence Language Modeling
Wenhao Li, Bangcheng Sun, Weihao Ye, Tianyi Zhang, Daohai Yu, Fei Chao, Rongrong Ji
arxiv.org/abs/2509.09199

@arXiv_csAR_bot@mastoxiv.page
2025-09-11 08:37:53

BitROM: Weight Reload-Free CiROM Architecture Towards Billion-Parameter 1.58-bit LLM Inference
Wenlun Zhang, Xinyu Li, Shimpei Ando, Kentaro Yoshioka
arxiv.org/abs/2509.08542

@arXiv_statML_bot@mastoxiv.page
2025-09-09 09:06:22

MOSAIC: Minimax-Optimal Sparsity-Adaptive Inference for Change Points in Dynamic Networks
Yingying Fan, Jingyuan Liu, Jinchi Lv, Ao Sun
arxiv.org/abs/2509.06303

@arXiv_csCV_bot@mastoxiv.page
2025-10-13 10:27:10

RadioFlow: Efficient Radio Map Construction Framework with Flow Matching
Haozhe Jia, Wenshuo Chen, Xiucheng Wang, Nan Cheng, Hongbo Zhang, Kuimou Yu, Songning Lai, Nanjian Jia, Bowen Tian, Hongru Xiao, Yutao Yue
arxiv.org/abs/2510.09314

@arXiv_csAI_bot@mastoxiv.page
2025-10-13 09:47:40

OSCAR: Orthogonal Stochastic Control for Alignment-Respecting Diversity in Flow Matching
Jingxuan Wu, Zhenglin Wan, Xingrui Yu, Yuzhe Yang, Bo An, Ivor Tsang
arxiv.org/abs/2510.09060

@arXiv_csCL_bot@mastoxiv.page
2025-10-13 10:43:40

Prompting Test-Time Scaling Is A Strong LLM Reasoning Data Augmentation
Sondos Mahmoud Bsharat, Zhiqiang Shen
arxiv.org/abs/2510.09599 arxi…

@arXiv_csCR_bot@mastoxiv.page
2025-10-10 08:34:39

Comparison of Fully Homomorphic Encryption and Garbled Circuit Techniques in Privacy-Preserving Machine Learning Inference
Kalyan Cheerla (University of North Texas), Lotfi Ben Othmane (University of North Texas), Kirill Morozov (University of North Texas)
arxiv.org/abs/2510.07457

@arXiv_csRO_bot@mastoxiv.page
2025-10-07 11:45:42

HyperVLA: Efficient Inference in Vision-Language-Action Models via Hypernetworks
Zheng Xiong, Kang Li, Zilin Wang, Matthew Jackson, Jakob Foerster, Shimon Whiteson
arxiv.org/abs/2510.04898

@arXiv_statME_bot@mastoxiv.page
2025-08-13 08:42:02

Doubly robust pointwise confidence intervals for a monotonic continuous treatment effect curve
Charles R. Doss
arxiv.org/abs/2508.08415 arx…

@arXiv_csAI_bot@mastoxiv.page
2025-10-13 09:54:30

PAC Reasoning: Controlling the Performance Loss for Efficient Reasoning
Hao Zeng, Jianguo Huang, Bingyi Jing, Hongxin Wei, Bo An
arxiv.org/abs/2510.09133

@arXiv_csCV_bot@mastoxiv.page
2025-08-13 10:21:12

Turbo-VAED: Fast and Stable Transfer of Video-VAEs to Mobile Devices
Ya Zou, Jingfeng Yao, Siyuan Yu, Shuai Zhang, Wenyu Liu, Xinggang Wang
arxiv.org/abs/2508.09136

@arXiv_statML_bot@mastoxiv.page
2025-09-09 08:54:51

Fisher Random Walk: Automatic Debiasing Contextual Preference Inference for Large Language Model Evaluation
Yichi Zhang, Alexander Belloni, Ethan X. Fang, Junwei Lu, Xiaoan Xu
arxiv.org/abs/2509.05852

@arXiv_csLG_bot@mastoxiv.page
2025-10-10 11:05:29

Mix- and MoE-DPO: A Variational Inference Approach to Direct Preference Optimization
Jason Bohne, Pawel Polak, David Rosenberg, Brian Bloniarz, Gary Kazantsev
arxiv.org/abs/2510.08256

@arXiv_csCL_bot@mastoxiv.page
2025-10-13 10:36:50

Hybrid Models for Natural Language Reasoning: The Case of Syllogistic Logic
Manuel Vargas Guzm\'an, Jakub Szymanik, Maciej Malicki
arxiv.org/abs/2510.09472

@arXiv_csCV_bot@mastoxiv.page
2025-09-11 07:58:33

Video Parallel Scaling: Aggregating Diverse Frame Subsets for VideoLLMs
Hyungjin Chung, Hyelin Nam, Jiyeon Kim, Hyojun Go, Byeongjun Park, Junho Kim, Joonseok Lee, Seongsu Ha, Byung-Hoon Kim
arxiv.org/abs/2509.08016

@arXiv_csAI_bot@mastoxiv.page
2025-10-13 10:02:20

Localist LLMs -- A Mathematical Framework for Dynamic Locality Control
Joachim Diederich
arxiv.org/abs/2510.09338 arxiv.org/pdf/2510.09338

@arXiv_statME_bot@mastoxiv.page
2025-08-13 09:37:32

Sensitivity Analysis to Unobserved Confounding with Copula-based Normalizing Flows
Sourabh Balgi, Marc Braun, Jose M. Pe\~na, Adel Daoud
arxiv.org/abs/2508.08752

@arXiv_csCL_bot@mastoxiv.page
2025-09-12 10:00:49

Steering MoE LLMs via Expert (De)Activation
Mohsen Fayyaz, Ali Modarressi, Hanieh Deilamsalehy, Franck Dernoncourt, Ryan Rossi, Trung Bui, Hinrich Sch\"utze, Nanyun Peng
arxiv.org/abs/2509.09660

@arXiv_csCR_bot@mastoxiv.page
2025-10-08 09:39:59

Membership Inference Attacks on Tokenizers of Large Language Models
Meng Tong, Yuntao Du, Kejiang Chen, Weiming Zhang, Ninghui Li
arxiv.org/abs/2510.05699

@arXiv_csLG_bot@mastoxiv.page
2025-10-06 10:27:19

Best-of-Majority: Minimax-Optimal Strategy for Pass@$k$ Inference Scaling
Qiwei Di, Kaixuan Ji, Xuheng Li, Heyang Zhao, Quanquan Gu
arxiv.org/abs/2510.03199

@arXiv_csCV_bot@mastoxiv.page
2025-09-12 10:06:19

Modality-Agnostic Input Channels Enable Segmentation of Brain lesions in Multimodal MRI with Sequences Unavailable During Training
Anthony P. Addison, Felix Wagner, Wentian Xu, Natalie Voets, Konstantinos Kamnitsas
arxiv.org/abs/2509.09290

@arXiv_csAI_bot@mastoxiv.page
2025-10-13 09:31:10

Tiny-R1V: Lightweight Multimodal Unified Reasoning Model via Model Merging
Qixiang Yin, Huanjin Yao, Jianghao Chen, Jiaxing Huang, Zhicheng Zhao, Fei Su
arxiv.org/abs/2510.08987

@arXiv_statME_bot@mastoxiv.page
2025-09-11 08:45:33

Doubly robust average treatment effect estimation for survival data
Byeonghee Lee, Joonsung Kang
arxiv.org/abs/2509.08788 arxiv.org/pdf/250…

@arXiv_csLG_bot@mastoxiv.page
2025-09-10 10:39:51

MoE-Compression: How the Compression Error of Experts Affects the Inference Accuracy of MoE Model?
Songkai Ma, Zhaorui Zhang, Sheng Di, Benben Liu, Xiaodong Yu, Xiaoyi Lu, Dan Wang
arxiv.org/abs/2509.07727

@arXiv_csCR_bot@mastoxiv.page
2025-09-09 11:48:22

DCMI: A Differential Calibration Membership Inference Attack Against Retrieval-Augmented Generation
Xinyu Gao, Xiangtao Meng, Yingkai Dong, Zheng Li, Shanqing Guo
arxiv.org/abs/2509.06026

@arXiv_csLG_bot@mastoxiv.page
2025-10-10 11:06:29

Dynamic Features Adaptation in Networking: Toward Flexible training and Explainable inference
Yannis Belkhiter, Seshu Tirupathi, Giulio Zizzo, Merim Dzaferagic, John D. Kelleher
arxiv.org/abs/2510.08303

@arXiv_statME_bot@mastoxiv.page
2025-09-09 10:21:32

Bayesian Inference for Confounding Variables and Limited Information
Ellis Scharfenaker, Duncan K. Foley
arxiv.org/abs/2509.05520 arxiv.org…

@arXiv_csAI_bot@mastoxiv.page
2025-10-07 12:16:52

Staircase Streaming for Low-Latency Multi-Agent Inference
Junlin Wang (Zach), Jue Wang (Zach), Zhen (Zach), Xu, Ben Athiwaratkun, Bhuwan Dhingra, Ce Zhang, James Zou
arxiv.org/abs/2510.05059

@arXiv_csLG_bot@mastoxiv.page
2025-10-08 10:49:49

Empirical Comparison of Membership Inference Attacks in Deep Transfer Learning
Yuxuan Bai, Gauri Pradhan, Marlon Tobaben, Antti Honkela
arxiv.org/abs/2510.05753

@arXiv_csLG_bot@mastoxiv.page
2025-10-08 10:43:39

(Token-Level) \textbf{InfoRMIA}: Stronger Membership Inference and Memorization Assessment for LLMs
Jiashu Tao, Reza Shokri
arxiv.org/abs/2510.05582